Main Articles

Fresh thinking for builders

Explore practical articles, tutorials, and field notes on AI, software engineering, machine learning, and modern development.

Featured image for I Profiled Bad GEMM Kernels. Shared Memory Wasn’t the First Win.
Artificial Intelligence

I Profiled Bad GEMM Kernels. Shared Memory Wasn’t the First Win.

I broke CUDA matrix multiplication on purpose, fixed one bottleneck at a time, and measured which optimizations actually moved performance. CUDA optimization advice is everywhere: use shared memory, improve occupancy, coalesce memory, unroll loops, reduce synchronization, avoid bank conflicts. All of that advice can be true, but it is not equally important at every stage. […]

Read Main Article

Power YourAI & CSJourney

Explore cutting-edge AI, Machine Learning, and Computer Science insights, tutorials, and innovations. From deep learning to systems programming, we’re your gateway to the future of technology.

500+
Articles
100+
Tutorials
50+
Projects

Deep Learning

Neural networks, transformers, and computer vision

Software Engineering

System design, architecture, and full-stack development

LLMs

Fairness, transparency, and responsible innovation

Automation & Robotics

Intelligent systems and autonomous agents

Newsletter

Stay Connected with Lusera

Get the latest AI, Machine Learning, and Computer Science insights, tutorials, and industry news delivered straight to your inbox. Join thousands of tech enthusiasts staying ahead of the curve.

Weekly insights, practical tutorials, and zero spam.

Weekly Articles

Get our latest technical deep-dives and AI/ML research insights every week.

Exclusive Tutorials

Access subscriber-only AI/ML coding tutorials, model-building walkthroughs, and more.

Industry News

Stay informed about the latest trends in artificial intelligence and computer science.

No spam, unsubscribe at any time. Your email is safe with us.

Trusted by readers across AI, ML, and software engineering.