14 Live Sessions · Starts April 27 · Every Day 9 AM IST (Mon–Fri) · 2–3 hrs/day

MasterLLM▌Engineering.

A 4-week intensive workshop taught live by Dr. Raj Dandekar (MIT PhD) with 9 industry experts from Anthropic, NVIDIA, Apple & more.Can't attend live? All sessions are recorded for lifetime access.

See Plans View Curriculum

Core Lectures

Guest Lectures

Hardware Labs

Capstone Projects

Research Publication Track

200+

Engineers Enrolled

Scroll

Curriculum

Two phases.
One complete education.

14 lectures across 4 weeks. Each phase is self-contained — take one or both.

Phase 1 — Foundations & Optimization

Apr 27 – May 10, 2026

7 core lectures

Hardware lab sessions included

Daily Colab labs & visual guides

Live Zoom sessions + recordings

Lifetime access

Phase 2 — Production & Edge Deployment

May 11 – May 25, 2026

7 core lectures

Hardware lab sessions included

Daily Colab labs & visual guides

Live Zoom sessions + recordings

Lifetime access

Frameworks

The tools that power
production inference.

You won't just learn theory — you'll build with the same frameworks used at Anthropic, NVIDIA, and Apple.

vLLM

High-throughput serving with PagedAttention, continuous batching & scheduler internals

SGLang

Fast inference with RadixAttention, structured generation & compiler-driven optimization

Ray Serve

Distributed serving & batch processing at scale — covered by Suman Debnath (AnyScale)

Megatron-LM

Large-scale model parallelism, distributed training & GRPO for online RL

FlashAttention

IO-aware attention kernels — memory-efficient tiling & online softmax

TensorRT-LLM

NVIDIA's optimized inference engine with INT4/INT8 quantization & kernel fusion

Capstone Projects

Build something
you can actually ship.

Each phase culminates in a hands-on capstone project that ties together everything you've learned.

Phase 1 Capstone

Build a Speed-Optimized LLM Inference Server

Combine every optimization from L1–L7 into one deployable pipeline

Take a 7B model from raw weights to a fully optimized inference server — then benchmark every layer of the stack live.

Phase 2 Capstone

OpenClaw-RL: Self-Improving WhatsApp AI Assistant

A full RL pipeline where your everyday messages become training data

Build and deploy a personal AI assistant that improves from every conversation using reinforcement learning — no labeling, no datasets.

Hardware Labs

Don't just learn it.

Run it on real hardware.

Dedicated lab days included in every phase. Every device has a different bottleneck — and you'll benchmark each one live.

Any OS · Lab Day 1

Your Own Laptop / PC

Set up llama.cpp, run your first inference, benchmark tok/s across model sizes on your own machine.

ARM · Lab Day 2

Raspberry Pi 4

Quantization experiments on ARM. Compare INT4 vs INT8 latency. Power-aware inference on a 1.5GHz Quad-core Cortex-A72.

Mobile · Lab Day 3

Android Device

SmolChat-Android live session with Shubham Panchal. Deploy a real LLM on your phone.

NVIDIA CUDA · Demo (To Be Decided)

Jetson Orin Nano

CUDA inference on edge GPU. TensorRT-LLM on Jetson. GPU vs CPU throughput battle. Demo by Dr. Raj — not yet confirmed.

Lab Day 3 (Android · Shubham Panchal) is a confirmed 3-hour workshop. Jetson Orin Nano demo is to be decided. All other labs conducted by Dr. Raj Dandekar.

Hardware Guide

What you need to get started

Most labs run on hardware you already own. Only the Raspberry Pi 4 needs to be purchased separately. The Jetson Orin Nano is optional — Dr. Raj will demo it live if the session is confirmed.

Your Own Laptop / PC

Required

Required · Any OS

Lab Day 1 uses llama.cpp and benchmarking tools that run on any modern laptop or desktop — macOS, Windows, or Linux.

You already have this

Raspberry Pi 4 Model B

Required

Required · 1.5GHz Quad-core, up to 8GB RAM

Used in Lab Day 2 for ARM inference and quantization experiments. Broadcom BCM2711 SoC, dual-band Wi-Fi, Bluetooth 5.0, Gigabit Ethernet, 2x USB 3.0, 2x USB 2.0.

Buy on Amazon.in →

Android Phone

Required

Required · Any Android 10+

Lab Day 3 with Shubham Panchal. Any Android 10+ phone with at least 6GB RAM works. You almost certainly already own one.

You likely already own this

Jetson Orin Nano

Optional

Optional · Demo to be decided

This is a bit expensive, so if you don't have it, it's fine — Dr. Raj will demo this live in the workshop. NVIDIA Ampere GPU, 8GB RAM, 1024-core CUDA. Demo session not yet confirmed.

Buy on Amazon.in →

Hardware labs are included in Phase 1 and Phase 2 — no extra fee for lab access. Hardware devices must be purchased separately. Prices and availability vary by region.

Who is this for?

Built for engineers
who want to go deep.

Engineers transitioning into ML infrastructure or AI engineering

Students targeting roles at Anthropic, NVIDIA, Microsoft, Apple, Amazon

Engineers who want to go beyond using LLMs — to building inference systems

Researchers who need production engineering depth alongside theory

What you will achieve

Leave
interview-ready.

Top-company interview question:

"Design a low-latency, high-throughput LLM inference system handling millions of requests. Walk me through the engineering trade-offs."

Asked at Anthropic, NVIDIA, Microsoft, Meta, Google DeepMind. You will have a complete answer.

Answer end-to-end inference system design questions in any ML interview

Explain low-latency, high-throughput, cost-optimised LLM serving at scale

Deploy real LLMs on your own laptop, Raspberry Pi 4, Android & Jetson Orin Nano

Build industry-level portfolio projects from hands-on hardware lab days

Get career insights directly from engineers at Anthropic, NVIDIA & Microsoft

Guest Lecturers

Learn from engineers
at the frontier.

9 industry experts from Anthropic, NVIDIA, Microsoft, Apple, AnyScale, Red Hat, Amazon and more. Sessions at Anthropic, NVIDIA, Microsoft and Apple include a dedicated career insights segment.

Research Starter Kit

Start your research
with a head start.

Don't start from scratch. Tell us your topic of interest and we'll generate a personalised research roadmap and an initial version of your research paper — delivered asynchronously, so you can hit the ground running from day one.

What's in the kit

Personalised Research Roadmap (PDF)

You tell us your topic of interest. We generate an 8-week structured plan with milestones, deliverables, and acceptance criteria — tailored to your specific research area. Includes literature review scope, data pipeline design, experiment matrix, and manuscript timeline. Delivered asynchronously.

Initial Research Paper Draft

We generate an initial version of your research paper — research questions framed, methodology outlined, related work surveyed, and experiment setup defined. You don't start with a blank page — you start with a 6–8 page scaffold ready to build on. Delivered asynchronously based on your topic.

Curated Paper Reading List

12–15 handpicked papers relevant to your topic with reading order, key takeaways, and connections between papers. Includes a literature matrix template for systematic tracking.

Starter Code Template

A clean, documented codebase scaffold for your research project — data loading, training loop, evaluation pipeline, and experiment config. Ready to run on day one.

Example research topics

Your roadmap is personalised to your background and goals. Here are some topics our students have worked on:

Vision-Language Planning for Autonomous Navigation with Nano-Scale Models

Knowledge Distillation for Edge LLM Deployment on Jetson

Efficient Speculative Decoding for On-Device Inference

RT-DETR for Real-Time BEV Perception in Driving Simulators

KV Cache Compression for Memory-Constrained Serving

Multimodal Inference Pipelines with Sub-200M Parameter Models

Quantization-Aware Training for Mobile LLM Deployment

Cache-Aware Routing for Multi-Model Inference Systems

1:1 Research Mentorship

Personal guidance from
industry & research leaders.

Two months of 1:1 mentorship with Yash Dixit and Dr. Raj Dandekar. One live call every two weeks — where they review your progress, guide your next steps, and help you work towards a publishable research paper. Get both industrial and research exposure from mentors at Apple, McKinsey, and MIT.

Yash Dixit

AI/ML Product Manager · Apple

View LinkedIn Profile

Apple

AI/ML Product Manager — on-device intelligence, ML product strategy, CoreML

McKinsey

Management Consultant — data-driven strategy for Fortune 500 clients

MIT

Graduate research in AI/ML systems and applied machine learning

IIT

Undergraduate engineering — top-tier technical foundation

Dr. Raj Dandekar

Founder, Vizuara AI Labs · MIT PhD

View LinkedIn Profile

MIT

PhD in AI/ML — scientific machine learning, neural ODEs, physics-informed models

Vizuara AI Labs

Founder — building AI education and inference infrastructure

Published Researcher

Multiple publications in top ML venues — NeurIPS, AAAI, Nature

What mentorship includes

1:1 Call Every Two Weeks

4 live sessions over 2 months. Yash and Dr. Raj personally review your progress, give feedback, and set the direction for your next two-week sprint.

Target: Publishable Paper

The goal is a research paper. Your mentors guide you from topic selection through experiments to a publication-ready manuscript.

Every Step Guided

Literature review, experiment design, ablation studies, writing — your mentors walk you through every step of the research process so you never feel stuck.

Industry + Research Exposure

Get career strategy from Yash (Apple, McKinsey) and deep research guidance from Dr. Raj (MIT PhD, published researcher). Both perspectives in one mentorship.

Paper Reading Guidance

Curated reading lists, paper discussion, and feedback on how to extract and apply insights from the literature.

Actionable Next Steps

Every session ends with clear deliverables and deadlines. You always know exactly what to do next.

Enroll

Choose your workshop.

Select what you need. Everything adjusts instantly.

Step 1 — Choose your base

Phase 1

Foundations & Optimization

Apr 27 – May 10, 2026 · 7 lectures

₹45,000

Hardware lab sessions included

Colab labs & visual guides

Live Zoom + recordings

Lifetime access

Phase 2

Production & Edge Deployment

May 11 – May 25, 2026 · 7 lectures

₹55,000

Hardware lab sessions included

Colab labs & visual guides

Live Zoom + recordings

Lifetime access

Take both phases for ₹1,00,000 combined. Want the best deal? Get the entire bundle and save 15%.

Step 2 — Add-ons (optional)

Guest Speaker Pass

All 9 sessions — Anthropic, NVIDIA, Microsoft, Apple, AnyScale, Red Hat, Amazon and more

+₹50,000

Research Roadmap + Code Starter

Personalised roadmap PDF + starter code template for your research project

+₹15,000

1:1 Research Mentorship — 2 Months

with Yash Dixit, AI/ML Product Manager at Apple · 4 bi-weekly sessions

+₹70,000

Your Workshop

Select a phase to get started.

Total—

Select a phase first

Add Phase 1 or Phase 2 to continue.

EMI available · No refunds

Dr. Raj Dandekar

MIT PhD · Vizuara AI Labs

Your Instructor

Dr. Raj Dandekar

MIT PhD · Co-founder & Director, Vizuara AI Labs

Dr. Raj holds a PhD from MIT and is the co-founder and director of Vizuara AI Labs. He has built a 50,000+ subscriber YouTube channel dedicated to teaching LLMs from first principles, and has taught 200+ engineers across previous workshop cohorts.

His teaching philosophy: visual intuition first, mathematical rigour second, hands-on implementation always. Every concept is taught from scratch — no hand-waving.

All 15 core lectures personally delivered

Visual guides + Colab notebooks for every topic

MIT PhD — rigorous technical foundation

50K+ YouTube subscribers · 200+ engineers taught

FAQ

Common questions.

About the Workshop

Research Starter Kit

Research Mentorship with Yash Dixit

Guest Speakers

MasterLLM▌Engineering.

Two phases.One complete education.

The tools that powerproduction inference.

Build somethingyou can actually ship.

Build a Speed-Optimized LLM Inference Server

OpenClaw-RL: Self-Improving WhatsApp AI Assistant

Don't just learn it.

Run it on real hardware.

Your Own Laptop / PC

Raspberry Pi 4

Android Device

Jetson Orin Nano

What you need to get started

Built for engineerswho want to go deep.

Leaveinterview-ready.

Learn from engineersat the frontier.

Start your researchwith a head start.

What's in the kit

Example research topics

Personal guidance fromindustry & research leaders.

Yash Dixit

Dr. Raj Dandekar

What mentorship includes

Choose your workshop.

Foundations & Optimization

Production & Edge Deployment

Dr. Raj Dandekar

Common questions.

Two phases.
One complete education.

The tools that power
production inference.

Build something
you can actually ship.

Built for engineers
who want to go deep.

Leave
interview-ready.

Learn from engineers
at the frontier.

Start your research
with a head start.

Personal guidance from
industry & research leaders.