Understand Internal Architecture
Curtain Raiser Webinar
Free Registration
Master ML Fundamentals for Model Training
Cohort Starts in April
Understand Internal Architecture
Curtain Raiser Webinar
Free Registration
Master ML Fundamentals for Model Training
Cohort Starts in April
Understand Internal Architecture
Curtain Raiser Webinar
Free Registration
Master ML Fundamentals for Model Training
Cohort Starts in April
Most ComprehensiveFor Engineers ready to go Deep

LLMs, Agentic AI & Deep Learning for Engineers

Who want to go from deep learning first principles to building production-grade AI systems.

20,000+Engineers taught
20 WeeksBecome an AI-ML Engineer
2Successful Past Cohorts
What will you learn inside the cohort:
Deep Neural Network
Deep Neural Network
Optimization methods
Optimization methods
Transformer internals
Transformer internals
Reinforcement Learning
Reinforcement Learning
ChatGPT Lineage
ChatGPT Lineage
Deepseek Architecture
Deepseek Architecture
Huggingface
Huggingface
Tool Calling in Action
Tool Calling in Action
RAG & Vector Search
RAG & Vector Search
Eval Frameworks
Eval Frameworks
Model Context Protocol
Model Context Protocol
Finetuning and Distillation
Finetuning and Distillation
Deep Neural Network
Deep Neural Network
Optimization methods
Optimization methods
Transformer internals
Transformer internals
Reinforcement Learning
Reinforcement Learning
ChatGPT Lineage
ChatGPT Lineage
Deepseek Architecture
Deepseek Architecture
Huggingface
Huggingface
Tool Calling in Action
Tool Calling in Action
RAG & Vector Search
RAG & Vector Search
Eval Frameworks
Eval Frameworks
Model Context Protocol
Model Context Protocol
Finetuning and Distillation
Finetuning and Distillation
Deep Neural Network
Deep Neural Network
Optimization methods
Optimization methods
Transformer internals
Transformer internals
Reinforcement Learning
Reinforcement Learning
ChatGPT Lineage
ChatGPT Lineage
Deepseek Architecture
Deepseek Architecture
Huggingface
Huggingface
Tool Calling in Action
Tool Calling in Action
RAG & Vector Search
RAG & Vector Search
Eval Frameworks
Eval Frameworks
Model Context Protocol
Model Context Protocol
Finetuning and Distillation
Finetuning and Distillation
Post cohort completion
Deep Neural Network
Deep Neural Network
Optimization methods
Optimization methods
Transformer internals
Transformer internals
Reinforcement Learning
Reinforcement Learning
ChatGPT Lineage
ChatGPT Lineage
Deepseek Architecture
Deepseek Architecture
Huggingface
Huggingface
Tool Calling in Action
Tool Calling in Action
RAG & Vector Search
RAG & Vector Search
Eval Frameworks
Eval Frameworks
Model Context Protocol
Model Context Protocol
Finetuning and Distillation
Finetuning and Distillation
Deep Neural Network
Deep Neural Network
Optimization methods
Optimization methods
Transformer internals
Transformer internals
Reinforcement Learning
Reinforcement Learning
ChatGPT Lineage
ChatGPT Lineage
Deepseek Architecture
Deepseek Architecture
Huggingface
Huggingface
Tool Calling in Action
Tool Calling in Action
RAG & Vector Search
RAG & Vector Search
Eval Frameworks
Eval Frameworks
Model Context Protocol
Model Context Protocol
Finetuning and Distillation
Finetuning and Distillation
Deep Neural Network
Deep Neural Network
Optimization methods
Optimization methods
Transformer internals
Transformer internals
Reinforcement Learning
Reinforcement Learning
ChatGPT Lineage
ChatGPT Lineage
Deepseek Architecture
Deepseek Architecture
Huggingface
Huggingface
Tool Calling in Action
Tool Calling in Action
RAG & Vector Search
RAG & Vector Search
Eval Frameworks
Eval Frameworks
Model Context Protocol
Model Context Protocol
Finetuning and Distillation
Finetuning and Distillation

Companies Hiring AI-ML Engineers

Your Transformation Path

From First Principles to Production Grade AI Systems, 2 milestones 1 Journey

Software Engineers

0–10+ years of experience

Whether you are looking to move from a service company to a product company or AI lab, exploring ML for the first time, or a lead who needs architectural depth to make better decisions.

Technical Professionals

Data Scientists, Data Engineers, Quant ML, Researchers & Others

Who are comfortable with programming and want to go deeper into the mathematics and architecture, building Agentic AI systems powering next generation of softwares.

Starting

Learning Journey

Track 1 - ML Engineer

Deep Learning from First Principles

Reinforcement LearningNeural NetworksCNNs & RNNsTraining at ScaleDeepSeek Architecture
tick

Outcome on completing Track 1

ML Engineer

Build, train & evaluate production ML models from scratch

Companies that hire ML Engineers

company logo
company logo
company logo
company logo
company logo
company logo
company logo
company logo

Track 2 - Agentic AI

Building Production-Grade AI Systems

LLMs & PromptingRAG SystemAgent ArchitecturesTools use & planningMulti-Agent SystemDeployment & Ops
tick

Outcome on completing Track 2

AI-ML Engineer

Design & ship full stack Agentic AI system end-to-end

Companies that hire AI-ML Engineers

company logo
company logo
company logo
company logo
company logo
company logo
company logo
company logo

Ecosystem Designed for Learning

Move from just calling APIs to understanding the architectures behind them.

Deep Learning by Dr. Kshitiz Verma
2.5 MONTHS · LIVE · APRIL 2026
📅 MON-THURS, SUN
9:00PM–11:00PM IST
Hands-On Projects You'll Create
Linear Regression — Three Ways

Linear Regression — Three Ways

Optimizer Benchmarking Lab

Optimizer Benchmarking Lab

MNIST Digit Classifier

MNIST Digit Classifier

Large-Scale Image Classification

Large-Scale Image Classification

Object Detection System

Object Detection System

Face Recognition System

Face Recognition System

Image Segmentation

Image Segmentation

Sentiment Analysis

Sentiment Analysis

Translation & Summarisation

Translation & Summarisation

GPT, Built From Scratch

GPT, Built From Scratch

Linear regression for one/two variables
Cost/loss/error function: Mean squared and cross entropy
Back propagation & Gradient descent
Vectorization & Generalized linear models
Regularization - L1 and L2
Advanced optimizers - Stochastic gradient descent, Momentum, RMSProp, Adam.
Introduction to neural networks
Computing the number of parameters in a neural network
Multiclass classification
Backpropagation in neural networks
Vanishing and exploding gradients
Computer vision overview
Image classification
CNN architectures - AlexNet, VGG, ResNet, etc
Object detection (YOLO)
Image segmentation
Face Recognition
History of NLP
Wordvectors - word2vec, GloVe, Fasttext
Recurrent Neural Networks
Vanishing and exploding gradients in RNNs
GRUs, LSTMs
Language modeling, translation, etc
Attention mechanism
Transformer architecture
GPT architecture
Scaling laws
Pre-training and post training RLHF
Introduction to reinforcement learning
PPO, DPO, GRPO
The understanding of inference. Latency and memory.
Detailed dive in KV cache
Inference optimization. Model compression
Evaluation of LLMs
EU AI Act
India DPDP Act
Responsible AI guidelines.

Capstone Projects You'll Build

Apply everything you've learned to build production-grade AI systems.

Build an LLM from Scratch
CAPSTONE PROJECT 1

Build an LLM from Scratch

Understand transformers deeply by implementing tokenization, attention, and training loops from the ground up.

Transformers
Self-Attention
PyTorch
Deep Learning
Multi-Agent Personal Productivity Assistant
CAPSTONE PROJECT 2

Multi-Agent Personal Productivity Assistant

An intelligent system of AI agents that plans, schedules, executes tasks, and integrates with tools like calendar, email, and workspace apps.

Multi-Agent Systems
Tool Calling
Automation

Meet the Mentors

Learn from the top 0.1% of tech mentors

Program Curators & Instructors

LinkedIn
Kshitiz Verma

Kshitiz Verma

ML INSTRUCTOR
PROGRAM CURATOR
IIT KANPUR CSE
MS+PHD UCSM MADRID
LinkedIn
Vivekanand Vivek

Vivekanand Vivek

AGENTIC AI INSTRUCTOR
PROGRAM CURATOR
IIT BHU CSE
EX-AMAZON SDE

Advisors & Guest Speakers

LinkedIn
Harsh Gugale

Harsh Gugale

ARMUT AUSTINIIT BHU
LinkedIn
Deepak Gupta

Deepak Gupta

FOUND. ENGINEER DEVREVNIT KKR
LinkedIn
Anoop Garg

Anoop Garg

UNIVERSITY OF WARWICKIIT BHU
LinkedIn
Biren Goyal

Biren Goyal

ATLASSIANNIT KURUKSHETRA (KKR)

Impact Stories from Past Cohorts

10,000+

Engineers in our alumni cohorts

40%

Students transitioned into AI Engineering post completion

8 weeks

Average time to ship a working AI Project from scratch

Krishan Kumar Pareek

Krishan Kumar Pareek

Senior Analyst @NAB

MCP was my favorite, it showed how simple yet powerful automation can be. It boosted my confidence that complex systems are achievable with the right setup. Prompt engineering was eye-opening; I learned strategies like examples, step-by-step, and “think” prompts. Practical tips, like splitting chats to manage context and cost, changed my AI habits. Breakout rooms and live interactive sessions enriched the learning experience.

Tushar Mahajan

Tushar Mahajan

Software Engineer III @ Walmart

I enrolled in this program to learn how to apply AI models in real projects. While I had theory, I lacked confidence in execution. The blend of AI and software engineering made it practical and doable. Thanks to this, I built an OnCall Agent at production scale, something I never imagined before!

Gagan Agrawal

Gagan Agrawal

Engineering @ CommerceIQ

I joined the program for a roadmap on learning the fundamentals, as keeping up with new AI tools becomes impossible. I loved the concept of breakout rooms, hands-on opportunities while sticking to fundamentals. I now feel more confident in experimenting and creating something using AI, while I still keep learning new things

Tashvik Shrivastava

Tashvik Shrivastava

SDEINFRA MARKETIIT MADRAS

My intention behind joining the program was to strengthen my AI foundations and learn structured ways to build real-world systems. Clear sessions with Vivek Sir and hands-on projects made theory practical. The focus on engineering principles and best practices gave me clarity, skills, and confidence.

Chetan Verma

Chetan Verma

AI INNOVATION SPECIALIST @ TRILOGY

Thanks to the remarkable course of AI Engineering, I am now skilled at navigating complex architectural challenges, evaluating multiple solutions with precision, and making informed decisions by weighing their respective advantages and disadvantages.

Krishan Kumar Pareek

Krishan Kumar Pareek

Senior Analyst @NAB

MCP was my favorite, it showed how simple yet powerful automation can be. It boosted my confidence that complex systems are achievable with the right setup. Prompt engineering was eye-opening; I learned strategies like examples, step-by-step, and “think” prompts. Practical tips, like splitting chats to manage context and cost, changed my AI habits. Breakout rooms and live interactive sessions enriched the learning experience.

Tushar Mahajan

Tushar Mahajan

Software Engineer III @ Walmart

I enrolled in this program to learn how to apply AI models in real projects. While I had theory, I lacked confidence in execution. The blend of AI and software engineering made it practical and doable. Thanks to this, I built an OnCall Agent at production scale, something I never imagined before!

Gagan Agrawal

Gagan Agrawal

Engineering @ CommerceIQ

I joined the program for a roadmap on learning the fundamentals, as keeping up with new AI tools becomes impossible. I loved the concept of breakout rooms, hands-on opportunities while sticking to fundamentals. I now feel more confident in experimenting and creating something using AI, while I still keep learning new things

Tashvik Shrivastava

Tashvik Shrivastava

SDEINFRA MARKETIIT MADRAS

My intention behind joining the program was to strengthen my AI foundations and learn structured ways to build real-world systems. Clear sessions with Vivek Sir and hands-on projects made theory practical. The focus on engineering principles and best practices gave me clarity, skills, and confidence.

Chetan Verma

Chetan Verma

AI INNOVATION SPECIALIST @ TRILOGY

Thanks to the remarkable course of AI Engineering, I am now skilled at navigating complex architectural challenges, evaluating multiple solutions with precision, and making informed decisions by weighing their respective advantages and disadvantages.

Krishan Kumar Pareek

Krishan Kumar Pareek

Senior Analyst @NAB

MCP was my favorite, it showed how simple yet powerful automation can be. It boosted my confidence that complex systems are achievable with the right setup. Prompt engineering was eye-opening; I learned strategies like examples, step-by-step, and “think” prompts. Practical tips, like splitting chats to manage context and cost, changed my AI habits. Breakout rooms and live interactive sessions enriched the learning experience.

Tushar Mahajan

Tushar Mahajan

Software Engineer III @ Walmart

I enrolled in this program to learn how to apply AI models in real projects. While I had theory, I lacked confidence in execution. The blend of AI and software engineering made it practical and doable. Thanks to this, I built an OnCall Agent at production scale, something I never imagined before!

Gagan Agrawal

Gagan Agrawal

Engineering @ CommerceIQ

I joined the program for a roadmap on learning the fundamentals, as keeping up with new AI tools becomes impossible. I loved the concept of breakout rooms, hands-on opportunities while sticking to fundamentals. I now feel more confident in experimenting and creating something using AI, while I still keep learning new things

Tashvik Shrivastava

Tashvik Shrivastava

SDEINFRA MARKETIIT MADRAS

My intention behind joining the program was to strengthen my AI foundations and learn structured ways to build real-world systems. Clear sessions with Vivek Sir and hands-on projects made theory practical. The focus on engineering principles and best practices gave me clarity, skills, and confidence.

Chetan Verma

Chetan Verma

AI INNOVATION SPECIALIST @ TRILOGY

Thanks to the remarkable course of AI Engineering, I am now skilled at navigating complex architectural challenges, evaluating multiple solutions with precision, and making informed decisions by weighing their respective advantages and disadvantages.

Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story
Success Story

Why Programming Pathshala?

Most Programs optimise for breadth, placement numbers and surface level tools; we optimise for depth, real outcomes & first principles

Other Programs

"Learn to build AI Applications using pre-built models and popular frameworks, learn AI-ML the traditional way."

Programming Pathshala

"We dive deep into that section of AI which truly matters at this point of time, and is powering almost every model out there. We don't scratch the surface — rather we dissect the architecture and build them on our own."

Dimension

AI

ML

PPA

Intricate Maths for ML

partial

CNNs, RNNs, Attention mechanisms

partial

Transformer Architecture Internals

partial

GPT lineage; GPT 1 - InstructGPT

Scaling Laws & RLHF

AI Agents- ReAct, Tool Use, Memory

MCP - Model context protocol

RAG Pipelines & Multi Agent Systems

partial

Frameworks - Langgraph, Langchain

Finetuning & Distillation

Duration

4-8 weeks

~12-14 months

5 Months

In today's AI economy, calling APIs might get you 2X, but understanding the architecture is what gets you 10X.

Why going deep in AI-ML frameworks matter?

DeepSeek R1 rivaling GPT-4 was built for a fraction of cost while the industry spent a fortune.

671B

Total Parameters

Full model size- comparable to GPT-4 class models

37B

Active per forward pass

only the relevant subset activates per query- this is mixture of experts

Deepseek R1 didn't beat the Industry on compute, but on internal architecture. With Mixture of Experts and improvements in RLHF, it made sure only the relevant parts of the model activate per query.

" They understood transformers internally, finding efficiencies others had missed "

DeepSeek model reasoning example

But I'm just calling the API…

Until you aren't. Here's why every serious engineer ends up needing what's inside this cohort.

WHEN YOU REACH FOR CLASSICAL ML

icon
icon
icon

KNN, lean models

Latency
~1ms
Cost/query
~$0
Dependency
None
VS

WHEN YOU REACH FOR AN LLM

icon
icon
icon

GPT-4, Claude, Gemini

Latency
~800ms
Cost/query
$0.01+
Dependency
High

WALL 01

icon

Fine-tuning breaks without internals

"Why is the model still drifting after fine-tuning"

Transformer internals

WALL 02

icon

LLM bill won't clear finance

"10M queries/day — we can't afford this."

Distillation · LLMOps

WALL 03

icon

Agent works in staging, breaks in production

"It's looping. Tool calls failing. I don't know why."

Agents · MCP · RAG

ML TRACK

tick

Fine-tune with confidence, not guesswork

tick

Distil models to cut cost at scale

tick

Read scaling laws, not just benchmarks

AGENTIC AI TRACK

tick

Debug agents that break in production

tick

Build RAG pipelines that don't hallucinate

tick

Ship multi-agent systems that scale

Career Opportunities

AI native companies don’t hire engineers who wrap APIs. They hire engineers who can reason about what’s inside them

ML ENGINEER BASED ROLES

tick

ML Engineer

tick

ML Scientist

tick

Predictive Modeler

tick

Quantitative Analyst

tick

Computer Vision Engineer

AI ENGINEER / RESEARCHER

tick

AI Engineer / Researcher

tick

NLP Engineer

tick

AI Research Scientist

tick

Chatbot Developer

tick

Data Scientist

Salary gap — traditional SDE vs ML/ AI engineer, India 2025

Across three career stages, mid-point of reported ranges

₹6 LPA

0-2

₹13 LPA

3-6

₹28 LPA

7+

Traditional SDE (Experience wise)

₹9 LPA

0-2

₹22 LPA

3-6

₹50+ LPA

7+

AI-ML Engineer (Experience wise)

Skills you will develop

star

Computer Vision

star

Deep Learning

star

Generative AI

star

Large Language Models

star

Machine Learning Algorithms

star

Model Training

star

Reinforcement Learning

star

Natural Language Processing

Early Bird Pricing Active

Limited seats. The price goes up when the timer hits zero.

EARLY BIRD ACTIVE
₹1,19,999₹59,999
arrow50% Early Bird Discount

EMI Options available — starting at ₹2,109/month

Applied

Early Bird Ends In

07
DAYS
:
14
HRS
:
28
MIN
:
35
SEC
Course Features

5 Live Classes Every Week

Evening sessions 9–11PM IST, designed for working professionals

1 Year Access to Recordings

10,000+ Alumni Community

Access to a network of engineers who have transitioned to AI Engineering

Course Outcomes
1.

Crack AI/ML Engineering Interviews — with architectural depth that stands out in technical rounds

2.

Build an AI Product or Startup Independently — from model selection to deployment, without relying on off-the-shelf wrappers

3.

Deep Insight into Model Architecture — understand what's actually happening inside the system, not just what it returns

Cut your API bills. Build engineers who think in systems.

Teams that understand model internals make better architecture calls, ship more resilient AI, and reduce vendor dependency.

Product & SaaS Companies
IT Services & Consulting
Fintech/Healthtech & Data Heavy Orgs
AI-First Startups
E-Commerce & Operations Teams
Enterprise L&D & Engineering Leaders
College Professors & Faculties

Need a fully custom training program designed around your team's existing stack, or want to run this as an internal bootcamp with your branding? We'll design it from scratch.

Webinar

FAQs

Common Questions

Is this a beginner-friendly course, or do I need prior ML knowledge ?

No prior ML knowledge needed. We start from the fundamentals and go all the way to building a GPT from scratch. Knowing Python fundamentals is recommended. As long as you're comfortable with programming and basic maths, you're good to go.

Do I need a math background to enroll?

Not a strong one. We cover everything you need, backpropagation, gradient descent, linear algebra, from scratch. High school-level maths is sufficient to get started.

Why is learning ML even relevant for a software engineer right now?

AI is no longer a separate discipline; it's becoming part of how software is built. Engineers who understand what's inside models will make better architecture decisions, debug faster, and build systems that are resilient.

What's the difference between an AI Engineer, ML Engineer, and a traditional Software Engineer?

A traditional software engineer builds systems and products. An AI Engineer integrates and deploys models, often working at the application layer. An ML Engineer goes the deepest, understanding model internals well enough to train, optimise, and improve AI systems from the ground up.

What will I be able to build by the end of the program?

11 end-to-end projects, including a GPT built from scratch in PyTorch, a full RAG pipeline, and an AI Agent. No API calls, no wrappers, everything built from fundamentals.

What kind of roles can I target post completion of the cohort?

The program prepares you for a wide range of high-demand roles, ML Engineer, AI Engineer, Research Engineer, Data Scientist, Data Engineer, MLOps Engineer, LLM Engineer, and senior SDE roles at companies actively building AI systems.

When does Cohort 3 start, and how long does it run?

Cohort 3 starts in April 2026 and runs for 5 months. Exact dates are shared post-enrollment.

What are the class timings and how many sessions per week?

5 live sessions per week, Monday to Thursday and Sunday, 9–11 PM IST. All sessions are recorded and accessible for 1 year.

About us

Contact Us

Social Links

social-icon
social-icon
social-icon
social-icon

All Right Reserved with @Programming Pathshala