VIT Workshop — March 11, 2026

Beyond ChatGPT

Build an AI That Works For You

PG
Prateek Gupta
VIT '16  |  EIR @ noon  |  Ex-CEO CustomerGlu

12 Months That Changed Everything

The AI landscape moved faster than anyone predicted

Feb '25
Claude 3.5 Sonnet
coding crown
Jun '25
GPT-5
1M context
Sep '25
Open-source
catches up
Nov '25
Claude
Opus 4
Feb '26
Grok 4.20
multi-agent
Mar '26
GPT-5.4
unified
282
models on leaderboard
12
major models in 1 week
(March 2026)
182
open-weight models
"More changed in 12 months than in the previous 12 years."

Three Eras of AI

Prediction

2012 — 2020

ML, classification, recommendations, search ranking

Generation

2020 — 2024

ChatGPT, content creation, co-pilots, text & image gen

Agency

2025 →

Memory, tools, initiative, autonomous work, multi-agent

WE ARE HERE ▶
AI isn't just answering questions anymore. It's doing work.

ChatGPT vs AI Agent

ChatGPTAI Agent
MemoryResets every chatRemembers everything
ToolsCan only generate textEmail, calendar, code, browser
InitiativeWaits for youWorks proactively
ContextStarts from zeroKnows your history
ActionsSuggests what to doActually DOES it
👤 User AGENT LLM Memory Tools Think → Act → Learn → Repeat Email Code Search Calendar Result → Memory
"ChatGPT is a conversation. An agent is a collaboration."

Vibe Coding → The Early Singularity

Vibe Coding coined by Andrej Karpathy

  • Building software by talking to AI — not typing code
  • Karpathy now says we're in the "Early Singularity" era (March 2026)
  • "Programming is unrecognizable" — Business Insider

Autoresearch Mar 7, 2026

  • 630-line Python script — AI runs autonomous ML experiments overnight
  • Human writes prompt.md, AI edits train.py
  • Runs hundreds of experiments on a single GPU
  • Entire codebase fits in one LLM context window
"AI isn't just writing code. It's writing the AI that writes code."
Karpathy: X/Twitter · Autoresearch: MarkTechPost, Mar 7 2026

Context Engineering > Prompt Engineering

Prompt Engineering

Crafting the perfect question

Optimizing words, few-shot examples, chain-of-thought tricks

Context Engineering

Designing what the AI knows before it thinks

9,649 experiments proved: context matters more than prompts

The Evolution

1. RAG — retrieve docs
2. Agentic RAG — auto search & refine
3. Knowledge Graphs — relationships
4. Observational Memory — 10× cheaper than RAG
"The skill isn't asking better questions. It's building better context."

Computer Use — AI Sees Your Screen

  • GPT-5.4 + Claude: can literally see your screen and control your computer
  • OSWorld benchmark: navigate real desktop environments
  • WebArena: interact with real websites, fill forms, submit data
CAPTURE
Screenshot pixels
VISION
Understand UI
DECIDE
Plan next step
ACT
Click, type, scroll
DONE
Task complete
"Your AI can now do what a human intern does — click, type, navigate."

The Numbers

38%
startups solo-founded
(was 22%)
84%
developers using
AI tools daily
$52.6B
AI agent market
by 2030
$2B
Cursor ARR
(doubled in 3 months)
41%
of all code
now AI-generated
$1T+
wiped in
SaaSpocalypse
Sources: Bloomberg, MarketsAndMarkets, index.dev (2026)
Part 2: Real Stories

How I Operate with Zero Employees

CustomerGlu

  • Gamification SaaS — 250M+ devices, 3 regions
  • Raised $1.9M (Better Capital + Techstars)
  • YourStory Tech50 2021

The Transition

  • Software dev → Claude Code, Codex
  • Deployment → automated pipelines, agent-monitored
  • Client ops → agent drafts emails, preps meetings
"For the last 6 months, I've managed development, deployment, and client operations entirely through AI agents — operating with 0 employees."

Not zero employees from day one. But the transition to zero happened — and it works.

What My Agent Does Daily

This Morning (real examples)

  • Scanned 12 emails, flagged 2 urgent deal replies
  • Prepped briefing docs for 3 meetings automatically
  • Added 10 new AI signals to signals.prateek.sh
  • Built this slide deck (yes, this one you're watching)

On-Demand (yesterday)

  • Researched 5 LinkedIn profiles before a sales call
  • Deployed code updates to 3 cloud regions in 4 minutes
  • Drafted & sent 8 personalized outreach emails
  • Negotiated meeting times across 3 timezones
One agent. Zero busywork. 24/7 availability.

Building at noon — AI-Powered Hiring

noon: Middle East's largest e-commerce — $1.5B revenue, 4M DAU

Proto: AI hiring pipeline built by a 4-person team in 2 weeks

CV Upload
AI Assessment
Build Challenge
Auto-Evaluation
  • 3,000 candidates processed — AI evaluates each one
"Small teams + AI agents = impossibly powerful."

The SaaSpocalypse

$285B
vanished in 24 hours
$1T+
wiped in a week
  • Forrester: "SaaS as we know it is dead"
  • Per-seat pricing dying — AI replaces the users of software
  • The software you're learning to build may not exist in 2 years
"But the people who build WITH AI will be the most valuable people in any room."
Part 3: The Agentic Ecosystem

Agent Frameworks

Which one for what?

OpenClaw Personal

Personal AI agent — runs on your machine, connects to everything

LangGraph Production

Stateful workflows, complex multi-step pipelines

CrewAI Teams

Multi-agent teams with role-based collaboration

Google ADK Cloud

MCP-native agent development kit

OpenAI SDK OpenAI

Agents SDK — tool-use native, async

Mastra Memory

Observational memory — 10× cheaper than RAG

Coding Agents — The New IDE

Cursor + Automations

$2B ARR · Event-driven autonomous coding

Claude Code

Terminal-native · Thinks before it codes

Codex (OpenAI)

Cloud sandboxed · Async coding agent

Cline / Aider / Roo Code

Open-source alternatives · Self-hostable

41%
of all code now written by AI
index.dev, 2026
"The IDE is dead. Long live the agent."
Cursor ARR: Bloomberg, Mar 2 2026 · Code stats: index.dev

Running Models Locally

Ollama — no data center needed

What you needWhat you can runCost
8GB RAM (any laptop)Qwen 3.5 35B-A3B (MoE)Free
16GB RAM (MacBook)14-27B parameter modelsFree
64GB Mac Mini70B models₹1.8L
128GB Mac Studio122B models₹3.5L
256GB Mac Studio UltraKimi K2.5 (1T params)₹7L+
# Install Ollama and run in 30 seconds
curl -fsSL https://ollama.com/install.sh | sh
ollama pull qwen3.5:35b-a3b
"You don't need a data center. You need Ollama + a laptop."

Memory Architectures for Agents

No Memory
ChatGPT style
Forgets everything
RAG
Retrieve docs
Knows your files
Agentic RAG
Auto search
& refine
Knowledge Graphs
Understands
relationships
Observational
Remembers what
it's seen · 10× cheaper

My Agent's 5-Layer Memory

Working (what I'm doing now) → Episodic (daily logs) → Entity (people tracking) → Causal (why decisions were made) → Semantic (curated knowledge)

"Your agent gets smarter the longer you use it. Mine has 6 months of context."

Coding Is Commoditized — What's Next?

The Old World

  • Coding skill = career moat
  • Memorize syntax & APIs
  • Build everything from scratch
  • 10 years to become "senior"

The New World

  • Coding is table stakes (like typing)
  • AI writes 41% of all code — and rising fast
  • Value = directing AI
  • Context engineering, system design, product thinking

AI Operator

Manages fleets of AI agents to run business operations

AI-First Builder

Builds products by orchestrating AI, not writing every line

"The world doesn't need more coders. It needs more AI operators and AI-first builders."

Building Agentic Workflows

A systematic approach to AI automation

1

Do it manually first

Understand the full process end-to-end before automating. If you can't do it, you can't automate it.

2

Map the cycle

Identify repetitive, cyclic tasks that follow patterns. AI thrives on repetition and pattern.

3

Identify decision points

What context does each decision need? Missing context = hallucination. Map the inputs.

4

Design the prompt chain

Break into discrete prompt steps, each with clear input/output. Small steps > one giant prompt.

5

Build evaluation loops

How do you know it worked? Evals > vibes. Automate your quality checks too.

Choosing the Right Model

Not every task needs GPT-5

Task Model Tier Why
Classification, routing Small Haiku · 4o-mini Fast, cheap, good enough
Summarization, extraction Medium Sonnet · 4o Balance of quality and cost
Complex reasoning, coding Large Opus · o3 / GPT-5 Worth the premium
Creative, open-ended Large + high temp Needs breadth and surprise
3-4
models in a smart toolkit
40×
cheaper: Haiku vs Opus
90%
saved with prompt caching
"The best engineers use 3-4 models. Match model to task."

The Intelligence Staircase

Where we are and where we're going — verified claims from lab leaders

ANI

Narrow AI WE ARE HERE

Superhuman at specific tasks. GPT-5.4, Claude Opus, AlphaFold. Beats humans at coding, chess, protein folding — but can't do everything.

AGI

Artificial General Intelligence

Human-level across ALL cognitive tasks. Can learn anything a human can. No consensus on when — estimates range from 2027 to 2040+.

ASI

Artificial Superintelligence

Surpasses all human intelligence combined. Self-improving. The "intelligence explosion" scenario. Timeline: unknown — possibly years after AGI, possibly days.

"We're building narrow AI that already outperforms humans at tasks. The question is how fast it generalizes."

Beyond LLMs — World Models

Yann LeCun leaves Meta

Turing Award winner, ex-Meta AI Chief — launched AMI Labs

Raised $1.03B (WIRED, TechCrunch — Mar 9, 2026)

JEPA Architecture

Joint-Embedding Predictive Architecture — learns from video & sensor data, not just text

LLMs vs World Models

  • LLMs predict the next word
  • World models predict what happens next in the physical world
  • LLMs hallucinate because they only know language
  • World models understand physics, causality, space

"Meta had to catch up with the industry on LLMs... which is not my interest" — LeCun to WIRED

"LLMs are not the end. They might just be the beginning."

What This Means For You

Whether AGI arrives in 2027 or 2035 — the implications are the same

1

Learn to work WITH AI, not against it

84% of developers now use AI tools daily. 41% of all code is AI-generated. Those who don't adapt will fall behind — not in years, in months.

2

Invest in judgment, not just knowledge

AI can look up anything. It can't decide what matters. System design, product thinking, and taste become the differentiators.

3

Build your agent NOW — compound advantage

An agent that knows you for 6 months is 10x more useful than a fresh one. Start today, and by graduation, you have a serious advantage.

4

The $710B bet says this isn't hype

OpenAI spending $115B through 2029. Anthropic raising $30B. Google, Meta, Microsoft all-in.

"The future belongs to people who can direct intelligence — human or artificial."

Compute Is the New Electricity

From luxury to utility in 10 years

Then (Electricity)

1900s: Own Power Plants

Every factory built its own generator

1920s: Power Grid

Standardized → plug in, pay per kWh

Now (Compute)

2020s: Own GPU Clusters

Companies run massive on-prem infrastructure

2026+: Serverless Inference

Pay per token, zero infrastructure

Three Predictions

1. Inference 10× cheaper by 2028

Moore's Law for AI — costs halving every 12 months

2. Compute credits = electricity bills

Operational expense, not capital investment

3. Value moves UP the stack

Infrastructure → Apps → Workflows → Outcomes

"Don't invest in GPUs. Invest in knowing what to DO with compute."
Part 4: Live Demos

Demo Time

My Agent on Telegram
Email, calendar, memory, proactive alerts

🌐

signals.prateek.sh
Website maintained daily by AI

Vibe Coding Live
Build something by just talking

Local Model
Ollama running Qwen on a laptop

Demo: Agent in Action

What's on my calendar today?
AI
You have 3 meetings: 10 AM standup, 2 PM VIT workshop, 4 PM Capillary call. I've prepped briefing docs for each. ✅
Summarize my last 3 emails
AI
1. Capillary — deal meeting confirmed Thu 11 AM
2. noon — Proto V2 shipped, Faraz wants demo
3. Newsletter — skip, nothing urgent
"This is not a script. This is my actual daily workflow."
Part 5: Hands-On

Your Turn — Free Setup Options

Gemini Free

  • aistudio.google.com
  • Click "Get API Key"
  • No credit card needed
  • 1,500 requests/day

Ollama (Local)

  • curl install — 1 command
  • ollama pull qwen3:8b
  • No internet needed
  • 100% private & free

OpenRouter

  • openrouter.ai
  • Sign in with Google
  • Access free models
  • Multiple providers

Pick ONE and set it up in the next 3 minutes

Install in 3 Minutes

# Install OpenClaw
curl -fsSL https://get.openclaw.ai | bash

# Start
openclaw init
# → Choose: Gemini (free) or Ollama (local)

# Connect Telegram
# Scan QR code → your agent is live

Raise your hand if stuck

Make It Yours

SOUL.md — personality

You are Jarvis, a sharp AI
assistant for a VIT CS student.
Be concise. Be proactive.
Remember everything.

MEMORY.md — context

- Name: [Your Name]
- Year: 3rd Year B.Tech CS
- Interests: ML, web dev
- Targets: Google, Microsoft
"This is what makes YOUR agent different from everyone else's."

What to Build

Research Copilot

Feed it your thesis topic → daily paper summaries, auto-built literature review

Placement Hunter

Auto-research companies before interviews, mock Q&A with real Glassdoor questions

Inbox Zero Agent

Reads every email, drafts replies, follows up — you just approve & send

Code Guardian

Reviews your PRs before push, catches bugs, writes missing tests automatically

Daily Briefer

Morning: weather + calendar + deadlines. Evening: what you accomplished + tomorrow's plan

Weekend Shipper

Describe what you want → agent scaffolds, codes, deploys. Ship a project per weekend.

Token Economics

Every AI interaction has a price tag

Input vs Output Tokens

Input (your prompt) vs Output (AI response) — output costs 3-5x more

GPT-4o pricing

~$2.50/M input · ~$10/M output

Claude Sonnet

~$3/M input · ~$15/M output

Haiku: $0.25/$1.25

40x cheaper than Opus — same task, different scale

The Skill: Value Per Token

  • Cache repeated context → 90% savings
  • Small models for routing, big for thinking
  • Batch similar requests together
  • Get it right FIRST TIME — retries = 2x cost
$1M+
companies spending per month on tokens by 2027
"Token efficiency is the new cloud cost optimization. The winners extract more value per token."
Pricing: OpenAI & Anthropic official docs, Mar 2026

India's Moment

170+
AI startups in India
$2B+
AI funding raised
10×
output with AI agents
at 1/10th the cost

Indian engineers + AI agents = global competitive edge. Sovereign LLMs (Sarvam, Krutrim) are already here. The cost advantage is real.

You're graduating into the biggest tech shift since the internet. Remote work + AI means location doesn't matter. VIT alumni are building the future — that includes you.

"Zero risk, maximum learning. Start while you're still in university."

Resources

Do This TODAY

1. Get your API key (Gemini free) → 2. Install OpenClaw → 3. Build your first agent → 4. Share what you built

Learn & Build

  • signals.prateek.sh — AI signals updated daily
  • docs.openclaw.ai — OpenClaw docs
  • github.com/openclaw/openclaw — Star it ⭐

Free Tools

  • aistudio.google.com — Free Gemini API
  • ollama.comRun models locally
  • openrouter.ai — Free model access

discord.com/invite/clawd — Join the community   |   linkedin.com/in/prateek1gupta — Connect with me

The Future Belongs to Builders

You're not just learning to code. You're learning to direct intelligence.

These Slides
aislides.prateek.sh
AI Signals
signals.prateek.sh
Connect
linkedin.com/in/prateek1gupta
"10 years from now, you'll either be the person who started building with AI in college... or the person who wishes they had."

Thank you!   Questions?

Prateek Gupta  ·  VIT '16  ·  @neo17th

Appendix

Bonus Slides

For Q&A and deep dives

The Model Landscape Today

GPT-5.4

1M context · Reasoning + coding + computer use unified

Claude Opus 4.6

Best coding model · Deep reasoning · Agentic workflows

Gemini 3 Pro

Native multimodal · 2M context · Google ecosystem

Grok 4.20

4-agent debate system · Real-time X integration

Qwen 3.5 open

35B-A3B MoE · Runs on any laptop · Free

DeepSeek R1 open

671B MoE · Open reasoning rival · Chain-of-thought specialist

Open-source is 6 months behind closed models. Soon it'll be 6 weeks.

MCP — The USB-C of AI

Before MCP

N × M

N models × M tools = N×M integrations
Every model, every tool, custom code

After MCP

N + M

N + M connections via one standard
Universal connector for all AI tools

  • Model Context Protocol: JSON-RPC based, tool discovery, streaming
  • Adopted by: Google ADK, Cursor, GitHub, Figma
  • One standard: databases, file systems, APIs, SaaS tools
Agent MCP Hub GitHub Figma Slack Database Browser File System

Claude Code Review — AI Reviewing AI

Anthropic · March 9, 2026 · TechCrunch, The New Stack

Multi-Agent Code Review

  • Dispatches teams of AI agents to review every pull request
  • Flags logic errors, classifies severity
  • Recommends fixes with reasoning

"Modeled on the review system we run at Anthropic" — claude.com/blog

The AI Code Loop

AI writes code
AI reviews code
AI fixes code
Human approves
"AI writes more code → need AI to review that code → multi-agent quality assurance."

Multi-Agent Systems

Grok 4.20 — Debate System

4 agents: coordinator, fact-checker, technical expert, creative thinker — all debate to reach better answers

CrewAI

Define roles & goals, agents collaborate on complex tasks autonomously

OpenClaw Subagents

Spawn workers for parallel tasks — research, code, analyze simultaneously

RentAHuman

518K humans hired BY AI agents for physical-world tasks AI can't do

"The future isn't one AI. It's a team of AIs — sometimes hiring humans."

What the Lab Leaders Say

Verified public statements from the people building these systems

Sam Altman

CEO, OpenAI

"We built AGIs... AGI kinda went whooshing by with less impact than expected. The field should move on to defining superintelligence."

Demis Hassabis

CEO, Google DeepMind

"We're approaching a threshold moment with agentic AI. But true AGI — creativity, continual learning, robust understanding — is still 5-8 years away."

Dario Amodei

CEO, Anthropic

"We will see some form of AGI in 2026." Anthropic raised $30B, building safety-first systems.

Elon Musk

CEO, xAI

"AGI by end of 2026." Built Grok 4.20 with multi-agent debate. Track record: often early on predictions.

Note: "AGI" means different things to different people. The debate isn't IF — it's WHEN and HOW we define it.

Agentic AI — The Real Revolution

Forget AGI debates. Agentic AI is transforming work RIGHT NOW.

Tasks Already Automated

  • Code generation & review (41% of code AI-generated)
  • Email triage, drafting, & follow-ups
  • Data analysis & report generation
  • Customer support (L1/L2)
  • Content creation & social media
  • Research & competitive analysis

Being Automated in 2026

  • Multi-step web tasks (booking, forms, shopping)
  • Sales outreach & lead qualification
  • Legal document review
  • Financial analysis & trading
  • Hiring & candidate evaluation
  • DevOps & deployment pipelines

Still Needs Humans (for now)

  • Creative strategy & vision
  • Complex negotiations
  • Physical tasks & hardware
  • Ethical judgments & policy
  • Novel scientific discovery
AI doesn't replace jobs — it replaces tasks. The winners use agents to amplify what's left.

The One-Person Company

$200M
Midjourney ARR
<15 employees
36%
of new ventures
are solo-founded
340%
avg revenue increase
with AI agents
creative vision is
the new bottleneck
"Every one of you could be running a company while still in college."