VIT Workshop — March 11, 2026

Beyond ChatGPT

Build an AI That Works For You

Prateek Gupta

VIT '16 | EIR @ noon | Ex-CEO CustomerGlu

12 Months That Changed Everything

Feb '25

Claude 3.5 Sonnet
coding crown

Jun '25

GPT-5
1M context

Sep '25

Open-source
catches up

Nov '25

Claude
Opus 4

Feb '26

Grok 4.20
multi-agent

Mar '26

GPT-5.4
unified

282

models on leaderboard

major models in 1 week

182

open-weight models

"More changed in 12 months than in the previous 12 years."

"Computer" Was a Human Job

1940s — The Original "Computers"

Women at NASA (NACA) performed trajectory calculations by hand. "Computer" was their job title — not a machine.

Katherine Johnson, Hidden Figures — NASA History

1960s — Machines Replace Humans

Electronic computers arrived. The job title vanished. "Using computers" became the new job.

2020s — Coding Is the Job

60 years of "programmer" as a career. Writing instructions for machines. The golden age of software.

2025+ — Computers Use Computers

AI agents write code, deploy servers, browse the web. The cycle completes.

"Every generation automates the last generation's job. Now it's our turn."

Three Eras of AI

Prediction

2012 — 2020

ML, classification, recommendations, search ranking

Generation

2020 — 2024

ChatGPT, content creation, co-pilots, text & image gen

Agency

2025 →

Memory, tools, initiative, autonomous work, multi-agent

WE ARE HERE ▶

AI isn't just answering questions anymore. It's doing work.

ChatGPT vs AI Agent

	ChatGPT	AI Agent
Memory	Resets every chat	Remembers everything
Tools	Can only generate text	Email, calendar, code, browser
Initiative	Waits for you	Works proactively
Context	Starts from zero	Knows your history
Actions	Suggests what to do	Actually DOES it

ChatGPT is a conversation. An agent is a collaboration."

Vibe Coding → The Early Singularity

Vibe Coding coined by Andrej Karpathy

Building software by talking to AI — not typing code
Karpathy now says we're in the "Early Singularity" era (March 2026)
"Programming is unrecognizable" — Business Insider

Autoresearch Mar 7, 2026

630-line Python script — AI runs autonomous ML experiments overnight
Human writes prompt.md, AI edits train.py
Runs hundreds of experiments on a single GPU
Entire codebase fits in one LLM context window

"AI isn't just writing code. It's writing the AI that writes code."

Karpathy: X/Twitter · Autoresearch: MarkTechPost, Mar 7 2026

Beyond Code — What Comes Next?

The abstraction ladder keeps climbing

1950s: Machine Code

Humans write binary: 01001000 01100101

1970s: High-Level Languages

C, then Java, Python — compilers handle the binary

2024: Vibe Coding

Describe what you want → AI writes the code → compiler makes binary

2026+: Prompt → Binary Directly?

Elon Musk (Feb 2026): "AI will generate compiled binary, bypassing code entirely"

+ Neuralink = imagination-to-software

The Pattern

Every decade, we move one level up:

Binary → Assembly → C → Python → English → ???

Each layer abstracts the one below. Programming languages may become like assembly — still there, but rarely touched by humans.

Other Unlocking Trends

Computer Use — AI operates GUIs, not just APIs
Multi-Agent — AI teams, not solo assistants
Observational Memory — AI learns from watching you
World Models — AI understands physics, not just text

"The question isn't whether code disappears. It's what you build when it does."

Musk: MoneyControl, Feb 2026 · Pattern: historical computing evolution

Context Engineering > Prompt Engineering

Prompt Engineering

Crafting the perfect question

Optimizing words, few-shot examples, chain-of-thought tricks

Context Engineering

Designing what the AI knows before it thinks

9,649 experiments proved: context matters more than prompts

The Evolution

1. RAG — retrieve docs
2. Agentic RAG — auto search & refine
3. Knowledge Graphs — relationships
4. Observational Memory — 10× cheaper than RAG

"The skill isn't asking better questions. It's building better context."

Computer Use — AI Sees Your Screen

GPT-5.4 + Claude: can literally see your screen and control your computer
OSWorld benchmark: navigate real desktop environments
WebArena: interact with real websites, fill forms, submit data

CAPTURE

Screenshot pixels

VISION

Understand UI

DECIDE

Plan next step

ACT

Click, type, scroll

DONE

Task complete

"Your AI can now do what a human intern does — click, type, navigate."

The Numbers

38%

startups solo-founded
(was 22%)

84%

developers using
AI tools daily

$52.6B

AI agent market
by 2030

$2B

Cursor ARR
(doubled in 3 months)

41%

of all code
now AI-generated

Sources: Bloomberg, MarketsAndMarkets, index.dev (2026)

Part 2: Real Stories

How I Operate with Zero Employees

CustomerGlu

Gamification SaaS — 250M+ devices, 3 regions
Raised $1.9M (Better Capital + Techstars)
YourStory Tech50 2021

The Transition

Software dev → Claude Code, Codex
Deployment → automated pipelines, agent-monitored
Client ops → agent drafts emails, preps meetings

"For the last 6 months, I've managed development, deployment, and client operations entirely through AI agents — operating with 0 employees."

Not zero employees from day one. But the transition to zero happened — and it works.

What My Agent Does Daily

This Morning (real examples)

Scanned 12 emails, flagged 2 urgent deal replies
Prepped briefing docs for 3 meetings automatically
Added 10 new AI signals to signals.prateek.sh
Built this slide deck (yes, this one you're watching)

On-Demand (yesterday)

Researched 5 LinkedIn profiles before a sales call
Deployed code updates to 3 cloud regions in 4 minutes
Drafted & sent 8 personalized outreach emails
Negotiated meeting times across 3 timezones

One agent. Zero busywork. 24/7 availability.

Building at noon — AI-Powered Hiring

noon: Middle East's largest e-commerce — $1.5B revenue, 4M DAU

Proto: AI hiring pipeline built by a 4-person team in 2 weeks

CV Upload

→

AI Assessment

→

Build Challenge

→

Auto-Evaluation

3,000 candidates processed — AI evaluates each one

"Small teams + AI agents = impossibly powerful."

Part 3: The Agentic Ecosystem

Agent Frameworks

Which one for what?

OpenClaw Personal

Personal AI agent — runs on your machine, connects to everything

LangGraph Production

Stateful workflows, complex multi-step pipelines

CrewAI Teams

Multi-agent teams with role-based collaboration

Google ADK Cloud

MCP-native agent development kit

OpenAI SDK OpenAI

Agents SDK — tool-use native, async

Mastra Memory

Observational memory — 10× cheaper than RAG

Coding Agents — The New IDE

Cursor + Automations

$2B ARR · Event-driven autonomous coding

Claude Code

Terminal-native · Thinks before it codes

Codex (OpenAI)

Cloud sandboxed · Async coding agent

Cline / Aider / Roo Code

Open-source alternatives · Self-hostable

41%

of all code now written by AI
index.dev, 2026

"The IDE is dead. Long live the agent."

Cursor ARR: Bloomberg, Mar 2 2026 · Code stats: index.dev

Running Models Locally

Ollama — no data center needed

What you need	What you can run	Cost
8GB RAM (any laptop)	Qwen 3.5 35B-A3B (MoE)	Free
16GB RAM (MacBook)	14-27B parameter models	Free
64GB Mac Mini	70B models	₹1.8L
128GB Mac Studio	122B models	₹3.5L
256GB Mac Studio Ultra	Kimi K2.5 (1T params)	₹7L+

# Install

Ollama and run in 30 seconds
curl -fsSL https://ollama.com/install.sh | sh
ollama pull qwen3.5:35b-a3b

"You don't need a data center. You need

Ollama + a laptop."

Memory Architectures for Agents

No Memory

ChatGPT style
Forgets everything

→

RAG

Retrieve docs
Knows your files

→

Agentic RAG

Auto search
& refine

→

Knowledge Graphs

Understands
relationships

→

Observational

Remembers what
it's seen · 10× cheaper

My Agent's 5-Layer Memory

Working (what I'm doing now) → Episodic (daily logs) → Entity (people tracking) → Causal (why decisions were made) → Semantic (curated knowledge)

"Your agent gets smarter the longer you use it. Mine has 6 months of context."

Coding Is Commoditized — What's Next?

The Old World

Coding skill = career moat
Memorize syntax & APIs
Build everything from scratch
10 years to become "senior"

The New World

Coding is table stakes (like typing)
AI writes 41% of all code — and rising fast
Value = directing AI
Context engineering, system design, product thinking

AI Operator

Manages fleets of AI agents to run business operations

AI-First Builder

Builds products by orchestrating AI, not writing every line

"The world doesn't need more coders. It needs more AI operators and AI-first builders."

Building Agentic Workflows

A systematic approach to AI automation

Do it manually first

Understand the full process end-to-end before automating. If you can't do it, you can't automate it.

Map the cycle

Identify repetitive, cyclic tasks that follow patterns. AI thrives on repetition and pattern.

Identify decision points

What context does each decision need? Missing context = hallucination. Map the inputs.

Design the prompt chain

Break into discrete prompt steps, each with clear input/output. Small steps > one giant prompt.

Build evaluation loops

How do you know it worked? Evals > vibes. Automate your quality checks too.

Choosing the Right Model

Not every task needs GPT-5

Task	Model Tier	Why
Classification, routing	Small Haiku · 4o-mini	Fast, cheap, good enough
Summarization, extraction	Medium Sonnet · 4o	Balance of quality and cost
Complex reasoning, coding	Large Opus · o3 / GPT-5	Worth the premium
Creative, open-ended	Large + high temp	Needs breadth and surprise

3-4

models in a smart toolkit

40×

cheaper: Haiku vs Opus

90%

saved with prompt caching

"The best engineers use 3-4 models. Match model to task."

The Intelligence Staircase

Where we are and where we're going — verified claims from lab leaders

ANI

Narrow AI WE ARE HERE

Superhuman at specific tasks. GPT-5.4, Claude Opus, AlphaFold. Beats humans at coding, chess, protein folding — but can't do everything.

AGI

Artificial General Intelligence

Human-level across ALL cognitive tasks. Can learn anything a human can. No consensus on when — estimates range from 2027 to 2040+.

ASI

Artificial Superintelligence

Surpasses all human intelligence combined. Self-improving. The "intelligence explosion" scenario. Timeline: unknown — possibly years after AGI, possibly days.

"We're building narrow AI that already outperforms humans at tasks. The question is how fast it generalizes."

Beyond LLMs — World Models

Yann LeCun leaves Meta

Turing Award winner, ex-Meta AI Chief — launched AMI Labs

Raised $1.03B (WIRED, TechCrunch — Mar 9, 2026)

JEPA Architecture

Joint-Embedding Predictive Architecture — learns from video & sensor data, not just text

LLMs vs World Models

LLMs predict the next word
World models predict what happens next in the physical world
LLMs hallucinate because they only know language
World models understand physics, causality, space

"Meta had to catch up with the industry on LLMs... which is not my interest" — LeCun to WIRED

"LLMs are not the end. They might just be the beginning."

What This Means For You

Whether AGI arrives in 2027 or 2035 — the implications are the same

Learn to work WITH AI, not against it

84% of developers now use AI tools daily. 41% of all code is AI-generated. Those who don't adapt will fall behind — not in years, in months.

Invest in judgment, not just knowledge

AI can look up anything. It can't decide what matters. System design, product thinking, and taste become the differentiators.

Build your agent NOW — compound advantage

An agent that knows you for 6 months is 10x more useful than a fresh one. Start today, and by graduation, you have a serious advantage.

The $710B bet says this isn't hype

OpenAI spending $115B through 2029. Anthropic raising $30B. Google, Meta, Microsoft all-in.

"The future belongs to people who can direct intelligence — human or artificial."

Compute Is the New Electricity

From luxury to utility in 10 years

Then (Electricity)

1900s: Own Power Plants

Every factory built its own generator

1920s: Power Grid

Standardized → plug in, pay per kWh

Now (Compute)

2020s: Own GPU Clusters

Companies run massive on-prem infrastructure

2026+: Serverless Inference

Pay per token, zero infrastructure

Three Predictions

1. Inference 10× cheaper by 2028

Moore's Law for AI — costs halving every 12 months

2. Compute credits = electricity bills

Operational expense, not capital investment

3. Value moves UP the stack

Infrastructure → Apps → Workflows → Outcomes

"Don't invest in GPUs. Invest in knowing what to DO with compute."

Part 4: Live Demos

Demo Time

My Agent on Telegram
Email, calendar, memory, proactive alerts

🌐

signals.prateek.sh
Website maintained daily by AI

Vibe Coding Live
Build something by just talking

Local Model
Ollama running Qwen on a laptop

Demo: Agent in Action

What's on my calendar today?

You have 3 meetings: 10 AM standup, 2 PM VIT workshop, 4 PM Capillary call. I've prepped briefing docs for each. ✅

Summarize my last 3 emails

1. Capillary — deal meeting confirmed Thu 11 AM
2. noon — Proto V2 shipped, Faraz wants demo
3. Newsletter — skip, nothing urgent

"This is not a script. This is my actual daily workflow."

Part 5: Hands-On

Your Turn — Free Setup Options

Gemini Free

aistudio.google.com
Click "Get API Key"
No credit card needed
1,500 requests/day

Ollama (Local)

curl install — 1 command
ollama pull qwen3:8b
No internet needed
100% private & free

OpenRouter

openrouter.ai
Sign in with Google
Access free models
Multiple providers

Pick ONE and set it up in the next 3 minutes

Install in 3 Minutes

    # Install OpenClaw

    curl -fsSL https://get.openclaw.ai | bash

    # Start

    openclaw init

    # → Choose: Gemini (free) or Ollama (local)

    # Connect Telegram

    # Scan QR code → your agent is live

Raise your hand if stuck

Make It Yours

SOUL.md — personality

        You are Jarvis, a sharp AI

        assistant for a VIT CS student.

        Be concise. Be proactive.

        Remember everything.

MEMORY.md — context

        - Name: [Your Name]

        - Year: 3rd Year B.Tech CS

        - Interests: ML, web dev

        - Targets: Google, Microsoft

"This is what makes YOUR agent different from everyone else's."

What to Build

Research Copilot

Feed it your thesis topic → daily paper summaries, auto-built literature review

Placement Hunter

Auto-research companies before interviews, mock Q&A with real Glassdoor questions

Inbox Zero Agent

Reads every email, drafts replies, follows up — you just approve & send

Code Guardian

Reviews your PRs before push, catches bugs, writes missing tests automatically

Daily Briefer

Morning: weather + calendar + deadlines. Evening: what you accomplished + tomorrow's plan

Weekend Shipper

Describe what you want → agent scaffolds, codes, deploys. Ship a project per weekend.

Token Economics

Every AI interaction has a price tag

Input vs Output Tokens

Input (your prompt) vs Output (AI response) — output costs 3-5x more

GPT-4o pricing

~$2.50/M input · ~$10/M output

Claude Sonnet

~$3/M input · ~$15/M output

Haiku: $0.25/$1.25

40x cheaper than Opus — same task, different scale

The Skill: Value Per Token

Cache repeated context → 90% savings
Small models for routing, big for thinking
Batch similar requests together
Get it right FIRST TIME — retries = 2x cost

$1M+

companies spending per month on tokens by 2027

"Token efficiency is the new cloud cost optimization. The winners extract more value per token."

Pricing: OpenAI & Anthropic official docs, Mar 2026

India's Moment

170+

AI startups in India

$2B+

AI funding raised

10×

output with AI agents
at 1/10th the cost

Indian engineers + AI agents = global competitive edge. Sovereign LLMs (Sarvam, Krutrim) are already here. The cost advantage is real.

You're graduating into the biggest tech shift since the internet. Remote work + AI means location doesn't matter. VIT alumni are building the future — that includes you.

"Zero risk, maximum learning. Start while you're still in university."

Resources

Do This TODAY

1. Get your API key (Gemini free) → 2. Install OpenClaw → 3. Build your first agent → 4. Share what you built

Learn & Build

signals.prateek.sh — AI signals updated daily
docs.openclaw.ai — OpenClaw docs
github.com/openclaw/openclaw — Star it ⭐

Free Tools

aistudio.google.com — Free Gemini API
ollama.com — Run models locally
openrouter.ai — Free model access

discord.com/invite/clawd — Join the community | linkedin.com/in/prateek1gupta — Connect with me

The Future Belongs to Builders

You're not just learning to code. You're learning to direct intelligence.

These Slides

aislides.prateek.sh

AI Signals

signals.prateek.sh

Connect

linkedin.com/in/prateek1gupta

"10 years from now, you'll either be the person who started building with AI in college... or the person who wishes they had."

Thank you! Questions?

Prateek Gupta · VIT '16 · @neo17th

Appendix

Bonus Slides

For Q&A and deep dives

The Model Landscape Today

GPT-5.4

1M context · Reasoning + coding + computer use unified

Claude Opus 4.6

Best coding model · Deep reasoning · Agentic workflows

Gemini 3 Pro

Native multimodal · 2M context · Google ecosystem

Grok 4.20

4-agent debate system · Real-time X integration

Qwen 3.5 open

35B-A3B MoE · Runs on any laptop · Free

DeepSeek R1 open

671B MoE · Open reasoning rival · Chain-of-thought specialist

Open-source is 6 months behind closed models. Soon it'll be 6 weeks.

MCP — The USB-C of AI

Before MCP

N × M

N models × M tools = N×M integrations
Every model, every tool, custom code

After MCP

N + M

N + M connections via one standard
Universal connector for all AI tools

Model Context Protocol: JSON-RPC based, tool discovery, streaming
Adopted by: Google ADK, Cursor, GitHub, Figma
One standard: databases, file systems, APIs, SaaS tools

Claude Code Review — AI Reviewing AI

Anthropic · March 9, 2026 · TechCrunch, The New Stack

Multi-Agent Code Review

Dispatches teams of AI agents to review every pull request
Flags logic errors, classifies severity
Recommends fixes with reasoning

"Modeled on the review system we run at Anthropic" — claude.com/blog

The AI Code Loop

AI writes code →

AI reviews code →

AI fixes code →

Human approves ✅

"AI writes more code → need AI to review that code → multi-agent quality assurance."

Multi-Agent Systems

Grok 4.20 — Debate System

4 agents: coordinator, fact-checker, technical expert, creative thinker — all debate to reach better answers

CrewAI

Define roles & goals, agents collaborate on complex tasks autonomously

OpenClaw Subagents

Spawn workers for parallel tasks — research, code, analyze simultaneously

RentAHuman

518K humans hired BY AI agents for physical-world tasks AI can't do

"The future isn't one AI. It's a team of AIs — sometimes hiring humans."

What the Lab Leaders Say

Verified public statements from the people building these systems

Sam Altman

CEO, OpenAI

"We built AGIs... AGI kinda went whooshing by with less impact than expected. The field should move on to defining superintelligence."

Demis Hassabis

CEO, Google DeepMind

"We're approaching a threshold moment with agentic AI. But true AGI — creativity, continual learning, robust understanding — is still 5-8 years away."

Dario Amodei

CEO, Anthropic

"We will see some form of AGI in 2026." Anthropic raised $30B, building safety-first systems.

Elon Musk

CEO, xAI

"AGI by end of 2026." Built Grok 4.20 with multi-agent debate. Track record: often early on predictions.

Note: "AGI" means different things to different people. The debate isn't IF — it's WHEN and HOW we define it.

Agentic AI — The Real Revolution

Forget AGI debates. Agentic AI is transforming work RIGHT NOW.

Tasks Already Automated

Code generation & review (41% of code AI-generated)
Email triage, drafting, & follow-ups
Data analysis & report generation
Customer support (L1/L2)
Content creation & social media
Research & competitive analysis

Being Automated in 2026

Multi-step web tasks (booking, forms, shopping)
Sales outreach & lead qualification
Legal document review
Financial analysis & trading
Hiring & candidate evaluation
DevOps & deployment pipelines

Still Needs Humans (for now)

Creative strategy & vision
Complex negotiations
Physical tasks & hardware
Ethical judgments & policy
Novel scientific discovery

AI doesn't replace jobs — it replaces tasks. The winners use agents to amplify what's left.

The One-Person Company

$200M

Midjourney ARR
<15 employees

36%

of new ventures
are solo-founded

340%

avg revenue increase
with AI agents

∞

creative vision is
the new bottleneck

"Every one of you could be running a company while still in college."

Let's Connect

Prateek Gupta · VIT '16 · EIR @ noon

✉

[email protected]

💬

+91 9626937819