Research · Frontier Lab & Model News

Back to sweep

Research sweep · shallow · 2025 – present

The Karpathy Loop — AI Agents Running Autonomous Training Experiments

The "Karpathy loop" — autonomous AI agent research cycles that run and evaluate ML training experiments to discover improvements, April 2025–April 19 2026, including Karpathy's own explanations, independent commentary, and real-world implementations

  • frontier
  • blogs
  • tech

Synthesised 2026-04-19

Narrative

In March 2026, Andrej Karpathy—independent researcher and former OpenAI/Tesla executive—released autoresearch, catalyzing mainstream coverage of what the AI community terms 'the Karpathy loop': an autonomous agent that iteratively reads training code, proposes modifications (learning rate, architecture depth, hyperparameters), executes time-boxed experiments (typically 5 minutes on GPU), evaluates outcomes, and repeats. Karpathy's proof-of-concept ran 700 experiments in 2 days on a small language model, discovering 20 optimizations that yielded ~11% training speedup when applied to larger models. Critically, Karpathy disclosed a personal inflection point: since December 2025, he has stopped writing code directly, instead directing AI agents for 16 hours daily—a shift he frames as entry into a 'loopy era' where autonomous systems conduct research without humans in the loop. The concept proved immediately portable: Shopify CEO Tobias Lütke ran autoresearch overnight on internal data and achieved 19% performance gains from 37 automated experiments. Fortune, The New Stack, Latent Space, and NextBigFuture all positioned autoresearch as foundational to how frontier ML labs will conduct research, though concrete adoption announcements from official labs (OpenAI, Google DeepMind, Anthropic, Meta) have not yet appeared in public coverage.


Sources

ID Title Outlet Date Significance
t1 Why everyone is talking about Andrej Karpathy's autonomous AI research agent Fortune 2026-03 Business publication's major coverage of March 2026 autoresearch announcement, detailing 700 experiments in 2 days and implications for frontier ML labs.
t2 GitHub - karpathy/autoresearch: AI agents running research on single-GPU nanochat training automatically GitHub 2026-03 The official open-source autoresearch repository demonstrating the autonomous loop in practice with runnable 630-line agent code.
t3 Andrej Karpathy's 630-line Python script ran 50 experiments overnight without any human input The New Stack 2026-03 Technical deep-dive explaining the agent loop mechanics: code modification, time-boxed execution, evaluation, and iteration without human intervention.
t4 Andrej Karpathy on Code Agents, AutoResearch and the Self Improvement Loopy Era of AI NextBigFuture 2026-03 Karpathy's canonical explanation of the concept, including his December 2025 transition to directing agents full-time and framing of the 'loopy era'.
t5 [AINews] Autoresearch: Sparks of Recursive Self Improvement Latent Space 2026-03 AI-community analysis positioning autoresearch within autonomous research agents, self-improving loops, and implications for frontier research methodology.

We use analytics cookies to understand site usage and improve the service. We do not use marketing cookies.