Latent Space - Videos
Back to ChannelThe $15B Physical AI Company: Simulation, Autonomy OS, Neural Sim, & 1K Engineers—Applied Intuition
From building Applied Intuition from YC-era autonomy tooling into a $15B physical AI company, Qasar Younis and Peter Ludwig have spent the last decade living through the full arc of autonomy: from ...
AI-Native Engineering: 100% adoption, 5x search throughput, unlimited tokens — Mikhail Parakhin
From running one of the most aggressive internal AI rollouts in tech to building systems that simulate customers, optimize ML pipelines, and rethink search latency from first principles, Mikhail Pa...
🔬 Training Transformers to solve 95% failure rate of Cancer Trials — Ron Alfa & Daniel Bear, Noetik
TL;DR: 95% of cancer treatments fail to pass clinical trials, but it may be a matching problem — if we better understood what patients have which tumors which will respond to which treatments, succ...
⚡️ How to turn Documents into Knowledge: Graphs in Modern AI — Emil Eifrem, CEO Neo4J
The core argument: AI systems need more than top-K chunks. They need structured context about entities, relationships, permissions, authorship, provenance, and history. GraphRAG combines vector sea...
⚡️ Competing with ChatGPT and Sierra, building a $10M ARR company — Yasser Elsaid, Founder, Chatbase
Resources: https://x.com/yasser_elsaid_/status/1632401916771590144 https://x.com/yasser_elsaid_/status/1878899512443379918 https://x.com/yasser_elsaid_/status/2039038515019997210 https://x.co...
Notion’s Sarah Sachs & Simon Last on Custom Agents, Evals, and the Future of Work
Sarah Sachs and Simon Last of Notion join us for a deep dive into how Notion built Custom Agents, why it took years and multiple rebuilds to get right, and what it means to turn a productivity tool...
⚡️ The best engineers don't write the most code. They delete the most code. — Stay Sassy
Anonymous tech writers Stay Sassy join swyx to talk about AI budgets, per-person token spend, build vs buy, and why code review matters more, not less, in the age of AI coding tools. https://stays...
Extreme Harness Engineering: 1M LOC, 1B toks/day, 0% human code or review — Ryan Lopopolo, OpenAI
We’re proud to release this ahead of Ryan’s keynote at AIE Europe. Hit the bell, get notified when it is live! Attendees: come prepped for Ryan’s AMA with Vibhu after. Move over, context engineeri...
Marc Andreessen introspects on Death of the Browser, Pi + OpenClaw, and Why "This Time Is Different"
From Mosaic and Netscape to cofounding @a16z, Marc Andreessen has lived through multiple computing platform shifts firsthand. In this episode, Marc joins swyx and Alessio in a16z’s original office ...
Moonlake: Interactive, Multimodal World Models — with Chris Manning and Fan-yun Sun
We’ve been on a bit of a mini World Models series over the last quarter: from introducing the topic with Yi Tay, to exploring Marble with World Labs’ Fei-Fei Li and Justin Johnson, to previewing Wo...
The Stove Guy: Sam D'Amico Shows New AI Cooking Features on America's Most Powerful Stove at Impulse
In this episode, we visit Impulse with founder and CEO Sam D’Amico for a tour of the office, a live demo of the most powerful induction stove on the market, and a cooking session with new AI featur...
Mistral: Voxtral TTS, Forge, Leanstral, & Mistral 4 — w/ Pavan Kumar Reddy & Guillaume Lample
Mistral is one of the world's leading frontier model labs, and has just raised $900m to build their European data center hub. Last year marked their first ventures into multimodal models, with Pixt...
🔬There Is No AlphaFold for Materials — AI for Materials Discovery with Heather Kulik
Materials science is the unsung hero of the science world. Behind every physical product you interact was decades of research into getting the properties of materials just right. Your gym clothes c...
Dreamer: the Agent OS for Everyone — David Singleton
David Singleton, the longtime former CTO of Stripe, has now launched Dreamer, a new consumer-focused platform to discover, build, and use AI agents and “agentic apps,” centered on a customizable pe...
Anthropic’s Felix Rieseberg on AI Coworkers, Local-First Agents, and the Future of Knowledge Work
From building Electron and helping ship the Slack desktop app to now shaping Claude Cowork at Anthropic, Felix Rieseberg has spent years working at the interface layer. In this episode, Felix joins...
⚡️Monty: the ultrafast Python interpreter by Agents for Agents — Samuel Colvin, Pydantic
https://github.com/pydantic/monty
Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Eskildsen of Turbopuffer
From spending nearly a decade scaling Shopify’s infrastructure to doing “angel engineering” for companies like Readwise and Replicate, Simon Hørup Eskildsen went through the arc of getting haunted ...
⚡️ OpenClaw's Memory Sucks and the fix is simple — Dhravya Shah, Supermemory
https://github.com/supermemoryai/openclaw-supermemory https://x.com/DhravyaShah/status/2023630749065228364 Timestamps 0:00 - The Evolution of Supermemory: Dhravya explains how his project starte...
Agent Inference at the "Speed of Light" — How NVIDIA moves like a $4.3 Trillion Startup
Swyx and Vibhu chat with Nader Khalil (https://x.com/naderlikeladder) and Kyle Kranen (https://x.com/KranenKyle) from NVIDIA about NVIDIA'S DX mission - Brev’s origin as a one-click way to access G...
Why Your AI Agents Don’t Work with Dex Horthy of HumanLayer | In-Context Cooking
In this episode, we have Dex Horthy, founder and CEO of Humanlayer and the mind behind “context engineering,” joining us to recreate restaurant-style Dan Dan noodles. From his first internship at N...
Cursor's Third Era: Cloud Agents — ft. Sam Whitmore, Jonas Nelle, Cursor
https://cursor.com/blog/third-era Cursor Cloud Agents: Tested PRs, Demo Videos, Parallel Model Swarms, and the Future of Agentic Coding swyx catches up with Jonas and Sam in the beautiful new Cur...
Why Every Agent Needs a Box — Aaron Levie, Box
swyx and guest host Jeff Huber (CEO of Chroma, past guest!) chat with Aaron Levie, CEO of Box, on how AI agents will transform enterprise knowledge work and why companies must adapt workflows to ma...
⚡️ Polsia: Solo Founder Tiny Team from 0 to 1m ARR in 1 month & the future of Self-Running Companies
https://x.com/bencera_/status/2027825976261111966?s=20
Measuring Exponential Trends Rising (in AI) — Joel Becker, METR
Joel Becker explains METR’s focus on Model Evaluation and Threat Research to assess whether AI could pose enormous or catastrophic risks. Becker discusses METR’s publicized work such as the time ho...
Dylan Patel Explains the AI War While Cooking | In-Context Cooking
In this episode, we have Dylan Patel founder and CEO of SemiAnalysis joining us to recreate restaurant-style chicken fried rice. From semiconductor bottlenecks and Nvidia’s paranoia, to $200B hyper...
‘You guys are so inefficient’ #substack #shorts
This is a clip from https://www.latent.space/p/paid-anthropic-distillation-and-how?utm_source=youtube_shorts See the full video: https://www.youtube.com/watch?v=7EBQ04OL-is #shorts #substack
Privacy or policing? #substack #shorts
This is a clip from https://www.latent.space/p/paid-anthropic-distillation-and-how?utm_source=youtube_shorts See the full video: https://www.youtube.com/watch?v=7EBQ04OL-is #shorts #substack
How models memorize from one pass #substack #shorts
This is a clip from https://www.latent.space/p/paid-anthropic-distillation-and-how?utm_source=youtube_shorts See the full video: https://www.youtube.com/watch?v=7EBQ04OL-is #shorts #substack
🔬Max Welling: Materials Underlie Everything
In this episode recorded at NeurIPS 2025, Max Welling traces the intellectual thread connecting quantum gravity, equivariant neural networks, diffusion models, and climate-focused materials discove...
Claude Code for Finance + The Global Memory Shortage: Doug O'Laughlin, SemiAnalysis
A special double pod on the 1 year anniversary of Claude Code: we chat with one of its most vocal fans, who thinks it will write 25-50% of all code on GitHub, plus get a breakdown on the memory cru...
The End of SWE-Bench Verified — Mia Glaese & Olivia Watkins, OpenAI Frontier Evals
Olivia Watkins (Frontier Evals team) and Mia Glaese (VP of Research at OpenAI, leading the Codex, human data, and alignment teams) discuss a new blog post (https://openai.com/index/why-we-no-longer...
Inside AI’s $10B+ Capital Flywheel — Martin Casado & Sarah Wang of a16z
From pioneering software-defined networking to backing many of the most aggressive AI model companies of this cycle, Martin Casado and Sarah Wang sit at the center of the capital, compute, and tale...
The AI Frontier: from Gemini 3 Deep Think distilling to Flash — Jeff Dean
From rewriting Google’s search stack in the early 2000s to reviving sparse trillion-parameter models and co-designing TPUs with frontier ML research, Jeff Dean has quietly shaped nearly every layer...
🔬Generating Molecules, Not Just Models
This episode traces the remarkable journey from AlphaFold2’s landmark achievement in protein structure prediction to the broader landscape of molecular interaction modeling and protein design. The ...
⚡️ Reverse Engineering OpenAI's Training Data — Pratyush Maini, Datology
Should you add reasoning traces to your pretraining data? There’s been a surge of academic work speculating its advantages. But do frontier labs actually do this? Turns out, we can answer confident...
Goodfire AI’s Bet: Interpretability as the Next Frontier of Model Design — Myra Deng & Mark Bissell
From Palantir and Two Sigma to building Goodfire into the poster-child for actionable mechanistic interpretability, Mark Bissell (Member of Technical Staff) and Myra Deng (Head of Product) are tryi...
⚡️Context Graphs: according to the authors — Jaya Gupta, Ashu Garg, Foundation Capital
In this Lightning pod, swyx hosts Jaya Gupta and Ashu Garg from Foundation Capital to discuss the emergence of context graphs. They define this new framework as the institutional memory of "the why...
🔬 From Red Teaming GPT-4 to Automating Drug Discovery: The Future of AI in Science — Andrew White
_Editor’s note: Welcome to our new AI for Science pod, with your new hosts RJ and Brandon! See the writeup on __Latent.Space_ (http://Latent.Space)_ for more details on why we’re launching 2 new po...
⚡️ Prism: OpenAI's LaTeX "Cursor for Scientists" — Kevin Weil & Victor Powell, OpenAI for Science
“2026 in AI for Science is going to look a lot like 2025 for Software Engineering” — Kevin Weil From building *Crixet* in stealth (so stealthy Kevin had to hunt down Victor on Reddit to explore an...
Captaining IMO Gold, Deep Think, On-Policy RL, Feeling the AGI in Singapore — Yi Tay
From shipping *Gemini Deep Think* and *IMO Gold* to launching the *Reasoning and AGI team in Singapore,* *Yi Tay* has spent the last 18 months living through the full arc of Google DeepMind's pivot...
Brex’s AI Hail Mary — With CTO James Reggio (acquired for $5B by Capital One!)
From building internal AI labs to becoming CTO of Brex, James Reggio has helped lead one of the most disciplined AI transformations inside a real financial institution where compliance, auditabilit...
Artificial Analysis: The Independent LLM Analysis House — with George Cameron and Micah Hill-Smith
don’t miss George’s AIE talk: https://www.youtube.com/watch?v=sRpqPgKeXNk —- From launching a side project in a Sydney basement to becoming the *independent gold standard for AI benchmarking*—trust...
[State of Research Funding] Beyond NSF, Slingshots, Open Frontiers — Andy Konwinski, Laude Institute
From co-founding *Databricks* and *Perplexity* to launching the *Laude Institute*—a dual venture fund and nonprofit designed to turbocharge the path from *research breakthrough to breakout company*...
[State of Code Evals] After SWE-bench, Code Clash & SOTA Coding Benchmarks recap — John Yang
From creating *SWE-bench* in a Princeton basement to shipping *CodeClash,* *SWE-bench Multimodal,* and *SWE-bench Multilingual,* *John Yang* has spent the last year and a half watching his benchmar...
[State of MechInterp] SAEs in Production, Circuit Tracing, AI4Science, "Pragmatic" Interp — Goodfire
From PhD research on grounding and language models to shipping interpretability tools in production at *Goodfire,* *Jack Merullo* and *Mark Bissell* are building the infrastructure to crack open th...
[State of AI Papers 2025] Fixing Research with Social Signals, OCR & Implementation — Team AlphaXiv
From late-night dorm-room hacking sessions at Stanford to building the most-used platform for navigating AI research, the founders of *AlphaXiv* have spent the last few years watching the archive f...
[NeurIPS Best Paper] 1000 Layer Networks for Self-Supervised RL — Kevin Wang et al, Princeton
From undergraduate research seminars at Princeton to winning *Best Paper award at NeurIPS 2025,* *Kevin Wang, Ishaan Javali, Michał Bortkiewicz, Tomasz Trzcinski, Benjamin Eysenbach* defied convent...
[State of Context Engineering] Agentic RAG, Context Rot, MCP, Subagents — Nina Lopatina, Contextual
From neuroscience PhD research on reward learning and decision making to building the infrastructure for *context engineering at scale,* *Nina Lopatina* has spent the last year watching a brand-new...
[State of Evals] LMArena's $1.7B Vision — Anastasios Angelopoulos, LMArena
_We are reupping this episode after LMArena announced their fresh Series A (_https://www.theinformation.com/articles/ai-evaluation-startup-lmarena-valued-1-7-billion-new-funding-round?rc=luxwz4_), ...
[State of Post-Training] From GPT-4.1 to 5.1: RLVR, Agent & Token Efficiency — Josh McGrath, OpenAI
From pre-training data curation to shipping *GPT-4o,* *o1,* *o3,* and now *GPT-5 thinking* and the *shopping model,* *Josh McGrath* has lived through the full arc of OpenAI's post-training evolutio...