Latent Space - Videos
Back to Channel🔬 From Red Teaming GPT-4 to Automating Drug Discovery: The Future of AI in Science — Andrew White
_Editor’s note: Welcome to our new AI for Science pod, with your new hosts RJ and Brandon! See the writeup on __Latent.Space_ (http://Latent.Space)_ for more details on why we’re launching 2 new po...
⚡️ Prism: OpenAI's LaTeX "Cursor for Scientists" — Kevin Weil & Victor Powell, OpenAI for Science
“2026 in AI for Science is going to look a lot like 2025 for Software Engineering” — Kevin Weil From building *Crixet* in stealth (so stealthy Kevin had to hunt down Victor on Reddit to explore an...
Captaining IMO Gold, Deep Think, On-Policy RL, Feeling the AGI in Singapore — Yi Tay
From shipping *Gemini Deep Think* and *IMO Gold* to launching the *Reasoning and AGI team in Singapore,* *Yi Tay* has spent the last 18 months living through the full arc of Google DeepMind's pivot...
Brex’s AI Hail Mary — With CTO James Reggio (acquired for $5B by Capital One!)
From building internal AI labs to becoming CTO of Brex, James Reggio has helped lead one of the most disciplined AI transformations inside a real financial institution where compliance, auditabilit...
Artificial Analysis: The Independent LLM Analysis House — with George Cameron and Micah Hill-Smith
don’t miss George’s AIE talk: https://www.youtube.com/watch?v=sRpqPgKeXNk —- From launching a side project in a Sydney basement to becoming the *independent gold standard for AI benchmarking*—trust...
[State of Research Funding] Beyond NSF, Slingshots, Open Frontiers — Andy Konwinski, Laude Institute
From co-founding *Databricks* and *Perplexity* to launching the *Laude Institute*—a dual venture fund and nonprofit designed to turbocharge the path from *research breakthrough to breakout company*...
[State of Code Evals] After SWE-bench, Code Clash & SOTA Coding Benchmarks recap — John Yang
From creating *SWE-bench* in a Princeton basement to shipping *CodeClash,* *SWE-bench Multimodal,* and *SWE-bench Multilingual,* *John Yang* has spent the last year and a half watching his benchmar...
[State of MechInterp] SAEs in Production, Circuit Tracing, AI4Science, "Pragmatic" Interp — Goodfire
From PhD research on grounding and language models to shipping interpretability tools in production at *Goodfire,* *Jack Merullo* and *Mark Bissell* are building the infrastructure to crack open th...
[State of AI Papers 2025] Fixing Research with Social Signals, OCR & Implementation — Team AlphaXiv
From late-night dorm-room hacking sessions at Stanford to building the most-used platform for navigating AI research, the founders of *AlphaXiv* have spent the last few years watching the archive f...
[NeurIPS Best Paper] 1000 Layer Networks for Self-Supervised RL — Kevin Wang et al, Princeton
From undergraduate research seminars at Princeton to winning *Best Paper award at NeurIPS 2025,* *Kevin Wang, Ishaan Javali, Michał Bortkiewicz, Tomasz Trzcinski, Benjamin Eysenbach* defied convent...
[State of Context Engineering] Agentic RAG, Context Rot, MCP, Subagents — Nina Lopatina, Contextual
From neuroscience PhD research on reward learning and decision making to building the infrastructure for *context engineering at scale,* *Nina Lopatina* has spent the last year watching a brand-new...
[State of Evals] LMArena's $1.7B Vision — Anastasios Angelopoulos, LMArena
_We are reupping this episode after LMArena announced their fresh Series A (_https://www.theinformation.com/articles/ai-evaluation-startup-lmarena-valued-1-7-billion-new-funding-round?rc=luxwz4_), ...
[State of Post-Training] From GPT-4.1 to 5.1: RLVR, Agent & Token Efficiency — Josh McGrath, OpenAI
From pre-training data curation to shipping *GPT-4o,* *o1,* *o3,* and now *GPT-5 thinking* and the *shopping model,* *Josh McGrath* has lived through the full arc of OpenAI's post-training evolutio...
[State of RL/Reasoning] IMO/IOI Gold, OpenAI o3/GPT-5, and Cursor Composer — Ashvin Nair, Cursor
From Berkeley robotics and OpenAI's 2017 Dota-era internship to shipping RL breakthroughs on GPT-4o, o1, and o3, and now leading model development at *Cursor,* *Ashvin Nair* has done it all. We cau...
[State of AI Startups] Memory/Learning, RL Envs & DBT-Fivetran — Sarah Catanzaro, Amplify
From investing through the modern data stack era (DBT, Fivetran, and the analytics explosion) to now investing at the frontier of AI infrastructure and applications at *Amplify Partners,* *Sarah Ca...
One Year of MCP — with David Soria Parria and AAIF leads from OpenAI, Goose, Linux Foundation
One year ago, Anthropic launched the *Model Context Protocol (MCP)*—a simple, open standard to connect AI applications to the data and tools they need. Today, MCP has exploded from a local-only exp...
Steve Yegge's Vibe Coding Manifesto: Why Claude Code Isn't It & What Comes After the IDE
Note: Steve and Gene’s talk on Vibe Coding and the post IDE world was one of the top talks of AIE CODE: https://www.youtube.com/watch?v=7Dtu2bilcFs&t=1019s&pp=0gcJCU0KAYcqIYzv From building legend...
⚡️GPT5-Codex-Max: Training Agents with Personality, Tools & Trust — Brian Fioca + Bill Chen, OpenAI
From the frontlines of OpenAI's Codex and GPT-5 training teams, *Bryan* and *Bill* are building the future of AI-powered coding—where agents don't just autocomplete, they architect, refactor, and s...
Neural Nets My AI Pill Moment with Chris Olah
Unlocking the Immune System A Deep Dive into Biology
IIT India's Elite Engineering Schools Explained
Future of Doctors AI's Role in Healthcare
SAM 3: The Eyes for AI — Nikhila & Pengchuan (Meta Superintelligence), ft. Joseph Nelson (Roboflow)
_as with all demo-heavy and especially vision AI podcasts, we encourage watching along on our YouTube (and tossing us an upvote/subscribe if you like!)_ From SAM 1's 11-million-image data engine to...
Supporting Python in AI Cloud
⚡️Jailbreaking AGI: Pliny the Liberator & John V on Red Teaming, BT6, and the Future of AI Security
*Note: this is Pliny and John’s first major podcast. Voices have been changed for opsec.* From jailbreaking every frontier model and turning down Anthropic's Constitutional AI challenge to leading ...
Agent Driven Anomaly Detection
Figma's LLM Design Bridge
AI to AE's: Grit, Glean, and Kleiner Perkins' next Enterprise AI hit — Joubin Mirzadegan, Roadrunner
Glean started as a Kleiner Perkins incubation and is now a $7B, $200m ARR Enterprise AI leader. Now KP has tapped its own podcaster to lead it’s next big swing. From building go-to-market the hard...
Vercel's Open Source Business Model
The Future of Email: Superhuman CTO on Your Inbox As the Real AI Agent (Not ChatGPT) — Loïc Houssier
From applied cryptography and offensive security in France’s defense industry to optimizing nuclear submarine workflows, then selling his e-signature startup to Docusign (https://www.docusign.com/c...
AI's Impact on Developer Roles
Secure Apps Despite Incompetent Developers
The Great Evals Debate — Ankur Goyal & Malte Ubl
Ankur Goyal and Malte Ubl, co-founders of Braintrust and Vercel respectively, join swyx for a spirited debate about the role of evaluations (evals) in building AI coding agents. Sparked by a viral ...
World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI
From building Medal into a 12M-user game clipping platform with 3.8B highlight moments to turning down a reported $500M offer from OpenAI (https://www.theinformation.com/articles/openai-offered-pay...
NextGen Developer 4 Key Attributes
Vercel Supports Python on Resell
AI's Role in Future UI Design
Hiring Trends Senior vs Junior
Claude Code A General Purpose Agent SDK
AI's Exponential Growth & Speed Imperative
Investor Types - Early vs Late Stage
AI's Shift from Novelty to Utility
AI's Impact on Human Craft Value
Terminal Bench: A Pure Model Test
#TerminalBench #aiengineering #aidevelopment
After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs
Fei-Fei Li and Justin Johnson are cofounders of World Labs, who have recently launched Marble (https://marble.worldlabs.ai/), a new kind of generative “world model” that can create editable 3D envi...
Barak Lenz: Interviewing for Problem Solvers
#Jamba #aiengineering #aidevelopment
Hybrid LLMs: The Future of Attention
#Jamba #aiengineering #aidevelopment
⚡️ Building the AI Hardware Engineer with Matthias Wagner, Co-founder of Flux
In this episode of Latent Space, Matthias Wagner, CEO & co-founder of Flux, reveals how they're revolutionizing hardware design with AI agents that can transform product briefs into manufacturable ...
⚡️ 10x AI Engineers with $1m Salaries — Alex Lieberman & Arman Hezarkhani, Tenex
*Alex Lieberman* and *Arman Hezarkani,* co-founders of Tenex, reveal how they're revolutionizing software consulting by compensating AI engineers for output rather than hours—enabling some engineer...
Long Context on Edge Devices
#Jamba #aiengineering #aidevelopment