AI Engineer - Videos
Back to ChannelHacking Subagents Into Codex CLI — Brian John, Betterup
Subagents are amazing tools for managing context, among other things. But Codex CLI doesn't have them. Let's change that! Brian John is a Principal Full Stack Engineer with over a decade of experi...
Context Engineering: Connecting the Dots with Graphs — Stephen Chin, Neo4j
AI systems need more than intelligence; they need context. Without it, even the most advanced models can misinterpret information, lose track of details, or arrive at conclusions that don’t hold up...
Developing Taste in Coding Agents: Applied Meta Neuro-Symbolic RL — Ahmad Awais, CommandCode
Your coding agent writes code like an LLM bot. CommandCode writes code like me. Every developer has a coding agent now. What if your coding agent actually had taste? What if it understood not just...
Infra that fixes itself, thanks to coding agents — Mahmoud Abdelwahab, Railway
This talk shows how we built Railway Autofix, a plug-in template you can drop into any Railway project to monitor your infrastructure, and open PRs with fixes when issues are detected. We use OpenC...
AI changes *Nothing* — Dax Raad, OpenCode
Everyone says AI changes everything. Dax Raad argues that when it comes to building a winning product, AI changes nothing. In this contrarian talk, Dax breaks down why the fundamental challenges o...
Z.ai GLM 4.6: What We Learned From 100 Million Open Source Downloads — Yuxuan Zhang, Z.ai
GLM 4.6 is the only open-source model currently tied for #1 on the LMSYS Chatbot Arena, standing shoulder-to-shoulder with GPT-4o and Claude 3.5 Sonnet. In this talk, Zhang Yuxuan from zAI breaks d...
AIE CODE Day 2: ft Google Deepmind, Anthropic, Cursor, Netflix, Cline, OpenAI, Meta, and METR
NOVEMBER 21 - all times in EST * 9:06 am – Opening Remarks Swyx | Organizer, AI Engineer * 9:11 am – Stop Building Agents Barry Zhang | AI Engineer, Anthropic Mahesh Murag | AI Engineer, An...
AIE CODE 2025: AI Leadership ft Anthropic, OpenAI, McKinsey, Bloomberg, Google Deepmind, and Tenex
Alex Lieberman – Co-founder, @MorningBrew & @tenex_labs – Your Leadership Track MC! Kath Korevec – Engineering, @GoogleLabs – Proactive Agents Katelyn Lesse – Head of Engineering, Claude Developer ...
AI Engineer Paris 2025 (Day 2)
Full schedule at https://www.ai.engineer/paris#schedule - Emil Eifrem, Co-Founder and CEO, Neo4j, “The State of^H^Hin AI Engineering” - Tushar Jain, President, Product & Engineering, Docker, “Demo...
Opening Keynotes - AIE Paris 2025 (Day 1)
The opening welcome reception is all about the hallway track and the expo -- meeting and mingling with other founders and engineers who are (mostly) based in Europe. However, for those who can't ma...
Rishabh Garg, Tesla Optimus — Challenges in High Performance Robotics Systems
A robot's behavior is influenced by the control policy, the software configuration, and electrical characteristics of the communication protocol. When unexpected behaviors arise, it is not straigh...
Building an Agentic Platform — Ben Kus, CTO Box
Explore the technical evolution of metadata extraction at Box and how it shaped the foundation of our AI platform. We’ll walk through our transition to an agentic-first design—why it was necessary,...
Five hard earned lessons about Evals — Ankur Goyal, Braintrust
The main thesis of the video is that building successful AI applications requires a sophisticated engineering approach that goes beyond simply writing good prompts. The speaker argues for the impor...
Perceptual Evaluations: Evals for Aesthetics — Diego Rodriguez, Krea.ai
Special session with KREA.ai's cofounder Diego Rodriguez on how evals for aesthetics and image/generative media work — the hardest kinds of evals. linkedin.com/in/asciidiego/ Timestamps 00:15 I...
How BlackRock Builds Custom Knowledge Apps at Scale — Vaibhav Page & Infant Vasanth, BlackRock
Investment Operations teams are the backbone of asset and investment management firms. Their day-to-day work not only enables portfolio managers to respond swiftly to market events but also ensures...
Form factors for your new AI coworkers — Craig Wattrus, Flatfile
Designing user experiences for AI means moving beyond traditional interfaces. Designers are grappling with how to create intuitive and effective interactions for these new AI capabilities, while g...
Fuzzing in the GenAI Era — Leonard Tang, Haize Labs
"Evaluation" is one of those concepts that every AI practitioner vaguely knows is important, but few practitioners truly understand. Is "eval" the dataset for measuring the quality of your AI syste...
Multi Agent AI and Network Knowledge Graphs for Change — Ola Mabadeje, Cisco
Traditional ticketing and testing workflows for change management and network operations often operate independently and lack critical real-world context and adaptive decision making capabilities. ...
Wisdom-Driven Knowledge Augmented Generation at Scale - Chin Keong Lam, Patho AI
The main thesis of the video is that by using a Wisdom-Driven Knowledge Graph, we can significantly enhance the quantitative analysis capabilities of Knowledge-Augmented Generation (KAG) systems. T...
The Next Unicorns: 7 Top AI startups from the HF0 Residency
HF0's Demo Days are usually hilariously oversubscribed and have never before been aired publicly. For the first time, they are joining the AIE stage to pitch AI Engineers. https://www.hf0.com/ Ti...
#define AI Engineer - Greg Brockman, OpenAI (ft. Jensen Huang)
Greg Brockman's career and advice for AI Engineers Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming events and content by joining our newsletter here: htt...
The Future of Evals - Ankur Goyal, Braintrust
About Ankur Ankur Goyal is the founder & CEO of Braintrust—the developer platform that companies like Zapier, Notion, Instacart, Airtable, and more use to evaluate, log, and ship reliable AI produc...
Designing AI-Intensive Applications - swyx
Whether you call it a workflow or an agent, AI engineered applications are seeing user-input:LLM-call ratios go from 1:1 (ChatGPT) to 1:100 (Deep Research, Codex) and even 0:n (Ambient/Proactive ag...
How to look at your data — Jeff Huber (Chroma) + Jason Liu (567)
By the end of this talk, you'll understand what it takes to apply clustering techniques and data analysis to understand what is the valuable work that your AI application is doing through analyzing...
On Engineering AI Systems that Endure The Bitter Lesson - Omar Khattab, DSPy & Databricks
Will discuss the principles for building AI software that underpin DSPy, highlighting the differences between conventional prompting (or finetuning/RL) versus the design and programming of truly mo...
Evals Are Not Unit Tests — Ido Pesok, Vercel v0
How to think about evaluating a non-deterministic system — and how to actually succeed at it. About Ido Pesok Ido Pesok is an engineer and researcher at Vercel, working on the AI behind v0 and foc...
2025 is the Year of Evals! Just like 2024, and 2023, and … — John Dickerson, CEO Mozilla AI
AI is getting deployed without guardrails, without governance, without due diligence. Surely this is the year we’ll see a Fortune 500 CEO fired because of a preventable AI incident. Surely this i...
Vibe Coding with Confidence — Itamar Friedman, Qodo
Everyone wants to do Vibe Code, even large Enterprises. But how can we ensure that the generated code is well-grounded with the dev team's code and software development standards? In this talk, Ita...
AI Automation that actually works: $100M, messy data, zero surprises - Tanmai Gopal, Hasura/PromptQL
We will review the different kinds of automation use-cases, and the approach we used, that will drive over a $100M of expected annual impact by deploying AI for business critical initiatives. We w...
Full Workshop: Realtime Voice AI — Mark Backman, Daily
Voice AI agents today can conduct natural, human-like conversations and perform a wide variety of tasks: customer support, lead qualification, healthcare patient intake, market research, and more. ...
Vision AI in 2025 — Peter Robicheaux, Roboflow
Attendee-Only and Attendee-Led 10min lightning talks: see https://crowdcomms.com/aiengineer25/qanda/41445 Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming...
Practical tactics to build reliable AI apps — Dmitry Kuchin, Multinear
[last round of Attendee-Led 10min lightning talks] Practical tactics to build reliable AI apps. Reverse engineering real-world evals with o3. Nobody does it this way. Companies pay me $500/h for th...
How to Improve your Vibe Coding — Ian Butler
[last round of Attendee-Led 10min lightning talks] Are your vibes immaculate? - Vibe coding is the new hotness but everyone has a story of AI making really dumb choices. Let's talk about how you ca...
Vibes won't cut it — Chris Kelly, Augment Code
What's the role of vibe coding in a production-grade applications? Join Augment Code's Chris Kelly as he talks about the role of context in software engineering, not code. About Chris Kelly Chris ...
Real World Development with GitHub Copilot and VS Code — Harald Kirschner, Christopher Harrison
Join us to see how VS Code and GitHub Copilot's expanding suite of AI features can match or even surpasses the benefits of other popular AI developer tools. We'll focus on practical scenarios to e...
Building Agents at Cloud Scale — Antje Barth, AWS
Let's explore practical strategies for building and scaling agents in production. Discover how to move from local MCP implementations to cloud-scale architectures and how engineering teams lever...
State of Startups and AI 2025 - Sarah Guo, Conviction
Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming events and content by joining our newsletter here: https://www.ai.engineer/newsletter
Useful General Intelligence — Danielle Perszyk, Amazon AGI
We’re all hearing that AI agents will enable AGI, but they can’t yet reliably perform even basic computer tasks. It turns out that getting AI to click, type, and scroll is more challenging than get...
The 2025 AI Engineering Report — Barr Yaron, Amplify
Come hear the results of the 2025 State of AI Engineering: https://www.amplifypartners.com/blog-posts/the-2025-ai-engineering-report About Barr Yaon Barr is a data scientist turned investment part...
Agents vs Workflows: Why Not Both? — Sam Bhagwat, Mastra.ai
One current hot debate is should you make your top-level abstraction a ReAct type agent running in a loop? or should you make it a structured workflow graph? OpenAI is launching their new framewor...
Why We Don’t Need More Data Centers - Dr. Jasper Zhang, Hyperbolic
AI infrastructure today is caught in an endless cycle: build more data centers, deploy more GPUs, repeat. But this approach is fundamentally flawed—expensive, inefficient, and environmentally unsu...
Infrastructure for the Singularity — Jesse Han, Morph
We're at an inflection point where AI agents are transitioning from experimental tools to practical coworkers. This new world will demand new infrastructure for RL training, test-time scaling, and ...
Hacking the Inference Pareto Frontier - Kyle Kranen, NVIDIA
Your model works! It aces the evals! It even passes the vibe check! All that’s required is inference, right? Oops, you’ve just stepped into a minefield: -Not low-latency enough? Choppy experience....
Pipecat Cloud: Enterprise Voice Agents Built On Open Source - Kwindla Hultman Kramer, Daily
Voice AI agents today can conduct natural, human-like conversations and perform a wide variety of tasks: customer support, lead qualification, healthcare patient intake, market research, and more. ...
[Full Workshop] Building Conversational AI Agents - Thor Schaeff, ElevenLabs
In this workshop you will learn how to build multilingual Conversational AI agents that can automatically detect your user's spoken language and can seamlessly switch to their preferred language. ...
From Self-driving to Autonomous Voice Agents — Brooke Hopkins, Coval
The reliability challenges facing voice & chat AI deployment today mirror those that the autonomous vehicle industry confronted years ago. This talk explores how evaluation methodologies developed ...
Your realtime AI is ngmi — Sean DuBois (OpenAI), Kwindla Kramer (Daily)
Sean DuBois of OpenAI and Pion, and Kwindla Hultman Kramer of Daily and Pipecat, will talk about why you have to design realtime AI systems from the network layer up. Most people who build realtim...
Why ChatGPT Keeps Interrupting You — Dr. Tom Shapland, LiveKit
ChatGPT Advanced Voice Mode isn’t interrupting just you. Interruptions, and turn-taking in general, are unsolved problems for all Voice AI agents. Nobody likes being cut short – and people have muc...
Serving Voice AI at $1/hr: Open-source, LoRAs, Latency, Load Balancing - Neil Dwyer, Gabber
This is a talk that goes over our experience deploying Orpheus (Emotive, Realtime TTS) to production. It will cover topics: - Latency and optimizations - High fidelity voice clones w/ examples - L...
How to defend your sites from AI bots — David Mytton, Arcjet
Constantly seeing CAPTCHAs? It used to be easy to detect the humans from the droids, but what else can we do when synthetic clients make up nearly half of all web requests. Rotating IPs, spoofed br...