AI Engineer - Videos
Back to ChannelAI Engineer Paris 2025 (Day 2)
Full schedule at https://www.ai.engineer/paris#schedule - Emil Eifrem, Co-Founder and CEO, Neo4j, “The State of^H^Hin AI Engineering” - Tushar Jain, President, Product & Engineering, Docker, “Demo...
Opening Keynotes - AIE Paris 2025 (Day 1)
The opening welcome reception is all about the hallway track and the expo -- meeting and mingling with other founders and engineers who are (mostly) based in Europe. However, for those who can't ma...
Rishabh Garg, Tesla Optimus — Challenges in High Performance Robotics Systems
A robot's behavior is influenced by the control policy, the software configuration, and electrical characteristics of the communication protocol. When unexpected behaviors arise, it is not straigh...
Building an Agentic Platform — Ben Kus, CTO Box
Explore the technical evolution of metadata extraction at Box and how it shaped the foundation of our AI platform. We’ll walk through our transition to an agentic-first design—why it was necessary,...
Five hard earned lessons about Evals — Ankur Goyal, Braintrust
The main thesis of the video is that building successful AI applications requires a sophisticated engineering approach that goes beyond simply writing good prompts. The speaker argues for the impor...
Perceptual Evaluations: Evals for Aesthetics — Diego Rodriguez, Krea.ai
Special session with KREA.ai's cofounder Diego Rodriguez on how evals for aesthetics and image/generative media work — the hardest kinds of evals. linkedin.com/in/asciidiego/ Timestamps 00:15 I...
How BlackRock Builds Custom Knowledge Apps at Scale — Vaibhav Page & Infant Vasanth, BlackRock
Investment Operations teams are the backbone of asset and investment management firms. Their day-to-day work not only enables portfolio managers to respond swiftly to market events but also ensures...
Form factors for your new AI coworkers — Craig Wattrus, Flatfile
Designing user experiences for AI means moving beyond traditional interfaces. Designers are grappling with how to create intuitive and effective interactions for these new AI capabilities, while g...
Fuzzing in the GenAI Era — Leonard Tang, Haize Labs
"Evaluation" is one of those concepts that every AI practitioner vaguely knows is important, but few practitioners truly understand. Is "eval" the dataset for measuring the quality of your AI syste...
Multi Agent AI and Network Knowledge Graphs for Change — Ola Mabadeje, Cisco
Traditional ticketing and testing workflows for change management and network operations often operate independently and lack critical real-world context and adaptive decision making capabilities. ...
Wisdom-Driven Knowledge Augmented Generation at Scale - Chin Keong Lam, Patho AI
The main thesis of the video is that by using a Wisdom-Driven Knowledge Graph, we can significantly enhance the quantitative analysis capabilities of Knowledge-Augmented Generation (KAG) systems. T...
The Next Unicorns: 7 Top AI startups from the HF0 Residency
HF0's Demo Days are usually hilariously oversubscribed and have never before been aired publicly. For the first time, they are joining the AIE stage to pitch AI Engineers. https://www.hf0.com/ Ti...
#define AI Engineer - Greg Brockman, OpenAI (ft. Jensen Huang)
Greg Brockman's career and advice for AI Engineers Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming events and content by joining our newsletter here: htt...
The Future of Evals - Ankur Goyal, Braintrust
About Ankur Ankur Goyal is the founder & CEO of Braintrust—the developer platform that companies like Zapier, Notion, Instacart, Airtable, and more use to evaluate, log, and ship reliable AI produc...
Designing AI-Intensive Applications - swyx
Whether you call it a workflow or an agent, AI engineered applications are seeing user-input:LLM-call ratios go from 1:1 (ChatGPT) to 1:100 (Deep Research, Codex) and even 0:n (Ambient/Proactive ag...
How to look at your data — Jeff Huber (Chroma) + Jason Liu (567)
By the end of this talk, you'll understand what it takes to apply clustering techniques and data analysis to understand what is the valuable work that your AI application is doing through analyzing...
On Engineering AI Systems that Endure The Bitter Lesson - Omar Khattab, DSPy & Databricks
Will discuss the principles for building AI software that underpin DSPy, highlighting the differences between conventional prompting (or finetuning/RL) versus the design and programming of truly mo...
Evals Are Not Unit Tests — Ido Pesok, Vercel v0
How to think about evaluating a non-deterministic system — and how to actually succeed at it. About Ido Pesok Ido Pesok is an engineer and researcher at Vercel, working on the AI behind v0 and foc...
2025 is the Year of Evals! Just like 2024, and 2023, and … — John Dickerson, CEO Mozilla AI
AI is getting deployed without guardrails, without governance, without due diligence. Surely this is the year we’ll see a Fortune 500 CEO fired because of a preventable AI incident. Surely this i...
Vibe Coding with Confidence — Itamar Friedman, Qodo
Everyone wants to do Vibe Code, even large Enterprises. But how can we ensure that the generated code is well-grounded with the dev team's code and software development standards? In this talk, Ita...
AI Automation that actually works: $100M, messy data, zero surprises - Tanmai Gopal, Hasura/PromptQL
We will review the different kinds of automation use-cases, and the approach we used, that will drive over a $100M of expected annual impact by deploying AI for business critical initiatives. We w...
Full Workshop: Realtime Voice AI — Mark Backman, Daily
Voice AI agents today can conduct natural, human-like conversations and perform a wide variety of tasks: customer support, lead qualification, healthcare patient intake, market research, and more. ...
Vision AI in 2025 — Peter Robicheaux, Roboflow
Attendee-Only and Attendee-Led 10min lightning talks: see https://crowdcomms.com/aiengineer25/qanda/41445 Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming...
Practical tactics to build reliable AI apps — Dmitry Kuchin, Multinear
[last round of Attendee-Led 10min lightning talks] Practical tactics to build reliable AI apps. Reverse engineering real-world evals with o3. Nobody does it this way. Companies pay me $500/h for th...
How to Improve your Vibe Coding — Ian Butler
[last round of Attendee-Led 10min lightning talks] Are your vibes immaculate? - Vibe coding is the new hotness but everyone has a story of AI making really dumb choices. Let's talk about how you ca...
Vibes won't cut it — Chris Kelly, Augment Code
What's the role of vibe coding in a production-grade applications? Join Augment Code's Chris Kelly as he talks about the role of context in software engineering, not code. About Chris Kelly Chris ...
Real World Development with GitHub Copilot and VS Code — Harald Kirschner, Christopher Harrison
Join us to see how VS Code and GitHub Copilot's expanding suite of AI features can match or even surpasses the benefits of other popular AI developer tools. We'll focus on practical scenarios to e...
Building Agents at Cloud Scale — Antje Barth, AWS
Let's explore practical strategies for building and scaling agents in production. Discover how to move from local MCP implementations to cloud-scale architectures and how engineering teams lever...
State of Startups and AI 2025 - Sarah Guo, Conviction
Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming events and content by joining our newsletter here: https://www.ai.engineer/newsletter
Useful General Intelligence — Danielle Perszyk, Amazon AGI
We’re all hearing that AI agents will enable AGI, but they can’t yet reliably perform even basic computer tasks. It turns out that getting AI to click, type, and scroll is more challenging than get...
The 2025 AI Engineering Report — Barr Yaron, Amplify
Come hear the results of the 2025 State of AI Engineering: https://www.amplifypartners.com/blog-posts/the-2025-ai-engineering-report About Barr Yaon Barr is a data scientist turned investment part...
Agents vs Workflows: Why Not Both? — Sam Bhagwat, Mastra.ai
One current hot debate is should you make your top-level abstraction a ReAct type agent running in a loop? or should you make it a structured workflow graph? OpenAI is launching their new framewor...
Why We Don’t Need More Data Centers - Dr. Jasper Zhang, Hyperbolic
AI infrastructure today is caught in an endless cycle: build more data centers, deploy more GPUs, repeat. But this approach is fundamentally flawed—expensive, inefficient, and environmentally unsu...
Infrastructure for the Singularity — Jesse Han, Morph
We're at an inflection point where AI agents are transitioning from experimental tools to practical coworkers. This new world will demand new infrastructure for RL training, test-time scaling, and ...
Hacking the Inference Pareto Frontier - Kyle Kranen, NVIDIA
Your model works! It aces the evals! It even passes the vibe check! All that’s required is inference, right? Oops, you’ve just stepped into a minefield: -Not low-latency enough? Choppy experience....
Pipecat Cloud: Enterprise Voice Agents Built On Open Source - Kwindla Hultman Kramer, Daily
Voice AI agents today can conduct natural, human-like conversations and perform a wide variety of tasks: customer support, lead qualification, healthcare patient intake, market research, and more. ...
[Full Workshop] Building Conversational AI Agents - Thor Schaeff, ElevenLabs
In this workshop you will learn how to build multilingual Conversational AI agents that can automatically detect your user's spoken language and can seamlessly switch to their preferred language. ...
From Self-driving to Autonomous Voice Agents — Brooke Hopkins, Coval
The reliability challenges facing voice & chat AI deployment today mirror those that the autonomous vehicle industry confronted years ago. This talk explores how evaluation methodologies developed ...
Your realtime AI is ngmi — Sean DuBois (OpenAI), Kwindla Kramer (Daily)
Sean DuBois of OpenAI and Pion, and Kwindla Hultman Kramer of Daily and Pipecat, will talk about why you have to design realtime AI systems from the network layer up. Most people who build realtim...
Why ChatGPT Keeps Interrupting You — Dr. Tom Shapland, LiveKit
ChatGPT Advanced Voice Mode isn’t interrupting just you. Interruptions, and turn-taking in general, are unsolved problems for all Voice AI agents. Nobody likes being cut short – and people have muc...
Serving Voice AI at $1/hr: Open-source, LoRAs, Latency, Load Balancing - Neil Dwyer, Gabber
This is a talk that goes over our experience deploying Orpheus (Emotive, Realtime TTS) to production. It will cover topics: - Latency and optimizations - High fidelity voice clones w/ examples - L...
How to defend your sites from AI bots — David Mytton, Arcjet
Constantly seeing CAPTCHAs? It used to be easy to detect the humans from the droids, but what else can we do when synthetic clients make up nearly half of all web requests. Rotating IPs, spoofed br...
The Unofficial Guide to Apple’s Private Cloud Compute - Jmo, CONFSEC
In October 2024, Apple released a new private AI technology onto millions of devices called “Private Cloud Compute”. It brings the same level of privacy and security a local device offers but on an...
How to Secure Agents using OAuth — Jared Hanson (Keycard, Passport.js)
We all know sharing passwords is bad (unless you want free TV), so why are we sharing API keys with AI? We shouldn't, and that’s why we need to talk about OAuth. In this talk, we will give a brie...
How we hacked YC Spring 2025 batch’s AI agents — Rene Brandel, Casco
We hacked 7 of the16 publicly-accessible YC X25 AI agents. This allowed us to leak user data, execute code remotely, and take over databases. All within 30 minutes each. In this session, we'll walk...
OpenAI on Securing Code-Executing AI Agents — Fouad Matin (Codex, Agent Robustness)
Code is the lingua franca for both software engineers and highly capable AI models. As we give agents the ability to build, test, and run code that they generate, the command line becomes their can...
Evaluating AI Search: A Practical Framework for Augmented AI Systems — Quotient AI + Tavily
AI search is becoming the front door to information, whether through Retrieval-Augmented Generation (RAG), Search-Augmented Generation (SAG), or custom agents that synthesize answers on top of inde...
Scaling Enterprise-Grade RAG: Lessons from Legal Frontier - Calvin Qi (Harvey), Chang She (Lance)
In domains like law, compliance, and tax, building enterprise-grade RAG means very large scale, spikey workloads, a focus on accuracy, and non-negotiable privacy. In this talk, we'll share war stor...
Building Alice’s Brain: an AI Sales Rep that Learns Like a Human - Sherwood & Satwik, 11x
AI agents are becoming essential tools for teams of all sizes and industries - but training them to become experts in your product, business, and customerbase remains a challenge. What if onboardi...
Layering every technique in RAG, one query at a time - David Karam, Pi Labs (fmr. Google Search)
Start with the simplest Search - in-memory embeddings with relevance ranking. End with the most complex planet-scale Search - 70+ corpus mix of token, embeddings, and knowledge graphs, all jointly ...