AI Engineer - Videos

Back to Channel

Z.ai GLM 4.6: What We Learned From 100 Million Open Source Downloads — Yuxuan Zhang, Z.ai

GLM 4.6 is the only open-source model currently tied for #1 on the LMSYS Chatbot Arena, standing shoulder-to-shoulder with GPT-4o and Claude 3.5 Sonnet. In this talk, Zhang Yuxuan from zAI breaks d...

5,662 views • 112 likes • 4 comments • November 22, 2025

AIE CODE Day 2: ft Google Deepmind, Anthropic, Cursor, Netflix, Cline, OpenAI, Meta, and METR

NOVEMBER 21 - all times in EST * 9:06 am – Opening Remarks Swyx | Organizer, AI Engineer * 9:11 am – Stop Building Agents Barry Zhang | AI Engineer, Anthropic Mahesh Murag | AI Engineer, An...

62,243 views • 651 likes • 22 comments • November 21, 2025

AIE CODE 2025: AI Leadership ft Anthropic, OpenAI, McKinsey, Bloomberg, Google Deepmind, and Tenex

Alex Lieberman – Co-founder, @MorningBrew & @tenex_labs – Your Leadership Track MC! Kath Korevec – Engineering, @GoogleLabs – Proactive Agents Katelyn Lesse – Head of Engineering, Claude Developer ...

26,264 views • 420 likes • 16 comments • November 20, 2025

AI Engineer Paris 2025 (Day 2)

Full schedule at https://www.ai.engineer/paris#schedule - Emil Eifrem, Co-Founder and CEO, Neo4j, “The State of^H^Hin AI Engineering” - Tushar Jain, President, Product & Engineering, Docker, “Demo...

12,434 views • 189 likes • 10 comments • September 24, 2025

Opening Keynotes - AIE Paris 2025 (Day 1)

The opening welcome reception is all about the hallway track and the expo -- meeting and mingling with other founders and engineers who are (mostly) based in Europe. However, for those who can't ma...

6,750 views • 132 likes • 4 comments • September 24, 2025

Rishabh Garg, Tesla Optimus — Challenges in High Performance Robotics Systems

A robot's behavior is influenced by the control policy, the software configuration, and electrical characteristics of the communication protocol. When unexpected behaviors arise, it is not straigh...

8,012 views • 180 likes • 11 comments • August 25, 2025

Building an Agentic Platform — Ben Kus, CTO Box

Explore the technical evolution of metadata extraction at Box and how it shaped the foundation of our AI platform. We’ll walk through our transition to an agentic-first design—why it was necessary,...

26,323 views • 438 likes • 9 comments • August 24, 2025

Five hard earned lessons about Evals — Ankur Goyal, Braintrust

The main thesis of the video is that building successful AI applications requires a sophisticated engineering approach that goes beyond simply writing good prompts. The speaker argues for the impor...

17,778 views • 367 likes • 3 comments • August 23, 2025

Perceptual Evaluations: Evals for Aesthetics — Diego Rodriguez, Krea.ai

Special session with KREA.ai's cofounder Diego Rodriguez on how evals for aesthetics and image/generative media work — the hardest kinds of evals. linkedin.com/in/asciidiego/ Timestamps 00:15 I...

2,867 views • 50 likes • 5 comments • August 23, 2025

How BlackRock Builds Custom Knowledge Apps at Scale — Vaibhav Page & Infant Vasanth, BlackRock

Investment Operations teams are the backbone of asset and investment management firms. Their day-to-day work not only enables portfolio managers to respond swiftly to market events but also ensures...

18,375 views • 240 likes • 21 comments • August 23, 2025

Form factors for your new AI coworkers — Craig Wattrus, Flatfile

Designing user experiences for AI means moving beyond traditional interfaces. Designers are grappling with how to create intuitive and effective interactions for these new AI capabilities, while g...

4,538 views • 75 likes • 4 comments • August 22, 2025

Fuzzing in the GenAI Era — Leonard Tang, Haize Labs

"Evaluation" is one of those concepts that every AI practitioner vaguely knows is important, but few practitioners truly understand. Is "eval" the dataset for measuring the quality of your AI syste...

3,924 views • 76 likes • 10 comments • August 22, 2025

Multi Agent AI and Network Knowledge Graphs for Change — Ola Mabadeje, Cisco

Traditional ticketing and testing workflows for change management and network operations often operate independently and lack critical real-world context and adaptive decision making capabilities. ...

7,925 views • 113 likes • 5 comments • August 22, 2025

Wisdom-Driven Knowledge Augmented Generation at Scale - Chin Keong Lam, Patho AI

The main thesis of the video is that by using a Wisdom-Driven Knowledge Graph, we can significantly enhance the quantitative analysis capabilities of Knowledge-Augmented Generation (KAG) systems. T...

3,212 views • 64 likes • 4 comments • August 22, 2025

The Next Unicorns: 7 Top AI startups from the HF0 Residency

HF0's Demo Days are usually hilariously oversubscribed and have never before been aired publicly. For the first time, they are joining the AIE stage to pitch AI Engineers. https://www.hf0.com/ Ti...

8,758 views • 183 likes • 11 comments • August 21, 2025

#define AI Engineer - Greg Brockman, OpenAI (ft. Jensen Huang)

Greg Brockman's career and advice for AI Engineers Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming events and content by joining our newsletter here: htt...

60,562 views • 1,165 likes • 60 comments • August 10, 2025

The Future of Evals - Ankur Goyal, Braintrust

About Ankur Ankur Goyal is the founder & CEO of Braintrust—the developer platform that companies like Zapier, Notion, Instacart, Airtable, and more use to evaluate, log, and ship reliable AI produc...

8,373 views • 93 likes • 10 comments • August 09, 2025

Designing AI-Intensive Applications - swyx

Whether you call it a workflow or an agent, AI engineered applications are seeing user-input:LLM-call ratios go from 1:1 (ChatGPT) to 1:100 (Deep Research, Codex) and even 0:n (Ambient/Proactive ag...

27,309 views • 432 likes • 14 comments • August 09, 2025

How to look at your data — Jeff Huber (Chroma) + Jason Liu (567)

By the end of this talk, you'll understand what it takes to apply clustering techniques and data analysis to understand what is the valuable work that your AI application is doing through analyzing...

9,321 views • 203 likes • 4 comments • August 06, 2025

On Engineering AI Systems that Endure The Bitter Lesson - Omar Khattab, DSPy & Databricks

Will discuss the principles for building AI software that underpin DSPy, highlighting the differences between conventional prompting (or finetuning/RL) versus the design and programming of truly mo...

18,246 views • 415 likes • 9 comments • August 06, 2025

Evals Are Not Unit Tests — Ido Pesok, Vercel v0

How to think about evaluating a non-deterministic system — and how to actually succeed at it. About Ido Pesok Ido Pesok is an engineer and researcher at Vercel, working on the AI behind v0 and foc...

13,889 views • 282 likes • 16 comments • August 06, 2025

2025 is the Year of Evals! Just like 2024, and 2023, and … — John Dickerson, CEO Mozilla AI

AI is getting deployed without guardrails, without governance, without due diligence. Surely this is the year we’ll see a Fortune 500 CEO fired because of a preventable AI incident. Surely this i...

4,954 views • 72 likes • 3 comments • August 06, 2025

Vibe Coding with Confidence — Itamar Friedman, Qodo

Everyone wants to do Vibe Code, even large Enterprises. But how can we ensure that the generated code is well-grounded with the dev team's code and software development standards? In this talk, Ita...

6,189 views • 125 likes • 6 comments • August 06, 2025

AI Automation that actually works: $100M, messy data, zero surprises - Tanmai Gopal, Hasura/PromptQL

We will review the different kinds of automation use-cases, and the approach we used, that will drive over a $100M of expected annual impact by deploying AI for business critical initiatives. We w...

2,890 views • 45 likes • 2 comments • August 06, 2025

Full Workshop: Realtime Voice AI — Mark Backman, Daily

Voice AI agents today can conduct natural, human-like conversations and perform a wide variety of tasks: customer support, lead qualification, healthcare patient intake, market research, and more. ...

14,107 views • 319 likes • 11 comments • August 03, 2025

Vision AI in 2025 — Peter Robicheaux, Roboflow

Attendee-Only and Attendee-Led 10min lightning talks: see https://crowdcomms.com/aiengineer25/qanda/41445 Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming...

11,866 views • 303 likes • 17 comments • August 03, 2025

Practical tactics to build reliable AI apps — Dmitry Kuchin, Multinear

[last round of Attendee-Led 10min lightning talks] Practical tactics to build reliable AI apps. Reverse engineering real-world evals with o3. Nobody does it this way. Companies pay me $500/h for th...

5,431 views • 123 likes • 16 comments • August 03, 2025

How to Improve your Vibe Coding — Ian Butler

[last round of Attendee-Led 10min lightning talks] Are your vibes immaculate? - Vibe coding is the new hotness but everyone has a story of AI making really dumb choices. Let's talk about how you ca...

2,815 views • 43 likes • 6 comments • August 03, 2025

Vibes won't cut it — Chris Kelly, Augment Code

What's the role of vibe coding in a production-grade applications? Join Augment Code's Chris Kelly as he talks about the role of context in software engineering, not code. About Chris Kelly Chris ...

86,777 views • 2,388 likes • 197 comments • August 03, 2025

Real World Development with GitHub Copilot and VS Code — Harald Kirschner, Christopher Harrison

Join us to see how VS Code and GitHub Copilot's expanding suite of AI features can match or even surpasses the benefits of other popular AI developer tools. We'll focus on practical scenarios to e...

15,642 views • 244 likes • 7 comments • August 03, 2025

Building Agents at Cloud Scale — Antje Barth, AWS

Let's explore practical strategies for building and scaling agents in production. Discover how to move from local MCP implementations to cloud-scale architectures and how engineering teams lever...

5,298 views • 123 likes • 9 comments • August 02, 2025

State of Startups and AI 2025 - Sarah Guo, Conviction

Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming events and content by joining our newsletter here: https://www.ai.engineer/newsletter

58,542 views • 1,442 likes • 55 comments • August 02, 2025

Useful General Intelligence — Danielle Perszyk, Amazon AGI

We’re all hearing that AI agents will enable AGI, but they can’t yet reliably perform even basic computer tasks. It turns out that getting AI to click, type, and scroll is more challenging than get...

8,121 views • 196 likes • 5 comments • August 02, 2025

The 2025 AI Engineering Report — Barr Yaron, Amplify

Come hear the results of the 2025 State of AI Engineering: https://www.amplifypartners.com/blog-posts/the-2025-ai-engineering-report About Barr Yaon Barr is a data scientist turned investment part...

7,958 views • 211 likes • 2 comments • August 01, 2025

Agents vs Workflows: Why Not Both? — Sam Bhagwat, Mastra.ai

One current hot debate is should you make your top-level abstraction a ReAct type agent running in a loop? or should you make it a structured workflow graph? OpenAI is launching their new framewor...

21,888 views • 410 likes • 34 comments • August 01, 2025

Why We Don’t Need More Data Centers - Dr. Jasper Zhang, Hyperbolic

AI infrastructure today is caught in an endless cycle: build more data centers, deploy more GPUs, repeat. But this approach is fundamentally flawed—expensive, inefficient, and environmentally unsu...

3,597 views • 64 likes • 7 comments • August 01, 2025

Infrastructure for the Singularity — Jesse Han, Morph

We're at an inflection point where AI agents are transitioning from experimental tools to practical coworkers. This new world will demand new infrastructure for RL training, test-time scaling, and ...

2,038 views • 52 likes • 6 comments • August 01, 2025

Hacking the Inference Pareto Frontier - Kyle Kranen, NVIDIA

Your model works! It aces the evals! It even passes the vibe check! All that’s required is inference, right? Oops, you’ve just stepped into a minefield: -Not low-latency enough? Choppy experience....

2,130 views • 46 likes • 3 comments • August 01, 2025

Pipecat Cloud: Enterprise Voice Agents Built On Open Source - Kwindla Hultman Kramer, Daily

Voice AI agents today can conduct natural, human-like conversations and perform a wide variety of tasks: customer support, lead qualification, healthcare patient intake, market research, and more. ...

6,021 views • 149 likes • 5 comments • July 31, 2025

[Full Workshop] Building Conversational AI Agents - Thor Schaeff, ElevenLabs

In this workshop you will learn how to build multilingual Conversational AI agents that can automatically detect your user's spoken language and can seamlessly switch to their preferred language. ...

4,397 views • 94 likes • 1 comments • July 31, 2025

From Self-driving to Autonomous Voice Agents — Brooke Hopkins, Coval

The reliability challenges facing voice & chat AI deployment today mirror those that the autonomous vehicle industry confronted years ago. This talk explores how evaluation methodologies developed ...

1,748 views • 39 likes • 2 comments • July 31, 2025

Your realtime AI is ngmi — Sean DuBois (OpenAI), Kwindla Kramer (Daily)

Sean DuBois of OpenAI and Pion, and Kwindla Hultman Kramer of Daily and Pipecat, will talk about why you have to design realtime AI systems from the network layer up. Most people who build realtim...

2,205 views • 66 likes • 3 comments • July 31, 2025

Why ChatGPT Keeps Interrupting You — Dr. Tom Shapland, LiveKit

ChatGPT Advanced Voice Mode isn’t interrupting just you. Interruptions, and turn-taking in general, are unsolved problems for all Voice AI agents. Nobody likes being cut short – and people have muc...

3,597 views • 102 likes • 9 comments • July 31, 2025

Serving Voice AI at $1/hr: Open-source, LoRAs, Latency, Load Balancing - Neil Dwyer, Gabber

This is a talk that goes over our experience deploying Orpheus (Emotive, Realtime TTS) to production. It will cover topics: - Latency and optimizations - High fidelity voice clones w/ examples - L...

6,757 views • 189 likes • 6 comments • July 31, 2025

How to defend your sites from AI bots — David Mytton, Arcjet

Constantly seeing CAPTCHAs? It used to be easy to detect the humans from the droids, but what else can we do when synthetic clients make up nearly half of all web requests. Rotating IPs, spoofed br...

2,020 views • 56 likes • 6 comments • July 30, 2025

The Unofficial Guide to Apple’s Private Cloud Compute - Jmo, CONFSEC

In October 2024, Apple released a new private AI technology onto millions of devices called “Private Cloud Compute”. It brings the same level of privacy and security a local device offers but on an...

3,209 views • 57 likes • 4 comments • July 30, 2025

How to Secure Agents using OAuth — Jared Hanson (Keycard, Passport.js)

We all know sharing passwords is bad (unless you want free TV), so why are we sharing API keys with AI? We shouldn't, and that’s why we need to talk about OAuth. In this talk, we will give a brie...

7,864 views • 185 likes • 5 comments • July 30, 2025

How we hacked YC Spring 2025 batch’s AI agents — Rene Brandel, Casco

We hacked 7 of the16 publicly-accessible YC X25 AI agents. This allowed us to leak user data, execute code remotely, and take over databases. All within 30 minutes each. In this session, we'll walk...

2,491 views • 83 likes • 2 comments • July 30, 2025

OpenAI on Securing Code-Executing AI Agents — Fouad Matin (Codex, Agent Robustness)

Code is the lingua franca for both software engineers and highly capable AI models. As we give agents the ability to build, test, and run code that they generate, the command line becomes their can...

2,768 views • 72 likes • 1 comments • July 30, 2025

Evaluating AI Search: A Practical Framework for Augmented AI Systems — Quotient AI + Tavily

AI search is becoming the front door to information, whether through Retrieval-Augmented Generation (RAG), Search-Augmented Generation (SAG), or custom agents that synthesize answers on top of inde...

3,091 views • 58 likes • 3 comments • July 29, 2025