AI Engineer - Videos

Back to Channel

Practical tactics to build reliable AI apps — Dmitry Kuchin, Multinear

[last round of Attendee-Led 10min lightning talks] Practical tactics to build reliable AI apps. Reverse engineering real-world evals with o3. Nobody does it this way. Companies pay me $500/h for th...

5,374 views • 123 likes • 16 comments • August 03, 2025

How to Improve your Vibe Coding — Ian Butler

[last round of Attendee-Led 10min lightning talks] Are your vibes immaculate? - Vibe coding is the new hotness but everyone has a story of AI making really dumb choices. Let's talk about how you ca...

2,711 views • 44 likes • 6 comments • August 03, 2025

Vibes won't cut it — Chris Kelly, Augment Code

What's the role of vibe coding in a production-grade applications? Join Augment Code's Chris Kelly as he talks about the role of context in software engineering, not code. About Chris Kelly Chris ...

86,184 views • 2,378 likes • 200 comments • August 03, 2025

Real World Development with GitHub Copilot and VS Code — Harald Kirschner, Christopher Harrison

Join us to see how VS Code and GitHub Copilot's expanding suite of AI features can match or even surpasses the benefits of other popular AI developer tools. We'll focus on practical scenarios to e...

14,657 views • 236 likes • 7 comments • August 03, 2025

Building Agents at Cloud Scale — Antje Barth, AWS

Let's explore practical strategies for building and scaling agents in production. Discover how to move from local MCP implementations to cloud-scale architectures and how engineering teams lever...

5,117 views • 121 likes • 9 comments • August 02, 2025

State of Startups and AI 2025 - Sarah Guo, Conviction

Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming events and content by joining our newsletter here: https://www.ai.engineer/newsletter

57,814 views • 1,442 likes • 55 comments • August 02, 2025

Useful General Intelligence — Danielle Perszyk, Amazon AGI

We’re all hearing that AI agents will enable AGI, but they can’t yet reliably perform even basic computer tasks. It turns out that getting AI to click, type, and scroll is more challenging than get...

7,784 views • 188 likes • 5 comments • August 02, 2025

The 2025 AI Engineering Report — Barr Yaron, Amplify

Come hear the results of the 2025 State of AI Engineering: https://www.amplifypartners.com/blog-posts/the-2025-ai-engineering-report About Barr Yaon Barr is a data scientist turned investment part...

7,845 views • 210 likes • 2 comments • August 01, 2025

Agents vs Workflows: Why Not Both? — Sam Bhagwat, Mastra.ai

One current hot debate is should you make your top-level abstraction a ReAct type agent running in a loop? or should you make it a structured workflow graph? OpenAI is launching their new framewor...

20,722 views • 380 likes • 34 comments • August 01, 2025

Why We Don’t Need More Data Centers - Dr. Jasper Zhang, Hyperbolic

AI infrastructure today is caught in an endless cycle: build more data centers, deploy more GPUs, repeat. But this approach is fundamentally flawed—expensive, inefficient, and environmentally unsu...

3,276 views • 58 likes • 7 comments • August 01, 2025

Infrastructure for the Singularity — Jesse Han, Morph

We're at an inflection point where AI agents are transitioning from experimental tools to practical coworkers. This new world will demand new infrastructure for RL training, test-time scaling, and ...

1,911 views • 50 likes • 6 comments • August 01, 2025

Hacking the Inference Pareto Frontier - Kyle Kranen, NVIDIA

Your model works! It aces the evals! It even passes the vibe check! All that’s required is inference, right? Oops, you’ve just stepped into a minefield: -Not low-latency enough? Choppy experience....

1,861 views • 41 likes • 3 comments • August 01, 2025

Pipecat Cloud: Enterprise Voice Agents Built On Open Source - Kwindla Hultman Kramer, Daily

Voice AI agents today can conduct natural, human-like conversations and perform a wide variety of tasks: customer support, lead qualification, healthcare patient intake, market research, and more. ...

5,262 views • 132 likes • 5 comments • July 31, 2025

[Full Workshop] Building Conversational AI Agents - Thor Schaeff, ElevenLabs

In this workshop you will learn how to build multilingual Conversational AI agents that can automatically detect your user's spoken language and can seamlessly switch to their preferred language. ...

3,921 views • 90 likes • 1 comments • July 31, 2025

From Self-driving to Autonomous Voice Agents — Brooke Hopkins, Coval

The reliability challenges facing voice & chat AI deployment today mirror those that the autonomous vehicle industry confronted years ago. This talk explores how evaluation methodologies developed ...

1,638 views • 39 likes • 2 comments • July 31, 2025

Your realtime AI is ngmi — Sean DuBois (OpenAI), Kwindla Kramer (Daily)

Sean DuBois of OpenAI and Pion, and Kwindla Hultman Kramer of Daily and Pipecat, will talk about why you have to design realtime AI systems from the network layer up. Most people who build realtim...

2,131 views • 67 likes • 3 comments • July 31, 2025

Why ChatGPT Keeps Interrupting You — Dr. Tom Shapland, LiveKit

ChatGPT Advanced Voice Mode isn’t interrupting just you. Interruptions, and turn-taking in general, are unsolved problems for all Voice AI agents. Nobody likes being cut short – and people have muc...

3,332 views • 95 likes • 9 comments • July 31, 2025

Serving Voice AI at $1/hr: Open-source, LoRAs, Latency, Load Balancing - Neil Dwyer, Gabber

This is a talk that goes over our experience deploying Orpheus (Emotive, Realtime TTS) to production. It will cover topics: - Latency and optimizations - High fidelity voice clones w/ examples - L...

6,607 views • 186 likes • 6 comments • July 31, 2025

How to defend your sites from AI bots — David Mytton, Arcjet

Constantly seeing CAPTCHAs? It used to be easy to detect the humans from the droids, but what else can we do when synthetic clients make up nearly half of all web requests. Rotating IPs, spoofed br...

1,962 views • 57 likes • 6 comments • July 30, 2025

The Unofficial Guide to Apple’s Private Cloud Compute - Jmo, CONFSEC

In October 2024, Apple released a new private AI technology onto millions of devices called “Private Cloud Compute”. It brings the same level of privacy and security a local device offers but on an...

2,696 views • 50 likes • 4 comments • July 30, 2025

How to Secure Agents using OAuth — Jared Hanson (Keycard, Passport.js)

We all know sharing passwords is bad (unless you want free TV), so why are we sharing API keys with AI? We shouldn't, and that’s why we need to talk about OAuth. In this talk, we will give a brie...

6,940 views • 172 likes • 5 comments • July 30, 2025

How we hacked YC Spring 2025 batch’s AI agents — Rene Brandel, Casco

We hacked 7 of the16 publicly-accessible YC X25 AI agents. This allowed us to leak user data, execute code remotely, and take over databases. All within 30 minutes each. In this session, we'll walk...

2,381 views • 85 likes • 2 comments • July 30, 2025

OpenAI on Securing Code-Executing AI Agents — Fouad Matin (Codex, Agent Robustness)

Code is the lingua franca for both software engineers and highly capable AI models. As we give agents the ability to build, test, and run code that they generate, the command line becomes their can...

2,670 views • 71 likes • 1 comments • July 30, 2025

Evaluating AI Search: A Practical Framework for Augmented AI Systems — Quotient AI + Tavily

AI search is becoming the front door to information, whether through Retrieval-Augmented Generation (RAG), Search-Augmented Generation (SAG), or custom agents that synthesize answers on top of inde...

2,918 views • 58 likes • 3 comments • July 29, 2025

Scaling Enterprise-Grade RAG: Lessons from Legal Frontier - Calvin Qi (Harvey), Chang She (Lance)

In domains like law, compliance, and tax, building enterprise-grade RAG means very large scale, spikey workloads, a focus on accuracy, and non-negotiable privacy. In this talk, we'll share war stor...

4,947 views • 118 likes • 8 comments • July 29, 2025

Building Alice’s Brain: an AI Sales Rep that Learns Like a Human - Sherwood & Satwik, 11x

AI agents are becoming essential tools for teams of all sizes and industries - but training them to become experts in your product, business, and customerbase remains a challenge. What if onboardi...

6,131 views • 141 likes • 9 comments • July 29, 2025

Layering every technique in RAG, one query at a time - David Karam, Pi Labs (fmr. Google Search)

Start with the simplest Search - in-memory embeddings with relevance ranking. End with the most complex planet-scale Search - 70+ corpus mix of token, embeddings, and knowledge graphs, all jointly ...

15,800 views • 466 likes • 14 comments • July 29, 2025

Building a Smarter AI Agent with Neural RAG - Will Bryk, Exa.ai

RAG quality for AI agents is critical, and traditional keyword-based search engines consistently underperform in agentic or multi-step tasks, where semantic grounding and contextual nuance matter m...

18,624 views • 363 likes • 26 comments • July 29, 2025

[Full Workshop] Building Metrics that actually work — David Karam, Pi Labs (fmr Google Search)

One of the biggest challenges in building evals you can trust is building metrics that reliably measure goodness in your application; metrics that are highly accurate, rapid fast, and tunable to gr...

1,774 views • 37 likes • 1 comments • July 29, 2025

Make your LLM app a Domain Expert: How to Build an Expert System — Christopher Lovejoy, Anterior

Vertical AI is a multi-trillion-dollar opportunity. But you can't build a domain-expert application simply by grabbing the latest LLMs off-the-shelf: you need a system for codifying latent insights...

82,389 views • 1,762 likes • 39 comments • July 28, 2025

Shipping Products When You Don't Know What they Can Do — Ben Stein, Teammates

A customer recently asked me: “Hey, can I tag your AI agent in a Google Doc comment?” The honest answer: I have no idea! We never designed our agents to handle Google Doc comments, but we tried it...

1,642 views • 18 likes • 1 comments • July 28, 2025

Shipping something to someone always wins — Kenneth Auchenberg (ex. Stripe, VSCode)

Learnings from building products at Stripe and applying them in an AI native word. About Kenneth Auchenberg Partner at @alley_corp, investor focused on backing founders building for developers. P...

967 views • 23 likes • 1 comments • July 28, 2025

Why your product needs an AI product manager, and why it should be you — James Lowe, i.AI

So you've built another cool demo. Now what? You have hype, but not impact. You have kudos but no users. Ultimately you have a demo, but not a product. The unique uncertainty of AI technology dema...

5,986 views • 126 likes • 5 comments • July 28, 2025

Everything is ugly, so go build something that isn't — Raiza Martin, Huxe (ex NotebookLM)

We're in an awkward adolescent phase of AI product (design). But what if this chaotic moment is actually our greatest opportunity? Enter the rebuilding revolution. In this talk, we'll explore how ...

4,772 views • 104 likes • 11 comments • July 28, 2025

Building the platform for agent coordination — Tom Moor, Linear

Learn how we're evolving Linear into an operating system for engineering teams to ship product with agents as a first class citizen. About Tom Moor Tom Moor is the Head of Engineering at Linear, a...

4,923 views • 90 likes • 8 comments • July 28, 2025

What Is a Humanoid Foundation Model? An Introduction to GR00T N1 - Annika & Aastha

Foundation models don’t just write or draw anymore—they’re starting to move. GR00T N1 is NVIDIA’s open Vision-Language-Action (VLA) foundation model for humanoid robots. Built with a dual-system a...

8,345 views • 250 likes • 6 comments • July 28, 2025

Real-time Experiments with an AI Co-Scientist - Stefania Druga, fmr. Google Deepmind

The sheer volume of data and complexity of modern scientific challenges necessitate tools that go beyond mere analysis. The vision of an "AI Co-scientist" – a true collaborative partner in the lab ...

3,945 views • 105 likes • 2 comments • July 28, 2025

Scaling AI Agents Without Breaking Reliability — Preeti Somal, Temporal

As AI agents move from prototypes to production, developers are running into new challenges with orchestration, failure handling, and infrastructure. This session will unpack lessons from teams alr...

3,106 views • 72 likes • 6 comments • July 28, 2025

Government Agents: AI Agents vs Tough Regulations — Mark Myshatyn, Los Alamos National Laboratory

https://www.linkedin.com/in/markmyshatyn/

1,375 views • 51 likes • 3 comments • July 28, 2025

Ship Agents that Ship: A Hands-On Workshop - Kyle Penfound, Jeremy Adams, Dagger

Coding agents are transforming how software gets built, tested, and deployed, but engineering teams face a critical challenge: how to embrace this automation wave without sacrificing trust, control...

3,075 views • 61 likes • 2 comments • July 27, 2025

The AI Engineer’s Guide to Raising VC — Dani Grant (Jam), Chelcie Taylor (Notable)

A no fluff, all tactics discussion. More AI engineers should build startups, the world needs more software. But there’s a way to raise VC and it’s hard to do it if you’ve never seen it done. We are...

3,084 views • 84 likes • 5 comments • July 27, 2025

Strategies for LLM Evals (GuideLLM, lm-eval-harness, OpenAI Evals Workshop) — Taylor Jordan Smith

Accuracy scores and leaderboard metrics look impressive—but production-grade AI requires evals that reflect real-world performance, reliability, and user happiness. Traditional benchmarks rarely he...

11,185 views • 241 likes • 11 comments • July 27, 2025

Why you should care about AI interpretability - Mark Bissell, Goodfire AI

The goal of mechanistic interpretability is to reverse engineer neural networks. Having direct, programmable access to the internal neurons of models unlocks new ways for developers and users to in...

3,521 views • 115 likes • 5 comments • July 27, 2025

Information Retrieval from the Ground Up - Philipp Krenn, Elastic

Vector search is only a feature. Search engines and information retrieval have retaken their position as the foundation of RAG. This workshop takes you through decades of research, what has been wo...

4,601 views • 109 likes • 2 comments • July 27, 2025

Introduction to LLM serving with SGLang - Philip Kiely and Yineng Zhang, Baseten

Do you want to learn how to serve models like DeepSeek and Qwen with SOTA speeds on launch day? SGLang is an open-source fast serving framework for LLMs and VLMs that generates trillions of tokens ...

3,441 views • 63 likes • 4 comments • July 26, 2025

Waymo's EMMA: Teaching Cars to Think - Jyh Jing Hwang, Waymo

This session explores Waymo's latest research on the End-to-End Multimodal Model for Autonomous Driving (EMMA) and advanced sensor simulation techniques. Jyh-Jing Hwang will demonstrate how multimo...

4,930 views • 156 likes • 10 comments • July 26, 2025

Robotics: why now? - Quan Vuong and Jost Tobias Springberg, Physical Intelligence

Sharing recent progress from Physical Intelligence and why it is an exciting time to push the frontier in general purpose robotics About Quan Vuong Quan Vuong is co-founder at Physical Intelligenc...

40,040 views • 1,221 likes • 22 comments • July 26, 2025

A2A & MCP Workshop: Automating Business Processes with LLMs — Damien Murphy, Bench

Ever wished your webhooks could think for themselves? Join us to discover how A2A agents can transform passive webhook endpoints into intelligent workflow processors. In this session, we'll show y...

24,669 views • 461 likes • 24 comments • July 26, 2025

Piloting agents in GitHub Copilot - Christopher Harrison, Microsoft

The agent capabilities added to GitHub Copilot have enhanced its ability to act as a peer programmer. Copilot can now discover and generate code based on existing standards, run tests, recover from...

9,147 views • 107 likes • 7 comments • July 26, 2025

Ship Production Software in Minutes, Not Months — Eno Reyes, Factory

Planning, coding, testing, monitoring—the endless cycle that spans 10+ tools that fragment our focus and slows delivery to a crawl. Vibe coding doesn't work when you've got 10TB of code. If you jus...

5,559 views • 138 likes • 8 comments • July 25, 2025