AI Engineer - Videos

Back to Channel

Playground in Prod: Optimising Agents in Production Environments — Samuel Colvin, Pydantic

Deploying an agent is only the start. In this workshop, Samuel Colvin shows how to improve agents after they are already live, using Pydantic AI and Logfire to change prompts, models, and other par...

1,931 views • 45 likes • 0 comments • May 07, 2026

Vibe Engineering Effect Apps — Michael Arnaldi, Effectful

What if the best way to get coding agents to use a library well is not better prompts, but giving them the library's actual code? In this workshop, Michael Arnaldi walks through a practical approac...

4,773 views • 123 likes • 14 comments • May 07, 2026

Everything You Need To Know About Agent Observability — Danny Gollapalli and Ben Hylak, Raindrop

Agent failures do not look like normal software failures. In this workshop, the Raindrop team breaks down what it actually takes to monitor production agents, from explicit signals like tool errors...

2,682 views • 76 likes • 3 comments • May 07, 2026

Skills at Scale — Nick Nisi and Zack Proser, WorkOS

Chat interfaces are no longer limited to walls of text. In this talk, Liad Yosef and Ido Salomon explain how MCP Apps turn tools into interactive UI inside hosts like ChatGPT, Claude, VS Code, Curs...

6,500 views • 134 likes • 13 comments • May 06, 2026

The Multi-Agent Architecture That Actually Ships — Luke Alvoeiro, Factory

Everyone's building multi-agent systems, but nobody agrees on how. This talk proposes a taxonomy of five frontier multi-agent strategies and shows what happens when you compose them into a single a...

33,899 views • 969 likes • 81 comments • May 06, 2026

MCP UI: Extending the frontier — Liad Yosef and Ido Salomon, MCP Apps

Chat interfaces are no longer limited to walls of text. In this talk, Liad Yosef and Ido Salomon explain how MCP Apps turn tools into interactive UI inside hosts like ChatGPT, Claude, VS Code, Curs...

14,064 views • 350 likes • 32 comments • May 06, 2026

The Small Model Infrastructure Nobody Built (So We Did) — Filip Makraduli, Superlinked

Most embedding infrastructure assumes you know exactly which model you want ahead of time. This talk starts where that assumption breaks. Filip Makraduli walks through the real profiling mistakes, ...

12,463 views • 306 likes • 11 comments • May 05, 2026

Accelerating AI on Edge — Chintan Parikh and Weiyi Wang, Google DeepMind

As models get smaller and more capable, more AI workloads can move onto the device itself. In this talk, Chintan Parikh from Google DeepMind walks through what that looks like in practice, from Gem...

4,280 views • 119 likes • 7 comments • May 05, 2026

Demand-Driven Context: A Methodology for Coherent Knowledge Bases Through Agent Failure

Enterprise teams spend a lot of time trying to guess what AI agents need to know. This workshop flips that around. Instead of curating context top-down, Raj Navakoti shows how to build a demand-dri...

9,575 views • 184 likes • 16 comments • May 05, 2026

Training an LLM from Scratch, Locally — Angelos Perivolaropoulos, ElevenLabs

Training an LLM from scratch on a local machine sounds unreasonable, until it isn't. In this workshop, Angelos Perivolaropoulos from ElevenLabs walks through what it actually takes to train a langu...

10,162 views • 307 likes • 4 comments • May 04, 2026

Skill Issue: How We Used AI to Make Agents Actually Good at Supabase — Pedro Rodrigues, Supabase

Writing Agent Skills is easy. Writing ones that actually improve agent performance is not. In this hands-on workshop, you’ll build, test, and iterate on Agent Skills against real Supabase workflow...

7,948 views • 146 likes • 11 comments • May 04, 2026

Ralph Loops: Build Dumb AI Loops That Ship — Chris Parsons, Cherrypick

Dumb loops beat clever workflows. Most teams building with AI agents reach for multi-agent orchestration, planning graphs, and elaborate tool chains. Then they spend months debugging them. A single...

18,688 views • 367 likes • 41 comments • May 04, 2026

TLMs: Tiny LLMs and Agents on Edge Devices with LiteRT-LM — Cormac Brick, Google

Tiny LLMs are making on-device agents much more practical. In this workshop, Cormac Brick walks through how LiteRT-LM brings language models to edge devices, with a focus on Gemma, agent skills, an...

20,189 views • 546 likes • 36 comments • May 03, 2026

Mergeable by default: Building the context engine to save time and tokens — Peter Werry, Unblocked

Agents can generate code. The hard part is generating code that's right for your system, team conventions, and past decisions. That's a context problem that naive RAG, MCP servers, and bigger conte...

10,601 views • 226 likes • 19 comments • May 03, 2026

Context Is the New Code — Patrick Debois, Tessl

As AI coding agents become more capable, context is starting to matter as much as code. Yet while code has version control, review, testing, CI/CD, and production observability, the prompts, rules,...

52,198 views • 1,126 likes • 61 comments • May 03, 2026

Human-in-the-Loop Automation with n8n — Liam McGarrigle

What does it actually take to build an AI workflow that can do useful work without becoming a black box? This workshop shows how to build secure, human-in-the-loop automations in n8n, using a Gmail...

5,028 views • 123 likes • 2 comments • May 02, 2026

I Gave an AI Agent the Keys to My Life (Here's What Happened) — Radek Sienkiewicz (@velvetshark-com)

An honest look at what happens when a personal AI agent is allowed to operate around the clock. Over months, one permission at a time, it went from reading files to handling email, backing up its o...

9,313 views • 190 likes • 16 comments • May 02, 2026

Software Engineering Is Becoming Plan and Review — Louis Knight-Webb, Vibe Kanban

AI eats the middle, software engineers are spending all their time planning and reviewing the work of AI. If all humans are going to do is plan and review the work of AI, the biggest lever you have...

13,624 views • 382 likes • 21 comments • May 02, 2026

Mastering AI Pricing: Flexible & Agile Monetization — Mayank Pant, Stripe

Monetizing AI is hard. Rising GPU and inference costs are squeezing margins, and traditional SaaS pricing simply does not work for the unpredictable compute demands of new-age AI companies. With mo...

3,832 views • 100 likes • 4 comments • May 01, 2026

Agents on the Canvas in tldraw — Steve Ruiz, tldraw

At tldraw, we've been bringing agents to our infinite canvas. In December 2025, we ran a one-month experiment named Fairydraw where users could work with three fairies—virtual collaborators who wor...

9,171 views • 198 likes • 12 comments • May 01, 2026

Shipping complex AI applications — Braintrust & Trainline

Getting a prototype working is straightforward. Making it reliable in production, especially with multi-step agents, tool use, and real users is the hard part. In this hands-on workshop, you'll wor...

4,243 views • 96 likes • 7 comments • May 01, 2026

Agents for Everything Else — swyx

How we run AI Engineer with Agents like Cognition's Devin and Town Assistant Speaker info: - x.com/swyx - github.com/swyxio

4,621 views • 72 likes • 3 comments • May 01, 2026

Building Conversational Agents — Thor Schaeff and Philipp Schmid, Google DeepMind

Thor Schaeff and Philipp Schmid show how to build conversational agents with Google DeepMind's Gemini APIs, from tool-using coding agents to realtime voice interfaces. The session covers the new In...

5,428 views • 125 likes • 2 comments • April 30, 2026

LLM codegen fails and how to stop 'em — Danilo Campos, PostHog

Danilo Campos breaks down the most common failure modes in LLM code generation and the practical strategies PostHog uses to prevent them. Drawing from a system that helps 5,000+ users each month, h...

2,829 views • 80 likes • 1 comments • April 30, 2026

Replacing 12K LoC with a 200 LoC Skill — David Gomes, Cursor

David Gomes shows how Cursor replaced a heavyweight WorkTrees feature with a lightweight layer built from skills, commands, and subagents. He walks through how parallel coding workflows were recrea...

13,294 views • 240 likes • 19 comments • April 30, 2026

OpenAI Codex Masterclass — Vaibhav Srivastav & Katia Gil Guzman

Codex is no longer just a coding assistant in a terminal. In this workshop, Vaibhav and Katia show how it becomes a full software engineering system, combining frontier models, the Codex app and CL...

26,599 views • 585 likes • 23 comments • April 29, 2026

Build & deploy AI-powered apps — Paige Bailey, Google DeepMind

Got a massive idea but stuck in the "just talking about it" phase? This session cuts the fluff and dives straight into how to build and prototype at lightning speed using AI Studio Build and Antigr...

4,172 views • 105 likes • 8 comments • April 29, 2026

Everything I Learned Training Frontier Small Models — Maxime Labonne, Liquid AI

A new class of small models is emerging with the ability to reliably follow instructions and call tools while running on-device under 1 GB of memory. In this talk, we'll break down how to post-trai...

60,045 views • 1,705 likes • 58 comments • April 29, 2026

Building your own software factory — Eric Zakariasson, Cursor

Most of us are pair-programming with one agent and stopping there. There's a lot more on the table. This workshop is about going from one agent to many. We'll start with codebase setup, the foundat...

18,830 views • 319 likes • 19 comments • April 28, 2026

Why building eval platforms is hard — Phil Hetzel, Braintrust

An eval platform is not just a test runner. You are building shared definitions of "good," reliable data pipelines, labelling workflows, versioning, and trust in results across many teams and model...

7,557 views • 170 likes • 4 comments • April 28, 2026

One Login to Rule Them All: Cross-App Access for MCP — Garrett Galow, WorkOS

Connecting a coding agent to multiple services often means facing a dozen OAuth consent screens, a dozen token lifecycles, and a dozen chances for something to break. Despite having Single Sign-On,...

3,212 views • 54 likes • 5 comments • April 28, 2026

Gemma 4 Deep Dive — Cassidy Hardin, Researcher, Google DeepMind

Open models are getting smaller, faster, and far more capable. In this talk, Cassidy Hardin walks through the latest advances in the Gemma family, with a focus on Gemma 4 and what it enables for de...

26,268 views • 755 likes • 45 comments • April 27, 2026

Scaling GitHub for your Agents — Sam Morrow, GitHub

GitHub operates one of the most heavily-utilised MCP servers in the ecosystem, with over 4 million downloads of the stdio server alone. Discover the architectural decisions, technical challenges an...

2,751 views • 67 likes • 3 comments • April 27, 2026

Gateways are All You Need — Karan Sampath, Anthropic

MCPs are often flaky, face multiple security vulnerabilities, and are generally hard to scale. Most enterprises struggle to use more than single digit numbers of MCPs due to issues with security, o...

15,824 views • 298 likes • 17 comments • April 27, 2026

Collaborative AI Engineering: One Dev, Two Dozen Agents, Zero Alignment — Maggie Appleton, GitHub

Agentic engineering so far has been a solo story: one developer and a dozen agents moving at warp speed. But speed without thoughtful planning and team alignment is just wasting tokens. When everyo...

42,758 views • 1,197 likes • 81 comments • April 26, 2026

MCP = Mega Context Problem - Matt Carey

The best MCP server is the one you didn't have to build. At Cloudflare we have a lot of products. Our REST OpenAPI spec is over 2.3 million tokens. When teams started building MCP servers, they di...

19,809 views • 452 likes • 36 comments • April 25, 2026

AgentCraft: Putting the Orc in Orchestration — Ido Salomon

As we run more agents in parallel, it becomes clear: we are the bottleneck. Luckily, the skills we need for effective multi-agent orchestration aren’t entirely new, they’ve just been hiding in unex...

34,110 views • 1,045 likes • 48 comments • April 25, 2026

Full Walkthrough: Workflow for AI Coding — Matt Pocock

A hands-on workshop covering the full lifecycle of AI-assisted development, from turning ambiguous requirements into agent-ready plans to running autonomous coding agents that ship production featu...

483,083 views • 11,072 likes • 303 comments • April 24, 2026

What Do Models Still Suck At? - Peter Gostev, Arena.ai, BullshitBench

What type of real world model responses do users still hate? We get to see millions of user's prompts - and we let users 'dislike both' on the Arena. We'll show you trends and examples of the tasks...

8,560 views • 190 likes • 12 comments • April 24, 2026

The End of Apps — Kitze, Sizzy.co

Speaker info: - https://x.com/thekitze - https://www.linkedin.com/in/kitaborovskis/

9,363 views • 276 likes • 18 comments • April 23, 2026

"Software Fundamentals Matter More Than Ever" — Matt Pocock

AI coding tools are overhyped and powerful at the same time. Used well, they're extraordinary. Used badly, they'll bury you in spaghetti code faster than any human team could. The difference isn't ...

581,966 views • 22,377 likes • 449 comments • April 23, 2026

Agents need more than a chat - Jacob Lauritzen, CTO Legora

Jacob Lauritzen is CTO at Legora, the fastest growing legal tech startup in history. Speaker info: - https://www.linkedin.com/in/jacob-lauritzen/ - https://github.com/Jacse

17,759 views • 422 likes • 10 comments • April 22, 2026

AIE Miami Day 2 ft. Cerebras, OpenCode, Cursor, Arize AI, and more!

April 21, 2026 - all times in EST -- 9:00am - Welcome to Day 2 -- 9:10am - David House, G2i Transforming Programming Mindsets: Case Studies in Agentic Coding Adoption -- 9:35am - Sarah Chieng, C...

7,964 views • 125 likes • 3 comments • April 21, 2026

Building Generative Image & Video models at Scale - Sander Dieleman, Google DeepMind

https://sander.ai/2025/04/15/latents.html Speaker info: - https://sander.ai/ - https://github.com/benanne - https://www.linkedin.com/in/sanderdieleman - https://x.com/sedielem Timestamps 0:00 Int...

3,729 views • 124 likes • 2 comments • April 21, 2026

How AI is changing Software Engineering: A Conversation with Gergely Orosz, @pragmaticengineer

Gergely Orosz is a formar Uber and Skyscanner engineer and is the author of https://www.engguidebook.com/ and https://www.pragmaticengineer.com/ , the #1 software/AI engineering newsletter on Subst...

15,318 views • 271 likes • 11 comments • April 21, 2026

Taste & Craft: A Conversation with Tuomas Artman, CTO Linear & Gergely Orosz, @pragmaticengineer

Tuomas Artman is Cofounder and CTO of Linear. - https://x.com/artman - https://www.linkedin.com/in/tuomasartman/ Timestamps 0:00 Introduction 0:36 The danger of shipping features too quickly with...

9,348 views • 140 likes • 9 comments • April 21, 2026

Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX — Adrien Grondin, Locally AI

See more: https://x.com/adrgrondin/status/2040512861953270226 Speaker info: - https://x.com/adrgrondin

8,754 views • 235 likes • 6 comments • April 20, 2026

AIE Miami Keynote & Talks ft. OpenCode. Google Deepmind, OpenAI, and more!

April 20, 2026 - all times in EST -- 9:00am - Welcome to AI Engineer Miami -- 9:10am - Gabe Greenberg, G2i Opening Remarks -- 9:15am - Dax Raad, OpenCode Keynote -- 9:40am - Dexter Horthy, Huma...

16,214 views • 277 likes • 8 comments • April 20, 2026

Full Workshop: Build Your Own Deep Research Agents - Louis-François Bouchard, Paul Iusztin, Samridhi

Deep research is one of the best ways to learn how to build real AI systems because it forces you to combine reasoning, planning, autonomy, tools, grounding, and feedback loops in a single end-to-e...

19,369 views • 504 likes • 22 comments • April 20, 2026

The New Application Layer - Malte Ubl, CTO Vercel

AI engineering is the legitimate successor to web development and the mainstream discipline that will define the next decade. Drawing on Vercel's own experience, Malte explores what it means to bui...

21,512 views • 336 likes • 13 comments • April 20, 2026