AI Engineer - Videos

Back to Channel

Prompt Engineering and AI Red Teaming — Sander Schulhoff, HackAPrompt/LearnPrompting

Learn from the creator of Learn Prompting, the internet's 1st Prompt Engineering guide (released 2 months before ChatGPT), and HackAPrompt, the World's 1st AI Red Teaming competition. My talk will...

9,238 views • 228 likes • 8 comments • July 14, 2025

Survive the AI Knife Fight: Building Products That Win — Brian Balfour, Reforge

If you’ve ever been blocked by vague specs, shifting goals, or chasing “vibes,” things have only gotten messier in the age of AI. Everyone is obsessing over engineers doing PM work and PMs cranking...

14,194 views • 377 likes • 8 comments • July 14, 2025

Automating Escrow with USDC and AI - Corey Cooper, Circle

This workshop explores how USDC, AI, and smart contracts can streamline escrow by automating fund release based on task or process verification. By using AI to interpret off-chain signals such as d...

1,818 views • 47 likes • 2 comments • July 14, 2025

How LLMs work for Web Devs: GPT in 600 lines of Vanilla JS - Ishan Anand

Don't be intimidated. Modern AI can feel like magic, but underneath the hood are principles that web developers can understand, even if you don't have a machine learning background. In this worksho...

7,704 views • 261 likes • 6 comments • July 13, 2025

[Workshop] AI Pipelines and Agents in Pure TypeScript with Mastra.ai — Nick Nisi, Zack Proser

This hands-on workshop introduces Mastra.ai, a TypeScript framework that streamlines the development of agentic AI systems compared to traditional approaches using LangChain and vector databases. P...

7,634 views • 174 likes • 17 comments • July 12, 2025

AI Engineering with the Google Gemini 2.5 Model Family - Philipp Schmid, Google DeepMind

Hands on Workshop on learning to use Gemini 2.5 Pro in combination with Agentic tooling and MCP Servers. About Philipp Schmid Philipp Schmid is a Senior AI Developer Relations Engineer at Google...

4,559 views • 97 likes • 6 comments • July 11, 2025

The New Code — Sean Grove, OpenAI

In an era where AI transforms software development, the most valuable skill isn't writing code - it's communicating intent with precision. This talk reveals how specifications, not prompts or code,...

951,914 views • 17,596 likes • 1,727 comments • July 11, 2025

Production software keeps breaking and it will only get worse — Anish Agarwal, Traversal.ai

Software is eating the world. AI is eating software. AI-powered SWE means a whole lot more software is going to be written that powers mission critical systems in the coming years, with hardly any ...

3,807 views • 81 likes • 6 comments • July 10, 2025

Thinking Deeper in Gemini — Jack Rae, Google DeepMind

Progress towards general intelligence has been marked by identifying fundamental intelligence bottlenecks within existing models and developing solutions that improve the architecture or training o...

29,609 views • 597 likes • 33 comments • July 10, 2025

A year of Gemini progress + what comes next — Logan Kilpatrick, Google DeepMind

Over the last year, Google and Gemini models have shown rapid progress across all dimensions (model, product, etc). Let's highlight all the work that has happened, how we got the worlds best models...

14,922 views • 302 likes • 13 comments • July 10, 2025

The Wild World of AI: 6 Months That Changed Everything

From pelicans on bicycles to $600 billion market crashes - discover the most insane AI developments of the past 6 months! 🤖🚲 #AI #MachineLearning #LLM #TechNews #AIRevolution #OpenAI #DeepSeek #Te...

4,574 views • 87 likes • 2 comments • July 10, 2025

2025 in LLMs so far, illustrated by Pelicans on Bicycles — Simon Willison

What's changed in the world of LLMs since the AIE World's Fair last year? A lot! I'll be taking full advantage of my role as a fiercely independent researcher to review the past 12 months of advan...

154,391 views • 3,801 likes • 103 comments • July 09, 2025

Trends Across the AI Frontier — George Cameron, ArtificialAnalysis.ai

The entire AI stack is developing faster than ever - from chips to infrastructure to models. How do you sort the signal from the noise? Artificial Analysis an independent benchmarking and insights ...

13,280 views • 235 likes • 11 comments • July 08, 2025

Training Agentic Reasoners — Will Brown, Prime Intellect

This talk will be a technical deep dive into RL for agentic reasoning via multi-turn tool calling, similar to OpenAI's o3 and Deep Research. In particular, we'll cover: - When, why, and how - GRPO...

17,187 views • 427 likes • 19 comments • July 07, 2025

New York Times' Connections: A Case Study on NLP in Word Games — Shafik Quoraishee, NYT Games

This session will examine the interplay between human intuition and artificial intelligence in puzzle-solving, using the popular New York Times Connections game as a practical case study. ...

4,445 views • 108 likes • 6 comments • July 05, 2025

Claude Code & the evolution of agentic coding — Boris Cherny, Anthropic

A ten thousand foot view of the coding space, the UX of coding, and the Claude Code team's approach. About Boris Chemy Created Claude Code. Member of Technical Staff @Anthropic. Prev: Principal En...

124,593 views • 2,438 likes • 107 comments • July 04, 2025

12-Factor Agents: Patterns of reliable LLM applications — Dex Horthy, HumanLayer

Hi, I'm Dex. I've been hacking on AI agents for a while. I've tried every agent framework out there, from the plug-and-play crew/langchains to the "minimalist" smolagents of the world to t...

213,460 views • 5,425 likes • 161 comments • July 03, 2025

MCP Is Not Good Yet — David Cramer, Sentry

You’ve heard a lot about MCP, probably been given an AI mandate or two, and are trying to figure out what’s real and what’s make believe. This session will give practical advice for how you shoul...

8,045 views • 174 likes • 12 comments • July 03, 2025

Your Personal Open-Source Humanoid Robot for $8,999 — JX Mo, K-Scale Labs

Introducing developer ready robots that are open-source, affordable, and easy to use. https://www.kscale.dev/ About Jingxiang Mo Jingxiang Mo is a founding engineer at K-Scale Labs, where he lead...

35,422 views • 1,013 likes • 86 comments • July 02, 2025

The Build-Operate Divide: Bridging Product Vision and AI Operational Reality

Product leaders see AI possibilities. Operations teams see implementation chaos. That disconnect can kill promising AI features before they ever reach users. In this session, Chris Hernandez (Chim...

2,565 views • 49 likes • 0 comments • July 02, 2025

The New Lean Startup — Sid Bendre, Oleve

In this session, I will be presenting a case study of Oleve's journey, revealing how we've scaled a profitable multi-product portfolio with a tiny team. I'll walk you through the emergence of "tiny...

32,367 views • 977 likes • 28 comments • July 01, 2025

Optimizing inference for voice models in production - Philip Kiely, Baseten

How do you get time to first byte (TTFB) below 150 milliseconds for voice models -- and scale it in production? As it turns out, open-source TTS models like Orpheus have an LLM backbone that lets u...

2,648 views • 77 likes • 0 comments • July 01, 2025

Conquering Agent Chaos — Rick Blalock, Agentuity

Agent deployments can be dicey, especially at first. This session goes over all the things that cause headache with deployments from serverless issues to networking issues - and how we fix them. ...

1,226 views • 25 likes • 0 comments • July 01, 2025

[Evals Workshop] Mastering AI Evaluation: From Playground to Production

This hands-on workshop will guide participants through the complete AI evaluation lifecycle using Braintrust, from initial prompt testing to production monitoring. Attendees will learn to build eva...

9,777 views • 163 likes • 3 comments • July 01, 2025

Intro to GraphRAG — Zach Blumenfeld

Learn the foundations of GraphRAG, starting with knowledge graph construction and then common retrieval patterns. --- GraphRAG has gone from nice-to-have to essential as AI solutions have increased...

20,481 views • 424 likes • 11 comments • June 30, 2025

Securing Agents with Open Standards — Bobby Tiernay and Kam Sween, Auth0

Shipping AI agents that are safe for production means solving some tough identity and authorization challenges that are not always obvious at the prototype stage. In practice, this comes down to a ...

1,045 views • 20 likes • 0 comments • June 30, 2025

The emerging skillset of wielding coding agents — Beyang Liu, Sourcegraph / Amp

It's raining coding agents! But while many are saying they're feeling the AGI, others say they're not that useful for serious programming. How much is hype and how much is a skill issue? We'll shar...

21,020 views • 436 likes • 17 comments • June 30, 2025

Agents, Access, and the Future of Machine Identity — Nick Nisi (WorkOS) + Lizzie Siegle (Cloudflare)

AI agents are calling APIs, submitting forms, and sending emails—but how do you control what they’re allowed to do? As agents act on behalf of users or organizations, traditional patterns like OAut...

779 views • 20 likes • 2 comments • June 30, 2025

Turning Fails into Features: Zapier’s Hard-Won Eval Lessons — Rafal Willinski, Vitor Balocco, Zapier

Every agent failure can be a roadmap to your next breakthrough. This talk reveals how Zapier's evaluation system transforms frustrating user experiences into targeted improvements, creating a data ...

3,449 views • 78 likes • 4 comments • June 30, 2025

Building voice agents with OpenAI — Dominik Kundel, OpenAI

We'll walk through the differences between chained and speech-to-speech powered voice agents, how to approach them, best practices and transform a text-based agent into our first voice-enabled agen...

21,973 views • 533 likes • 13 comments • June 29, 2025

Containing Agent Chaos — Solomon Hykes, Dagger

AI agents promise breakthroughs but often deliver operational chaos. Building reliable, deployable systems with unpredictable LLMs feels like wrestling fog – testing outputs alone is insufficient w...

11,523 views • 268 likes • 27 comments • June 28, 2025

Evals 101 — Doug Guthrie, Braintrust

This hands-on workshop guides participants through the full AI evaluation lifecycle with Braintrust, from initial prompt testing to production monitoring. Attendees will build evaluation frameworks...

14,251 views • 240 likes • 15 comments • June 27, 2025

Why should anyone care about Evals? — Manu Goyal, Braintrust

An introduction to the evals track About Manu Goyal Manu Goyal is the founding engineer at Braintrust. Previously, he developed autonomous systems at Nuro. He has an 8 year old Pomeranian named He...

12,608 views • 106 likes • 7 comments • June 27, 2025

Engineering Better Evals: Scalable LLM Evaluation Pipelines That Work — Dat Ngo, Aman Khan, Arize

As LLM-powered products become more sophisticated, the need for scalable, reliable evaluation pipelines has never been more critical. This session dives deep into advanced LLM evaluation strategies...

3,557 views • 58 likes • 8 comments • June 27, 2025

To the moon! Navigating deep context in legacy code with Augment Agent — Forrest Brazeal, Matt Ball

Shortened presentation-only version of our Apollo 11 workshop! About Forrest Brazeal Forrest Brazeal is an author, tech educator, cartoonist, and Pwnie Award-winning songwriter. He left Google in ...

1,562 views • 37 likes • 0 comments • June 27, 2025

Serving Voice AI at Scale — Arjun Desai (Cartesia) & Rohit Talluri (AWS)

Real-Time Voice AI applications demand the lowest possible latencies to enhance user experiences with more advanced reasoning and agentic capabilities. AWS is hosting Arjun Desai, co-founder of Car...

1,364 views • 26 likes • 1 comments • June 27, 2025

Ship it! Building Production Ready Agents — Mike Chambers, AWS

Explore the practical challenges and solutions for deploying AI agents in real-world production environments. Through detailed technical analysis and practical examples, we'll examine strategies fo...

2,060 views • 28 likes • 5 comments • June 27, 2025

Introducing Strands Agents, an Open Source AI Agents SDK — Suman Debnath, AWS

Building AI agents used to require complex orchestration, extensive scaffolding, and months of tuning. With Strands Agents, an open source SDK from AWS. You can now build, test, and deploy intellig...

5,813 views • 107 likes • 7 comments • June 27, 2025

Data is Your Differentiator: Building Secure and Tailored AI Systems — Mani Khanuja, AWS

As organizations seek to harness their proprietary data while maintaining security and compliance, Amazon Bedrock provides a comprehensive framework for building tailored AI applications. Using ...

505 views • 14 likes • 3 comments • June 27, 2025

How to build world-class AI products — Sarah Sachs (AI lead @ Notion) & Carlos Esteban (Braintrust)

Join us for a hands-on workshop where you'll learn practical strategies to evaluate AI applications throughout their lifecycle—from initial testing of prompts to ongoing monitoring in production. W...

2,751 views • 45 likes • 5 comments • June 27, 2025

From Mixture of Experts to Mixture of Agents with Super Fast Inference - Daniel Kim & Daria Soboleva

Our hands-on workshop will walk you through how to build your own Mixture of Agents (MoA) system using the fastest, and most capable open models available: Qwen3-32B and Llama 3.3-70B. MoA is an em...

3,826 views • 80 likes • 7 comments • June 27, 2025

Forget RAG Pipelines—Build Production Ready Agents in 15 Mins: Nina Lopatina, Rajiv Shah, Contextual

Want to take advantage of your data, but don't want to reinvent RAG infrastructure? Join our workshop and see how you can deploy Agentic RAG in minutes using Contextual AI's managed RAG solution. W...

9,743 views • 175 likes • 16 comments • June 27, 2025

Milliseconds to Magic: Real‑Time Workflows using the Gemini Live API and Pipecat

The Gemini Live API GA is now powered by Google's best cost-effective thinking model Gemini 2.5 Flash. We will do a deep dive on the capabilities that the Gemini Live API combined with Pipecat unl...

1,893 views • 33 likes • 10 comments • June 27, 2025

Realtime Conversational Video with Pipecat and Tavus — Chad Bailey and Brian Johnson, Daily & Tavus

Tavus shipped the world's first realtime video avatar platform last year. Developers use Tavus' conversational video APIs to create education, social, and customer support agents. The Tavus team bu...

1,411 views • 27 likes • 5 comments • June 27, 2025

Vector Search Benchmark[eting] - Philipp Krenn, Elastic

Every vector database out there is both faster and slower than any other competitor — if you believe all the benchmarketing out there. Let's turn the marketing into useful benchmarks that actually ...

706 views • 7 likes • 0 comments • June 27, 2025

Taming Rogue AI Agents with Observability-Driven Evaluation — Jim Bennett, Galileo

LLM agents often drift into failure when prompts, retrieval, external data, and policies interact in unpredictable ways. This session introduces a repeatable, metric-driven framework for detecting,...

1,051 views • 21 likes • 1 comments • June 27, 2025

Building agent fleet architectures your CISO doesn't hate — Lou Bichard, Gitpod

Security is the biggest blocker for agent orchestration adoption in regulated industries for SWE agents. Gitpod's agent orchestration went from an originally self-hosted kubernetes architecture to ...

280 views • 2 likes • 0 comments • June 27, 2025

Don’t get one-shotted: Use AI to test, review, merge, and deploy code — Tomas Reimers, Graphite

As AI tools like GitHub Copilot and ChatGPT help engineers generate code at an unprecedented rate, the “outer loop”—reviewing, testing, merging, and deploying—becomes more vital than ever. Studies ...

463 views • 3 likes • 0 comments • June 27, 2025

Effective agent design patterns in production — Laurie Voss, LlamaIndex

At LlamaIndex we see a lot of agents built every day, and we've got a sense of what works and what doesn't. We've distilled those learnings down into a series of patterns and best practices for bui...

12,700 views • 311 likes • 9 comments • June 27, 2025

Foundry Local: Cutting-Edge AI experiences on device with ONNX Runtime/Olive — Emma Ning, Microsoft

About Emma Ning Emma Ning is a Principal PM in the Microsoft AI Framework team, focusing on AI model operationalization and acceleration with ONNX Runtime/Olive for open and interoperable AI. She ...

416 views • 4 likes • 0 comments • June 27, 2025