AI Engineer - Videos

Back to Channel

How to Train Your Agent: Building Reliable Agents with RL — Kyle Corbitt, OpenPipe

Have you ever launched an awesome agentic demo, only to realize no amount of prompting will make it reliable enough to deploy in production? Agent reliability is a famously difficult problem to sol...

58,554 views • 1,474 likes • 27 comments • July 19, 2025

OpenThoughts: Data Recipes for Reasoning Models — Ryan Marten, Bespoke Labs

Peel back the curtain on state of the art model post-training through the story of OpenThinker, a SOTA small reasoning model (outperforming DeepSeek distill), built in the open. Learn about the dat...

3,373 views • 94 likes • 1 comments • July 19, 2025

Google Photos Magic Editor: GenAI Under the Hood of a Billion-User App - Kelvin Ma, Google Photos

Go behind the scenes of Google Photos' Magic Editor. Explore the engineering feats required to integrate complex CV and cutting-edge generative AI models into a seamless mobile experience. We'll di...

2,204 views • 55 likes • 4 comments • July 19, 2025

Dream Machine: Scaling to 1m users in 4 days — Keegan McCallum, Luma AI

Talking about Luma AI, our mission, and how our ML infrastructure enables SOTA multimodal model development About Keegan McCallum I'm Keegan McCallum, the Head of ML infrastructure at Luma AI. I ...

1,670 views • 58 likes • 6 comments • July 19, 2025

ComfyUI Full Workshop — first workshop from ComfyAnonymous himself!

Quick introduction to ComfyUI and what's new followed by a QA session. Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming events and content by joining our ...

3,446 views • 89 likes • 13 comments • July 19, 2025

Design like Karpathy is watching — Zeke Sikelianos, Replicate

Legendary AI engineer and educator Andrej Karpathy recently blogged about his experiences building, deploying, and monetizing a vibe-coded web app called MenuGen. Let's dig into the challenges he f...

6,175 views • 162 likes • 6 comments • July 19, 2025

On Curiosity — Sharif Shameem, Lexica

Creating and sharing demos is the easiest way to influence the future. It gets people to think about what's possible. A good tech demo doesn't have to be fully fleshed out. It doesn't even have to ...

1,872 views • 63 likes • 5 comments • July 19, 2025

Real world MCPs in GitHub Copilot Agent Mode — Jon Peck, Microsoft

As developers, we don't spend most of our time vibe-coding prototypes. More often, we're adding features, squashing bugs, and building tests for existing apps across a wide variety of services and ...

4,927 views • 93 likes • 7 comments • July 19, 2025

The rise of the agentic economy on the shoulders of MCP — Jan Curn, Apify

Thanks to MCP and all the MCP server directories, agents can now autonomously discover new tools and other agents. This lays down the foundation for the future agentic economy, where businesses wil...

6,241 views • 132 likes • 9 comments • July 18, 2025

MCP is all you need — Samuel Colvin, Pydantic

Everyone is talking about agents, and right after that, they’re talking about agent-to-agent communications. Not surprisingly, various nascent, competing protocols are popping up to handle it. But...

63,481 views • 1,106 likes • 30 comments • July 18, 2025

Full Spec MCP: Hidden Capabilities of the MCP spec — Harald Kirschner, Microsoft/VSCode

The true power of Model Context Protocol emerges when clients and servers collaborate across the full spectrum of the specification. This talk presents practical examples of how VS Code's comprehen...

4,102 views • 61 likes • 11 comments • July 18, 2025

Shipping an Enterprise Voice AI Agent in 100 Days - Peter Bar, Intercom Fin

What does it take to go from blank page to live enterprise voice agent in 100 days? That’s the challenge we took on with Fin Voice at Intercom. Enterprise customer service demands high-quality, re...

3,843 views • 84 likes • 3 comments • July 18, 2025

The State of Generative Media - Gorkem Yurtseven, FAL

Generative AI is reshaping the creative landscape, enabling the production of images, audio, and video with unprecedented speed and sophistication. This session offers an in-depth exploration of th...

1,516 views • 38 likes • 2 comments • July 16, 2025

Teaching Gemini to Speak YouTube: Adapting LLMs for Video Recommendations to 2B+DAU - Devansh Tandon

YouTube recommendations drive the majority of video watch time for billions of daily users. Traditionally powered by large embedding models (LEMs), we're undertaking a fundamental shift: rebuilding...

15,425 views • 381 likes • 16 comments • July 16, 2025

Transforming search and discovery using LLMs — Tejaswi & Vinesh, Instacart

Learn how Instacart uses cutting-edge LLMs to redefine search and product discovery. - Explore innovative solutions overcoming traditional search engine limitations for grocery shopping. - Discove...

4,398 views • 72 likes • 2 comments • July 16, 2025

Netflix's Big Bet: One model to rule recommendations: Yesu Feng, Netflix

Discuss the foundation model strategy for personalization at Netflix based on this post https://netflixtechblog.com/foundation-model-for-personalized-recommendation-1a0bd8e02d39 and recent developm...

7,373 views • 171 likes • 6 comments • July 16, 2025

360Brew: LLM-based Personalized Ranking and Recommendation - Hamed and Maziar, LinkedIn AI

We will give a talk about our journey of building a foundation model for solving ranking and recommendation tasks About Hamed Firooz Principal AI Scientist at LinkedIn Core AI. With 15 years in la...

2,528 views • 43 likes • 1 comments • July 16, 2025

What We Learned from Using LLMs in Pinterest — Mukuntha Narayanan, Han Wang, Pinterest

Pinterest Search integrates Large Language Models (LLMs) to enhance relevance scoring by combining search queries with rich multimodal content, including visual captions, link-based text, and user ...

2,000 views • 43 likes • 1 comments • July 16, 2025

ARC AGI-3: Interactive Reasoning Benchmarks for Measuring AGI — Greg Kamradt, ARC Prize Foundation

ARC Prize Foundation is building the North Star for AGI—rigorous, open benchmarks that track reasoning progress in modern AI. We'll show why static AGI evaluations are useful, but fall short when c...

461 views • 13 likes • 0 comments • July 16, 2025

RL for Autonomous Coding — Aakanksha Chowdhery, Reflection.ai

The models and techniques to build fully autonomous coding agents - not just coding copilots - are already here. In this talk, former Google DeepMind staff research scientist, now CEO of Reflection...

7,034 views • 154 likes • 11 comments • July 16, 2025

Recsys Keynote: Improving Recommendation Systems & Search in the Age of LLMs - Eugene Yan, Amazon

Recommendation systems and search have long adopted advances in language modeling, from early adoption of Word2vec for embedding-based retrieval to the transformative impact of GRUs, Transformers, ...

15,866 views • 480 likes • 8 comments • July 16, 2025

Benchmarks Are Memes: How What We Measure Shapes AI—and Us - Alex Duffy, Every.to

Benchmarks shape more than just AI models—they shape our future. The things we choose to measure become self-fulfilling prophecies, guiding AI toward specific abilities and, ultimately, defining hu...

1,519 views • 41 likes • 6 comments • July 15, 2025

Small AI Teams with Huge Impact — Vik Paruchuri, Datalab

We scaled Datalab 5x this year - to 7-figure ARR, with customers that include tier 1 AI labs. We train custom models for document intelligence (OCR, layout), with popular repos surya and marker. I...

8,004 views • 158 likes • 10 comments • July 15, 2025

Rethinking Team Building: how a 30-person Startup serves 50 Million Users — Grant Lee, Gamma

The central thesis of this talk is that in the rapidly evolving age of AI, startups and tech companies should reject the traditional "blitzscaling" model of hyper-growth and specialized roles. Inst...

6,144 views • 113 likes • 2 comments • July 15, 2025

Building a 10 person unicorn - Max Brodeur-Urbas, Gumloop

An overview of how Gumloop is scaling automation across companies like Instacart, Webflow and Shopify with less than 10 people. About Max Brodeur-Urbas ex-microsoft engineer, started Gumloop in my...

5,956 views • 91 likes • 5 comments • July 15, 2025

Using OSS models to build AI apps with millions of users — Hassan El Mghari

In this talk, Hassan will go over how he builds open source AI apps that get millions of users like roomGPT.io 2.9 million users, restorePhotos.io 1.1 million users, Blinkshot.io 1 million visitors...

6,282 views • 200 likes • 10 comments • July 15, 2025

Bolt.new: How we scaled $0-20m ARR in 60 days, with 15 people — Eric Simons, Bolt

Tiny Teams are the future of how startups are built, and it all comes down to team culture, decision making, tooling choices, and endless grit. In this talk, Eric will share the high octane insigh...

5,739 views • 125 likes • 4 comments • July 15, 2025

Prompt Engineering and AI Red Teaming — Sander Schulhoff, HackAPrompt/LearnPrompting

Learn from the creator of Learn Prompting, the internet's 1st Prompt Engineering guide (released 2 months before ChatGPT), and HackAPrompt, the World's 1st AI Red Teaming competition. My talk will...

12,174 views • 327 likes • 9 comments • July 14, 2025

Survive the AI Knife Fight: Building Products That Win — Brian Balfour, Reforge

If you’ve ever been blocked by vague specs, shifting goals, or chasing “vibes,” things have only gotten messier in the age of AI. Everyone is obsessing over engineers doing PM work and PMs cranking...

14,751 views • 387 likes • 8 comments • July 14, 2025

Automating Escrow with USDC and AI - Corey Cooper, Circle

This workshop explores how USDC, AI, and smart contracts can streamline escrow by automating fund release based on task or process verification. By using AI to interpret off-chain signals such as d...

1,952 views • 51 likes • 3 comments • July 14, 2025

How LLMs work for Web Devs: GPT in 600 lines of Vanilla JS - Ishan Anand

Don't be intimidated. Modern AI can feel like magic, but underneath the hood are principles that web developers can understand, even if you don't have a machine learning background. In this worksho...

8,160 views • 273 likes • 5 comments • July 13, 2025

[Workshop] AI Pipelines and Agents in Pure TypeScript with Mastra.ai — Nick Nisi, Zack Proser

This hands-on workshop introduces Mastra.ai, a TypeScript framework that streamlines the development of agentic AI systems compared to traditional approaches using LangChain and vector databases. P...

8,791 views • 191 likes • 18 comments • July 12, 2025

AI Engineering with the Google Gemini 2.5 Model Family - Philipp Schmid, Google DeepMind

Hands on Workshop on learning to use Gemini 2.5 Pro in combination with Agentic tooling and MCP Servers. About Philipp Schmid Philipp Schmid is a Senior AI Developer Relations Engineer at Google...

4,756 views • 103 likes • 5 comments • July 11, 2025

The New Code — Sean Grove, OpenAI

In an era where AI transforms software development, the most valuable skill isn't writing code - it's communicating intent with precision. This talk reveals how specifications, not prompts or code,...

1,055,183 views • 19,126 likes • 2,102 comments • July 11, 2025

Production software keeps breaking and it will only get worse — Anish Agarwal, Traversal.ai

Software is eating the world. AI is eating software. AI-powered SWE means a whole lot more software is going to be written that powers mission critical systems in the coming years, with hardly any ...

3,922 views • 84 likes • 5 comments • July 10, 2025

Thinking Deeper in Gemini — Jack Rae, Google DeepMind

Progress towards general intelligence has been marked by identifying fundamental intelligence bottlenecks within existing models and developing solutions that improve the architecture or training o...

30,114 views • 609 likes • 32 comments • July 10, 2025

A year of Gemini progress + what comes next — Logan Kilpatrick, Google DeepMind

Over the last year, Google and Gemini models have shown rapid progress across all dimensions (model, product, etc). Let's highlight all the work that has happened, how we got the worlds best models...

15,355 views • 307 likes • 11 comments • July 10, 2025

The Wild World of AI: 6 Months That Changed Everything

From pelicans on bicycles to $600 billion market crashes - discover the most insane AI developments of the past 6 months! 🤖🚲 #AI #MachineLearning #LLM #TechNews #AIRevolution #OpenAI #DeepSeek #Te...

4,787 views • 87 likes • 1 comments • July 10, 2025

2025 in LLMs so far, illustrated by Pelicans on Bicycles — Simon Willison

What's changed in the world of LLMs since the AIE World's Fair last year? A lot! I'll be taking full advantage of my role as a fiercely independent researcher to review the past 12 months of advan...

157,424 views • 3,852 likes • 100 comments • July 09, 2025

Trends Across the AI Frontier — George Cameron, ArtificialAnalysis.ai

The entire AI stack is developing faster than ever - from chips to infrastructure to models. How do you sort the signal from the noise? Artificial Analysis an independent benchmarking and insights ...

13,865 views • 242 likes • 10 comments • July 08, 2025

Training Agentic Reasoners — Will Brown, Prime Intellect

This talk will be a technical deep dive into RL for agentic reasoning via multi-turn tool calling, similar to OpenAI's o3 and Deep Research. In particular, we'll cover: - When, why, and how - GRPO...

21,268 views • 500 likes • 19 comments • July 07, 2025

New York Times' Connections: A Case Study on NLP in Word Games — Shafik Quoraishee, NYT Games

This session will examine the interplay between human intuition and artificial intelligence in puzzle-solving, using the popular New York Times Connections game as a practical case study. ...

4,702 views • 111 likes • 3 comments • July 05, 2025

Claude Code & the evolution of agentic coding — Boris Cherny, Anthropic

A ten thousand foot view of the coding space, the UX of coding, and the Claude Code team's approach. About Boris Chemy Created Claude Code. Member of Technical Staff @Anthropic. Prev: Principal En...

131,622 views • 2,543 likes • 98 comments • July 04, 2025

12-Factor Agents: Patterns of reliable LLM applications — Dex Horthy, HumanLayer

Hi, I'm Dex. I've been hacking on AI agents for a while. I've tried every agent framework out there, from the plug-and-play crew/langchains to the "minimalist" smolagents of the world to t...

264,523 views • 6,545 likes • 173 comments • July 03, 2025

MCP Is Not Good Yet — David Cramer, Sentry

You’ve heard a lot about MCP, probably been given an AI mandate or two, and are trying to figure out what’s real and what’s make believe. This session will give practical advice for how you shoul...

8,325 views • 174 likes • 11 comments • July 03, 2025

Your Personal Open-Source Humanoid Robot for $8,999 — JX Mo, K-Scale Labs

Introducing developer ready robots that are open-source, affordable, and easy to use. https://www.kscale.dev/ About Jingxiang Mo Jingxiang Mo is a founding engineer at K-Scale Labs, where he lead...

40,635 views • 1,113 likes • 86 comments • July 02, 2025

The Build-Operate Divide: Bridging Product Vision and AI Operational Reality

Product leaders see AI possibilities. Operations teams see implementation chaos. That disconnect can kill promising AI features before they ever reach users. In this session, Chris Hernandez (Chim...

2,691 views • 52 likes • 0 comments • July 02, 2025

The New Lean Startup — Sid Bendre, Oleve

In this session, I will be presenting a case study of Oleve's journey, revealing how we've scaled a profitable multi-product portfolio with a tiny team. I'll walk you through the emergence of "tiny...

33,453 views • 1,000 likes • 27 comments • July 01, 2025

Conquering Agent Chaos — Rick Blalock, Agentuity

Agent deployments can be dicey, especially at first. This session goes over all the things that cause headache with deployments from serverless issues to networking issues - and how we fix them. ...

1,248 views • 26 likes • 0 comments • July 01, 2025

Optimizing inference for voice models in production - Philip Kiely, Baseten

How do you get time to first byte (TTFB) below 150 milliseconds for voice models -- and scale it in production? As it turns out, open-source TTS models like Orpheus have an LLM backbone that lets u...

3,017 views • 88 likes • 1 comments • July 01, 2025