AI Engineer - Videos

Back to Channel

RL for Autonomous Coding — Aakanksha Chowdhery, Reflection.ai

The models and techniques to build fully autonomous coding agents - not just coding copilots - are already here. In this talk, former Google DeepMind staff research scientist, now CEO of Reflection...

6,767 views • 148 likes • 11 comments • July 16, 2025

Recsys Keynote: Improving Recommendation Systems & Search in the Age of LLMs - Eugene Yan, Amazon

Recommendation systems and search have long adopted advances in language modeling, from early adoption of Word2vec for embedding-based retrieval to the transformative impact of GRUs, Transformers, ...

12,534 views • 381 likes • 7 comments • July 16, 2025

Benchmarks Are Memes: How What We Measure Shapes AI—and Us - Alex Duffy, Every.to

Benchmarks shape more than just AI models—they shape our future. The things we choose to measure become self-fulfilling prophecies, guiding AI toward specific abilities and, ultimately, defining hu...

1,421 views • 38 likes • 6 comments • July 15, 2025

Small AI Teams with Huge Impact — Vik Paruchuri, Datalab

We scaled Datalab 5x this year - to 7-figure ARR, with customers that include tier 1 AI labs. We train custom models for document intelligence (OCR, layout), with popular repos surya and marker. I...

7,819 views • 153 likes • 10 comments • July 15, 2025

Rethinking Team Building: how a 30-person Startup serves 50 Million Users — Grant Lee, Gamma

The central thesis of this talk is that in the rapidly evolving age of AI, startups and tech companies should reject the traditional "blitzscaling" model of hyper-growth and specialized roles. Inst...

5,992 views • 112 likes • 2 comments • July 15, 2025

Building a 10 person unicorn - Max Brodeur-Urbas, Gumloop

An overview of how Gumloop is scaling automation across companies like Instacart, Webflow and Shopify with less than 10 people. About Max Brodeur-Urbas ex-microsoft engineer, started Gumloop in my...

5,675 views • 85 likes • 5 comments • July 15, 2025

Using OSS models to build AI apps with millions of users — Hassan El Mghari

In this talk, Hassan will go over how he builds open source AI apps that get millions of users like roomGPT.io 2.9 million users, restorePhotos.io 1.1 million users, Blinkshot.io 1 million visitors...

6,180 views • 197 likes • 10 comments • July 15, 2025

Bolt.new: How we scaled $0-20m ARR in 60 days, with 15 people — Eric Simons, Bolt

Tiny Teams are the future of how startups are built, and it all comes down to team culture, decision making, tooling choices, and endless grit. In this talk, Eric will share the high octane insigh...

5,620 views • 123 likes • 4 comments • July 15, 2025

Prompt Engineering and AI Red Teaming — Sander Schulhoff, HackAPrompt/LearnPrompting

Learn from the creator of Learn Prompting, the internet's 1st Prompt Engineering guide (released 2 months before ChatGPT), and HackAPrompt, the World's 1st AI Red Teaming competition. My talk will...

10,390 views • 260 likes • 8 comments • July 14, 2025

Survive the AI Knife Fight: Building Products That Win — Brian Balfour, Reforge

If you’ve ever been blocked by vague specs, shifting goals, or chasing “vibes,” things have only gotten messier in the age of AI. Everyone is obsessing over engineers doing PM work and PMs cranking...

14,511 views • 383 likes • 8 comments • July 14, 2025

Automating Escrow with USDC and AI - Corey Cooper, Circle

This workshop explores how USDC, AI, and smart contracts can streamline escrow by automating fund release based on task or process verification. By using AI to interpret off-chain signals such as d...

1,868 views • 50 likes • 2 comments • July 14, 2025

How LLMs work for Web Devs: GPT in 600 lines of Vanilla JS - Ishan Anand

Don't be intimidated. Modern AI can feel like magic, but underneath the hood are principles that web developers can understand, even if you don't have a machine learning background. In this worksho...

7,888 views • 268 likes • 6 comments • July 13, 2025

[Workshop] AI Pipelines and Agents in Pure TypeScript with Mastra.ai — Nick Nisi, Zack Proser

This hands-on workshop introduces Mastra.ai, a TypeScript framework that streamlines the development of agentic AI systems compared to traditional approaches using LangChain and vector databases. P...

8,250 views • 183 likes • 18 comments • July 12, 2025

AI Engineering with the Google Gemini 2.5 Model Family - Philipp Schmid, Google DeepMind

Hands on Workshop on learning to use Gemini 2.5 Pro in combination with Agentic tooling and MCP Servers. About Philipp Schmid Philipp Schmid is a Senior AI Developer Relations Engineer at Google...

4,654 views • 99 likes • 6 comments • July 11, 2025

The New Code — Sean Grove, OpenAI

In an era where AI transforms software development, the most valuable skill isn't writing code - it's communicating intent with precision. This talk reveals how specifications, not prompts or code,...

1,008,402 views • 18,440 likes • 1,966 comments • July 11, 2025

Production software keeps breaking and it will only get worse — Anish Agarwal, Traversal.ai

Software is eating the world. AI is eating software. AI-powered SWE means a whole lot more software is going to be written that powers mission critical systems in the coming years, with hardly any ...

3,871 views • 81 likes • 6 comments • July 10, 2025

Thinking Deeper in Gemini — Jack Rae, Google DeepMind

Progress towards general intelligence has been marked by identifying fundamental intelligence bottlenecks within existing models and developing solutions that improve the architecture or training o...

29,907 views • 605 likes • 33 comments • July 10, 2025

A year of Gemini progress + what comes next — Logan Kilpatrick, Google DeepMind

Over the last year, Google and Gemini models have shown rapid progress across all dimensions (model, product, etc). Let's highlight all the work that has happened, how we got the worlds best models...

15,169 views • 307 likes • 12 comments • July 10, 2025

The Wild World of AI: 6 Months That Changed Everything

From pelicans on bicycles to $600 billion market crashes - discover the most insane AI developments of the past 6 months! 🤖🚲 #AI #MachineLearning #LLM #TechNews #AIRevolution #OpenAI #DeepSeek #Te...

4,661 views • 86 likes • 2 comments • July 10, 2025

2025 in LLMs so far, illustrated by Pelicans on Bicycles — Simon Willison

What's changed in the world of LLMs since the AIE World's Fair last year? A lot! I'll be taking full advantage of my role as a fiercely independent researcher to review the past 12 months of advan...

155,987 views • 3,826 likes • 104 comments • July 09, 2025

Trends Across the AI Frontier — George Cameron, ArtificialAnalysis.ai

The entire AI stack is developing faster than ever - from chips to infrastructure to models. How do you sort the signal from the noise? Artificial Analysis an independent benchmarking and insights ...

13,455 views • 239 likes • 11 comments • July 08, 2025

Training Agentic Reasoners — Will Brown, Prime Intellect

This talk will be a technical deep dive into RL for agentic reasoning via multi-turn tool calling, similar to OpenAI's o3 and Deep Research. In particular, we'll cover: - When, why, and how - GRPO...

19,237 views • 474 likes • 19 comments • July 07, 2025

New York Times' Connections: A Case Study on NLP in Word Games — Shafik Quoraishee, NYT Games

This session will examine the interplay between human intuition and artificial intelligence in puzzle-solving, using the popular New York Times Connections game as a practical case study. ...

4,592 views • 110 likes • 6 comments • July 05, 2025

Claude Code & the evolution of agentic coding — Boris Cherny, Anthropic

A ten thousand foot view of the coding space, the UX of coding, and the Claude Code team's approach. About Boris Chemy Created Claude Code. Member of Technical Staff @Anthropic. Prev: Principal En...

126,602 views • 2,468 likes • 98 comments • July 04, 2025

12-Factor Agents: Patterns of reliable LLM applications — Dex Horthy, HumanLayer

Hi, I'm Dex. I've been hacking on AI agents for a while. I've tried every agent framework out there, from the plug-and-play crew/langchains to the "minimalist" smolagents of the world to t...

236,166 views • 5,973 likes • 168 comments • July 03, 2025

MCP Is Not Good Yet — David Cramer, Sentry

You’ve heard a lot about MCP, probably been given an AI mandate or two, and are trying to figure out what’s real and what’s make believe. This session will give practical advice for how you shoul...

8,208 views • 173 likes • 12 comments • July 03, 2025

Your Personal Open-Source Humanoid Robot for $8,999 — JX Mo, K-Scale Labs

Introducing developer ready robots that are open-source, affordable, and easy to use. https://www.kscale.dev/ About Jingxiang Mo Jingxiang Mo is a founding engineer at K-Scale Labs, where he lead...

37,984 views • 1,058 likes • 88 comments • July 02, 2025

The Build-Operate Divide: Bridging Product Vision and AI Operational Reality

Product leaders see AI possibilities. Operations teams see implementation chaos. That disconnect can kill promising AI features before they ever reach users. In this session, Chris Hernandez (Chim...

2,637 views • 51 likes • 0 comments • July 02, 2025

The New Lean Startup — Sid Bendre, Oleve

In this session, I will be presenting a case study of Oleve's journey, revealing how we've scaled a profitable multi-product portfolio with a tiny team. I'll walk you through the emergence of "tiny...

32,921 views • 986 likes • 28 comments • July 01, 2025

Optimizing inference for voice models in production - Philip Kiely, Baseten

How do you get time to first byte (TTFB) below 150 milliseconds for voice models -- and scale it in production? As it turns out, open-source TTS models like Orpheus have an LLM backbone that lets u...

2,824 views • 77 likes • 1 comments • July 01, 2025

Conquering Agent Chaos — Rick Blalock, Agentuity

Agent deployments can be dicey, especially at first. This session goes over all the things that cause headache with deployments from serverless issues to networking issues - and how we fix them. ...

1,234 views • 25 likes • 0 comments • July 01, 2025

[Evals Workshop] Mastering AI Evaluation: From Playground to Production

This hands-on workshop will guide participants through the complete AI evaluation lifecycle using Braintrust, from initial prompt testing to production monitoring. Attendees will learn to build eva...

12,091 views • 196 likes • 5 comments • July 01, 2025

Intro to GraphRAG — Zach Blumenfeld

Learn the foundations of GraphRAG, starting with knowledge graph construction and then common retrieval patterns. --- GraphRAG has gone from nice-to-have to essential as AI solutions have increased...

24,172 views • 500 likes • 11 comments • June 30, 2025

Securing Agents with Open Standards — Bobby Tiernay and Kam Sween, Auth0

Shipping AI agents that are safe for production means solving some tough identity and authorization challenges that are not always obvious at the prototype stage. In practice, this comes down to a ...

1,084 views • 20 likes • 0 comments • June 30, 2025

The emerging skillset of wielding coding agents — Beyang Liu, Sourcegraph / Amp

It's raining coding agents! But while many are saying they're feeling the AGI, others say they're not that useful for serious programming. How much is hype and how much is a skill issue? We'll shar...

22,065 views • 455 likes • 12 comments • June 30, 2025

Agents, Access, and the Future of Machine Identity — Nick Nisi (WorkOS) + Lizzie Siegle (Cloudflare)

AI agents are calling APIs, submitting forms, and sending emails—but how do you control what they’re allowed to do? As agents act on behalf of users or organizations, traditional patterns like OAut...

800 views • 21 likes • 2 comments • June 30, 2025

Turning Fails into Features: Zapier’s Hard-Won Eval Lessons — Rafal Willinski, Vitor Balocco, Zapier

Every agent failure can be a roadmap to your next breakthrough. This talk reveals how Zapier's evaluation system transforms frustrating user experiences into targeted improvements, creating a data ...

3,567 views • 76 likes • 4 comments • June 30, 2025

Building voice agents with OpenAI — Dominik Kundel, OpenAI

We'll walk through the differences between chained and speech-to-speech powered voice agents, how to approach them, best practices and transform a text-based agent into our first voice-enabled agen...

24,440 views • 579 likes • 15 comments • June 29, 2025

Containing Agent Chaos — Solomon Hykes, Dagger

AI agents promise breakthroughs but often deliver operational chaos. Building reliable, deployable systems with unpredictable LLMs feels like wrestling fog – testing outputs alone is insufficient w...

11,737 views • 271 likes • 23 comments • June 28, 2025

Evals 101 — Doug Guthrie, Braintrust

This hands-on workshop guides participants through the full AI evaluation lifecycle with Braintrust, from initial prompt testing to production monitoring. Attendees will build evaluation frameworks...

17,947 views • 296 likes • 15 comments • June 27, 2025

Why should anyone care about Evals? — Manu Goyal, Braintrust

An introduction to the evals track About Manu Goyal Manu Goyal is the founding engineer at Braintrust. Previously, he developed autonomous systems at Nuro. He has an 8 year old Pomeranian named He...

12,820 views • 106 likes • 8 comments • June 27, 2025

Engineering Better Evals: Scalable LLM Evaluation Pipelines That Work — Dat Ngo, Aman Khan, Arize

As LLM-powered products become more sophisticated, the need for scalable, reliable evaluation pipelines has never been more critical. This session dives deep into advanced LLM evaluation strategies...

4,056 views • 71 likes • 8 comments • June 27, 2025

To the moon! Navigating deep context in legacy code with Augment Agent — Forrest Brazeal, Matt Ball

Shortened presentation-only version of our Apollo 11 workshop! About Forrest Brazeal Forrest Brazeal is an author, tech educator, cartoonist, and Pwnie Award-winning songwriter. He left Google in ...

1,686 views • 37 likes • 0 comments • June 27, 2025

Serving Voice AI at Scale — Arjun Desai (Cartesia) & Rohit Talluri (AWS)

Real-Time Voice AI applications demand the lowest possible latencies to enhance user experiences with more advanced reasoning and agentic capabilities. AWS is hosting Arjun Desai, co-founder of Car...

1,536 views • 30 likes • 1 comments • June 27, 2025

Ship it! Building Production Ready Agents — Mike Chambers, AWS

Explore the practical challenges and solutions for deploying AI agents in real-world production environments. Through detailed technical analysis and practical examples, we'll examine strategies fo...

2,155 views • 31 likes • 5 comments • June 27, 2025

Introducing Strands Agents, an Open Source AI Agents SDK — Suman Debnath, AWS

Building AI agents used to require complex orchestration, extensive scaffolding, and months of tuning. With Strands Agents, an open source SDK from AWS. You can now build, test, and deploy intellig...

6,895 views • 127 likes • 8 comments • June 27, 2025

Data is Your Differentiator: Building Secure and Tailored AI Systems — Mani Khanuja, AWS

As organizations seek to harness their proprietary data while maintaining security and compliance, Amazon Bedrock provides a comprehensive framework for building tailored AI applications. Using ...

519 views • 13 likes • 3 comments • June 27, 2025

How to build world-class AI products — Sarah Sachs (AI lead @ Notion) & Carlos Esteban (Braintrust)

Join us for a hands-on workshop where you'll learn practical strategies to evaluate AI applications throughout their lifecycle—from initial testing of prompts to ongoing monitoring in production. W...

2,941 views • 47 likes • 6 comments • June 27, 2025

From Mixture of Experts to Mixture of Agents with Super Fast Inference - Daniel Kim & Daria Soboleva

Our hands-on workshop will walk you through how to build your own Mixture of Agents (MoA) system using the fastest, and most capable open models available: Qwen3-32B and Llama 3.3-70B. MoA is an em...

3,923 views • 81 likes • 7 comments • June 27, 2025

Forget RAG Pipelines—Build Production Ready Agents in 15 Mins: Nina Lopatina, Rajiv Shah, Contextual

Want to take advantage of your data, but don't want to reinvent RAG infrastructure? Join our workshop and see how you can deploy Agentic RAG in minutes using Contextual AI's managed RAG solution. W...

10,099 views • 183 likes • 16 comments • June 27, 2025