Yt Tracker

How to Train Your Agent: Building Reliable Agents with RL — Kyle Corbitt, OpenPipe

Have you ever launched an awesome agentic demo, only to realize no amount of prompting will make it reliable enough to deploy in production? Agent reliability is a famously difficult problem to sol...

56,086 views • 1,413 likes • 27 comments • July 19, 2025

OpenThoughts: Data Recipes for Reasoning Models — Ryan Marten, Bespoke Labs

Peel back the curtain on state of the art model post-training through the story of OpenThinker, a SOTA small reasoning model (outperforming DeepSeek distill), built in the open. Learn about the dat...

3,301 views • 95 likes • 1 comments • July 19, 2025

Google Photos Magic Editor: GenAI Under the Hood of a Billion-User App - Kelvin Ma, Google Photos

Go behind the scenes of Google Photos' Magic Editor. Explore the engineering feats required to integrate complex CV and cutting-edge generative AI models into a seamless mobile experience. We'll di...

2,152 views • 55 likes • 4 comments • July 19, 2025

Dream Machine: Scaling to 1m users in 4 days — Keegan McCallum, Luma AI

Talking about Luma AI, our mission, and how our ML infrastructure enables SOTA multimodal model development About Keegan McCallum I'm Keegan McCallum, the Head of ML infrastructure at Luma AI. I ...

1,646 views • 58 likes • 6 comments • July 19, 2025

ComfyUI Full Workshop — first workshop from ComfyAnonymous himself!

Quick introduction to ComfyUI and what's new followed by a QA session. Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming events and content by joining our ...

3,358 views • 89 likes • 12 comments • July 19, 2025

Design like Karpathy is watching — Zeke Sikelianos, Replicate

Legendary AI engineer and educator Andrej Karpathy recently blogged about his experiences building, deploying, and monetizing a vibe-coded web app called MenuGen. Let's dig into the challenges he f...

6,158 views • 162 likes • 6 comments • July 19, 2025

On Curiosity — Sharif Shameem, Lexica

Creating and sharing demos is the easiest way to influence the future. It gets people to think about what's possible. A good tech demo doesn't have to be fully fleshed out. It doesn't even have to ...

1,828 views • 63 likes • 5 comments • July 19, 2025

Real world MCPs in GitHub Copilot Agent Mode — Jon Peck, Microsoft

As developers, we don't spend most of our time vibe-coding prototypes. More often, we're adding features, squashing bugs, and building tests for existing apps across a wide variety of services and ...

4,855 views • 92 likes • 7 comments • July 19, 2025

The rise of the agentic economy on the shoulders of MCP — Jan Curn, Apify

Thanks to MCP and all the MCP server directories, agents can now autonomously discover new tools and other agents. This lays down the foundation for the future agentic economy, where businesses wil...

6,180 views • 133 likes • 9 comments • July 18, 2025

MCP is all you need — Samuel Colvin, Pydantic

Everyone is talking about agents, and right after that, they’re talking about agent-to-agent communications. Not surprisingly, various nascent, competing protocols are popping up to handle it. But...

63,181 views • 1,100 likes • 30 comments • July 18, 2025

Full Spec MCP: Hidden Capabilities of the MCP spec — Harald Kirschner, Microsoft/VSCode

The true power of Model Context Protocol emerges when clients and servers collaborate across the full spectrum of the specification. This talk presents practical examples of how VS Code's comprehen...

4,058 views • 59 likes • 11 comments • July 18, 2025

Shipping an Enterprise Voice AI Agent in 100 Days - Peter Bar, Intercom Fin

What does it take to go from blank page to live enterprise voice agent in 100 days? That’s the challenge we took on with Fin Voice at Intercom. Enterprise customer service demands high-quality, re...

3,436 views • 80 likes • 3 comments • July 18, 2025

The State of Generative Media - Gorkem Yurtseven, FAL

Generative AI is reshaping the creative landscape, enabling the production of images, audio, and video with unprecedented speed and sophistication. This session offers an in-depth exploration of th...

1,502 views • 38 likes • 2 comments • July 16, 2025

Teaching Gemini to Speak YouTube: Adapting LLMs for Video Recommendations to 2B+DAU - Devansh Tandon

YouTube recommendations drive the majority of video watch time for billions of daily users. Traditionally powered by large embedding models (LEMs), we're undertaking a fundamental shift: rebuilding...

14,379 views • 353 likes • 16 comments • July 16, 2025

Transforming search and discovery using LLMs — Tejaswi & Vinesh, Instacart

Learn how Instacart uses cutting-edge LLMs to redefine search and product discovery. - Explore innovative solutions overcoming traditional search engine limitations for grocery shopping. - Discove...

4,211 views • 71 likes • 2 comments • July 16, 2025

Netflix's Big Bet: One model to rule recommendations: Yesu Feng, Netflix

Discuss the foundation model strategy for personalization at Netflix based on this post https://netflixtechblog.com/foundation-model-for-personalized-recommendation-1a0bd8e02d39 and recent developm...

6,839 views • 166 likes • 6 comments • July 16, 2025

360Brew: LLM-based Personalized Ranking and Recommendation - Hamed and Maziar, LinkedIn AI

We will give a talk about our journey of building a foundation model for solving ranking and recommendation tasks About Hamed Firooz Principal AI Scientist at LinkedIn Core AI. With 15 years in la...

2,267 views • 39 likes • 1 comments • July 16, 2025

What We Learned from Using LLMs in Pinterest — Mukuntha Narayanan, Han Wang, Pinterest

Pinterest Search integrates Large Language Models (LLMs) to enhance relevance scoring by combining search queries with rich multimodal content, including visual captions, link-based text, and user ...

1,864 views • 41 likes • 1 comments • July 16, 2025

ARC AGI-3: Interactive Reasoning Benchmarks for Measuring AGI — Greg Kamradt, ARC Prize Foundation

ARC Prize Foundation is building the North Star for AGI—rigorous, open benchmarks that track reasoning progress in modern AI. We'll show why static AGI evaluations are useful, but fall short when c...

438 views • 12 likes • 0 comments • July 16, 2025

RL for Autonomous Coding — Aakanksha Chowdhery, Reflection.ai

The models and techniques to build fully autonomous coding agents - not just coding copilots - are already here. In this talk, former Google DeepMind staff research scientist, now CEO of Reflection...

6,912 views • 151 likes • 11 comments • July 16, 2025

Recsys Keynote: Improving Recommendation Systems & Search in the Age of LLMs - Eugene Yan, Amazon

Recommendation systems and search have long adopted advances in language modeling, from early adoption of Word2vec for embedding-based retrieval to the transformative impact of GRUs, Transformers, ...

14,097 views • 429 likes • 8 comments • July 16, 2025

Benchmarks Are Memes: How What We Measure Shapes AI—and Us - Alex Duffy, Every.to

Benchmarks shape more than just AI models—they shape our future. The things we choose to measure become self-fulfilling prophecies, guiding AI toward specific abilities and, ultimately, defining hu...

1,459 views • 40 likes • 6 comments • July 15, 2025

Small AI Teams with Huge Impact — Vik Paruchuri, Datalab

We scaled Datalab 5x this year - to 7-figure ARR, with customers that include tier 1 AI labs. We train custom models for document intelligence (OCR, layout), with popular repos surya and marker. I...

7,884 views • 155 likes • 10 comments • July 15, 2025

Rethinking Team Building: how a 30-person Startup serves 50 Million Users — Grant Lee, Gamma

The central thesis of this talk is that in the rapidly evolving age of AI, startups and tech companies should reject the traditional "blitzscaling" model of hyper-growth and specialized roles. Inst...

6,044 views • 112 likes • 2 comments • July 15, 2025

Building a 10 person unicorn - Max Brodeur-Urbas, Gumloop

An overview of how Gumloop is scaling automation across companies like Instacart, Webflow and Shopify with less than 10 people. About Max Brodeur-Urbas ex-microsoft engineer, started Gumloop in my...

5,819 views • 88 likes • 5 comments • July 15, 2025

Using OSS models to build AI apps with millions of users — Hassan El Mghari

In this talk, Hassan will go over how he builds open source AI apps that get millions of users like roomGPT.io 2.9 million users, restorePhotos.io 1.1 million users, Blinkshot.io 1 million visitors...

6,231 views • 198 likes • 10 comments • July 15, 2025

Bolt.new: How we scaled $0-20m ARR in 60 days, with 15 people — Eric Simons, Bolt

Tiny Teams are the future of how startups are built, and it all comes down to team culture, decision making, tooling choices, and endless grit. In this talk, Eric will share the high octane insigh...

5,686 views • 124 likes • 4 comments • July 15, 2025

Prompt Engineering and AI Red Teaming — Sander Schulhoff, HackAPrompt/LearnPrompting

Learn from the creator of Learn Prompting, the internet's 1st Prompt Engineering guide (released 2 months before ChatGPT), and HackAPrompt, the World's 1st AI Red Teaming competition. My talk will...

11,276 views • 292 likes • 9 comments • July 14, 2025

Survive the AI Knife Fight: Building Products That Win — Brian Balfour, Reforge

If you’ve ever been blocked by vague specs, shifting goals, or chasing “vibes,” things have only gotten messier in the age of AI. Everyone is obsessing over engineers doing PM work and PMs cranking...

14,673 views • 388 likes • 8 comments • July 14, 2025

Automating Escrow with USDC and AI - Corey Cooper, Circle

This workshop explores how USDC, AI, and smart contracts can streamline escrow by automating fund release based on task or process verification. By using AI to interpret off-chain signals such as d...

1,917 views • 51 likes • 2 comments • July 14, 2025

How LLMs work for Web Devs: GPT in 600 lines of Vanilla JS - Ishan Anand

Don't be intimidated. Modern AI can feel like magic, but underneath the hood are principles that web developers can understand, even if you don't have a machine learning background. In this worksho...

8,048 views • 273 likes • 6 comments • July 13, 2025

[Workshop] AI Pipelines and Agents in Pure TypeScript with Mastra.ai — Nick Nisi, Zack Proser

This hands-on workshop introduces Mastra.ai, a TypeScript framework that streamlines the development of agentic AI systems compared to traditional approaches using LangChain and vector databases. P...

8,543 views • 189 likes • 18 comments • July 12, 2025

AI Engineering with the Google Gemini 2.5 Model Family - Philipp Schmid, Google DeepMind

Hands on Workshop on learning to use Gemini 2.5 Pro in combination with Agentic tooling and MCP Servers. About Philipp Schmid Philipp Schmid is a Senior AI Developer Relations Engineer at Google...

4,701 views • 102 likes • 6 comments • July 11, 2025

The New Code — Sean Grove, OpenAI

In an era where AI transforms software development, the most valuable skill isn't writing code - it's communicating intent with precision. This talk reveals how specifications, not prompts or code,...

1,041,429 views • 18,964 likes • 2,113 comments • July 11, 2025

Production software keeps breaking and it will only get worse — Anish Agarwal, Traversal.ai

Software is eating the world. AI is eating software. AI-powered SWE means a whole lot more software is going to be written that powers mission critical systems in the coming years, with hardly any ...

3,912 views • 83 likes • 6 comments • July 10, 2025

Thinking Deeper in Gemini — Jack Rae, Google DeepMind

Progress towards general intelligence has been marked by identifying fundamental intelligence bottlenecks within existing models and developing solutions that improve the architecture or training o...

30,036 views • 609 likes • 33 comments • July 10, 2025

A year of Gemini progress + what comes next — Logan Kilpatrick, Google DeepMind

Over the last year, Google and Gemini models have shown rapid progress across all dimensions (model, product, etc). Let's highlight all the work that has happened, how we got the worlds best models...

15,289 views • 306 likes • 12 comments • July 10, 2025

The Wild World of AI: 6 Months That Changed Everything

From pelicans on bicycles to $600 billion market crashes - discover the most insane AI developments of the past 6 months! 🤖🚲 #AI #MachineLearning #LLM #TechNews #AIRevolution #OpenAI #DeepSeek #Te...

4,723 views • 87 likes • 2 comments • July 10, 2025

2025 in LLMs so far, illustrated by Pelicans on Bicycles — Simon Willison

What's changed in the world of LLMs since the AIE World's Fair last year? A lot! I'll be taking full advantage of my role as a fiercely independent researcher to review the past 12 months of advan...

156,799 views • 3,844 likes • 105 comments • July 09, 2025

Trends Across the AI Frontier — George Cameron, ArtificialAnalysis.ai

The entire AI stack is developing faster than ever - from chips to infrastructure to models. How do you sort the signal from the noise? Artificial Analysis an independent benchmarking and insights ...

13,754 views • 241 likes • 11 comments • July 08, 2025

Training Agentic Reasoners — Will Brown, Prime Intellect

This talk will be a technical deep dive into RL for agentic reasoning via multi-turn tool calling, similar to OpenAI's o3 and Deep Research. In particular, we'll cover: - When, why, and how - GRPO...

20,623 views • 485 likes • 19 comments • July 07, 2025

New York Times' Connections: A Case Study on NLP in Word Games — Shafik Quoraishee, NYT Games

This session will examine the interplay between human intuition and artificial intelligence in puzzle-solving, using the popular New York Times Connections game as a practical case study. ...

4,656 views • 111 likes • 4 comments • July 05, 2025

Claude Code & the evolution of agentic coding — Boris Cherny, Anthropic

A ten thousand foot view of the coding space, the UX of coding, and the Claude Code team's approach. About Boris Chemy Created Claude Code. Member of Technical Staff @Anthropic. Prev: Principal En...

129,637 views • 2,512 likes • 97 comments • July 04, 2025

12-Factor Agents: Patterns of reliable LLM applications — Dex Horthy, HumanLayer

Hi, I'm Dex. I've been hacking on AI agents for a while. I've tried every agent framework out there, from the plug-and-play crew/langchains to the "minimalist" smolagents of the world to t...

252,879 views • 6,332 likes • 168 comments • July 03, 2025

MCP Is Not Good Yet — David Cramer, Sentry

You’ve heard a lot about MCP, probably been given an AI mandate or two, and are trying to figure out what’s real and what’s make believe. This session will give practical advice for how you shoul...

8,281 views • 175 likes • 12 comments • July 03, 2025

Your Personal Open-Source Humanoid Robot for $8,999 — JX Mo, K-Scale Labs

Introducing developer ready robots that are open-source, affordable, and easy to use. https://www.kscale.dev/ About Jingxiang Mo Jingxiang Mo is a founding engineer at K-Scale Labs, where he lead...

39,369 views • 1,084 likes • 87 comments • July 02, 2025

The Build-Operate Divide: Bridging Product Vision and AI Operational Reality

Product leaders see AI possibilities. Operations teams see implementation chaos. That disconnect can kill promising AI features before they ever reach users. In this session, Chris Hernandez (Chim...

2,662 views • 52 likes • 0 comments • July 02, 2025

The New Lean Startup — Sid Bendre, Oleve

In this session, I will be presenting a case study of Oleve's journey, revealing how we've scaled a profitable multi-product portfolio with a tiny team. I'll walk you through the emergence of "tiny...

33,244 views • 994 likes • 28 comments • July 01, 2025

Conquering Agent Chaos — Rick Blalock, Agentuity

Agent deployments can be dicey, especially at first. This session goes over all the things that cause headache with deployments from serverless issues to networking issues - and how we fix them. ...

1,244 views • 26 likes • 0 comments • July 01, 2025

Optimizing inference for voice models in production - Philip Kiely, Baseten

How do you get time to first byte (TTFB) below 150 milliseconds for voice models -- and scale it in production? As it turns out, open-source TTS models like Orpheus have an LLM backbone that lets u...

2,940 views • 82 likes • 1 comments • July 01, 2025

AI Engineer - Videos