AI Engineer - Videos

Back to Channel

3 ingredients for building reliable enterprise agents - Harrison Chase, LangChain/LangGraph

It's easy to build a prototype of an agent, but hard to put an agent in production - especially in an enterprise setting. In this section, will talk about three ingredients for building reliable ag...

47,008 views • 866 likes • 33 comments • July 23, 2025

From Hype to Habit: How We’re Building an AI-First SaaS Company—While Still Shipping the Roadmap

What does it really take to move a modern SaaS company from AI experimentation to becoming truly AI-first? At Sprout Social, we’re in the midst of that transformation—rearchitecting strategy, syst...

714 views • 14 likes • 0 comments • July 23, 2025

Machines of Buying and Selling Grace - Adam Behrens, New Generation

How to go beyond browser automation to truly agentic commerce, where AI can buy, sell and negotiate on behalf of users and merchants. About Adam Behrens Adam Behrens is the co-founder and CEO of N...

478 views • 10 likes • 0 comments • July 23, 2025

How to Build Planning Agents without losing control - Yogendra Miraje, Factset

LLMs are getting smarter—but Agents are still unpredictable, unreliable, and hard to control. In this talk, I’ll share practical lessons from building real-world plan-and-execute agents —covering ...

8,440 views • 199 likes • 4 comments • July 23, 2025

Building Agents (the hard parts!) - Rita Kozlov, Cloudflare

AI workloads are rapidly shifting from AI being used for augmentation (co-pilots), to AI becoming responsible for full, end-to-end automation (agents). But building effective agents, and even more ...

4,018 views • 77 likes • 1 comments • July 23, 2025

POC to PROD: Hard Lessons from 200+ Enterprise GenAI Deployments - Randall Hunt, Caylent

The transition from experimental GenAI demonstrations to robust, production-grade systems involves significant technical and organizational complexities. Humans provide a ceiling on the true ROI of...

38,399 views • 822 likes • 17 comments • July 23, 2025

From Copilot to Colleague: Trustworthy Agents for High-Stakes - Joel Hron, CTO Thomson Reuters

This keynote will explore what it takes to move from basic generative assistants to fully agentic AI—systems that don’t just suggest but plan, act, and adapt—all within the structured, high-trust e...

1,613 views • 31 likes • 3 comments • July 23, 2025

How to Hire AI Engineers when EVERYONE is cheating with AI — Beth Glenfield, DevDay

AI broke recruitment - how to think about hiring for AI-enabled engineers in the era of AI cheating agents and AI customised resumes. Recorded at the AI Engineer World's Fair in San Francisco. Sta...

7,137 views • 171 likes • 21 comments • July 22, 2025

Stateful environments for vertical agents — Josh Purtell, Synth Labs

Hey All - gave a talk on building stateful environments for vertical agents at AI tinkerers and ppl really liked it, happy to do again. Here's the repo - general code that endows environments like ...

1,341 views • 23 likes • 5 comments • July 22, 2025

Books reimagined: AI to create new experiences for things you know — Lukasz Gandecki, TheBrain.pro

[last round of Attendee-Led 10min lightning talks] I will showcase how I got tired of waiting for an AI assisted/no spoiler book reading experience and built my own. Check 30s video at https://yout...

1,947 views • 48 likes • 6 comments • July 22, 2025

AI powered entomology: Lessons from millions of AI code reviews — Tomas Reimers, Graphite

This talk will explore insights from millions of automated code reviews, revealing trends in bugs, vulnerabilities, and code health that Graphite’s AI code review agent have uncovered. This talk wi...

2,834 views • 46 likes • 1 comments • July 22, 2025

Do You Trust Your AI’s Inferences? — Sahil Yadav, Hariharan Ganesan, Telemetrak

Enterprise AI adoption is accelerating, but with it comes a hard question: Do we trust the model’s decisions? In this 18-minute talk, I’ll explore the invisible risks behind automated decision-maki...

622 views • 19 likes • 3 comments • July 22, 2025

How to run Evals at Scale: Thinking beyond Accuracy or Similarity — Muktesh Mishra, Adobe

https://www.linkedin.com/in/mukteshkrmishra/

749 views • 10 likes • 0 comments • July 22, 2025

Continuous Profiling for GPUs — Matthias Loibl, Polar Signals

Continuous Profiling for GPUs extends our industry-leading continuous profiling platform to provide deep, always-on visibility into your GPU workloads. Now you can see exactly how your GPUs are be...

300 views • 4 likes • 0 comments • July 22, 2025

Top Ten Challenges to Reach AGI — Stephen Chin, Andreas Kollegger

an opener to the GraphRAG track!

785 views • 10 likes • 1 comments • July 22, 2025

Practical GraphRAG: Making LLMs smarter with Knowledge Graphs — Michael, Jesus, and Stephen, Neo4j

RAG has become one standard architecture component for GenAI applications to address hallucinations and integrate factual knowledge. While vector search over text is common, knowledge graphs repres...

27,971 views • 604 likes • 25 comments • July 22, 2025

Knowledge Graphs in Litigation Agents — Tom Smoker, WhyHow

Structured Representations are pretty important in the law, where the relationships between clauses, documents, entities, and multiple parties matter. Structured Representation means Structured Con...

3,684 views • 116 likes • 3 comments • July 22, 2025

When Vectors Break Down: Graph-Based RAG for Dense Enterprise Knowledge - Sam Julien, Writer

Enterprise knowledge bases are filled with "dense mapping," thousands of documents where similar terms appear repeatedly, causing traditional vector retrieval to return the wrong version or irrelev...

29,784 views • 646 likes • 16 comments • July 22, 2025

HybridRAG: A Fusion of Graph and Vector Retrieval - Mitesh Patel, NVIDIA

Interpreting complex information from unstructured text data poses significant challenges to Large Language Models (LLM), with difficulties often arising from specialized terminology and the multif...

14,854 views • 374 likes • 19 comments • July 22, 2025

tldraw.computer - Steve Ruiz, tldraw

Learn about tldraw's latest experiments with AI on an infinite canvas. In 2024, we created tldraw computer, a loose visual programming environment where arrows and LLMs powered every step of a grap...

61,247 views • 2,435 likes • 98 comments • July 21, 2025

Excalidraw: AI and Human Whiteboarding Partnership - Christopher Chedeau

Covid sent everybody home and created the space of virtual whiteboards. At first the experience reused the physical constraints but soon it became better than a physical whiteboard thanks to using ...

3,183 views • 74 likes • 4 comments • July 21, 2025

The Bitter Layout or: How I Learned to Love the Model Picker — Maximillian Piras, Yutori

Are conversational interfaces the future or, as many designers have suggested, a lazy solution that is bottlenecking AI-HCI? Despite well-documented usability issues, the design of many AI applicat...

1,431 views • 21 likes • 1 comments • July 21, 2025

UX Design Principles for Semi Autonomous Multi Agent Systems — Victor Dibia, Microsoft

Autonomous or semi-autonomous multi-agent systems (MAS) involve exponentially complex configurations (system config, agent configs, task management and delegation, etc.). These present unique inter...

4,378 views • 106 likes • 5 comments • July 21, 2025

Agentic GraphRAG: AI’s Logical Edge — Stephen Chin, Neo4j

AI models are getting tasked to do increasingly complex and industry specific tasks where different retrieval approaches provide distinct advantages in accuracy, explainability, and cost to execute...

28,797 views • 591 likes • 26 comments • July 21, 2025

CIAM for AI: Authn/Authz for Agents — Michael Grinich, CEO of WorkOS

AI agents are changing the way modern SaaS products operate. Whether automating workflows, integrating with APIs, or acting on behalf of users, AI-driven assistants and autonomous systems are becom...

1,569 views • 43 likes • 3 comments • July 21, 2025

Good design hasn’t changed with AI — John Pham, SF Compute

Bad designs are still bad. AI doesn’t make it good. The novelty of AI makes the bad things tolerable, for a short time. Building great designs and experiences with AI have the same first principles...

3,557 views • 117 likes • 5 comments • July 21, 2025

Building Effective Voice Agents — Toki Sherbakov + Anoop Kotha, OpenAI

How to build production voice applications and learnings from working with customers along the way! https://x.com/tokisherbakov https://www.linkedin.com/in/akotha7/

9,553 views • 283 likes • 9 comments • July 20, 2025

What every AI engineer needs to know about GPUs — Charles Frye, Modal

Every programmer needs to know a few things about hardware, like processors, memory, and disks. Due to AI systems' extreme demand for mathematical processing power, AI engineers need to know a few ...

20,220 views • 531 likes • 15 comments • July 20, 2025

Robots as professional Chefs - Nikhil Abraham, CloudChef

How we converted a bimanual robot into a professional chef that works in novel kitchens and learn new recipes from a single demonstration About Nikhil Abraham Nikhil is the CEO of CloudChef - reim...

1,810 views • 25 likes • 4 comments • July 20, 2025

[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han

Why is Reinforcement Learning (RL) suddenly everywhere, and is it truly effective? Have LLMs hit a plateau in terms of intelligence and capabilities, or is RL the breakthrough they need? In this w...

97,629 views • 3,358 likes • 126 comments • July 19, 2025

A Taxonomy for Next-gen Reasoning — Nathan Lambert, Allen Institute (AI2) & Interconnects.ai

Current AI models are extremely skilled, which was seen as the step change in evaluation scores across the industry in the first half of 2025, but often fail when presented with even medium time-ho...

14,903 views • 261 likes • 10 comments • July 19, 2025

How to Train Your Agent: Building Reliable Agents with RL — Kyle Corbitt, OpenPipe

Have you ever launched an awesome agentic demo, only to realize no amount of prompting will make it reliable enough to deploy in production? Agent reliability is a famously difficult problem to sol...

52,758 views • 1,346 likes • 24 comments • July 19, 2025

OpenThoughts: Data Recipes for Reasoning Models — Ryan Marten, Bespoke Labs

Peel back the curtain on state of the art model post-training through the story of OpenThinker, a SOTA small reasoning model (outperforming DeepSeek distill), built in the open. Learn about the dat...

3,215 views • 96 likes • 1 comments • July 19, 2025

Google Photos Magic Editor: GenAI Under the Hood of a Billion-User App - Kelvin Ma, Google Photos

Go behind the scenes of Google Photos' Magic Editor. Explore the engineering feats required to integrate complex CV and cutting-edge generative AI models into a seamless mobile experience. We'll di...

2,090 views • 55 likes • 4 comments • July 19, 2025

Dream Machine: Scaling to 1m users in 4 days — Keegan McCallum, Luma AI

Talking about Luma AI, our mission, and how our ML infrastructure enables SOTA multimodal model development About Keegan McCallum I'm Keegan McCallum, the Head of ML infrastructure at Luma AI. I ...

1,563 views • 56 likes • 6 comments • July 19, 2025

ComfyUI Full Workshop — first workshop from ComfyAnonymous himself!

Quick introduction to ComfyUI and what's new followed by a QA session. Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming events and content by joining our ...

3,226 views • 88 likes • 12 comments • July 19, 2025

Design like Karpathy is watching — Zeke Sikelianos, Replicate

Legendary AI engineer and educator Andrej Karpathy recently blogged about his experiences building, deploying, and monetizing a vibe-coded web app called MenuGen. Let's dig into the challenges he f...

6,103 views • 161 likes • 6 comments • July 19, 2025

On Curiosity — Sharif Shameem, Lexica

Creating and sharing demos is the easiest way to influence the future. It gets people to think about what's possible. A good tech demo doesn't have to be fully fleshed out. It doesn't even have to ...

1,764 views • 64 likes • 4 comments • July 19, 2025

Real world MCPs in GitHub Copilot Agent Mode — Jon Peck, Microsoft

As developers, we don't spend most of our time vibe-coding prototypes. More often, we're adding features, squashing bugs, and building tests for existing apps across a wide variety of services and ...

4,742 views • 90 likes • 7 comments • July 19, 2025

The rise of the agentic economy on the shoulders of MCP — Jan Curn, Apify

Thanks to MCP and all the MCP server directories, agents can now autonomously discover new tools and other agents. This lays down the foundation for the future agentic economy, where businesses wil...

6,129 views • 131 likes • 9 comments • July 18, 2025

MCP is all you need — Samuel Colvin, Pydantic

Everyone is talking about agents, and right after that, they’re talking about agent-to-agent communications. Not surprisingly, various nascent, competing protocols are popping up to handle it. But...

62,660 views • 1,095 likes • 30 comments • July 18, 2025

Full Spec MCP: Hidden Capabilities of the MCP spec — Harald Kirschner, Microsoft/VSCode

The true power of Model Context Protocol emerges when clients and servers collaborate across the full spectrum of the specification. This talk presents practical examples of how VS Code's comprehen...

3,994 views • 58 likes • 11 comments • July 18, 2025

Shipping an Enterprise Voice AI Agent in 100 Days - Peter Bar, Intercom Fin

What does it take to go from blank page to live enterprise voice agent in 100 days? That’s the challenge we took on with Fin Voice at Intercom. Enterprise customer service demands high-quality, re...

2,910 views • 74 likes • 3 comments • July 18, 2025

The State of Generative Media - Gorkem Yurtseven, FAL

Generative AI is reshaping the creative landscape, enabling the production of images, audio, and video with unprecedented speed and sophistication. This session offers an in-depth exploration of th...

1,467 views • 36 likes • 2 comments • July 16, 2025

Teaching Gemini to Speak YouTube: Adapting LLMs for Video Recommendations to 2B+DAU - Devansh Tandon

YouTube recommendations drive the majority of video watch time for billions of daily users. Traditionally powered by large embedding models (LEMs), we're undertaking a fundamental shift: rebuilding...

13,245 views • 333 likes • 17 comments • July 16, 2025

Transforming search and discovery using LLMs — Tejaswi & Vinesh, Instacart

Learn how Instacart uses cutting-edge LLMs to redefine search and product discovery. - Explore innovative solutions overcoming traditional search engine limitations for grocery shopping. - Discove...

4,010 views • 69 likes • 2 comments • July 16, 2025

Netflix's Big Bet: One model to rule recommendations: Yesu Feng, Netflix

Discuss the foundation model strategy for personalization at Netflix based on this post https://netflixtechblog.com/foundation-model-for-personalized-recommendation-1a0bd8e02d39 and recent developm...

6,374 views • 147 likes • 6 comments • July 16, 2025

360Brew: LLM-based Personalized Ranking and Recommendation - Hamed and Maziar, LinkedIn AI

We will give a talk about our journey of building a foundation model for solving ranking and recommendation tasks About Hamed Firooz Principal AI Scientist at LinkedIn Core AI. With 15 years in la...

2,010 views • 34 likes • 1 comments • July 16, 2025

What We Learned from Using LLMs in Pinterest — Mukuntha Narayanan, Han Wang, Pinterest

Pinterest Search integrates Large Language Models (LLMs) to enhance relevance scoring by combining search queries with rich multimodal content, including visual captions, link-based text, and user ...

1,749 views • 37 likes • 1 comments • July 16, 2025

ARC AGI-3: Interactive Reasoning Benchmarks for Measuring AGI — Greg Kamradt, ARC Prize Foundation

ARC Prize Foundation is building the North Star for AGI—rigorous, open benchmarks that track reasoning progress in modern AI. We'll show why static AGI evaluations are useful, but fall short when c...

402 views • 12 likes • 0 comments • July 16, 2025