AI Engineer - Videos

Back to Channel

Stateful environments for vertical agents — Josh Purtell, Synth Labs

Hey All - gave a talk on building stateful environments for vertical agents at AI tinkerers and ppl really liked it, happy to do again. Here's the repo - general code that endows environments like ...

1,301 views • 24 likes • 5 comments • July 22, 2025

Books reimagined: AI to create new experiences for things you know — Lukasz Gandecki, TheBrain.pro

[last round of Attendee-Led 10min lightning talks] I will showcase how I got tired of waiting for an AI assisted/no spoiler book reading experience and built my own. Check 30s video at https://yout...

1,915 views • 47 likes • 6 comments • July 22, 2025

AI powered entomology: Lessons from millions of AI code reviews — Tomas Reimers, Graphite

This talk will explore insights from millions of automated code reviews, revealing trends in bugs, vulnerabilities, and code health that Graphite’s AI code review agent have uncovered. This talk wi...

2,477 views • 44 likes • 1 comments • July 22, 2025

Do You Trust Your AI’s Inferences? — Sahil Yadav, Hariharan Ganesan, Telemetrak

Enterprise AI adoption is accelerating, but with it comes a hard question: Do we trust the model’s decisions? In this 18-minute talk, I’ll explore the invisible risks behind automated decision-maki...

589 views • 19 likes • 3 comments • July 22, 2025

How to run Evals at Scale: Thinking beyond Accuracy or Similarity — Muktesh Mishra, Adobe

https://www.linkedin.com/in/mukteshkrmishra/

718 views • 9 likes • 0 comments • July 22, 2025

Continuous Profiling for GPUs — Matthias Loibl, Polar Signals

Continuous Profiling for GPUs extends our industry-leading continuous profiling platform to provide deep, always-on visibility into your GPU workloads. Now you can see exactly how your GPUs are be...

268 views • 4 likes • 0 comments • July 22, 2025

Top Ten Challenges to Reach AGI — Stephen Chin, Andreas Kollegger

an opener to the GraphRAG track!

755 views • 10 likes • 1 comments • July 22, 2025

Practical GraphRAG: Making LLMs smarter with Knowledge Graphs — Michael, Jesus, and Stephen, Neo4j

RAG has become one standard architecture component for GenAI applications to address hallucinations and integrate factual knowledge. While vector search over text is common, knowledge graphs repres...

20,417 views • 441 likes • 24 comments • July 22, 2025

Knowledge Graphs in Litigation Agents — Tom Smoker, WhyHow

Structured Representations are pretty important in the law, where the relationships between clauses, documents, entities, and multiple parties matter. Structured Representation means Structured Con...

3,446 views • 113 likes • 3 comments • July 22, 2025

When Vectors Break Down: Graph-Based RAG for Dense Enterprise Knowledge - Sam Julien, Writer

Enterprise knowledge bases are filled with "dense mapping," thousands of documents where similar terms appear repeatedly, causing traditional vector retrieval to return the wrong version or irrelev...

26,062 views • 546 likes • 16 comments • July 22, 2025

HybridRAG: A Fusion of Graph and Vector Retrieval - Mitesh Patel, NVIDIA

Interpreting complex information from unstructured text data poses significant challenges to Large Language Models (LLM), with difficulties often arising from specialized terminology and the multif...

11,902 views • 302 likes • 17 comments • July 22, 2025

tldraw.computer - Steve Ruiz, tldraw

Learn about tldraw's latest experiments with AI on an infinite canvas. In 2024, we created tldraw computer, a loose visual programming environment where arrows and LLMs powered every step of a grap...

59,790 views • 2,390 likes • 94 comments • July 21, 2025

Excalidraw: AI and Human Whiteboarding Partnership - Christopher Chedeau

Covid sent everybody home and created the space of virtual whiteboards. At first the experience reused the physical constraints but soon it became better than a physical whiteboard thanks to using ...

2,845 views • 65 likes • 4 comments • July 21, 2025

The Bitter Layout or: How I Learned to Love the Model Picker — Maximillian Piras, Yutori

Are conversational interfaces the future or, as many designers have suggested, a lazy solution that is bottlenecking AI-HCI? Despite well-documented usability issues, the design of many AI applicat...

1,316 views • 20 likes • 1 comments • July 21, 2025

UX Design Principles for Semi Autonomous Multi Agent Systems — Victor Dibia, Microsoft

Autonomous or semi-autonomous multi-agent systems (MAS) involve exponentially complex configurations (system config, agent configs, task management and delegation, etc.). These present unique inter...

4,183 views • 103 likes • 5 comments • July 21, 2025

Agentic GraphRAG: AI’s Logical Edge — Stephen Chin, Neo4j

AI models are getting tasked to do increasingly complex and industry specific tasks where different retrieval approaches provide distinct advantages in accuracy, explainability, and cost to execute...

24,718 views • 499 likes • 25 comments • July 21, 2025

CIAM for AI: Authn/Authz for Agents — Michael Grinich, CEO of WorkOS

AI agents are changing the way modern SaaS products operate. Whether automating workflows, integrating with APIs, or acting on behalf of users, AI-driven assistants and autonomous systems are becom...

1,483 views • 41 likes • 3 comments • July 21, 2025

Good design hasn’t changed with AI — John Pham, SF Compute

Bad designs are still bad. AI doesn’t make it good. The novelty of AI makes the bad things tolerable, for a short time. Building great designs and experiences with AI have the same first principles...

3,426 views • 113 likes • 5 comments • July 21, 2025

Building Effective Voice Agents — Toki Sherbakov + Anoop Kotha, OpenAI

How to build production voice applications and learnings from working with customers along the way! https://x.com/tokisherbakov https://www.linkedin.com/in/akotha7/

8,642 views • 257 likes • 9 comments • July 20, 2025

What every AI engineer needs to know about GPUs — Charles Frye, Modal

Every programmer needs to know a few things about hardware, like processors, memory, and disks. Due to AI systems' extreme demand for mathematical processing power, AI engineers need to know a few ...

19,432 views • 517 likes • 15 comments • July 20, 2025

Robots as professional Chefs - Nikhil Abraham, CloudChef

How we converted a bimanual robot into a professional chef that works in novel kitchens and learn new recipes from a single demonstration About Nikhil Abraham Nikhil is the CEO of CloudChef - reim...

1,657 views • 24 likes • 4 comments • July 20, 2025

[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han

Why is Reinforcement Learning (RL) suddenly everywhere, and is it truly effective? Have LLMs hit a plateau in terms of intelligence and capabilities, or is RL the breakthrough they need? In this w...

88,913 views • 3,148 likes • 125 comments • July 19, 2025

A Taxonomy for Next-gen Reasoning — Nathan Lambert, Allen Institute (AI2) & Interconnects.ai

Current AI models are extremely skilled, which was seen as the step change in evaluation scores across the industry in the first half of 2025, but often fail when presented with even medium time-ho...

14,343 views • 253 likes • 10 comments • July 19, 2025

How to Train Your Agent: Building Reliable Agents with RL — Kyle Corbitt, OpenPipe

Have you ever launched an awesome agentic demo, only to realize no amount of prompting will make it reliable enough to deploy in production? Agent reliability is a famously difficult problem to sol...

47,777 views • 1,216 likes • 23 comments • July 19, 2025

OpenThoughts: Data Recipes for Reasoning Models — Ryan Marten, Bespoke Labs

Peel back the curtain on state of the art model post-training through the story of OpenThinker, a SOTA small reasoning model (outperforming DeepSeek distill), built in the open. Learn about the dat...

3,062 views • 93 likes • 1 comments • July 19, 2025

Google Photos Magic Editor: GenAI Under the Hood of a Billion-User App - Kelvin Ma, Google Photos

Go behind the scenes of Google Photos' Magic Editor. Explore the engineering feats required to integrate complex CV and cutting-edge generative AI models into a seamless mobile experience. We'll di...

2,013 views • 55 likes • 4 comments • July 19, 2025

Dream Machine: Scaling to 1m users in 4 days — Keegan McCallum, Luma AI

Talking about Luma AI, our mission, and how our ML infrastructure enables SOTA multimodal model development About Keegan McCallum I'm Keegan McCallum, the Head of ML infrastructure at Luma AI. I ...

1,530 views • 56 likes • 7 comments • July 19, 2025

ComfyUI Full Workshop — first workshop from ComfyAnonymous himself!

Quick introduction to ComfyUI and what's new followed by a QA session. Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming events and content by joining our ...

3,083 views • 85 likes • 13 comments • July 19, 2025

Design like Karpathy is watching — Zeke Sikelianos, Replicate

Legendary AI engineer and educator Andrej Karpathy recently blogged about his experiences building, deploying, and monetizing a vibe-coded web app called MenuGen. Let's dig into the challenges he f...

6,030 views • 159 likes • 6 comments • July 19, 2025

On Curiosity — Sharif Shameem, Lexica

Creating and sharing demos is the easiest way to influence the future. It gets people to think about what's possible. A good tech demo doesn't have to be fully fleshed out. It doesn't even have to ...

1,640 views • 59 likes • 4 comments • July 19, 2025

Real world MCPs in GitHub Copilot Agent Mode — Jon Peck, Microsoft

As developers, we don't spend most of our time vibe-coding prototypes. More often, we're adding features, squashing bugs, and building tests for existing apps across a wide variety of services and ...

4,568 views • 88 likes • 7 comments • July 19, 2025

The rise of the agentic economy on the shoulders of MCP — Jan Curn, Apify

Thanks to MCP and all the MCP server directories, agents can now autonomously discover new tools and other agents. This lays down the foundation for the future agentic economy, where businesses wil...

6,035 views • 129 likes • 9 comments • July 18, 2025

MCP is all you need — Samuel Colvin, Pydantic

Everyone is talking about agents, and right after that, they’re talking about agent-to-agent communications. Not surprisingly, various nascent, competing protocols are popping up to handle it. But...

61,112 views • 1,059 likes • 34 comments • July 18, 2025

Full Spec MCP: Hidden Capabilities of the MCP spec — Harald Kirschner, Microsoft/VSCode

The true power of Model Context Protocol emerges when clients and servers collaborate across the full spectrum of the specification. This talk presents practical examples of how VS Code's comprehen...

3,884 views • 58 likes • 11 comments • July 18, 2025

Shipping an Enterprise Voice AI Agent in 100 Days - Peter Bar, Intercom Fin

What does it take to go from blank page to live enterprise voice agent in 100 days? That’s the challenge we took on with Fin Voice at Intercom. Enterprise customer service demands high-quality, re...

2,639 views • 72 likes • 3 comments • July 18, 2025

The State of Generative Media - Gorkem Yurtseven, FAL

Generative AI is reshaping the creative landscape, enabling the production of images, audio, and video with unprecedented speed and sophistication. This session offers an in-depth exploration of th...

1,412 views • 34 likes • 2 comments • July 16, 2025

Teaching Gemini to Speak YouTube: Adapting LLMs for Video Recommendations to 2B+DAU - Devansh Tandon

YouTube recommendations drive the majority of video watch time for billions of daily users. Traditionally powered by large embedding models (LEMs), we're undertaking a fundamental shift: rebuilding...

11,641 views • 301 likes • 17 comments • July 16, 2025

Transforming search and discovery using LLMs — Tejaswi & Vinesh, Instacart

Learn how Instacart uses cutting-edge LLMs to redefine search and product discovery. - Explore innovative solutions overcoming traditional search engine limitations for grocery shopping. - Discove...

3,678 views • 61 likes • 2 comments • July 16, 2025

Netflix's Big Bet: One model to rule recommendations: Yesu Feng, Netflix

Discuss the foundation model strategy for personalization at Netflix based on this post https://netflixtechblog.com/foundation-model-for-personalized-recommendation-1a0bd8e02d39 and recent developm...

5,450 views • 123 likes • 6 comments • July 16, 2025

360Brew: LLM-based Personalized Ranking and Recommendation - Hamed and Maziar, LinkedIn AI

We will give a talk about our journey of building a foundation model for solving ranking and recommendation tasks About Hamed Firooz Principal AI Scientist at LinkedIn Core AI. With 15 years in la...

1,340 views • 21 likes • 1 comments • July 16, 2025

What We Learned from Using LLMs in Pinterest — Mukuntha Narayanan, Han Wang, Pinterest

Pinterest Search integrates Large Language Models (LLMs) to enhance relevance scoring by combining search queries with rich multimodal content, including visual captions, link-based text, and user ...

1,517 views • 34 likes • 1 comments • July 16, 2025

ARC AGI-3: Interactive Reasoning Benchmarks for Measuring AGI — Greg Kamradt, ARC Prize Foundation

ARC Prize Foundation is building the North Star for AGI—rigorous, open benchmarks that track reasoning progress in modern AI. We'll show why static AGI evaluations are useful, but fall short when c...

328 views • 12 likes • 0 comments • July 16, 2025

RL for Autonomous Coding — Aakanksha Chowdhery, Reflection.ai

The models and techniques to build fully autonomous coding agents - not just coding copilots - are already here. In this talk, former Google DeepMind staff research scientist, now CEO of Reflection...

6,419 views • 141 likes • 11 comments • July 16, 2025

Recsys Keynote: Improving Recommendation Systems & Search in the Age of LLMs - Eugene Yan, Amazon

Recommendation systems and search have long adopted advances in language modeling, from early adoption of Word2vec for embedding-based retrieval to the transformative impact of GRUs, Transformers, ...

9,581 views • 277 likes • 5 comments • July 16, 2025

Benchmarks Are Memes: How What We Measure Shapes AI—and Us - Alex Duffy, Every.to

Benchmarks shape more than just AI models—they shape our future. The things we choose to measure become self-fulfilling prophecies, guiding AI toward specific abilities and, ultimately, defining hu...

1,357 views • 35 likes • 6 comments • July 15, 2025

Small AI Teams with Huge Impact — Vik Paruchuri, Datalab

We scaled Datalab 5x this year - to 7-figure ARR, with customers that include tier 1 AI labs. We train custom models for document intelligence (OCR, layout), with popular repos surya and marker. I...

7,604 views • 151 likes • 10 comments • July 15, 2025

Rethinking Team Building: how a 30-person Startup serves 50 Million Users — Grant Lee, Gamma

The central thesis of this talk is that in the rapidly evolving age of AI, startups and tech companies should reject the traditional "blitzscaling" model of hyper-growth and specialized roles. Inst...

5,825 views • 108 likes • 2 comments • July 15, 2025

Building a 10 person unicorn - Max Brodeur-Urbas, Gumloop

An overview of how Gumloop is scaling automation across companies like Instacart, Webflow and Shopify with less than 10 people. About Max Brodeur-Urbas ex-microsoft engineer, started Gumloop in my...

5,281 views • 83 likes • 5 comments • July 15, 2025

Using OSS models to build AI apps with millions of users — Hassan El Mghari

In this talk, Hassan will go over how he builds open source AI apps that get millions of users like roomGPT.io 2.9 million users, restorePhotos.io 1.1 million users, Blinkshot.io 1 million visitors...

6,061 views • 194 likes • 10 comments • July 15, 2025

Bolt.new: How we scaled $0-20m ARR in 60 days, with 15 people — Eric Simons, Bolt

Tiny Teams are the future of how startups are built, and it all comes down to team culture, decision making, tooling choices, and endless grit. In this talk, Eric will share the high octane insigh...

5,492 views • 124 likes • 4 comments • July 15, 2025