Yt Tracker

POC to PROD: Hard Lessons from 200+ Enterprise GenAI Deployments - Randall Hunt, Caylent

The transition from experimental GenAI demonstrations to robust, production-grade systems involves significant technical and organizational complexities. Humans provide a ceiling on the true ROI of...

39,449 views • 836 likes • 17 comments • July 23, 2025

From Copilot to Colleague: Trustworthy Agents for High-Stakes - Joel Hron, CTO Thomson Reuters

This keynote will explore what it takes to move from basic generative assistants to fully agentic AI—systems that don’t just suggest but plan, act, and adapt—all within the structured, high-trust e...

1,691 views • 33 likes • 3 comments • July 23, 2025

How to Hire AI Engineers when EVERYONE is cheating with AI — Beth Glenfield, DevDay

AI broke recruitment - how to think about hiring for AI-enabled engineers in the era of AI cheating agents and AI customised resumes. Recorded at the AI Engineer World's Fair in San Francisco. Sta...

7,228 views • 172 likes • 21 comments • July 22, 2025

Stateful environments for vertical agents — Josh Purtell, Synth Labs

Hey All - gave a talk on building stateful environments for vertical agents at AI tinkerers and ppl really liked it, happy to do again. Here's the repo - general code that endows environments like ...

1,367 views • 23 likes • 5 comments • July 22, 2025

Books reimagined: AI to create new experiences for things you know — Lukasz Gandecki, TheBrain.pro

[last round of Attendee-Led 10min lightning talks] I will showcase how I got tired of waiting for an AI assisted/no spoiler book reading experience and built my own. Check 30s video at https://yout...

2,008 views • 49 likes • 5 comments • July 22, 2025

AI powered entomology: Lessons from millions of AI code reviews — Tomas Reimers, Graphite

This talk will explore insights from millions of automated code reviews, revealing trends in bugs, vulnerabilities, and code health that Graphite’s AI code review agent have uncovered. This talk wi...

4,565 views • 60 likes • 1 comments • July 22, 2025

Do You Trust Your AI’s Inferences? — Sahil Yadav, Hariharan Ganesan, Telemetrak

Enterprise AI adoption is accelerating, but with it comes a hard question: Do we trust the model’s decisions? In this 18-minute talk, I’ll explore the invisible risks behind automated decision-maki...

676 views • 20 likes • 3 comments • July 22, 2025

How to run Evals at Scale: Thinking beyond Accuracy or Similarity — Muktesh Mishra, Adobe

https://www.linkedin.com/in/mukteshkrmishra/

786 views • 10 likes • 0 comments • July 22, 2025

Continuous Profiling for GPUs — Matthias Loibl, Polar Signals

Continuous Profiling for GPUs extends our industry-leading continuous profiling platform to provide deep, always-on visibility into your GPU workloads. Now you can see exactly how your GPUs are be...

357 views • 6 likes • 0 comments • July 22, 2025

Top Ten Challenges to Reach AGI — Stephen Chin, Andreas Kollegger

an opener to the GraphRAG track!

840 views • 11 likes • 1 comments • July 22, 2025

Practical GraphRAG: Making LLMs smarter with Knowledge Graphs — Michael, Jesus, and Stephen, Neo4j

RAG has become one standard architecture component for GenAI applications to address hallucinations and integrate factual knowledge. While vector search over text is common, knowledge graphs repres...

41,663 views • 849 likes • 35 comments • July 22, 2025

Knowledge Graphs in Litigation Agents — Tom Smoker, WhyHow

Structured Representations are pretty important in the law, where the relationships between clauses, documents, entities, and multiple parties matter. Structured Representation means Structured Con...

4,073 views • 121 likes • 3 comments • July 22, 2025

When Vectors Break Down: Graph-Based RAG for Dense Enterprise Knowledge - Sam Julien, Writer

Enterprise knowledge bases are filled with "dense mapping," thousands of documents where similar terms appear repeatedly, causing traditional vector retrieval to return the wrong version or irrelev...

36,502 views • 773 likes • 16 comments • July 22, 2025

HybridRAG: A Fusion of Graph and Vector Retrieval - Mitesh Patel, NVIDIA

Interpreting complex information from unstructured text data poses significant challenges to Large Language Models (LLM), with difficulties often arising from specialized terminology and the multif...

19,762 views • 511 likes • 24 comments • July 22, 2025

tldraw.computer - Steve Ruiz, tldraw

Learn about tldraw's latest experiments with AI on an infinite canvas. In 2024, we created tldraw computer, a loose visual programming environment where arrows and LLMs powered every step of a grap...

63,399 views • 2,481 likes • 100 comments • July 21, 2025

Excalidraw: AI and Human Whiteboarding Partnership - Christopher Chedeau

Covid sent everybody home and created the space of virtual whiteboards. At first the experience reused the physical constraints but soon it became better than a physical whiteboard thanks to using ...

3,877 views • 86 likes • 4 comments • July 21, 2025

The Bitter Layout or: How I Learned to Love the Model Picker — Maximillian Piras, Yutori

Are conversational interfaces the future or, as many designers have suggested, a lazy solution that is bottlenecking AI-HCI? Despite well-documented usability issues, the design of many AI applicat...

1,730 views • 28 likes • 1 comments • July 21, 2025

UX Design Principles for Semi Autonomous Multi Agent Systems — Victor Dibia, Microsoft

Autonomous or semi-autonomous multi-agent systems (MAS) involve exponentially complex configurations (system config, agent configs, task management and delegation, etc.). These present unique inter...

4,867 views • 111 likes • 5 comments • July 21, 2025

Agentic GraphRAG: AI’s Logical Edge — Stephen Chin, Neo4j

AI models are getting tasked to do increasingly complex and industry specific tasks where different retrieval approaches provide distinct advantages in accuracy, explainability, and cost to execute...

35,079 views • 693 likes • 30 comments • July 21, 2025

CIAM for AI: Authn/Authz for Agents — Michael Grinich, CEO of WorkOS

AI agents are changing the way modern SaaS products operate. Whether automating workflows, integrating with APIs, or acting on behalf of users, AI-driven assistants and autonomous systems are becom...

1,905 views • 46 likes • 3 comments • July 21, 2025

Good design hasn’t changed with AI — John Pham, SF Compute

Bad designs are still bad. AI doesn’t make it good. The novelty of AI makes the bad things tolerable, for a short time. Building great designs and experiences with AI have the same first principles...

3,716 views • 119 likes • 5 comments • July 21, 2025

Building Effective Voice Agents — Toki Sherbakov + Anoop Kotha, OpenAI

How to build production voice applications and learnings from working with customers along the way! https://x.com/tokisherbakov https://www.linkedin.com/in/akotha7/

11,106 views • 313 likes • 9 comments • July 20, 2025

What every AI engineer needs to know about GPUs — Charles Frye, Modal

Every programmer needs to know a few things about hardware, like processors, memory, and disks. Due to AI systems' extreme demand for mathematical processing power, AI engineers need to know a few ...

21,605 views • 558 likes • 15 comments • July 20, 2025

Robots as professional Chefs - Nikhil Abraham, CloudChef

How we converted a bimanual robot into a professional chef that works in novel kitchens and learn new recipes from a single demonstration About Nikhil Abraham Nikhil is the CEO of CloudChef - reim...

2,325 views • 29 likes • 4 comments • July 20, 2025

[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han

Why is Reinforcement Learning (RL) suddenly everywhere, and is it truly effective? Have LLMs hit a plateau in terms of intelligence and capabilities, or is RL the breakthrough they need? In this w...

112,992 views • 3,753 likes • 133 comments • July 19, 2025

A Taxonomy for Next-gen Reasoning — Nathan Lambert, Allen Institute (AI2) & Interconnects.ai

Current AI models are extremely skilled, which was seen as the step change in evaluation scores across the industry in the first half of 2025, but often fail when presented with even medium time-ho...

15,762 views • 281 likes • 10 comments • July 19, 2025

How to Train Your Agent: Building Reliable Agents with RL — Kyle Corbitt, OpenPipe

Have you ever launched an awesome agentic demo, only to realize no amount of prompting will make it reliable enough to deploy in production? Agent reliability is a famously difficult problem to sol...

61,051 views • 1,527 likes • 27 comments • July 19, 2025

OpenThoughts: Data Recipes for Reasoning Models — Ryan Marten, Bespoke Labs

Peel back the curtain on state of the art model post-training through the story of OpenThinker, a SOTA small reasoning model (outperforming DeepSeek distill), built in the open. Learn about the dat...

3,489 views • 95 likes • 1 comments • July 19, 2025

Google Photos Magic Editor: GenAI Under the Hood of a Billion-User App - Kelvin Ma, Google Photos

Go behind the scenes of Google Photos' Magic Editor. Explore the engineering feats required to integrate complex CV and cutting-edge generative AI models into a seamless mobile experience. We'll di...

2,259 views • 55 likes • 4 comments • July 19, 2025

Dream Machine: Scaling to 1m users in 4 days — Keegan McCallum, Luma AI

Talking about Luma AI, our mission, and how our ML infrastructure enables SOTA multimodal model development About Keegan McCallum I'm Keegan McCallum, the Head of ML infrastructure at Luma AI. I ...

1,700 views • 58 likes • 6 comments • July 19, 2025

ComfyUI Full Workshop — first workshop from ComfyAnonymous himself!

Quick introduction to ComfyUI and what's new followed by a QA session. Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming events and content by joining our ...

3,506 views • 91 likes • 13 comments • July 19, 2025

Design like Karpathy is watching — Zeke Sikelianos, Replicate

Legendary AI engineer and educator Andrej Karpathy recently blogged about his experiences building, deploying, and monetizing a vibe-coded web app called MenuGen. Let's dig into the challenges he f...

6,199 views • 161 likes • 6 comments • July 19, 2025

On Curiosity — Sharif Shameem, Lexica

Creating and sharing demos is the easiest way to influence the future. It gets people to think about what's possible. A good tech demo doesn't have to be fully fleshed out. It doesn't even have to ...

1,905 views • 64 likes • 5 comments • July 19, 2025

Real world MCPs in GitHub Copilot Agent Mode — Jon Peck, Microsoft

As developers, we don't spend most of our time vibe-coding prototypes. More often, we're adding features, squashing bugs, and building tests for existing apps across a wide variety of services and ...

4,994 views • 93 likes • 7 comments • July 19, 2025

The rise of the agentic economy on the shoulders of MCP — Jan Curn, Apify

Thanks to MCP and all the MCP server directories, agents can now autonomously discover new tools and other agents. This lays down the foundation for the future agentic economy, where businesses wil...

6,307 views • 132 likes • 10 comments • July 18, 2025

MCP is all you need — Samuel Colvin, Pydantic

Everyone is talking about agents, and right after that, they’re talking about agent-to-agent communications. Not surprisingly, various nascent, competing protocols are popping up to handle it. But...

65,282 views • 1,146 likes • 31 comments • July 18, 2025

Full Spec MCP: Hidden Capabilities of the MCP spec — Harald Kirschner, Microsoft/VSCode

The true power of Model Context Protocol emerges when clients and servers collaborate across the full spectrum of the specification. This talk presents practical examples of how VS Code's comprehen...

4,149 views • 61 likes • 11 comments • July 18, 2025

Shipping an Enterprise Voice AI Agent in 100 Days - Peter Bar, Intercom Fin

What does it take to go from blank page to live enterprise voice agent in 100 days? That’s the challenge we took on with Fin Voice at Intercom. Enterprise customer service demands high-quality, re...

4,194 views • 87 likes • 4 comments • July 18, 2025

The State of Generative Media - Gorkem Yurtseven, FAL

Generative AI is reshaping the creative landscape, enabling the production of images, audio, and video with unprecedented speed and sophistication. This session offers an in-depth exploration of th...

1,537 views • 39 likes • 2 comments • July 16, 2025

Teaching Gemini to Speak YouTube: Adapting LLMs for Video Recommendations to 2B+DAU - Devansh Tandon

YouTube recommendations drive the majority of video watch time for billions of daily users. Traditionally powered by large embedding models (LEMs), we're undertaking a fundamental shift: rebuilding...

16,490 views • 413 likes • 16 comments • July 16, 2025

Transforming search and discovery using LLMs — Tejaswi & Vinesh, Instacart

Learn how Instacart uses cutting-edge LLMs to redefine search and product discovery. - Explore innovative solutions overcoming traditional search engine limitations for grocery shopping. - Discove...

4,604 views • 73 likes • 2 comments • July 16, 2025

Netflix's Big Bet: One model to rule recommendations: Yesu Feng, Netflix

Discuss the foundation model strategy for personalization at Netflix based on this post https://netflixtechblog.com/foundation-model-for-personalized-recommendation-1a0bd8e02d39 and recent developm...

7,878 views • 177 likes • 6 comments • July 16, 2025

360Brew: LLM-based Personalized Ranking and Recommendation - Hamed and Maziar, LinkedIn AI

We will give a talk about our journey of building a foundation model for solving ranking and recommendation tasks About Hamed Firooz Principal AI Scientist at LinkedIn Core AI. With 15 years in la...

2,752 views • 46 likes • 1 comments • July 16, 2025

What We Learned from Using LLMs in Pinterest — Mukuntha Narayanan, Han Wang, Pinterest

Pinterest Search integrates Large Language Models (LLMs) to enhance relevance scoring by combining search queries with rich multimodal content, including visual captions, link-based text, and user ...

2,106 views • 45 likes • 1 comments • July 16, 2025

ARC AGI-3: Interactive Reasoning Benchmarks for Measuring AGI — Greg Kamradt, ARC Prize Foundation

ARC Prize Foundation is building the North Star for AGI—rigorous, open benchmarks that track reasoning progress in modern AI. We'll show why static AGI evaluations are useful, but fall short when c...

488 views • 13 likes • 0 comments • July 16, 2025

RL for Autonomous Coding — Aakanksha Chowdhery, Reflection.ai

The models and techniques to build fully autonomous coding agents - not just coding copilots - are already here. In this talk, former Google DeepMind staff research scientist, now CEO of Reflection...

7,264 views • 152 likes • 11 comments • July 16, 2025

Recsys Keynote: Improving Recommendation Systems & Search in the Age of LLMs - Eugene Yan, Amazon

Recommendation systems and search have long adopted advances in language modeling, from early adoption of Word2vec for embedding-based retrieval to the transformative impact of GRUs, Transformers, ...

17,675 views • 530 likes • 8 comments • July 16, 2025

Benchmarks Are Memes: How What We Measure Shapes AI—and Us - Alex Duffy, Every.to

Benchmarks shape more than just AI models—they shape our future. The things we choose to measure become self-fulfilling prophecies, guiding AI toward specific abilities and, ultimately, defining hu...

1,556 views • 42 likes • 6 comments • July 15, 2025

Small AI Teams with Huge Impact — Vik Paruchuri, Datalab

We scaled Datalab 5x this year - to 7-figure ARR, with customers that include tier 1 AI labs. We train custom models for document intelligence (OCR, layout), with popular repos surya and marker. I...

8,113 views • 161 likes • 10 comments • July 15, 2025

Rethinking Team Building: how a 30-person Startup serves 50 Million Users — Grant Lee, Gamma

The central thesis of this talk is that in the rapidly evolving age of AI, startups and tech companies should reject the traditional "blitzscaling" model of hyper-growth and specialized roles. Inst...

6,209 views • 112 likes • 2 comments • July 15, 2025

AI Engineer - Videos