Sam Witteveen - Videos

Back to Channel

LangChain Reaches 1.0 - Whats new?

In this video, I cover LangChain's recent announcements about raising money at over a $1.25 valuation and the launch of LangChain and LangGraph 1.0. LangChain Is now a Unicorn: https://blog.langch...

6,381 views • 253 likes • 25 comments • October 26, 2025

Is Meta killing FAIR?

In this video, I go through some of the recent news about Meta laying off 600 people that worked for Facebook AI Research. Article: https://www.axios.com/2025/10/22/meta-superintelligence-tbd-ai-...

5,872 views • 203 likes • 45 comments • October 24, 2025

ChatGPT Atlas - The Battle for your Browser

In this video we look at OpenAI's new browser Atlas and what it can do. Blog: https://chatgpt.com/atlas Download: https://chatgpt.com/atlas/get-started/ For more tutorials on using LLMs and build...

15,476 views • 311 likes • 40 comments • October 21, 2025

DeepSeek OCR - More than OCR

In this video, I look at DeepSeek OCR and show that it's an experiment in using images to compress text representations better. DeepSeek OCR Paper: https://github.com/deepseek-ai/DeepSeek-OCR/blo...

194,187 views • 6,680 likes • 316 comments • October 20, 2025

Claude Skills - SOPs For Agents

In this video, I look at a new announcement from Anthropic called Claude Skills, but also more generally at the concept of how frontier labs are creating standard operating procedures for agents to...

39,975 views • 1,120 likes • 110 comments • October 17, 2025

Haiku 4.5 - Small Beats Big

In this video, I look at the latest model from Anthropic, Claude Haiku 4.5, and see how it stacks up both in intelligence and speed to its bigger brother and its previous versions. Blog: https://...

15,226 views • 426 likes • 48 comments • October 16, 2025

OpenAI's Agent Builder

In this video, I go through OpenAI's Agent Builder. Blog for Agent Kit: https://openai.com/index/introducing-agentkit/ For more tutorials on using LLMs and building agents, check out my Patreon ...

9,303 views • 240 likes • 27 comments • October 07, 2025

OpenAI DevDay 2025 - What Hit What Missed

In this video, I go through the key announcements from the OpenAI DevDay keynote. Blog for Agent Kit: https://openai.com/index/introducing-agentkit/ For more tutorials on using LLMs and building...

13,977 views • 306 likes • 28 comments • October 06, 2025

Sora 2 - OpenAI's TikTok

In this video I look at the release SORA 2 and how it goes beyond just being a model, but is also an app and a social network. Blog: https://openai.com/index/sora-2/ SORA 2 feed philosophy: https...

4,521 views • 98 likes • 16 comments • October 02, 2025

... there's more to Sonnet 4.5

In this video I look at the release of Sonnet 4.5 in context of Anthropic's plans for the Virtual Collaborator. Blog: https://www.anthropic.com/news/claude-sonnet-4-5 Devin blog: https://cognitio...

43,449 views • 981 likes • 118 comments • September 30, 2025

Meta's Code World Model

In this video I look at some new research out of Meta which is a code world model and is basically an LLM trained in a different way to try and get it to understand the tokens that it's generating....

21,281 views • 658 likes • 37 comments • September 26, 2025

The Qwen Avalanche

Blog: https://qwen.ai/research For more tutorials on using LLMs and building agents, check out my Patreon Patreon: https://www.patreon.com/SamWitteveen Twitter: https://x.com/Sam_Witteveen 🕵️ Int...

10,504 views • 336 likes • 28 comments • September 24, 2025

Google's NEW Agent Money Protocol

In this video, I look at A2P, Google's new agent payments protocol. We look at some of the key facts behind it, where it sits in comparison to A2A and MCP, and how you can get started. Blog: htt...

38,164 views • 991 likes • 86 comments • September 16, 2025

Qwen3 Next - Behind the Curtain

In this video I go through Quentin3 Next and how it tests out a bunch of ideas. In this video, I go through Qwen3-Next and how it tests. Blog: https://qwen.ai/blog?id=4074cca80393150c248e508aa629...

9,669 views • 337 likes • 32 comments • September 12, 2025

Kimi K2 0905 for Agents

In this video, I look at how the Kimi K2-0905 model has been updated to be better for agentic and tool calling use cases. Model Card:https://huggingface.co/moonshotai/Kimi-K2-Instruct-0905 Colab...

7,950 views • 242 likes • 10 comments • September 05, 2025

EmbeddingGemma - Micro Embeddings for Mobile Devices

In this video, I look at the latest Gemma release which is EmbeddingGemma, a 300M model that can be used for doing embedding tasks like RAG and semantic similarity on phones and mobile devices at t...

11,200 views • 433 likes • 13 comments • September 04, 2025

The Future of AI Coding with Aja Hammerly

Recently at Google I/O Connect China, I sat down with Aja Hammerly, and we talked about the future of AI coding, how things are evolving and tips for getting better results. Firebase Studio: http...

3,939 views • 118 likes • 17 comments • September 02, 2025

Gemini 2.5 Flash Image is Nano Banana!!

In this video, I go through the latest Gemini 2.5 flash image model (also known as Nano Banana) and show what it can do when you combine reasoning and conversational input for really good image gen...

61,380 views • 1,298 likes • 135 comments • August 26, 2025

GPT 5 - What They Didn't Say

In this video, I look at the launch of GPT-5 and what we can work out about the system that they have released. Blog: https://openai.com/index/introducing-gpt-5/ For more tutorials on using LLMs...

99,938 views • 2,140 likes • 345 comments • August 07, 2025

OpenAI's New OPEN Models - GPT-OSS 120B & 20B

Blog: https://openai.com/index/introducing-gpt-oss/ Colab: https://dripl.ink/BLrkZ For more tutorials on using LLMs and building agents, check out my Patreon Patreon: https://www.patreon.com/SamW...

18,018 views • 522 likes • 79 comments • August 06, 2025

LangExtract - Google's New Library for NLP Tasks

In this video, I look at LangExtract, a library from Google that allows you to do old-world natural language processing tasks with ease using LLMs and structured outputs. Blog: https://developers...

88,634 views • 2,469 likes • 78 comments • August 04, 2025

Gemini Deep Think

In this video, we look at the latest Gemini release, Gemini DeepThink, and see what it can be used for and how it was able to reach gold medal standard in the International Math Olympiad. Blog: ht...

38,402 views • 742 likes • 74 comments • August 01, 2025

Ollama Gets a New App

To celebrate Ollama's 2nd birthday the cute llamas have got a new app!! Blog: https://ollama.com/blog/new-app For more tutorials on using LLMs and building agents, check out my Patreon Patreon: ...

34,419 views • 1,067 likes • 105 comments • July 31, 2025

Opal - Google Labs Killer NEW App

In this video, I look at the latest release from Google Labs, which is a new app called Opal. Opal allows you to create LLM and generative AI workflows using a drag-and-drop and description system....

141,647 views • 2,817 likes • 97 comments • July 29, 2025

SmolLMv3 - A Small Reasoner with Tool Use.

Blog: https://huggingface.co/blog/smollm3 Colab: https://dripl.ink/oFvSw For more tutorials on using LLMs and building agents, check out my Patreon Patreon: https://www.patreon.com/SamWitteveen Tw...

15,509 views • 506 likes • 34 comments • July 09, 2025

Kyutai STT & TTS - A Perfect Local Voice Solution?

Blog: https://kyutai.org/next/stt Blog: https://kyutai.org/next/tts GitHub: https://github.com/kyutai-labs/delayed-streams-modeling Colab: https://dripl.ink/QZevZ For more tutorials on using LLM...

28,811 views • 711 likes • 69 comments • July 04, 2025

GeminiCLI - The Deep Dive with MCPs

This time I do a deep dive into Gemini CLI and look at how you can use tools with it and how you can use MCPs with it to make both your development faster but also to be able to do other tasks beyo...

20,738 views • 534 likes • 79 comments • June 27, 2025

Introducing Gemini CLI

Blog: https://blog.google/technology/developers/introducing-gemini-cli-open-source-ai-agent/ GitHub: https://github.com/google-gemini/gemini-cli/tree/main For more tutorials on using LLMs and buil...

144,902 views • 3,279 likes • 246 comments • June 25, 2025

NanoNets OCR-s

Blog: https://nanonets.com/research/nanonets-ocr-s/ Colab: https://dripl.ink/YQEpC For more tutorials on using LLMs and building agents, check out my Patreon Patreon: https://www.patreon.com/SamWi...

22,160 views • 725 likes • 52 comments • June 20, 2025

Qwen 3 Embeddings & Rerankers

In this video I look at the new release from Qwen of their new Embedding and Reranking models which are start of the art and most importantly open weights models. Blog: https://qwenlm.github.io/bl...

18,172 views • 642 likes • 37 comments • June 06, 2025

Building with Chatterbox TTS, Voice Cloning & Watermarking

In this video, I look at the new Chatterbox TTS from Resemble.AI and how it's improving open-source text-to-speech with its impressive voice cloning and emotion control capabilities. We explore its...

13,593 views • 386 likes • 41 comments • June 05, 2025

MedGemma - An Open Doctor Model?

Blog: https://medgemma.org/ Colab 4B: https://dripl.ink/WgA5X Colab 27B: https://dripl.ink/WRzFq Colab Finetuning: https://dripl.ink/IxsYs For more tutorials on using LLMs and building agents, che...

36,565 views • 1,171 likes • 72 comments • June 03, 2025

Mistral Agents API - The NEW Agent System

In this video, I look at the new Agents API from Mistral and how they are building an agentic story around their models. Blog: https://mistral.ai/news/agents-api Colab: https://dripl.ink/q7VoH Co...

18,734 views • 522 likes • 28 comments • May 29, 2025

Gemini TTS - Native Audio Out

In this video, I look at the Gemini TTS that was released at Google I/O last week and show you how you can use it to do various things with speech and dialogue. Blog: https://blog.google/technolo...

45,026 views • 891 likes • 99 comments • May 28, 2025

Google I/O 25 - Models vs Products

In this video, I cover the new models and products that were announced in the Google I/O keynote. Blog: https://blog.google/technology/ai/io-2025-keynote/ For more tutorials on using LLMs and bu...

7,054 views • 217 likes • 18 comments • May 21, 2025

NVIDIA beats Whisper with Parakeetv2

In this video, I look at the latest open-weight ASR system from NVIDIA. Colab: https://dripl.ink/Op9rY HF: https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2 HF Spaces: https://huggingface.co/spac...

18,235 views • 601 likes • 32 comments • May 14, 2025

Slash Your Gemini Bill Up To 75 %

In this video, I look at Google's new implicit caching for Gemini 2.5 models. Colab: https://dripl.ink/aLabu Blog: https://developers.googleblog.com/en/gemini-2-5-models-now-support-implicit-cach...

8,724 views • 290 likes • 69 comments • May 12, 2025

The Improved Gemini 2.5 Pro - A Coding Powerhouse

In this video, I test out a new and improved version of the Gemini 2.5 Pro model. This model is exceptionally good at coding tasks and can reason over large docs and videos for context. Blog: htt...

43,720 views • 1,040 likes • 114 comments • May 06, 2025

Phi-4 Reasoning - Microsoft Joins the Reasoning Race!!

In this video, I look at the new 5-four reasoning models from Microsoft, and look at how the team created these models and actually how good these models are. Blog: https://azure.microsoft.com/en...

7,873 views • 247 likes • 18 comments • May 02, 2025

Introducing the Qwen 3 Family

Blog: https://qwenlm.github.io/blog/qwen3/ For more tutorials on using LLMs and building agents, check out my Patreon Patreon: https://www.patreon.com/SamWitteveen Twitter: https://x.com/Sam_Witte...

10,937 views • 347 likes • 29 comments • April 29, 2025

Dia 1.6B TTS for NotebookLM Podcasts

In this video, I look at the new TTS system called Dia by Nari Labs and explore how it could be used to make podcasts similar to Notebook LM. Colab: https://dripl.ink/UQnVJ Hugginface: https://hug...

18,100 views • 555 likes • 61 comments • April 24, 2025

GPT-4.1 - The Catchup Models

In this video I break down the recent release of the GPT-4.1 models from OpenAI and discuss where they fit in the OpenAI ecosystem and the overall LLM ecosystem. Blog: https://openai.com/index/gp...

7,003 views • 204 likes • 31 comments • April 16, 2025

Google's NEW Agent2Agent Protocol

In this video I cover Google's new Agent2Agent Protocol, what it can do, who is on board and who isn't. Blog: https://developers.googleblog.com/en/a2a-a-new-era-of-agent-interoperability/ Github: ...

39,093 views • 889 likes • 63 comments • April 11, 2025

Google Launches an Agent SDK - Agent Development Kit

In this video, I look at the new Agent Developer Kit from Google and how they are entering the Agent SDK market Docs: https://google.github.io/adk-docs/ Github: https://github.com/google/adk-pyth...

69,705 views • 1,465 likes • 67 comments • April 09, 2025

Gemini 2.5 Pro for YouTube Analysis

In this video, I look at how to use the Gemini 2.5 Pro model for tasks that use YouTube videos. Colab: https://dripl.ink/GolWz For more tutorials on using LLMs and building agents, check out my...

17,214 views • 485 likes • 45 comments • April 08, 2025

Gemini 2.5 Pro for Audio Transcription

In this video, I go through using the new Gemini 2.5 Pro for audio transcription and audio analysis tasks and show you how to get the best results out. Colab: https://dripl.ink/mXQLh Pricing: htt...

39,026 views • 764 likes • 78 comments • April 06, 2025

OpenAI Needs YOU!!

In this video, I go through how OpenAI is looking for feedback on their new open-weights LLM model. Feedback wanted : https://openai.com/open-model-feedback/ For more tutorials on using LLMs and...

9,587 views • 209 likes • 52 comments • April 01, 2025

Creating Mind Maps with OpenAI's Image Generation

In this video, I look at the latest model from OpenAI that can do a variety of different image generation tasks and look at how you can apply it to creating mind maps. Blog: https://openai.com/in...

14,198 views • 534 likes • 36 comments • March 30, 2025

Qwen 2.5 Omni - Your NEW Open Omni Powerhouse

In this video I look at the latest model out from Qwen, the Qwen 2.5 Omni model, which allows you to basically use the model for full multimodal input (text, images, video, audio) and get either te...

23,487 views • 825 likes • 82 comments • March 28, 2025

Gemini 2.5 - The Thinking Family of Models

In this video, we look at the Gemini 2.5 Pro model and how the new Gemini 2.5 family of models are becoming Google's new reasoning models. Blog: https://blog.google/technology/google-deepmind/gem...

13,397 views • 407 likes • 41 comments • March 26, 2025