Sam Witteveen - Videos
Back to ChannelFunctionGemma - Function Calling at the Edge
In this video, I look at the latest release from the Gemma team, FunctionGemma, which is all about having a super-small model that you can use to do function calling at the edge. In this video, we...
Gemini 3 Flash - Your Daily Workhorse Upgraded
In this video, I go through the latest Gemini 3 Flash model from Google. With a bump in intelligence, and yet still fast and cheap, this model is set to become many people's daily workhorse. Blo...
The Gemini Interactions API
In this video, we look at the new interactions API from the Gemini API team, and how you can use it to build and do various tasks with not only Gemini models but also agents. Blog: https://blog.g...
Junie The Anti-Vibe Coding IDE
In this video, I look at Junie from JetBrains a coding agent that is Anti vibe coding but is like a professional pair programming partner. It can run in the various JetBrains IDEs that are customiz...
Mistral 3: Europe's Answer to DeepSeek or Too Little, Too Late?
In this video, we look at the latest release from Mistral, a new Mistral-3 large model and 3 Minstral models. Blog: https://mistral.ai/news/mistral-3 HF: https://huggingface.co/mistralai/collecti...
Nano Banana Pro has arrived!!
In this video, I go through the new Nano Banana Pro aka Gemini 3 Pro Image Blog: https://blog.google/technology/developers/gemini-3-pro-image-developers/ https://blog.google/technology/ai/nano-ban...
Antigravity Google's Cursor Killer
In this video, I go through Antigravity the new Agentic Coding App from Google Blog: https://antigravity.google/blog/introducing-google-antigravity For more tutorials on using LLMs and building a...
Gemini 3 Pro - The Model You've Been Waiting For
In this video, I go through Gemini 3 Pro, the model that people have been waiting for. Blog: https://blog.google/products/gemini/gemini-3 For more tutorials on using LLMs and building agents, che...
Gemini RAG - File Search Tool
In this video, I go through the latest release from the Gemini API team, which is the file search tool, or basically a simple RAG system built right into the Gemini API Blog: https://blog.google/t...
NEW Top Open Model - Kimi K2 Thinking
In this video, I look at Kimi K2 Thinking from Moonshot AI, the most recent fully open reasoning model that scores higher than GPT-5 and Anthropic for multiple benchmarks. Blog: https://moonshota...
AgentHQ by Github
In this video, I cover AgentHQ which was launch by Github at the yearly conference and which had the goal of managing all your agents AgentHQ: https://github.blog/news-insights/company-news/welcom...
LangChain Reaches 1.0 - Whats new?
In this video, I cover LangChain's recent announcements about raising money at over a $1.25 valuation and the launch of LangChain and LangGraph 1.0. LangChain Is now a Unicorn: https://blog.langch...
Is Meta killing FAIR?
In this video, I go through some of the recent news about Meta laying off 600 people that worked for Facebook AI Research. Article: https://www.axios.com/2025/10/22/meta-superintelligence-tbd-ai-...
ChatGPT Atlas - The Battle for your Browser
In this video we look at OpenAI's new browser Atlas and what it can do. Blog: https://chatgpt.com/atlas Download: https://chatgpt.com/atlas/get-started/ For more tutorials on using LLMs and build...
DeepSeek OCR - More than OCR
In this video, I look at DeepSeek OCR and show that it's an experiment in using images to compress text representations better. DeepSeek OCR Paper: https://github.com/deepseek-ai/DeepSeek-OCR/blo...
Claude Skills - SOPs For Agents
In this video, I look at a new announcement from Anthropic called Claude Skills, but also more generally at the concept of how frontier labs are creating standard operating procedures for agents to...
Haiku 4.5 - Small Beats Big
In this video, I look at the latest model from Anthropic, Claude Haiku 4.5, and see how it stacks up both in intelligence and speed to its bigger brother and its previous versions. Blog: https://...
OpenAI's Agent Builder
In this video, I go through OpenAI's Agent Builder. Blog for Agent Kit: https://openai.com/index/introducing-agentkit/ For more tutorials on using LLMs and building agents, check out my Patreon ...
OpenAI DevDay 2025 - What Hit What Missed
In this video, I go through the key announcements from the OpenAI DevDay keynote. Blog for Agent Kit: https://openai.com/index/introducing-agentkit/ For more tutorials on using LLMs and building...
Sora 2 - OpenAI's TikTok
In this video I look at the release SORA 2 and how it goes beyond just being a model, but is also an app and a social network. Blog: https://openai.com/index/sora-2/ SORA 2 feed philosophy: https...
... there's more to Sonnet 4.5
In this video I look at the release of Sonnet 4.5 in context of Anthropic's plans for the Virtual Collaborator. Blog: https://www.anthropic.com/news/claude-sonnet-4-5 Devin blog: https://cognitio...
Meta's Code World Model
In this video I look at some new research out of Meta which is a code world model and is basically an LLM trained in a different way to try and get it to understand the tokens that it's generating....
The Qwen Avalanche
Blog: https://qwen.ai/research For more tutorials on using LLMs and building agents, check out my Patreon Patreon: https://www.patreon.com/SamWitteveen Twitter: https://x.com/Sam_Witteveen 🕵️ Int...
Google's NEW Agent Money Protocol
In this video, I look at A2P, Google's new agent payments protocol. We look at some of the key facts behind it, where it sits in comparison to A2A and MCP, and how you can get started. Blog: htt...
Qwen3 Next - Behind the Curtain
In this video I go through Quentin3 Next and how it tests out a bunch of ideas. In this video, I go through Qwen3-Next and how it tests. Blog: https://qwen.ai/blog?id=4074cca80393150c248e508aa629...
Kimi K2 0905 for Agents
In this video, I look at how the Kimi K2-0905 model has been updated to be better for agentic and tool calling use cases. Model Card:https://huggingface.co/moonshotai/Kimi-K2-Instruct-0905 Colab...
EmbeddingGemma - Micro Embeddings for Mobile Devices
In this video, I look at the latest Gemma release which is EmbeddingGemma, a 300M model that can be used for doing embedding tasks like RAG and semantic similarity on phones and mobile devices at t...
The Future of AI Coding with Aja Hammerly
Recently at Google I/O Connect China, I sat down with Aja Hammerly, and we talked about the future of AI coding, how things are evolving and tips for getting better results. Firebase Studio: http...
Gemini 2.5 Flash Image is Nano Banana!!
In this video, I go through the latest Gemini 2.5 flash image model (also known as Nano Banana) and show what it can do when you combine reasoning and conversational input for really good image gen...
GPT 5 - What They Didn't Say
In this video, I look at the launch of GPT-5 and what we can work out about the system that they have released. Blog: https://openai.com/index/introducing-gpt-5/ For more tutorials on using LLMs...
OpenAI's New OPEN Models - GPT-OSS 120B & 20B
Blog: https://openai.com/index/introducing-gpt-oss/ Colab: https://dripl.ink/BLrkZ For more tutorials on using LLMs and building agents, check out my Patreon Patreon: https://www.patreon.com/SamW...
LangExtract - Google's New Library for NLP Tasks
In this video, I look at LangExtract, a library from Google that allows you to do old-world natural language processing tasks with ease using LLMs and structured outputs. Blog: https://developers...
Gemini Deep Think
In this video, we look at the latest Gemini release, Gemini DeepThink, and see what it can be used for and how it was able to reach gold medal standard in the International Math Olympiad. Blog: ht...
Ollama Gets a New App
To celebrate Ollama's 2nd birthday the cute llamas have got a new app!! Blog: https://ollama.com/blog/new-app For more tutorials on using LLMs and building agents, check out my Patreon Patreon: ...
Opal - Google Labs Killer NEW App
In this video, I look at the latest release from Google Labs, which is a new app called Opal. Opal allows you to create LLM and generative AI workflows using a drag-and-drop and description system....
SmolLMv3 - A Small Reasoner with Tool Use.
Blog: https://huggingface.co/blog/smollm3 Colab: https://dripl.ink/oFvSw For more tutorials on using LLMs and building agents, check out my Patreon Patreon: https://www.patreon.com/SamWitteveen Tw...
Kyutai STT & TTS - A Perfect Local Voice Solution?
Blog: https://kyutai.org/next/stt Blog: https://kyutai.org/next/tts GitHub: https://github.com/kyutai-labs/delayed-streams-modeling Colab: https://dripl.ink/QZevZ For more tutorials on using LLM...
GeminiCLI - The Deep Dive with MCPs
This time I do a deep dive into Gemini CLI and look at how you can use tools with it and how you can use MCPs with it to make both your development faster but also to be able to do other tasks beyo...
Introducing Gemini CLI
Blog: https://blog.google/technology/developers/introducing-gemini-cli-open-source-ai-agent/ GitHub: https://github.com/google-gemini/gemini-cli/tree/main For more tutorials on using LLMs and buil...
NanoNets OCR-s
Blog: https://nanonets.com/research/nanonets-ocr-s/ Colab: https://dripl.ink/YQEpC For more tutorials on using LLMs and building agents, check out my Patreon Patreon: https://www.patreon.com/SamWi...
Qwen 3 Embeddings & Rerankers
In this video I look at the new release from Qwen of their new Embedding and Reranking models which are start of the art and most importantly open weights models. Blog: https://qwenlm.github.io/bl...
Building with Chatterbox TTS, Voice Cloning & Watermarking
In this video, I look at the new Chatterbox TTS from Resemble.AI and how it's improving open-source text-to-speech with its impressive voice cloning and emotion control capabilities. We explore its...
MedGemma - An Open Doctor Model?
Blog: https://medgemma.org/ Colab 4B: https://dripl.ink/WgA5X Colab 27B: https://dripl.ink/WRzFq Colab Finetuning: https://dripl.ink/IxsYs For more tutorials on using LLMs and building agents, che...
Mistral Agents API - The NEW Agent System
In this video, I look at the new Agents API from Mistral and how they are building an agentic story around their models. Blog: https://mistral.ai/news/agents-api Colab: https://dripl.ink/q7VoH Co...
Gemini TTS - Native Audio Out
In this video, I look at the Gemini TTS that was released at Google I/O last week and show you how you can use it to do various things with speech and dialogue. Blog: https://blog.google/technolo...
Google I/O 25 - Models vs Products
In this video, I cover the new models and products that were announced in the Google I/O keynote. Blog: https://blog.google/technology/ai/io-2025-keynote/ For more tutorials on using LLMs and bu...
NVIDIA beats Whisper with Parakeetv2
In this video, I look at the latest open-weight ASR system from NVIDIA. Colab: https://dripl.ink/Op9rY HF: https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2 HF Spaces: https://huggingface.co/spac...
Slash Your Gemini Bill Up To 75 %
In this video, I look at Google's new implicit caching for Gemini 2.5 models. Colab: https://dripl.ink/aLabu Blog: https://developers.googleblog.com/en/gemini-2-5-models-now-support-implicit-cach...
The Improved Gemini 2.5 Pro - A Coding Powerhouse
In this video, I test out a new and improved version of the Gemini 2.5 Pro model. This model is exceptionally good at coding tasks and can reason over large docs and videos for context. Blog: htt...
Phi-4 Reasoning - Microsoft Joins the Reasoning Race!!
In this video, I look at the new 5-four reasoning models from Microsoft, and look at how the team created these models and actually how good these models are. Blog: https://azure.microsoft.com/en...