Prompt Engineering - Videos
Back to ChannelAIStudio: Major upgrade with Auth and Database
Google AI Studio got a major upgrade. My Dictation App: www.whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/rag Signup for Ne...
Anthropic Made Their OpenClaw
Claude Dispatch is here. It let's you control your Desktop Claude instance in a persistent conversation. https://x.com/felixrieseberg/status/2034005731457044577 My Dictation App: www.whryte.com...
Anthropic Just Solved Long Context
Anthropic just made the 1M token context window generally available for Claude Opus 4.6 and Sonnet 4.6; and dropped the long-context pricing premium entirely. In this video, I break down why the pr...
Claude is taking over...
nthropic didn't add image generation. They gave Claude the ability to build interactive software inside your conversation and throw it away when you're done. My Dictation App: www.whryte.com Websi...
Gemini Embedding 2 Is a Big Deal
Thanks to Chargebee for making this video possible, check them out: https://www.chargebee.com/?utm_source=youtube&utm_medium=social_media&utm_campaign=2026-02-da-global-developer-influencer-campaig...
Opus just got caught ...
Anthropic just published a paper showing Claude Opus 4.6 figured out it was being tested on BrowseComp, found the encrypted answer key on GitHub, wrote its own decryption code, and extracted the an...
The “I Could Build That” Illusion
Everyone says AI can build any app in a weekend now. But if software is becoming cheap, why are simple AI products still selling for millions? In this video, I break down the Cal AI story, why dis...
Claude Code 2.0: Massive Upgrade with Agent Loops
Scheduled tasks just landed in Claude Code/Desktop and they cover most of what OpenClaw does but there are actually 4 different scheduling surfaces and picking the wrong one means your task silentl...
GPT-5.4: Everything You Need to Know
OpenAI skipped GPT-5.3 entirely and went straight to GPT-5.4 — their first model with native computer use, scoring 75% on OS World and matching human professionals across 44 occupations. Here's eve...
A Sad Day for Open Source AI
The entire core team behind Qwen; the most downloaded open-source AI model family in the world, was just pushed out by Alibaba. Here's what happened, why it happened, and what it means for the futu...
This 30-Year-Old Pattern Fixes AI Agents
Arcade: One API for all your agent auth: https://arcade.dev.plug.dev/LLbk9in In this video, I show how to apply classic three-tier architecture to agent systems so you can separate concerns across...
Gemini 3.1 Flash-Lite: The Model You'll Actually Use...
Google just released Gemini 3.1 Flash-Lite, their fastest and most affordable model yet — and it's the third Gemini release in three weeks. In this video, I test its UI generation and reasoning cap...
The Problem Every AI Company Is Hiding
LINKS in the VIDEO: https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-pro/ https://github.com/google-gemini/gemini-cli/discussions/19724 https://discuss.ai.google....
Nano Banana 2 is Here - Faster and Cheaper
Thanks to Google DeepMind for the #EarlyAccess” We're going hands-on with Google's new Nano Banana 2 (Gemini 3.1 Flash Image) to see if this budget-friendly model can truly deliver Pro-level quali...
Mercury 2: The First Diffusion Model That 'Thinks'"
In this video, I test Inception's new Mercury 2, a diffusion-based large language model that introduces reasoning capabilities and generates text at 1,000 tokens per second. I demonstrate its speed...
The AI Model Doesn't Matter Anymore
While the entire industry obsesses over whether GPT, Claude, or Gemini is the best model, they are completely missing the real reason AI agents keep failing. The actual bottleneck isn't the model i...
Open Source Backend for AI Agents
AI coding agents are incredible at building frontends, but they completely fall apart when you try to make them configure complex, production-ready backends. In this video, we explore Insforge, an ...
Gemini 3.1 Pro: The model no one expected
Google just dropped a massive upgrade with Gemini 3.1 Pro! In this video, we break down why this preview release is actually a huge leap forward for AI. We dive deep into the insane new benchmark s...
Anthropic Just Killed Tool Calling
Anthropic's latest Sonnet 4.6 release quietly introduced programmatic tool calling; a feature that lets AI agents write code instead of JSON to invoke tools, slashing token usage by up to 98% while...
Sonnet 4.6 Is Here—And It’s a Beast at Coding
Sonnet 4.6 is here! We are looking at Opus level performance at Sonnet prices. https://www.anthropic.com/news/claude-sonnet-4-6 https://claude.com/blog/improved-web-search-with-dynamic-filtering ...
Exploration is All You Need!
Standard RAG lacks context, but full agentic scanning is too slow—so I designed a "Dual-Path" architecture to fix this. In this video, we build a hybrid system using DuckDB and Gemini that combines...
Why OpenAI Just "Acquired" The Biggest Open Source Agent
The biggest story in AI right now: Peter Steinberg, the creator of the viral OpenClaw project, is officially joining OpenAI. After a rollercoaster month involving Anthropic blocks, three name chang...
OpenClaw Testing - Can't Get Easier than this!
The Easiest Way to Setup OpenClaw: https://app.emergent.sh/?via=engineerprompt OpenClaw is in the news. This is one of the easiest way to setup OpenClaw and test it out. My Dictation App: www.w...
The 100x AI Breakthrough No One is Talking About
While the internet obsesses over Gemini 3's benchmark scores, the real revolution is hidden in the 100x reduction in inference compute and the new 'Aletheia' agent. This video breaks down why "thin...
Codex-Spark: OpenAI Just Broke the Speed Limit (1,000 Tokens/s)
First look at GPT-5.3-Codex-Spark! We also look at Gemini 3 Deep Think, MiniMax M2.5 and GLM-5. These are exciting releases. LINKS: https://openai.com/index/introducing-gpt-5-3-codex-spark/ https...
Minimax-Agent: The Ultimate Open-Source "Workhorse" Model
In this video, we explore the Minimax M2.1 open-weight model and its capabilities as a dedicated "workhorse" for coding tasks. Its SOTA open weight model for coding and agentic tasks. The MiniMax A...
How OpenClaw Works: The Real "Magic"
Let's talk about what makes "OpenClaw" so special. Its elegant and simple engineering! website: https://openclaw.ai/ My voice to text App: whryte.com Website: https://engineerprompt.ai/ RAG Beyon...
RAG is Dead? Introducing Agentic File Exploration
In this video we will look at file search exploration as a potential replacement to RAG. The system uses local Qwen3 model running on DGX Spark. # Clone the Ollama/Qwen3 branch git clone -b feat/...
Claude Code's New Agents Team Are Absolutely Insane
Agent Teams in Claude code is a new design primitive that Anthropic introduced where you have dedicated claude instances as agents. https://code.claude.com/docs/en/agent-teams https://x.com/lydia...
Opus 4.6 & GPT-5.3: Things Got Interesting!
Two major releases - GPT-5.3-Codex and Opus 4.6! Biggest release day of the year! My Dictation App: www.whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-si...
Gemini’s Native Web Scraper: 100% "Free" & Multimodal
👉 Grab your free seat to the 2-Day AI Mastermind: This Saturday and Sunday https://link.outskill.com/PROMPTENG 🔐 100% Discount for the first 1000 people 💥 Dive deep into AI and Learn Automations, B...
OpenAI Codex App: Claude Cowork Killer?
Checkout the next codex app from openai: https://openai.com/index/introducing-the-codex-app/ My Dictation App: www.whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https:/...
Why Agents Are Ignoring Your Skills (Literally)
Vercel's latest evaluations reveal that AI agents ignore 'Skills' over 56% of the time, casting doubt on the effectiveness of progressive disclosure for current models. I break down why simple "con...
Stop Typing. (Seriously)
Get the App here: https://engineerly76.gumroad.com/l/pbzck Use code PROMPT for 25% off! I built the ultimate local-only voice transcription tool for macOS that lets you code and write faster than...
Kimi K2.5 - Its more than an LLM
Kimi K2.5 is here. Its a multimodal model specifically trained for agentic use. https://x.com/Kimi_Moonshot/status/2016019760308457649 https://www.kimi.com/blog/kimi-k2-5.html https://www.kimi.co...
Clawdbot (moltbot) Clearly Explained...
Clawdbot is blowing up on X/Twitter. Here is my take on it. https://clawd.bot/ My voice to text App: whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-sit...
Stop Using AI Agents Like Chatbots. (Do This Instead)
Checkout Verdent here: https://www.verdent.ai/?id=700231 use my personal code: 4S62V4 for 60% off Stop treating AI agents like chatbots—the secret to 'vibe coding' isn't the model, but the paralle...
Qwen3-TTS: The ElevenLabs Killer?
Qwen3-TTS is a groundbreaking open-source model that brings real-time voice cloning and custom voice design directly to your local machine. I breakdown the 1.7B and 0.6B models, demonstrating how t...
"Don’t Outsource Your Thinking" to your Agent
AI coding agents can drastically speed up your workflow, but without the right habits, they often lead to "context rot" and unmaintainable code. This video breaks down 10 essential lessons from a y...
Amazon’s New AI Agent is... Boring? (And that’s good)
Get started with Amazon Nova Act for free here: https://fandf.co/4rMkZYd Browser automation often breaks when you move from a demo to the real world. In this video, we explore Amazon Nova Act appr...
NVIDIA Nemotron ASR... The Whisper Killer?
NVIDIA’s new Nemotron Speech ASR uses cache-aware streaming to eliminate the latency drift found in sliding window models like Whisper. This video explains the architecture changes and demonstrates...
DeepSeek Engram: We’ve Been Building LLMs Wrong
In this video, I delve into a groundbreaking paper by DeepSeek called Engram that addresses the inefficiencies of transformer-based large language models (LLMs). Traditional LLMs use deep computati...
Anthropic Just Fixed MCP’s Biggest Problem
In this video, I discuss how Anthropic has addressed the issue of context window pollution caused by MCP server tool definitions. By using the new MCP tool search, you can dynamically load tools wh...
Claude Cowork: First General Agents from Anthropic
In this video I dive into @anthropic-ai new "Cowork" agent, a desktop application that gives AI the power to access your local files and control your browser to automate complex workflows. In this...
RAG is Dead? Try Agentic File Search
In this video we will look at file search exploration as a potential replacement to RAG. This is built on top of the the fs-explorer from llamaIndex. LINK to the Repo: https://github.com/PromtEng...
The Holy Grail of Intelligence - Explained.
I break down the "5 Levels of Continual Learning" to explain why current AI models suffer from catastrophic forgetting and remain frozen after training. We also explore whether engineering hacks li...
Ralph Wiggum - Ship Code While you Sleep!
This video explains the 'Ralph Wiggum' pattern—a method for running Claude Code autonomously using a simple bash loop. I cover the theoretical concept, when to use this approach versus standard pro...
Claude’s Best Hidden Feature
In this video we look at Anthropic's "Ask User Question Tool" for specs driven development. It interviews the user for collaboratively working on the SPECS for a project. My voice to text App: wh...
Open Source AI Voice Is Finally Good!
In this video, I test Chatterbox Turbo by Resemble AI. It's a permissive, MIT-licensed model that supports zero-shot voice cloning, multilingual output, and paralinguistic control (laughs, pauses, ...
""I have never felt this much behind as a programmer" - Andrej Karpathy
LINKS: https://x.com/karpathy/status/2004607146781278521 My voice to text App: whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/...