Prompt Engineering - Videos

Back to Channel

AIStudio: Major upgrade with Auth and Database

Google AI Studio got a major upgrade. My Dictation App: www.whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/rag Signup for Ne...

6,936 views • 179 likes • 15 comments • March 19, 2026

Anthropic Made Their OpenClaw

Claude Dispatch is here. It let's you control your Desktop Claude instance in a persistent conversation. https://x.com/felixrieseberg/status/2034005731457044577 My Dictation App: www.whryte.com...

32,204 views • 557 likes • 61 comments • March 18, 2026

Anthropic Just Solved Long Context

Anthropic just made the 1M token context window generally available for Claude Opus 4.6 and Sonnet 4.6; and dropped the long-context pricing premium entirely. In this video, I break down why the pr...

22,328 views • 383 likes • 38 comments • March 16, 2026

Claude is taking over...

nthropic didn't add image generation. They gave Claude the ability to build interactive software inside your conversation and throw it away when you're done. My Dictation App: www.whryte.com Websi...

15,011 views • 408 likes • 31 comments • March 13, 2026

Gemini Embedding 2 Is a Big Deal

Thanks to Chargebee for making this video possible, check them out: https://www.chargebee.com/?utm_source=youtube&utm_medium=social_media&utm_campaign=2026-02-da-global-developer-influencer-campaig...

8,425 views • 272 likes • 24 comments • March 12, 2026

Opus just got caught ...

Anthropic just published a paper showing Claude Opus 4.6 figured out it was being tested on BrowseComp, found the encrypted answer key on GitHub, wrote its own decryption code, and extracted the an...

6,317 views • 196 likes • 41 comments • March 11, 2026

The “I Could Build That” Illusion

Everyone says AI can build any app in a weekend now. But if software is becoming cheap, why are simple AI products still selling for millions? In this video, I break down the Cal AI story, why dis...

8,769 views • 401 likes • 44 comments • March 09, 2026

Claude Code 2.0: Massive Upgrade with Agent Loops

Scheduled tasks just landed in Claude Code/Desktop and they cover most of what OpenClaw does but there are actually 4 different scheduling surfaces and picking the wrong one means your task silentl...

15,227 views • 315 likes • 31 comments • March 08, 2026

GPT-5.4: Everything You Need to Know

OpenAI skipped GPT-5.3 entirely and went straight to GPT-5.4 — their first model with native computer use, scoring 75% on OS World and matching human professionals across 44 occupations. Here's eve...

20,901 views • 401 likes • 56 comments • March 05, 2026

A Sad Day for Open Source AI

The entire core team behind Qwen; the most downloaded open-source AI model family in the world, was just pushed out by Alibaba. Here's what happened, why it happened, and what it means for the futu...

35,909 views • 1,132 likes • 165 comments • March 05, 2026

This 30-Year-Old Pattern Fixes AI Agents

Arcade: One API for all your agent auth: https://arcade.dev.plug.dev/LLbk9in In this video, I show how to apply classic three-tier architecture to agent systems so you can separate concerns across...

8,426 views • 230 likes • 8 comments • March 04, 2026

Gemini 3.1 Flash-Lite: The Model You'll Actually Use...

Google just released Gemini 3.1 Flash-Lite, their fastest and most affordable model yet — and it's the third Gemini release in three weeks. In this video, I test its UI generation and reasoning cap...

16,003 views • 334 likes • 43 comments • March 03, 2026

The Problem Every AI Company Is Hiding

LINKS in the VIDEO: https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-pro/ https://github.com/google-gemini/gemini-cli/discussions/19724 https://discuss.ai.google....

8,956 views • 331 likes • 47 comments • February 28, 2026

Nano Banana 2 is Here - Faster and Cheaper

Thanks to Google DeepMind for the #EarlyAccess” We're going hands-on with Google's new Nano Banana 2 (Gemini 3.1 Flash Image) to see if this budget-friendly model can truly deliver Pro-level quali...

9,647 views • 240 likes • 24 comments • February 26, 2026

Mercury 2: The First Diffusion Model That 'Thinks'"

In this video, I test Inception's new Mercury 2, a diffusion-based large language model that introduces reasoning capabilities and generates text at 1,000 tokens per second. I demonstrate its speed...

58,122 views • 2,338 likes • 211 comments • February 24, 2026

The AI Model Doesn't Matter Anymore

While the entire industry obsesses over whether GPT, Claude, or Gemini is the best model, they are completely missing the real reason AI agents keep failing. The actual bottleneck isn't the model i...

30,374 views • 1,199 likes • 111 comments • February 23, 2026

Open Source Backend for AI Agents

AI coding agents are incredible at building frontends, but they completely fall apart when you try to make them configure complex, production-ready backends. In this video, we explore Insforge, an ...

8,640 views • 271 likes • 25 comments • February 22, 2026

Gemini 3.1 Pro: The model no one expected

Google just dropped a massive upgrade with Gemini 3.1 Pro! In this video, we break down why this preview release is actually a huge leap forward for AI. We dive deep into the insane new benchmark s...

34,647 views • 654 likes • 84 comments • February 19, 2026

Anthropic Just Killed Tool Calling

Anthropic's latest Sonnet 4.6 release quietly introduced programmatic tool calling; a feature that lets AI agents write code instead of JSON to invoke tools, slashing token usage by up to 98% while...

57,404 views • 1,616 likes • 108 comments • February 18, 2026

Sonnet 4.6 Is Here—And It’s a Beast at Coding

Sonnet 4.6 is here! We are looking at Opus level performance at Sonnet prices. https://www.anthropic.com/news/claude-sonnet-4-6 https://claude.com/blog/improved-web-search-with-dynamic-filtering ...

14,303 views • 321 likes • 31 comments • February 17, 2026

Exploration is All You Need!

Standard RAG lacks context, but full agentic scanning is too slow—so I designed a "Dual-Path" architecture to fix this. In this video, we build a hybrid system using DuckDB and Gemini that combines...

11,047 views • 344 likes • 34 comments • February 17, 2026

Why OpenAI Just "Acquired" The Biggest Open Source Agent

The biggest story in AI right now: Peter Steinberg, the creator of the viral OpenClaw project, is officially joining OpenAI. After a rollercoaster month involving Anthropic blocks, three name chang...

36,997 views • 998 likes • 225 comments • February 16, 2026

OpenClaw Testing - Can't Get Easier than this!

The Easiest Way to Setup OpenClaw: https://app.emergent.sh/?via=engineerprompt OpenClaw is in the news. This is one of the easiest way to setup OpenClaw and test it out. My Dictation App: www.w...

5,739 views • 102 likes • 11 comments • February 15, 2026

The 100x AI Breakthrough No One is Talking About

While the internet obsesses over Gemini 3's benchmark scores, the real revolution is hidden in the 100x reduction in inference compute and the new 'Aletheia' agent. This video breaks down why "thin...

45,323 views • 1,296 likes • 72 comments • February 14, 2026

Codex-Spark: OpenAI Just Broke the Speed Limit (1,000 Tokens/s)

First look at GPT-5.3-Codex-Spark! We also look at Gemini 3 Deep Think, MiniMax M2.5 and GLM-5. These are exciting releases. LINKS: https://openai.com/index/introducing-gpt-5-3-codex-spark/ https...

21,216 views • 403 likes • 70 comments • February 12, 2026

Minimax-Agent: The Ultimate Open-Source "Workhorse" Model

In this video, we explore the Minimax M2.1 open-weight model and its capabilities as a dedicated "workhorse" for coding tasks. Its SOTA open weight model for coding and agentic tasks. The MiniMax A...

6,528 views • 157 likes • 16 comments • February 12, 2026

How OpenClaw Works: The Real "Magic"

Let's talk about what makes "OpenClaw" so special. Its elegant and simple engineering! website: https://openclaw.ai/ My voice to text App: whryte.com Website: https://engineerprompt.ai/ RAG Beyon...

16,851 views • 513 likes • 58 comments • February 10, 2026

RAG is Dead? Introducing Agentic File Exploration

In this video we will look at file search exploration as a potential replacement to RAG. The system uses local Qwen3 model running on DGX Spark. # Clone the Ollama/Qwen3 branch git clone -b feat/...

19,375 views • 666 likes • 91 comments • February 09, 2026

Claude Code's New Agents Team Are Absolutely Insane

Agent Teams in Claude code is a new design primitive that Anthropic introduced where you have dedicated claude instances as agents. https://code.claude.com/docs/en/agent-teams https://x.com/lydia...

37,158 views • 724 likes • 53 comments • February 08, 2026

Opus 4.6 & GPT-5.3: Things Got Interesting!

Two major releases - GPT-5.3-Codex and Opus 4.6! Biggest release day of the year! My Dictation App: www.whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-si...

26,260 views • 526 likes • 57 comments • February 05, 2026

Gemini’s Native Web Scraper: 100% "Free" & Multimodal

👉 Grab your free seat to the 2-Day AI Mastermind: This Saturday and Sunday https://link.outskill.com/PROMPTENG 🔐 100% Discount for the first 1000 people 💥 Dive deep into AI and Learn Automations, B...

18,966 views • 595 likes • 33 comments • February 04, 2026

OpenAI Codex App: Claude Cowork Killer?

Checkout the next codex app from openai: https://openai.com/index/introducing-the-codex-app/ My Dictation App: www.whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https:/...

12,840 views • 189 likes • 14 comments • February 02, 2026

Why Agents Are Ignoring Your Skills (Literally)

Vercel's latest evaluations reveal that AI agents ignore 'Skills' over 56% of the time, casting doubt on the effectiveness of progressive disclosure for current models. I break down why simple "con...

6,263 views • 229 likes • 24 comments • January 30, 2026

Stop Typing. (Seriously)

Get the App here: https://engineerly76.gumroad.com/l/pbzck Use code PROMPT for 25% off! I built the ultimate local-only voice transcription tool for macOS that lets you code and write faster than...

5,469 views • 170 likes • 65 comments • January 29, 2026

Kimi K2.5 - Its more than an LLM

Kimi K2.5 is here. Its a multimodal model specifically trained for agentic use. https://x.com/Kimi_Moonshot/status/2016019760308457649 https://www.kimi.com/blog/kimi-k2-5.html https://www.kimi.co...

13,948 views • 326 likes • 21 comments • January 27, 2026

Clawdbot (moltbot) Clearly Explained...

Clawdbot is blowing up on X/Twitter. Here is my take on it. https://clawd.bot/ My voice to text App: whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-sit...

14,206 views • 392 likes • 49 comments • January 27, 2026

Stop Using AI Agents Like Chatbots. (Do This Instead)

Checkout Verdent here: https://www.verdent.ai/?id=700231 use my personal code: 4S62V4 for 60% off Stop treating AI agents like chatbots—the secret to 'vibe coding' isn't the model, but the paralle...

2,649 views • 72 likes • 2 comments • January 26, 2026

Qwen3-TTS: The ElevenLabs Killer?

Qwen3-TTS is a groundbreaking open-source model that brings real-time voice cloning and custom voice design directly to your local machine. I breakdown the 1.7B and 0.6B models, demonstrating how t...

17,380 views • 470 likes • 43 comments • January 23, 2026

"Don’t Outsource Your Thinking" to your Agent

AI coding agents can drastically speed up your workflow, but without the right habits, they often lead to "context rot" and unmaintainable code. This video breaks down 10 essential lessons from a y...

6,632 views • 243 likes • 26 comments • January 22, 2026

Amazon’s New AI Agent is... Boring? (And that’s good)

Get started with Amazon Nova Act for free here: https://fandf.co/4rMkZYd Browser automation often breaks when you move from a demo to the real world. In this video, we explore Amazon Nova Act appr...

2,158 views • 61 likes • 6 comments • January 21, 2026

NVIDIA Nemotron ASR... The Whisper Killer?

NVIDIA’s new Nemotron Speech ASR uses cache-aware streaming to eliminate the latency drift found in sliding window models like Whisper. This video explains the architecture changes and demonstrates...

4,992 views • 158 likes • 14 comments • January 20, 2026

DeepSeek Engram: We’ve Been Building LLMs Wrong

In this video, I delve into a groundbreaking paper by DeepSeek called Engram that addresses the inefficiencies of transformer-based large language models (LLMs). Traditional LLMs use deep computati...

22,898 views • 894 likes • 64 comments • January 19, 2026

Anthropic Just Fixed MCP’s Biggest Problem

In this video, I discuss how Anthropic has addressed the issue of context window pollution caused by MCP server tool definitions. By using the new MCP tool search, you can dynamically load tools wh...

20,810 views • 666 likes • 41 comments • January 15, 2026

Claude Cowork: First General Agents from Anthropic

In this video I dive into @anthropic-ai new "Cowork" agent, a desktop application that gives AI the power to access your local files and control your browser to automate complex workflows. In this...

10,570 views • 121 likes • 17 comments • January 13, 2026

RAG is Dead? Try Agentic File Search

In this video we will look at file search exploration as a potential replacement to RAG. This is built on top of the the fs-explorer from llamaIndex. LINK to the Repo: https://github.com/PromtEng...

21,535 views • 757 likes • 94 comments • January 11, 2026

The Holy Grail of Intelligence - Explained.

I break down the "5 Levels of Continual Learning" to explain why current AI models suffer from catastrophic forgetting and remain frozen after training. We also explore whether engineering hacks li...

4,486 views • 192 likes • 27 comments • January 08, 2026

Ralph Wiggum - Ship Code While you Sleep!

This video explains the 'Ralph Wiggum' pattern—a method for running Claude Code autonomously using a simple bash loop. I cover the theoretical concept, when to use this approach versus standard pro...

17,080 views • 428 likes • 39 comments • January 06, 2026

Claude’s Best Hidden Feature

In this video we look at Anthropic's "Ask User Question Tool" for specs driven development. It interviews the user for collaboratively working on the SPECS for a project. My voice to text App: wh...

17,232 views • 513 likes • 47 comments • January 02, 2026

Open Source AI Voice Is Finally Good!

In this video, I test Chatterbox Turbo by Resemble AI. It's a permissive, MIT-licensed model that supports zero-shot voice cloning, multilingual output, and paralinguistic control (laughs, pauses, ...

7,928 views • 330 likes • 27 comments • January 01, 2026

""I have never felt this much behind as a programmer" - Andrej Karpathy

LINKS: https://x.com/karpathy/status/2004607146781278521 My voice to text App: whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/...

13,661 views • 149 likes • 20 comments • December 30, 2025