Prompt Engineering - Videos

Back to Channel

Orchestration Over Architecture: What Stanford Found

Thanks to DataImpluse for sponsoring this video: https://dataimpulse.com/?utm_source=youtube&utm_medium=video&utm_campaign=engineerprompt Two new papers from Stanford and Tsinghua just put hard nu...

10,381 views • 475 likes • 30 comments • May 04, 2026

DeepSeek Just Killed Visual Reasoning (And It's 10× Cheaper)

Deepseek new paper "Thinking with Visual Primitives". They dropped the paper in their github repo and then removed it. Here is the paper: https://github.com/ailuntx/Thinking-with-Visual-Primitives...

11,751 views • 399 likes • 36 comments • May 02, 2026

What is an Agent Harness? and How to build a great one!

To apply 40% off 3 months of Coursera plus - https://imp.i384100.net/c/7245724/3880401/14726 Google AI Essentials - https://imp.i384100.net/1GW56D Prompt Engineering for ChatGPT - https://imp.i3841...

39,666 views • 1,219 likes • 74 comments • April 30, 2026

Save 98% on AI Agent Tokens With This One Trick

Thanks to BrightData for sponsoring this video. Checkout their new MCP server here: https://github.com/brightdata/brightdata-mcp MCP servers can burn through half your context window on tool defin...

7,370 views • 210 likes • 20 comments • April 28, 2026

DeepSeek Just Did It Again

DeepSeek V4 just dropped with two massive models, 1.6 trillion and 284 billion parameters, and the Pro version rivals closed source giants like GPT-5.5 and Opus 4.6 while using only 27% of the comp...

23,511 views • 531 likes • 79 comments • April 24, 2026

Claude Design Is Here — Figma Stock Crashed

Anthropic just released Claude Design, the new design partner for builders. https://www.anthropic.com/news/claude-design-anthropic-labs My Dictation App: www.whryte.com Website: https://enginee...

4,604 views • 91 likes • 22 comments • April 17, 2026

Codex Just Became the Everything App

OpenAI is betting on Codex to be the everything App! https://openai.com/index/codex-for-almost-everything/ My Dictation App: www.whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics ...

22,998 views • 386 likes • 34 comments • April 16, 2026

Opus 4.7 is here... upgrade or downgrade?

Opus 4.7 is here and its the most interesting release from Anthropic https://www.anthropic.com/news/claude-opus-4-7 My Dictation App: www.whryte.com Website: https://engineerprompt.ai/ RAG Beyond...

13,929 views • 273 likes • 51 comments • April 16, 2026

Anthropic Is Building a Super App

Anthropic just redesigned Claude Desktop around Claude Code with parallel sessions, split panels, and a built-in browser preview, turning it into a unified dev environment that could replace your I...

8,218 views • 178 likes • 22 comments • April 15, 2026

Hermes Agent: The Self-Improving AI That Learns You

Open Router gives you access to 100+ models through a single API endpoint. Try it free and find the best model for your workflow at https://openrouter.plug.dev/SoSUEGl Hermes Agent is an open-sour...

13,721 views • 356 likes • 31 comments • April 14, 2026

New Claude Features For Developers

Anthropic's new advisor strategy lets you pair Opus with Sonnet for better results at lower cost, the monitor tool kills wasteful polling loops in Claude Code, and managed agents handle the infrast...

12,324 views • 258 likes • 14 comments • April 10, 2026

Gemma 4 Vision Agent | Object Detection + VLM Pipeline

Vision language models like Gemma 4 are great at understanding images but terrible at counting objects. In this video, I combine Gemma 4 with Falcon Perception, a tiny 300M parameter segmentation m...

14,608 views • 499 likes • 28 comments • April 07, 2026

Replit Agent 4: Parallel Agents for Vibe Coding

Checkout Agent 4 on Replit: https://replit.com/refer/engineerprompt Replit recently launched Agent 4, and it lets you ideate, design, and build in the same interface. I rebuilt my Google Hackathon...

2,390 views • 57 likes • 4 comments • April 06, 2026

The "Free Lunch" Is Over!

OpenClaw just got banned by Anthropic and the drama continues. https://pbs.twimg.com/media/HFBME5fa4AAUdIi?format=jpg&name=large https://x.com/bcherny/status/2040206440556826908 My Dictation App...

6,247 views • 172 likes • 35 comments • April 04, 2026

Qwen 3.6 Plus is Opus but Free?

Alibaba just released Qwen 3.6 Plus, and it's dangerously close to the frontier. In this video, I test it across multiple coding tasks and show you why the harness you choose matters more than the ...

39,516 views • 577 likes • 55 comments • April 02, 2026

OpenAI Eating Anthropic Lunch: Codex inside Claude Code

OpenAI just released an official plugin that brings Codex inside Claude Code, now letting you review, challenge, and delegate code to a completely different model without leaving your workflow. In ...

13,328 views • 264 likes • 39 comments • March 31, 2026

Chroma's New 20B Model Beats GPT-5 at Search

Chroma just released Context-1 — a 20B parameter self-editing search agent that matches frontier models like GPT-5 and Opus 4.5 on retrieval benchmarks at a fraction of the cost and 10x faster infe...

14,918 views • 439 likes • 23 comments • March 27, 2026

Self-Evolving AI Is Here — And It's Open Weight

MiniMax M2.7 is the first model showing real signs of self-evolution — it analyzes its own failures, modifies its harness, and iterates until performance improves. In this video, I break down exact...

11,872 views • 308 likes • 15 comments • March 25, 2026

Claude's New Computer Control Feature Is Insane

Computer use feature in Claude https://x.com/claudeai/status/2036195789601374705 https://support.claude.com/en/articles/14128542-let-claude-use-your-computer-in-cowork My Dictation App: www.whryt...

15,625 views • 262 likes • 11 comments • March 24, 2026

AIStudio: Major upgrade with Auth and Database

Google AI Studio got a major upgrade. My Dictation App: www.whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/rag Signup for Ne...

8,087 views • 195 likes • 17 comments • March 19, 2026

Anthropic Made Their OpenClaw

Claude Dispatch is here. It let's you control your Desktop Claude instance in a persistent conversation. https://x.com/felixrieseberg/status/2034005731457044577 My Dictation App: www.whryte.com...

35,240 views • 591 likes • 60 comments • March 18, 2026

Anthropic Just Solved Long Context

Anthropic just made the 1M token context window generally available for Claude Opus 4.6 and Sonnet 4.6; and dropped the long-context pricing premium entirely. In this video, I break down why the pr...

23,222 views • 398 likes • 37 comments • March 16, 2026

Claude is taking over...

nthropic didn't add image generation. They gave Claude the ability to build interactive software inside your conversation and throw it away when you're done. My Dictation App: www.whryte.com Websi...

15,556 views • 410 likes • 31 comments • March 13, 2026

Gemini Embedding 2 Is a Big Deal

Thanks to Chargebee for making this video possible, check them out: https://www.chargebee.com/?utm_source=youtube&utm_medium=social_media&utm_campaign=2026-02-da-global-developer-influencer-campaig...

10,057 views • 296 likes • 25 comments • March 12, 2026

Opus just got caught ...

Anthropic just published a paper showing Claude Opus 4.6 figured out it was being tested on BrowseComp, found the encrypted answer key on GitHub, wrote its own decryption code, and extracted the an...

6,455 views • 196 likes • 41 comments • March 11, 2026

The “I Could Build That” Illusion

Everyone says AI can build any app in a weekend now. But if software is becoming cheap, why are simple AI products still selling for millions? In this video, I break down the Cal AI story, why dis...

8,920 views • 406 likes • 44 comments • March 09, 2026

Claude Code 2.0: Massive Upgrade with Agent Loops

Scheduled tasks just landed in Claude Code/Desktop and they cover most of what OpenClaw does but there are actually 4 different scheduling surfaces and picking the wrong one means your task silentl...

15,666 views • 318 likes • 31 comments • March 08, 2026

GPT-5.4: Everything You Need to Know

OpenAI skipped GPT-5.3 entirely and went straight to GPT-5.4 — their first model with native computer use, scoring 75% on OS World and matching human professionals across 44 occupations. Here's eve...

22,536 views • 413 likes • 53 comments • March 05, 2026

A Sad Day for Open Source AI

The entire core team behind Qwen; the most downloaded open-source AI model family in the world, was just pushed out by Alibaba. Here's what happened, why it happened, and what it means for the futu...

36,211 views • 1,140 likes • 166 comments • March 05, 2026

This 30-Year-Old Pattern Fixes AI Agents

Arcade: One API for all your agent auth: https://arcade.dev.plug.dev/LLbk9in In this video, I show how to apply classic three-tier architecture to agent systems so you can separate concerns across...

8,670 views • 231 likes • 8 comments • March 04, 2026

Gemini 3.1 Flash-Lite: The Model You'll Actually Use...

Google just released Gemini 3.1 Flash-Lite, their fastest and most affordable model yet — and it's the third Gemini release in three weeks. In this video, I test its UI generation and reasoning cap...

16,860 views • 338 likes • 44 comments • March 03, 2026

The Problem Every AI Company Is Hiding

LINKS in the VIDEO: https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-pro/ https://github.com/google-gemini/gemini-cli/discussions/19724 https://discuss.ai.google....

9,027 views • 330 likes • 48 comments • February 28, 2026

Nano Banana 2 is Here - Faster and Cheaper

Thanks to Google DeepMind for the #EarlyAccess” We're going hands-on with Google's new Nano Banana 2 (Gemini 3.1 Flash Image) to see if this budget-friendly model can truly deliver Pro-level quali...

9,750 views • 241 likes • 24 comments • February 26, 2026

Mercury 2: The First Diffusion Model That 'Thinks'"

In this video, I test Inception's new Mercury 2, a diffusion-based large language model that introduces reasoning capabilities and generates text at 1,000 tokens per second. I demonstrate its speed...

59,589 views • 2,365 likes • 212 comments • February 24, 2026

The AI Model Doesn't Matter Anymore

While the entire industry obsesses over whether GPT, Claude, or Gemini is the best model, they are completely missing the real reason AI agents keep failing. The actual bottleneck isn't the model i...

30,840 views • 1,201 likes • 109 comments • February 23, 2026

Open Source Backend for AI Agents

AI coding agents are incredible at building frontends, but they completely fall apart when you try to make them configure complex, production-ready backends. In this video, we explore Insforge, an ...

9,018 views • 278 likes • 25 comments • February 22, 2026

Gemini 3.1 Pro: The model no one expected

Google just dropped a massive upgrade with Gemini 3.1 Pro! In this video, we break down why this preview release is actually a huge leap forward for AI. We dive deep into the insane new benchmark s...

34,828 views • 655 likes • 80 comments • February 19, 2026

Anthropic Just Killed Tool Calling

Anthropic's latest Sonnet 4.6 release quietly introduced programmatic tool calling; a feature that lets AI agents write code instead of JSON to invoke tools, slashing token usage by up to 98% while...

58,136 views • 1,626 likes • 108 comments • February 18, 2026

Sonnet 4.6 Is Here—And It’s a Beast at Coding

Sonnet 4.6 is here! We are looking at Opus level performance at Sonnet prices. https://www.anthropic.com/news/claude-sonnet-4-6 https://claude.com/blog/improved-web-search-with-dynamic-filtering ...

14,804 views • 324 likes • 31 comments • February 17, 2026

Exploration is All You Need!

Standard RAG lacks context, but full agentic scanning is too slow—so I designed a "Dual-Path" architecture to fix this. In this video, we build a hybrid system using DuckDB and Gemini that combines...

11,461 views • 353 likes • 34 comments • February 17, 2026

Why OpenAI Just "Acquired" The Biggest Open Source Agent

The biggest story in AI right now: Peter Steinberg, the creator of the viral OpenClaw project, is officially joining OpenAI. After a rollercoaster month involving Anthropic blocks, three name chang...

37,042 views • 997 likes • 224 comments • February 16, 2026

OpenClaw Testing - Can't Get Easier than this!

The Easiest Way to Setup OpenClaw: https://app.emergent.sh/?via=engineerprompt OpenClaw is in the news. This is one of the easiest way to setup OpenClaw and test it out. My Dictation App: www.w...

6,239 views • 101 likes • 11 comments • February 15, 2026

The 100x AI Breakthrough No One is Talking About

While the internet obsesses over Gemini 3's benchmark scores, the real revolution is hidden in the 100x reduction in inference compute and the new 'Aletheia' agent. This video breaks down why "thin...

45,681 views • 1,299 likes • 72 comments • February 14, 2026

Codex-Spark: OpenAI Just Broke the Speed Limit (1,000 Tokens/s)

First look at GPT-5.3-Codex-Spark! We also look at Gemini 3 Deep Think, MiniMax M2.5 and GLM-5. These are exciting releases. LINKS: https://openai.com/index/introducing-gpt-5-3-codex-spark/ https...

21,391 views • 403 likes • 68 comments • February 12, 2026

Minimax-Agent: The Ultimate Open-Source "Workhorse" Model

In this video, we explore the Minimax M2.1 open-weight model and its capabilities as a dedicated "workhorse" for coding tasks. Its SOTA open weight model for coding and agentic tasks. The MiniMax A...

7,060 views • 158 likes • 16 comments • February 12, 2026

How OpenClaw Works: The Real "Magic"

Let's talk about what makes "OpenClaw" so special. Its elegant and simple engineering! website: https://openclaw.ai/ My voice to text App: whryte.com Website: https://engineerprompt.ai/ RAG Beyon...

18,769 views • 553 likes • 58 comments • February 10, 2026

RAG is Dead? Introducing Agentic File Exploration

In this video we will look at file search exploration as a potential replacement to RAG. The system uses local Qwen3 model running on DGX Spark. # Clone the Ollama/Qwen3 branch git clone -b feat/...

20,146 views • 687 likes • 91 comments • February 09, 2026

Claude Code's New Agents Team Are Absolutely Insane

Agent Teams in Claude code is a new design primitive that Anthropic introduced where you have dedicated claude instances as agents. https://code.claude.com/docs/en/agent-teams https://x.com/lydia...

38,228 views • 728 likes • 51 comments • February 08, 2026

Opus 4.6 & GPT-5.3: Things Got Interesting!

Two major releases - GPT-5.3-Codex and Opus 4.6! Biggest release day of the year! My Dictation App: www.whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-si...

26,304 views • 523 likes • 50 comments • February 05, 2026

Gemini’s Native Web Scraper: 100% "Free" & Multimodal

👉 Grab your free seat to the 2-Day AI Mastermind: This Saturday and Sunday https://link.outskill.com/PROMPTENG 🔐 100% Discount for the first 1000 people 💥 Dive deep into AI and Learn Automations, B...

19,936 views • 603 likes • 34 comments • February 04, 2026