Prompt Engineering - Videos
Back to ChannelLing 1T Model: Too Good to Be True?
Access Ling 1T: https://huggingface.co/inclusionAI/Ling-1T https://x.com/AntLingAGI/status/1975942293330018426 https://zenmux.ai/inclusionai/ling-1t?utm_source=hf_inclusionAI Let's look at Ling-1T...
OpenAI Just Confused Everyone... Again
OpenAI’s new Agent Kit has everyone confused! is it really building agents or just fancy workflows? In this video, I break down what OpenAI means by “agents,” how it differs from Anthropic’s defini...
Vibe Planning: The Smarter Way to Code with AI Agents
Get started with Traycer for free: : https://traycer.ai/ Claude Code is great at writing code, but it won’t save your project without a clear plan and verification—so we add Traycer’s Plan → Execu...
Claude Code 2.0: The Only Guide You Need
Checkout Snyk to find security vulnerabilities in your code: https://snyk.plug.dev/arcV6Qm Discover all the powerful new features in Claude Code 2.0, a complete rewrite that integrates seamlessly ...
OpenAI’s Sora 2 Can Talk—and Follow Physics
OpenAI just launched Sora 2, a next-gen AI video generation model delivering physics-accurate, cinematic visuals with synchronized dialogue and sound effects—watch real demos, strengths, and failur...
Sonnet 4.5 Is Here—And It’s a Beast at Coding
Very first look at the Claude Sonnet 4.5 release https://www.anthropic.com/news/claude-sonnet-4-5 https://docs.claude.com/en/docs/about-claude/models/whats-new-sonnet-4-5 https://docs.claude.com/e...
Can AI Make Better Slides Than You?
Checkout: https://www.gamma.app I this video I test Gamma 3.0 —the AI presentation agent. Gamma also recently released their API, which let's you use the same agent for buck content creation and a...
Building tools for agents — with agents
Checkout Browserbase for web browsing automation for AI agents: https://browserbase.plug.dev/fN72qF0 In this video we will learn how to design agent tools that actually work for AI agents. It will...
Finally a real VEO-3 competitor
Discover WAN 2.5, Alibaba’s latest AI video model that can generate both visuals and sound in sync. In this preview, I’ll show you how it stacks up with Veo 3 and why it could change the future of ...
Qwen 3 Omni — The Open AI Model That Does It ALL
In this video, I test out Qwen 3 Omni — Alibaba’s latest open-source multimodal model that can handle text, images, audio, and video in real time. From live demos to benchmarks, we’ll see if Qwen 3...
Claude Code Downgrade? Here’s What Actually Happened
If you noticed claude code degradation in last few weeks, you were not wrong. Anthropic just released a detailed blogpost on what caused it. https://www.anthropic.com/engineering/a-postmortem-of-...
How Good Is GPT-5 Codex? I Built an App
Sign-up for updates to Verbi - transcription app: https://tally.so/r/3y9bb0 I put GPT-5 Codex to a real test: I handed it a PRD in VS Code, ran it with Codex CLI, and timed how long it took to bui...
OpenAI Just Dropped A New Coding Model for Developers
GPT-5-codex was just released by OpenAI. Here we will at this new release: https://openai.com/index/introducing-upgrades-to-codex/ Website: https://engineerprompt.ai/ RAG Beyond Basics Course: h...
Agent Client Protocol : The “New MCP” for IDEs and Coding Agents
In this video. we're going t. look at agent-client protocol or ACP that enables communication between coding IDEs and coding agents (like Gemini CLI, Claude Code etc.). This is a standardized comm...
Super Agent: An Agent That Builds Its Own Tools
Subscribe to Skywork through this link to get up to 34% off.”: https://skywork.ai/p/zb7qts In this video, we will look at A Hierarchical Multi-Agent Framework for General-Purpose Task Solving and ...
GitHub Spec Kit: Can It FINALLY Fix “Vibe Coding”?
In this video I have a look at the Github's new Spec Kit, which is their opinionated implementation of Specification Driven Development. I will walk through all the steps that this new specificatio...
Did OpenAI Just FIX Hallucinations?
In this video I will look at why LLMs hallucinate. LLMs hallucinate not because they’re “broken,” but because today’s training and accuracy-only evaluations incentivize guessing. This is based on a...
Embedding Gemma: On-Device RAG Made Easy
In this video we learn how to use Google’s Embedding Gemma (300M) to build fast, on-device RAG with ≈200MB memory and support for 100+ languages. We will look at a RAG example. LINK: https://deve...
Claude for Chrome: Agentic Browsing is Here
Hands-on review of Claude for Chrome, Anthropic’s agentic browsing Chrome extension—with real demos (posting to X, Zillow search, research & shopping, W-9 download, form filling) to see what works ...
Can This FIX Context Loss in RAG?
Checkout Emergent: https://emergent.1stcollab.com/engineerprompt Chunking in RAG is broken! In this video we look at contextualized chunk embeddings that preserves document level global informatio...
Nano Banana is the NEW Gemini 2.5 Flash Image
Hands-on with Google’s Nano Banana (Gemini 2.5 Flash Image): I show how to access it in AI Studio and via the Gemini SDK, then demo precise text-guided edits, character/scene consistency, in/outpai...
Web Scrapping Made Easy with This FREE MCP
Get started with BrightData here: https://brdta.com/engineerprompt Learn how to use the free BrightData MCP server to collect information from multiple diverse sources including hard to scrape sit...
DeepSeek V3.1: Bigger Than You Think!
DeepSeek V3.1 is a unified hybrid reasoning open-weight model that powers agentic workflows—FP8 training, strong post-training for tool/function calling (non-thinking), Anthropic API support, and b...
Finally! A Standard for AI Coding Agents (Agents.md Explained)
Agents.md is a simple, open standard to replace the mess of agent-specific rule files. In this video, I explain how agents.md works, how to add it to your repo (even mono-repos), migration tips, an...
I Tested GPT-5 as a Coding Agent—Here’s What Happened
I put GPT-5 inside Cursor to build a real macOS speech-to-text app I actually use. Starting from a PRD, it coded a menu-bar recorder with hotkeys, a model picker (Whisper MLX + Qwen 1.7B), audio cu...
GPT-OSS Jailbreak with this Simple Trick
In this video, I show you how I managed to bypass GPT-OSS’s alignment with a single, simple tweak—no fine-tuning or complex hacks required. I walk through how the model’s prompt template works, why...
Not all models providers are equal
Deciphering GPT-OSS Performance: Why Inference Setup Matters We explore the performance variability across different API providers and benchmark tests, highlighting significant discrepancies in sp...
LangExtract + RAG: Smarter Retrieval with Metadata Filtering
In this video, I show you how to use LangExtract to generate high-quality metadata for your Retrieval Augmented Generation (RAG) system. By extracting structured data from unstructured documents, w...
GPT-5: The Most Polarizing Model
In this video, I take a deep dive into GPT-5—beyond the hype—to look at both its impressive capabilities and some glaring issues, including the “chart crimes.” I break down the real benchmarks, hid...
GPT-5 - A Good Coding Model?
Very first look at GPT-5 coding capabilities. This is the best coding model that OpenAI has released so far. GPT-5 is a next level model. @TheFeatureCrew Website: https://engineerprompt.ai/ RA...
OpenAI Finally Goes Open-Source: 120B & 20B Models
LINKS: https://openai.com/open-models/ https://cookbook.openai.com/articles/gpt-oss/run-locally-ollama https://openai.com/index/introducing-gpt-oss/ https://github.com/openai/harmony https://githu...
LangExtract: Turn Messy Text into Graph-RAG Insights
In this quick tutorial I show you how Google’s open-source LangExtract converts messy PDFs, HTML, and DOC files into clean knowledge graphs that plug straight into Retrieval-Augmented Generation (R...
Horizon: OpenAI’s Secret Open-Weight Model?
We will look at Horizon Beta, the alleged Open Weight Model from OpenAI. Its blazing 140 TPS throughput, huge 256K-token context window, and leaked 120B/20B MoE specs. https://openrouter.ai/openro...
Gemini Deep Think: Built for the Hardest Problems
Gemini Deep Think is the model for the hardest and most challenging problems. A version of this recently won gold in IMO 2025. Thanks to the Google DeepMind, I had early access to the model and her...
I Built a Voice Agent that Handles my Daily Tasks
Start building with Deepgram with $200 in credit. This can fuel a voice agent for up to 50 hours." https://dpgr.am/promptengxdg Docs: https://developers.deepgram.com/docs/voice-agent Starter Apps: ...
Master Claude Code Sub‑Agents in 10 Minutes
check out claude code: http://clau.de/prompteng In this video, you will learn about the new sub-agents feature in Claude Code. This addresses common issues related to context management and tool u...
Augment Code: Specs Driven Development For AI Coding Agents
Try Augment Code with Tasklist: https://www.augmentcode.com/ Tired of “vibe coding” that feels magical—right up to the moment everything breaks? In this video I show how switching to Specs‑Driven ...
NEW Qwen 3 Coder: Did the Benchmark Lie?
We are looking into Qwen 3 Coder, the first open weight model that is closer to Sonnet 4. https://qwenlm.github.io/blog/qwen3-coder/ https://github.com/QwenLM/qwen-code https://qwen.readthedocs.i...
NEW Qwen 3, Better than Kimi K2?
In this video, I compare the performance of two leading open weight AI models, Qwen3's latest non-reasoning model and KIMI K2, along with a few proprietary models, using the same set of prompts. We...
Developers’ Favorite AI Tools in 2025
Sources from the Pragmatic Engineer and A https://newsletter.pragmaticengineer.com/p/the-pragmatic-engineer-2025-survey https://artificialanalysis.ai/downloads/ai-adoption-survey/2025/Artificial-A...
ChatGPT Agent Is Here: Your All‑In‑One AI Worker?
Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/rag Let's Connect: 🦾 Discord: https://discord.com/invite/t4eYQRUcXB ☕ Buy me a Coffee: h...
Kimi K2 — The Deep Researcher Agent
Kimi-K2 is a great coding model but they also have a great Deep researcher tool that is SOTA on a number of key benchmarks. In this video we explore how it was trained and how it compares to the ot...
localGPT 2.0 - Building the Best Private RAG System
I am releasing the new version of localGPT as a preview. This has a ton of enhancements you will not find in other rank systems. Check out the repo: https://github.com/PromtEngineer/localGPT/tree...
Kimi K2 - The DeepSeek Moment for Agentic Coding?
KIMI K2 is the new State of the Art Open Weight Coding model. https://www.kimi.com/ https://moonshotai.github.io/Kimi-K2/ https://huggingface.co/collections/moonshotai/kimi-k2-6871243b990f2af5ba6...
Grok 4—Possibly the Most Powerful Model in the World?
XAI just released Grok 4, the most powerful model in the world. Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/rag Let's Connect: 🦾 D...
Secret Context Engineering Trick For RAG
I explain why re-ranking isn’t enough for RAG and show how sentence-level pruning strips out noisy tokens and cuts hallucinations. You’ll see the token savings, accuracy boost, and a quick setup yo...
Context Engineering — The Hottest Skill in AI Right Now
I unpack context engineering—why everyone’s talking about it, how it differs from classic prompt engineering, and where it actually matters for long-context LLMs. We’ll cover the big failure modes ...
The Only Embedding Model You Need for RAG
I walk you through a single, multimodal embedding model that handles text, images, tables —and even code —inside one vector space. In this short demo I show the install steps, run RAG retrieval ben...
I Gave Devin A Real World Coding Task, Here’s How it Cooked!
Get $20 in free credits (https://devin.ai/pricing: select Core plan) with promo code: PROMPTENGINEERING I put Devon AI, the “OG” coding agent, to the test by asking it to build a full RAG applicat...
Gemini CLI + ANY MCP Server — Step‑by‑Step Tutorial
To get started with BrightData get a $15 Credit with this link: https://brdta.com/engineerprompt In this video, I show you exactly how to connect Gemini CLI to any MCP server step by step. I’ll wa...