Prompt Engineering - Videos

Back to Channel

Ling 1T Model: Too Good to Be True?

Access Ling 1T: https://huggingface.co/inclusionAI/Ling-1T https://x.com/AntLingAGI/status/1975942293330018426 https://zenmux.ai/inclusionai/ling-1t?utm_source=hf_inclusionAI Let's look at Ling-1T...

17,064 views • 363 likes • 37 comments • October 11, 2025

OpenAI Just Confused Everyone... Again

OpenAI’s new Agent Kit has everyone confused! is it really building agents or just fancy workflows? In this video, I break down what OpenAI means by “agents,” how it differs from Anthropic’s defini...

4,958 views • 151 likes • 21 comments • October 09, 2025

Vibe Planning: The Smarter Way to Code with AI Agents

Get started with Traycer for free: : https://traycer.ai/ Claude Code is great at writing code, but it won’t save your project without a clear plan and verification—so we add Traycer’s Plan → Execu...

8,500 views • 252 likes • 16 comments • October 03, 2025

Claude Code 2.0: The Only Guide You Need

Checkout Snyk to find security vulnerabilities in your code: https://snyk.plug.dev/arcV6Qm Discover all the powerful new features in Claude Code 2.0, a complete rewrite that integrates seamlessly ...

16,667 views • 292 likes • 29 comments • October 01, 2025

OpenAI’s Sora 2 Can Talk—and Follow Physics

OpenAI just launched Sora 2, a next-gen AI video generation model delivering physics-accurate, cinematic visuals with synchronized dialogue and sound effects—watch real demos, strengths, and failur...

11,435 views • 104 likes • 21 comments • September 30, 2025

Sonnet 4.5 Is Here—And It’s a Beast at Coding

Very first look at the Claude Sonnet 4.5 release https://www.anthropic.com/news/claude-sonnet-4-5 https://docs.claude.com/en/docs/about-claude/models/whats-new-sonnet-4-5 https://docs.claude.com/e...

52,420 views • 1,089 likes • 122 comments • September 29, 2025

Can AI Make Better Slides Than You?

Checkout: https://www.gamma.app I this video I test Gamma 3.0 —the AI presentation agent. Gamma also recently released their API, which let's you use the same agent for buck content creation and a...

4,433 views • 117 likes • 2 comments • September 28, 2025

Building tools for agents — with agents

Checkout Browserbase for web browsing automation for AI agents: https://browserbase.plug.dev/fN72qF0 In this video we will learn how to design agent tools that actually work for AI agents. It will...

5,362 views • 160 likes • 11 comments • September 25, 2025

Finally a real VEO-3 competitor

Discover WAN 2.5, Alibaba’s latest AI video model that can generate both visuals and sound in sync. In this preview, I’ll show you how it stacks up with Veo 3 and why it could change the future of ...

10,374 views • 213 likes • 35 comments • September 24, 2025

Qwen 3 Omni — The Open AI Model That Does It ALL

In this video, I test out Qwen 3 Omni — Alibaba’s latest open-source multimodal model that can handle text, images, audio, and video in real time. From live demos to benchmarks, we’ll see if Qwen 3...

11,816 views • 373 likes • 35 comments • September 23, 2025

Claude Code Downgrade? Here’s What Actually Happened

If you noticed claude code degradation in last few weeks, you were not wrong. Anthropic just released a detailed blogpost on what caused it. https://www.anthropic.com/engineering/a-postmortem-of-...

8,798 views • 187 likes • 98 comments • September 18, 2025

How Good Is GPT-5 Codex? I Built an App

Sign-up for updates to Verbi - transcription app: https://tally.so/r/3y9bb0 I put GPT-5 Codex to a real test: I handed it a PRD in VS Code, ran it with Codex CLI, and timed how long it took to bui...

17,136 views • 276 likes • 39 comments • September 17, 2025

OpenAI Just Dropped A New Coding Model for Developers

GPT-5-codex was just released by OpenAI. Here we will at this new release: https://openai.com/index/introducing-upgrades-to-codex/ Website: https://engineerprompt.ai/ RAG Beyond Basics Course: h...

43,412 views • 200 likes • 27 comments • September 15, 2025

Agent Client Protocol : The “New MCP” for IDEs and Coding Agents

In this video. we're going t. look at agent-client protocol or ACP that enables communication between coding IDEs and coding agents (like Gemini CLI, Claude Code etc.). This is a standardized comm...

8,618 views • 198 likes • 22 comments • September 14, 2025

Super Agent: An Agent That Builds Its Own Tools

Subscribe to Skywork through this link to get up to 34% off.”: https://skywork.ai/p/zb7qts In this video, we will look at A Hierarchical Multi-Agent Framework for General-Purpose Task Solving and ...

8,913 views • 307 likes • 15 comments • September 12, 2025

GitHub Spec Kit: Can It FINALLY Fix “Vibe Coding”?

In this video I have a look at the Github's new Spec Kit, which is their opinionated implementation of Specification Driven Development. I will walk through all the steps that this new specificatio...

8,309 views • 200 likes • 34 comments • September 09, 2025

Did OpenAI Just FIX Hallucinations?

In this video I will look at why LLMs hallucinate. LLMs hallucinate not because they’re “broken,” but because today’s training and accuracy-only evaluations incentivize guessing. This is based on a...

6,723 views • 262 likes • 37 comments • September 07, 2025

Embedding Gemma: On-Device RAG Made Easy

In this video we learn how to use Google’s Embedding Gemma (300M) to build fast, on-device RAG with ≈200MB memory and support for 100+ languages. We will look at a RAG example. LINK: https://deve...

11,560 views • 312 likes • 18 comments • September 06, 2025

Claude for Chrome: Agentic Browsing is Here

Hands-on review of Claude for Chrome, Anthropic’s agentic browsing Chrome extension—with real demos (posting to X, Zillow search, research & shopping, W-9 download, form filling) to see what works ...

11,984 views • 227 likes • 18 comments • August 29, 2025

Can This FIX Context Loss in RAG?

Checkout Emergent: https://emergent.1stcollab.com/engineerprompt Chunking in RAG is broken! In this video we look at contextualized chunk embeddings that preserves document level global informatio...

9,198 views • 297 likes • 13 comments • August 27, 2025

Nano Banana is the NEW Gemini 2.5 Flash Image

Hands-on with Google’s Nano Banana (Gemini 2.5 Flash Image): I show how to access it in AI Studio and via the Gemini SDK, then demo precise text-guided edits, character/scene consistency, in/outpai...

12,819 views • 371 likes • 52 comments • August 26, 2025

Web Scrapping Made Easy with This FREE MCP

Get started with BrightData here: https://brdta.com/engineerprompt Learn how to use the free BrightData MCP server to collect information from multiple diverse sources including hard to scrape sit...

7,511 views • 235 likes • 19 comments • August 25, 2025

DeepSeek V3.1: Bigger Than You Think!

DeepSeek V3.1 is a unified hybrid reasoning open-weight model that powers agentic workflows—FP8 training, strong post-training for tool/function calling (non-thinking), Anthropic API support, and b...

21,208 views • 542 likes • 46 comments • August 22, 2025

Finally! A Standard for AI Coding Agents (Agents.md Explained)

Agents.md is a simple, open standard to replace the mess of agent-specific rule files. In this video, I explain how agents.md works, how to add it to your repo (even mono-repos), migration tips, an...

27,797 views • 892 likes • 84 comments • August 20, 2025

I Tested GPT-5 as a Coding Agent—Here’s What Happened

I put GPT-5 inside Cursor to build a real macOS speech-to-text app I actually use. Starting from a PRD, it coded a menu-bar recorder with hotkeys, a model picker (Whisper MLX + Qwen 1.7B), audio cu...

3,181 views • 109 likes • 12 comments • August 18, 2025

GPT-OSS Jailbreak with this Simple Trick

In this video, I show you how I managed to bypass GPT-OSS’s alignment with a single, simple tweak—no fine-tuning or complex hacks required. I walk through how the model’s prompt template works, why...

57,511 views • 1,999 likes • 143 comments • August 15, 2025

Not all models providers are equal

Deciphering GPT-OSS Performance: Why Inference Setup Matters We explore the performance variability across different API providers and benchmark tests, highlighting significant discrepancies in sp...

5,037 views • 215 likes • 32 comments • August 14, 2025

LangExtract + RAG: Smarter Retrieval with Metadata Filtering

In this video, I show you how to use LangExtract to generate high-quality metadata for your Retrieval Augmented Generation (RAG) system. By extracting structured data from unstructured documents, w...

23,931 views • 682 likes • 59 comments • August 12, 2025

GPT-5: The Most Polarizing Model

In this video, I take a deep dive into GPT-5—beyond the hype—to look at both its impressive capabilities and some glaring issues, including the “chart crimes.” I break down the real benchmarks, hid...

9,529 views • 231 likes • 47 comments • August 08, 2025

GPT-5 - A Good Coding Model?

Very first look at GPT-5 coding capabilities. This is the best coding model that OpenAI has released so far. GPT-5 is a next level model. @TheFeatureCrew Website: https://engineerprompt.ai/ RA...

17,423 views • 316 likes • 67 comments • August 07, 2025

OpenAI Finally Goes Open-Source: 120B & 20B Models

LINKS: https://openai.com/open-models/ https://cookbook.openai.com/articles/gpt-oss/run-locally-ollama https://openai.com/index/introducing-gpt-oss/ https://github.com/openai/harmony https://githu...

10,533 views • 266 likes • 47 comments • August 05, 2025

LangExtract: Turn Messy Text into Graph-RAG Insights

In this quick tutorial I show you how Google’s open-source LangExtract converts messy PDFs, HTML, and DOC files into clean knowledge graphs that plug straight into Retrieval-Augmented Generation (R...

36,092 views • 961 likes • 78 comments • August 04, 2025

Horizon: OpenAI’s Secret Open-Weight Model?

We will look at Horizon Beta, the alleged Open Weight Model from OpenAI. Its blazing 140 TPS throughput, huge 256K-token context window, and leaked 120B/20B MoE specs. https://openrouter.ai/openro...

4,161 views • 102 likes • 5 comments • August 02, 2025

Gemini Deep Think: Built for the Hardest Problems

Gemini Deep Think is the model for the hardest and most challenging problems. A version of this recently won gold in IMO 2025. Thanks to the Google DeepMind, I had early access to the model and her...

14,205 views • 261 likes • 31 comments • August 01, 2025

I Built a Voice Agent that Handles my Daily Tasks

Start building with Deepgram with $200 in credit. This can fuel a voice agent for up to 50 hours." https://dpgr.am/promptengxdg Docs: https://developers.deepgram.com/docs/voice-agent Starter Apps: ...

3,918 views • 139 likes • 16 comments • July 31, 2025

Master Claude Code Sub‑Agents in 10 Minutes

check out claude code: http://clau.de/prompteng In this video, you will learn about the new sub-agents feature in Claude Code. This addresses common issues related to context management and tool u...

39,555 views • 629 likes • 47 comments • July 29, 2025

Augment Code: Specs Driven Development For AI Coding Agents

Try Augment Code with Tasklist: https://www.augmentcode.com/ Tired of “vibe coding” that feels magical—right up to the moment everything breaks? In this video I show how switching to Specs‑Driven ...

7,792 views • 188 likes • 25 comments • July 28, 2025

NEW Qwen 3 Coder: Did the Benchmark Lie?

We are looking into Qwen 3 Coder, the first open weight model that is closer to Sonnet 4. https://qwenlm.github.io/blog/qwen3-coder/ https://github.com/QwenLM/qwen-code https://qwen.readthedocs.i...

7,429 views • 206 likes • 17 comments • July 23, 2025

NEW Qwen 3, Better than Kimi K2?

In this video, I compare the performance of two leading open weight AI models, Qwen3's latest non-reasoning model and KIMI K2, along with a few proprietary models, using the same set of prompts. We...

5,989 views • 149 likes • 20 comments • July 22, 2025

Developers’ Favorite AI Tools in 2025

Sources from the Pragmatic Engineer and A https://newsletter.pragmaticengineer.com/p/the-pragmatic-engineer-2025-survey https://artificialanalysis.ai/downloads/ai-adoption-survey/2025/Artificial-A...

3,576 views • 146 likes • 8 comments • July 20, 2025

ChatGPT Agent Is Here: Your All‑In‑One AI Worker?

Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/rag Let's Connect: 🦾 Discord: https://discord.com/invite/t4eYQRUcXB ☕ Buy me a Coffee: h...

13,199 views • 202 likes • 18 comments • July 17, 2025

Kimi K2 — The Deep Researcher Agent

Kimi-K2 is a great coding model but they also have a great Deep researcher tool that is SOTA on a number of key benchmarks. In this video we explore how it was trained and how it compares to the ot...

7,804 views • 216 likes • 9 comments • July 17, 2025

localGPT 2.0 - Building the Best Private RAG System

I am releasing the new version of localGPT as a preview. This has a ton of enhancements you will not find in other rank systems. Check out the repo: https://github.com/PromtEngineer/localGPT/tree...

14,001 views • 475 likes • 73 comments • July 15, 2025

Kimi K2 - The DeepSeek Moment for Agentic Coding?

KIMI K2 is the new State of the Art Open Weight Coding model. https://www.kimi.com/ https://moonshotai.github.io/Kimi-K2/ https://huggingface.co/collections/moonshotai/kimi-k2-6871243b990f2af5ba6...

22,340 views • 509 likes • 67 comments • July 12, 2025

Grok 4—Possibly the Most Powerful Model in the World?

XAI just released Grok 4, the most powerful model in the world. Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/rag Let's Connect: 🦾 D...

33,477 views • 499 likes • 84 comments • July 10, 2025

Secret Context Engineering Trick For RAG

I explain why re-ranking isn’t enough for RAG and show how sentence-level pruning strips out noisy tokens and cuts hallucinations. You’ll see the token savings, accuracy boost, and a quick setup yo...

11,665 views • 444 likes • 19 comments • July 07, 2025

Context Engineering — The Hottest Skill in AI Right Now

I unpack context engineering—why everyone’s talking about it, how it differs from classic prompt engineering, and where it actually matters for long-context LLMs. We’ll cover the big failure modes ...

39,087 views • 1,120 likes • 84 comments • July 04, 2025

The Only Embedding Model You Need for RAG

I walk you through a single, multimodal embedding model that handles text, images, tables —and even code —inside one vector space. In this short demo I show the install steps, run RAG retrieval ben...

37,687 views • 1,048 likes • 66 comments • July 02, 2025

I Gave Devin A Real World Coding Task, Here’s How it Cooked!

Get $20 in free credits (https://devin.ai/pricing: select Core plan) with promo code: PROMPTENGINEERING I put Devon AI, the “OG” coding agent, to the test by asking it to build a full RAG applicat...

4,888 views • 76 likes • 10 comments • July 01, 2025

Gemini CLI + ANY MCP Server — Step‑by‑Step Tutorial

To get started with BrightData get a $15 Credit with this link: https://brdta.com/engineerprompt In this video, I show you exactly how to connect Gemini CLI to any MCP server step by step. I’ll wa...

51,423 views • 650 likes • 31 comments • June 27, 2025