Alejandro AO - Videos

Back to Channel

Pi Agent – Crash Course | Minimal Coding Agent

Pi Agent is a lightweight terminal harness that gives you a powerful coding assistant without the bloat. Unlike traditional agents with built-in sub-agents, plan modes, and permission popups, Pi st...

1,092 views • 66 likes • 7 comments • May 06, 2026

DeepSeek V4: SOTA Coding Agent at 12x Lower Cost

DeepSeek V4 Pro matches GPT-5.5 and Opus 4.7 on agentic coding benchmarks at 3x lower cost. In this video, we compare Terminal-Bench, SWE Verified, and MCPAtlas scores across all SOTA models, analy...

7,599 views • 149 likes • 31 comments • April 28, 2026

GPT-5.5 - 25x More Expensive than Open Models

A quick overview of OpenAI's GPT-5.5 announcement—comparing it against open source models on benchmarks and pricing to see if it's actually worth using in your apps. 🤓 Topics Covered - GPT-5.5 ben...

2,277 views • 54 likes • 4 comments • April 23, 2026

Kimi K2.6: Open Source BEATS GPT-5.4 & Opus?

In this video, we take a deep dive into Kimi K2.6, the latest open-source foundation model from Moonshot AI. We explore how it stacks up against GPT-5.4 and Claude Opus 4.6, walk through key benchm...

17,178 views • 362 likes • 59 comments • April 22, 2026

MiniMax M2.7 vs GPT 5.4 (Real Coding Tasks)

In this video I take a look at MiniMax M2.7 right after the open weights dropped and compare it against GPT 5.4 on practical software tasks. I go through the benchmark context briefly, then test bo...

47,068 views • 186 likes • 39 comments • April 14, 2026

How to Evaluate and Test Agent Skills

This video walks through a practical workflow for evaluating and testing agent skills so you can improve them without breaking what already works. I use the Hugging Face `$model-trainer` skill as t...

3,935 views • 176 likes • 5 comments • March 19, 2026

Gemini Embedding 2: Multimodal Retrieval Demo with ChromaDB

Gemini Embedding 2 just dropped, and this release is a big step for multimodal retrieval workflows. In this video, I walk through what is new, key limits to know before you build, benchmark highli...

5,191 views • 168 likes • 9 comments • March 11, 2026

Best LLMs for UI design?

In this video I run a live UI beauty comparison across four models: Claude 4.6, GPT-5.2, MiniMax M2.5, and Kimi K2.5. Instead of benchmarking architecture or feature completeness, I focus on visual...

1,518 views • 65 likes • 7 comments • February 25, 2026

Fine-Tune an Open Source LLM with Claude Code/Codex (Hugging Face Model Trainer Skill)

In this video I show how to fine-tune a small open source model with Codex using the Hugging Face model trainer skill and HF Jobs. We train `Qwen/Qwen3-0.6B` into a customer-support ticket router o...

23,225 views • 860 likes • 47 comments • February 17, 2026

GPT-5.3 Codex vs Claude Opus 4.6: Real Coding Tasks

In this video, I compare GPT-5.3 Codex and Claude Opus 4.6 using a benchmark overview plus 4 real engineering tasks. Connect with me 🤗 - X: https://x.com/_alejandroao - LinkedIn: https://www.linke...

15,748 views • 333 likes • 68 comments • February 07, 2026

Kimi Code vs Claude Code vs Codex - NEW Best For Coding?

Hands-on comparison of Kimi K2.5 vs GPT 5.2 vs Opus 4.5 on coding tasks. I tested all three models on physics simulations: Maxwell's Demon thermodynamics experiment and a hand-gesture controlled ga...

13,074 views • 283 likes • 33 comments • January 30, 2026

Claude Code Skills & skills.sh - Crash Course

Learn everything about Agent Skills - the open standard for AI agent customization adopted by Anthropic, OpenAI, Microsoft, GitHub, and Cursor. This tutorial covers what skills are, how to install ...

68,028 views • 1,627 likes • 83 comments • January 28, 2026

MedGemma 1.5: Google's Open-Source Medical AI with 3D Imaging (Tutorial)

Run Google’s new MedGemma 1.5 locally for private, GPU‑accelerated medical AI. In this video, we break down what MedGemma 1.5 is, how its open‑weight 4B model runs on consumer hardware, and where i...

8,423 views • 248 likes • 19 comments • January 22, 2026

Create a Local Claude Cowork with Open Models

Tutorial to build an Open Cowork, a local, privacy-first alternative to Claude CoWork --- Connect with me 🤗 - X: https://x.com/_alejandroao - LinkedIn: https://www.linkedin.com/in/alejandro-ao/ -...

10,127 views • 248 likes • 18 comments • January 19, 2026

Create an Open Deep Research Multi-Agent in Python (Step by Step)

Build a complete multi-agent deep research system with open-source models: use Hugging Face Inference Providers, Firecrawl MCP tools for web search and scraping, and smolagents for agent coordinati...

10,812 views • 457 likes • 22 comments • December 11, 2025

Intro to Agents - Create an Agent from Scratch (No Frameworks)

Build an LLM agent from scratch in Python using Hugging Face Inference Providers. --- Connect with me 🤗 - X: https://x.com/_alejandroao - LinkedIn: https://www.linkedin.com/in/alejandro-ao/ --- L...

19,320 views • 806 likes • 48 comments • December 03, 2025

Intro to AgentKit - Create a RAG Agent

Build and deploy an OpenAI AgentKit workflow end-to-end: design a production-ready RAG agent in AgentBuilder with guardrails and the built-in File Search (Anna Karenina), preview with evals and tra...

5,321 views • 203 likes • 13 comments • October 22, 2025

Intro to ChatGPT Apps - The NEW App Store??

Here is intro to ChatGPT Apps with the OpenAI Apps SDK. We cover how ChatGPT apps extend the Model Context Protocol (MCP) with tools and UI schemas, handle OAuth 2.1, test locally, deploy a remote...

14,879 views • 216 likes • 16 comments • October 07, 2025

ChromaDB Crash Course - Intro to Vector Databases

Master vector databases and embeddings in Python: map text to high-dimensional vectors, spin up a local ChromaDB, run end-to-end CRUD on collections and points, search by meaning, and persist your ...

7,756 views • 249 likes • 15 comments • September 29, 2025

Python Project: AI Image Generator App

Build a Tinder-style AI image generator in Python with Streamlit and Google’s Gemini API: set up the project with uv, design a clean Streamlit UI (prompt, image uploader, generate button), and tran...

1,753 views • 54 likes • 13 comments • September 22, 2025

Image Generation with Nano Banana - Python Tutorial

This is an image generation tutorial with Google's new model Gemini 2.5 Flash (aka “Nano Banana”). We will learn how to set up the google‑genai Python SDK, get an API key from Google AI Studio, cal...

3,670 views • 120 likes • 19 comments • September 15, 2025

MCP Servers for VS Code (Crash Course)

Supercharge GitHub Copilot in VS Code with the Model Context Protocol (MCP): install and configure MCP servers (including Firecrawl), wire up custom tools over STDIO, and build a Python MCP server ...

6,330 views • 168 likes • 15 comments • September 10, 2025

MCP Security | Malicious MCP Servers (Protect Yourself)

We cover security concerns of MCP servers, especially those using the STDIO transport. You'll learn how to spot malicious MCP servers and protect yourself from them. You'll also learn how MCP agent...

2,620 views • 97 likes • 15 comments • September 08, 2025

RAG Project: Build an AI Onboarding Chatbot with Streamlit, LangChain, and ChromaDB

In this comprehensive video, you'll learn how to build a fully functional AI-powered onboarding chatbot tailored for new employees at a fictional company (Umbrella Corporation). The tutorial walks ...

8,036 views • 244 likes • 17 comments • September 03, 2025

#4 Allan Guo | 19-yo YC Founder - Willow Voice

Interview with Allan Guo: Building Willow Voice, the Next-Gen AI Voice Dictation App Backed by YC --- 🔗 Links - 📣 Willow Voice: https://willowvoice.com/?ref=AAO40 - 🤓 Allan Guo: https://www.linked...

1,373 views • 40 likes • 6 comments • September 01, 2025

Generate UI Tests with TestSprite MCP Server + TRAE

We’ll go step by step through the process of using some tools for generating Apps with AI (vibe coding): generating a clear Product Requirements Document (PRD) with TRAE, coding the app in Next.js ...

1,695 views • 47 likes • 8 comments • August 28, 2025

Tutorial: Auth for Remote MCP Servers (Step by Step) | OAuth 2.1 with ScaleKit

In this tutorial, I’ll show you how to secure your MCP (Model Context Protocol) servers with OAuth 2.1 authentication, using FastAPI and ScaleKit. You’ll learn step by step how to implement authent...

16,646 views • 347 likes • 33 comments • August 25, 2025

GPT-5 for Developers: API Changes, Pricing, Model Router & Security

GPT-5: New API Features, Pricing & Security Explained: In-depth exploration of everything you need to know about GPT-5, focusing on the technical changes, new API features, pricing models, and adva...

3,192 views • 102 likes • 13 comments • August 11, 2025

Deploy Remote MCP Servers in Python (Step by Step)

🚀 MCP Servers over Streamable HTTP (Step-by-Step Guide) Want to connect remote tools to your AI assistant like microservices? Meet MCP (Model Context Protocol) — the protocol that lets your LLM-bas...

28,534 views • 789 likes • 87 comments • June 23, 2025

Anthropic: How to Build Multi Agent Systems

How Anthropic Built a Multi‑Agent AI Research System + Tips for Your Own! Links - Anthropic’s article: https://www.anthropic.com/engineering/built-multi-agent-research-system - Writing Wikipedia...

9,682 views • 321 likes • 29 comments • June 19, 2025

Local MCP Servers for Cursor (Step by step)

In this video, you'll learn how to create a custom MCP (Model Context Protocol) server in Python, mock-integrate it with Confluence, and expose custom tools to the Cursor code editor. The tutorial ...

16,635 views • 223 likes • 24 comments • June 16, 2025

LlamaExtract Tutorial: Convert PDF & Images into JSON

How to Extract Structured Data from Unstructured Files Using Lama Extract (LlamaIndex) This video is a comprehensive tutorial on using LlamaExtract, a tool by LamaIndex, to automatically extract s...

9,103 views • 204 likes • 19 comments • June 09, 2025

Vibe Coding For Engineers (make it ACTUALLY work)

Here is a better way to do Vibe Coding with AI IDEs like Cursor, Copilot, Tree, and Windsurf. In this video, you’ll learn how to vibe code while building secure, useful, and maintainable code. We’...

3,449 views • 107 likes • 7 comments • June 05, 2025

How to Build an MCP Client GUI with Streamlit and FastAPI

In this video, you'll learn how to build a graphical user interface (GUI) for an MCP client using Streamlit in Python, and connect it to a FastAPI backend. The tutorial covers the full process—fr...

18,763 views • 397 likes • 33 comments • May 14, 2025

Create an MCP Client in Python - FastAPI Tutorial

In this tutorial we cover how to create an MCP client in Python using FastAPI. We will take inspiration from the Claude Desktop application to make our LLM call the MCP tools from our MCP server an...

39,451 views • 819 likes • 70 comments • April 29, 2025

Create MCP Clients in JavaScript - Tutorial

In this video, we cover how to create an MCP client that can connect to any MCP server. We will build this from scratch by showing you how to turn a simple chatbot into a chatbot that supports MCP ...

24,472 views • 641 likes • 107 comments • March 27, 2025

Learn MCP Servers with Python (EASY)

In this video, I explain what an MCP server is, how it works and how to create an MCP Server that we will use with Claude Desktop and Claude Code. 🔗 Links --- 🚀 Complete AI Engineer Bootcamp: http...

141,614 views • 3,758 likes • 182 comments • March 13, 2025

Advanced RAG with LlamaIndex - Metadata Extraction [2025]

Learn advanced Retrieval-Augmented Generation (RAG), focusing on techniques such as metadata augmentation and metadata filtering using LlamaIndex. This will help you improve your RAG models for bet...

37,850 views • 1,122 likes • 83 comments • February 28, 2025