1littlecoder - Videos
Back to ChannelApple's Latest OPEN SOURCE AI is FAST Vision!
FastVLM was introduced in FastVLM: Efficient Vision Encoding for Vision Language Models. (CVPR 2025) Accuracy vs latency figure. Highlights We introduce FastViTHD, a novel hybrid vision encoder d...
Voice to Voice GPT RealTime in 4 mins! 💥NEW OpenAI Voice Agents API💥
OpenAI has released a more advanced speech-to-speech model and new API capabilities including MCP server support, image input, and SIP phone calling support. 🔗 Links 🔗 https://openai.com/index/in...
NEW: What is OpenAI Codex in 3 mins!
OpenAI Codex is increasingly 🔗 Links 🔗 https://x.com/OpenAIDevs/status/1960809821122519470 ❤️ If you want to support the channel ❤️ Support here: Patreon - https://www.patreon.com/1littlecoder...
FREE Vibe Coding with Gemini AI Studio!
🔗 Links 🔗 Access AI Studio here https://ai.studio/ ❤️ If you want to support the channel ❤️ Support here: Patreon - https://www.patreon.com/1littlecoder/ Ko-Fi - https://ko-fi.com/1littlecoder ...
Gemini Nano Banana 2.5 Flash Image is here!!!
Today in the Gemini app, we're unveiling a new image editing model from Google DeepMind. People have been going bananas over it already in early previews — it's the top-rated image editing model in...
NotebookLM but OPEN SOURCE Text to Speech from Microsoft!
Microsoft's VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as podcasts, from text. It addresses significant challenges in tra...
The Official Deepseek V3.1 in 4 mins!
Deepseek v3.1 is officially here! 🔗 Links 🔗 access deepseek v3.1 here - https://chat.deepseek.com/ https://api-docs.deepseek.com/news/news250821 Related Video: NotebookLM but Open Source from ...
Don't sleep on Google's Nano-Banana AI!
Try Nano-Banana on LMArena.ai while it's available. It's possibly a new model that coincides with Google Pixel 10 Pro launch powering it! 🔗 Links 🔗 Nano Banana - https://lmarena.ai/c/17f0e6c8-028...
Deepseek Teases a Comeback!
Deepseek v3.1 Base - This is a new model uploaded by Deepseek with literally no information! 🔗 Links 🔗 https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Base ❤️ If you want to support the chan...
What's ChatGPT Go - OpenAI's NEW PLAN for India!!!
What is ChatGPT GO? https://help.openai.com/en/articles/11989085-what-is-chatgpt-go 🔗 Links 🔗 https://x.com/nickaturley/status/1957613818902892985 ChatGPT Pricing - https://chatgpt.com/#pricing...
This AI Designs and Builds FULL SaaS in *just Minutes*
Try it out here - https://chatllm.abacus.ai/ajm Your All-In-One AI Platform ChatLLM Teams + DeepAgent! Access all the state-of-the-art AI in one powerhouse package. Top LLMs, image and video gene...
AI News of the Week (Live Stream)
AI News Claude Opus 4 and 4.1 can now end a rare subset of conversations https://www.anthropic.com/research/end-subset-conversations The Hidden Drivers of HRM's Performance on ARC-AGI https:/...
Gemma 3 270M - Google's NEW Tiny LLM in 7 mins!!
Gemma 3 270M, a compact, 270-million parameter model designed from the ground up for task-specific fine-tuning with strong instruction-following and text structuring capabilities already trained in...
GPT-5 Update! 💥 How to get GPT 4o on ChatGPT back? 💥
This tutorial shows how to get back old models like GPT 4o and O3 back on ChatGPT and also quick GPT-5 update about which models should you use! https://chatgpt.com/ #chatgpt #chatgpt4 #gpt4o #gp...
GLM VISION a new VLM - full testing!
GLM-4.5V: a breakthrough in open-source visual reasoning GLM-4.5V delivers state-of-the-art performance among open-source models in its size class, dominating across 41 benchmarks. Built on the G...
GPT 5 CODING TEST! 💥 GPT-5 in Cursor Beginners Tutorial💥
In this video, I show How i built a Python CLI tool using GPT-5 and Cursor Here's the GPT-5 built Python package and CLI - https://pypi.org/project/confradar/ ❤️ If you want to support the cha...
GPT-OSS-20B Colab setup: Beginner's ultimate guide
If you have got a computer with not so great compute, This video shows how you can run GPT-OSS-20B on Free Colab. This is OpenAI's open source LLM! You can use this notebook for batching and als...
GPT-5 watch party (for littlecoders)
GPT-5 in 7 mins - Didn't feel the AGI!!
https://openai.com/index/introducing-gpt-5/ GPT‑5, our best AI system yet. GPT‑5 is a significant leap in intelligence over all our previous models, featuring state-of-the-art performance across c...
Quick update on "LEAKED" GPT-5 Variants! (Not the ACTUAL Model)
This video discusses about the leaked info on GPT-5 variants 🔗 Links 🔗 GPT-5 is generally available in github models (leaked page) https://archive.is/2025.08.07-035308/https://github.blog/change...
No GPU Needed! BEST FREE Text to Speech for CPU! 😻 KittenTTS 😻
Kitten TTS is an open-source realistic text-to-speech model with just 15 million parameters, designed for lightweight deployment and high-quality voice synthesis. ✨ Features Ultra-lightweight: Mod...
1600 tokens/s - The FASTEST ACCESS to gpt-oss-120b!!!
Watch my video on GPT-OSS-20B and GPT-OSS-120B - https://www.youtube.com/watch?v=D4RFytIOks4 This video highlights the fastest access to gpt-oss-120b 🔗 Links 🔗 https://openai.com/index/introduc...
OpenAI's OPEN SOURCE "GPT-OSS" in 8 mins!
We’re releasing gpt-oss-120b and gpt-oss-20b—two state-of-the-art open-weight language models that deliver strong real-world performance at low cost. Available under the flexible Apache 2.0 license...
Jack Dorsey's Goose + SECRET MODEL is so WILD!
This video shows the powerful combination of GOose and Secret Model (Horizon Beta) Horizon Alpha video OpenAI's NEW Secret Model?! Free Usage Today! https://www.youtube.com/watch?v=piOVejhZ9aY ...
VEO 3 is the NEW High for AI videos!💥Veo 3 Image to Videos 💥
VEO 3's image to video is here right now on FAL And this video shows how to do the same ! veo 3 fast - image to video on FAL https://fal.ai/models/fal-ai/veo3/fast/image-to-video This is probabl...
OpenAI's NEW Secret Model?! Free Usage Today!
256,000 context openrouter/horizon-alpha 🔗 Links 🔗 https://openrouter.ai/openrouter/horizon-alpha ❤️ If you want to support the channel ❤️ Support here: Patreon - https://www.patreon.com/1littl...
Reacting to Mark Zuckerberg's AI Vision 💥 Meta AI Super Intelligence Labs 💥
Today Mark shared Meta’s vision for the future of personal superintelligence for everyone. 🔗 Links 🔗 https://www.meta.com/superintelligence/ Main Video - https://x.com/AIatMeta/status/1950543458...
This Chinese AI one-shots a Full CODING PROJECT! (GLM 4.5 coding test)
GLM-4.5 and GLM-4.5-Air — our latest flagship models. GLM-4.5 is built with 355 billion total parameters and 32 billion active parameters, and GLM-4.5-Air with 106 billion total parameters and 12 b...
China's FREE Agentic AI 🔥 GLM-4.5 JUST DROPPED!!!
GLM-4.5 is built with 355 billion total parameters and 32 billion active parameters, and GLM-4.5-Air with 106 billion total parameters and 12 billion active parameters. Both are designed to unify r...
ChatGPT Agent vs Manus AI - AI Agents King ?!
In this war of Agents, I tested the newly launched ChatGPT Agent with Manus AI. The results are very interesting. Watch the video :) ❤️ If you want to support the channel ❤️ Support here: Patre...
NEW EMOTIONAL Text-to-Speech AI - New Best Voice Cloning?
Higgs Audio v2, a powerful audio foundation model pretrained on over 10 million hours of audio data and a diverse set of text data. Despite having no post-training or fine-tuning, Higgs Audio v2 ex...
China quietly drops a Coding MONSTER!
🔗 Links 🔗 https://qwenlm.github.io/blog/qwen3-coder/ https://huggingface.co/spaces/Qwen/Qwen3-Coder-WebDev https://chat.qwen.ai/ ❤️ If you want to support the channel ❤️ Support here: Patreon -...
AI Almost Won! 🥇OpenAI vs Deepmind Controversy! 🥇
This video talks about IMO 2025 Gold medal level performance by Google and OpenAI! Google Deepmind IMO 2025 solutions https://storage.googleapis.com/deepmind-media/gemini/IMO_2025.pdf ❤️ If yo...
Can Kimi K2 Outsmart AI Detectors? [Real‑World Test]
I spent a few hours testing the best Flagship AI that can escape AI Detection! The video has got the test and results ! Let me know if you are surprised! ❤️ If you want to support the chan...
Kimi K2 - The BEST Open Source LLM, right now!
Want to know more about Kimi K2? https://moonshotai.github.io/Kimi-K2/ https://huggingface.co/blog/fdaudens/moonshot-ai-kimi-k2-explained How to access Kimi K2? https://huggingface.co/moonshot...
ChatGPT Agent* - Not GPT-5, Not AGI, But REAL workhorse!
Introducing ChatGPT agent: bridging research and action ChatGPT now thinks and acts, proactively choosing from a toolbox of agentic skills to complete tasks for you using its own computer. 🔗 Link...
Grok 4 - Crazy Pricing and Crazy Politics!
🔗 Links 🔗 https://grok.com/ Grok ARC AGI 2 benchmark - https://x.com/arcprize/status/1943168950763950555/photo/1 Grok 4 Jeremy Howard Test - https://x.com/jeremyphoward/status/194344454969691771...
This is a SUPER Fast TTS that's FREE!!! ⚡️ How to run Kyutai TTS ⚡️
Kyutai TTS - model for streaming text-to-speech (TTS). Unlike offline text-to-speech, where the model needs the entire text to produce the audio, our model starts to output audio as soon as the fir...
This MIGHT be OpenAI's Open Source LLM!
Cypher Alpha is being specualated as OpenAI's open source LLM It's still a speculation but you could try this for free! Chat with Cypher Alpha here - https://openrouter.ai/chat?models=openrouter...
Better than MoE- Grouped Experts!
PANGU PRO MOE: MIXTURE OF GROUPED EXPERTS FOR EFFICIENT SPARSITY 🔗 Links 🔗 Pangu Pro Paper https://arxiv.org/pdf/2505.21411 Pangu Pro model can be downloaded here - https://gitcode.com/ascend-tr...
Massive Release from China AGAIN! 💥Ernie 4.5 💥
Ernie 4.5 release doc https://yiyan.baidu.com/blog/posts/ernie4.5/ Ernie 4.5 models! https://huggingface.co/collections/baidu/ernie-45-6861cd4c9be84540645f35c9 ❤️ If you want to support the ch...
LEAKED Memo: Meta Super Intelligence Labs!!!
Here Is Everyone Mark Zuckerberg Has Hired So Far for Meta’s ‘Superintelligence’ Team After a poaching frenzy that’s brought in talent from rival firms like OpenAI, Anthropic, and Google, Zuckerber...
STOP USING Veo 3, This is the Cheapest AI Audio for AI Videos!!!
MMAudio generates synchronized audio given video and/or text inputs. 🔗 Links 🔗 On HF - https://huggingface.co/spaces/hkchengrex/MMAudio On FAL - https://fal.ai/models/fal-ai/mmaudio-v2 Credits...
Gemini Music? Real-Time AI music is just here 💥 Google Magenta RT 💥
Magenta RealTime (Magenta RT), an open-weights live music model that allows you to interactively create, control and perform music in the moment. 🔗 Links 🔗 https://magenta.withgoogle.com/magenta-...
HUGE Gemini 2.5 FLASH update you won't LIKE!
Gemini 2.5 Flash and Pro are now generally available, and Google is introducing 2.5 Flash-Lite, the most cost-efficient and fastest 2.5 model yet. https://blog.google/products/gemini/gemini-2-5-m...
This SMALL OCR AI is FREE! 💥Nanonets OCR-S Explained 💥
Nanonets-OCR-s, a state-of-the-art image-to-markdown OCR model that goes far beyond traditional text extraction. This powerful model transforms documents into structured markdown with intelligent c...
This Voice CLONING is Insanity! 💥Commercial-use TTS x ElevenLabs Alternative💥
Chatterbox, Resemble AI's first production-grade open source TTS model. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs, and is consistentl...
10 Wild Veo 3 examples in 2 mins!
Google Veo 3 just launched and it's already creating hollywood level videos! This compilation will blow your minds! The first Google model to make videos auto-synced with Audio #googleai #veo...
Real-Time VOICE Cloning 💥 The Best Low-latency AI Speech Engine 💥
UNMUTE by Kyutai - This is a cascaded system made by Kyutai: our speech-to-text transcribes what you say, an LLM (we use Gemma 3 12B) generates the text of the response, and we then use our text-to...