1littlecoder - Videos

Back to Channel

Apple's Latest OPEN SOURCE AI is FAST Vision!

FastVLM was introduced in FastVLM: Efficient Vision Encoding for Vision Language Models. (CVPR 2025) Accuracy vs latency figure. Highlights We introduce FastViTHD, a novel hybrid vision encoder d...

4,835 views • 173 likes • 26 comments • August 29, 2025

Voice to Voice GPT RealTime in 4 mins! 💥NEW OpenAI Voice Agents API💥

OpenAI has released a more advanced speech-to-speech model and new API capabilities including MCP server support, image input, and SIP phone calling support. 🔗 Links 🔗 https://openai.com/index/in...

4,057 views • 83 likes • 22 comments • August 29, 2025

NEW: What is OpenAI Codex in 3 mins!

OpenAI Codex is increasingly 🔗 Links 🔗 https://x.com/OpenAIDevs/status/1960809821122519470 ❤️ If you want to support the channel ❤️ Support here: Patreon - https://www.patreon.com/1littlecoder...

12,466 views • 220 likes • 18 comments • August 28, 2025

FREE Vibe Coding with Gemini AI Studio!

🔗 Links 🔗 Access AI Studio here https://ai.studio/ ❤️ If you want to support the channel ❤️ Support here: Patreon - https://www.patreon.com/1littlecoder/ Ko-Fi - https://ko-fi.com/1littlecoder ...

4,057 views • 126 likes • 14 comments • August 27, 2025

Gemini Nano Banana 2.5 Flash Image is here!!!

Today in the Gemini app, we're unveiling a new image editing model from Google DeepMind. People have been going bananas over it already in early previews — it's the top-rated image editing model in...

2,093 views • 92 likes • 18 comments • August 26, 2025

NotebookLM but OPEN SOURCE Text to Speech from Microsoft!

Microsoft's VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as podcasts, from text. It addresses significant challenges in tra...

3,250 views • 146 likes • 27 comments • August 25, 2025

The Official Deepseek V3.1 in 4 mins!

Deepseek v3.1 is officially here! 🔗 Links 🔗 access deepseek v3.1 here - https://chat.deepseek.com/ https://api-docs.deepseek.com/news/news250821 Related Video: NotebookLM but Open Source from ...

10,599 views • 230 likes • 17 comments • August 21, 2025

Don't sleep on Google's Nano-Banana AI!

Try Nano-Banana on LMArena.ai while it's available. It's possibly a new model that coincides with Google Pixel 10 Pro launch powering it! 🔗 Links 🔗 Nano Banana - https://lmarena.ai/c/17f0e6c8-028...

6,024 views • 167 likes • 19 comments • August 20, 2025

Deepseek Teases a Comeback!

Deepseek v3.1 Base - This is a new model uploaded by Deepseek with literally no information! 🔗 Links 🔗 https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Base ❤️ If you want to support the chan...

2,824 views • 99 likes • 12 comments • August 19, 2025

What's ChatGPT Go - OpenAI's NEW PLAN for India!!!

What is ChatGPT GO? https://help.openai.com/en/articles/11989085-what-is-chatgpt-go 🔗 Links 🔗 https://x.com/nickaturley/status/1957613818902892985 ChatGPT Pricing - https://chatgpt.com/#pricing...

15,400 views • 214 likes • 67 comments • August 19, 2025

This AI Designs and Builds FULL SaaS in *just Minutes*

Try it out here - https://chatllm.abacus.ai/ajm Your All-In-One AI Platform ChatLLM Teams + DeepAgent! Access all the state-of-the-art AI in one powerhouse package. Top LLMs, image and video gene...

7,133 views • 109 likes • 26 comments • August 18, 2025

AI News of the Week (Live Stream)

AI News Claude Opus 4 and 4.1 can now end a rare subset of conversations https://www.anthropic.com/research/end-subset-conversations The Hidden Drivers of HRM's Performance on ARC-AGI https:/...

768 views • 44 likes • 12 comments • August 17, 2025

AI News of the Week (kind of)

AI News

0 views • 0 likes • 0 comments • August 16, 2025

Gemma 3 270M - Google's NEW Tiny LLM in 7 mins!!

Gemma 3 270M, a compact, 270-million parameter model designed from the ground up for task-specific fine-tuning with strong instruction-following and text structuring capabilities already trained in...

14,644 views • 441 likes • 57 comments • August 14, 2025

GPT-5 Update! 💥 How to get GPT 4o on ChatGPT back? 💥

This tutorial shows how to get back old models like GPT 4o and O3 back on ChatGPT and also quick GPT-5 update about which models should you use! https://chatgpt.com/ #chatgpt #chatgpt4 #gpt4o #gp...

8,443 views • 116 likes • 58 comments • August 13, 2025

GLM VISION a new VLM - full testing!

GLM-4.5V: a breakthrough in open-source visual reasoning GLM-4.5V delivers state-of-the-art performance among open-source models in its size class, dominating across 41 benchmarks. Built on the G...

2,422 views • 108 likes • 20 comments • August 12, 2025

GPT 5 CODING TEST! 💥 GPT-5 in Cursor Beginners Tutorial💥

In this video, I show How i built a Python CLI tool using GPT-5 and Cursor Here's the GPT-5 built Python package and CLI - https://pypi.org/project/confradar/ ❤️ If you want to support the cha...

3,300 views • 68 likes • 18 comments • August 11, 2025

GPT-OSS-20B Colab setup: Beginner's ultimate guide

If you have got a computer with not so great compute, This video shows how you can run GPT-OSS-20B on Free Colab. This is OpenAI's open source LLM! You can use this notebook for batching and als...

5,169 views • 132 likes • 22 comments • August 08, 2025

GPT-5 watch party (for littlecoders)

853 views • 22 likes • 3 comments • August 08, 2025

GPT-5 in 7 mins - Didn't feel the AGI!!

https://openai.com/index/introducing-gpt-5/ GPT‑5, our best AI system yet. GPT‑5 is a significant leap in intelligence over all our previous models, featuring state-of-the-art performance across c...

21,878 views • 482 likes • 128 comments • August 07, 2025

Quick update on "LEAKED" GPT-5 Variants! (Not the ACTUAL Model)

This video discusses about the leaked info on GPT-5 variants 🔗 Links 🔗 GPT-5 is generally available in github models (leaked page) https://archive.is/2025.08.07-035308/https://github.blog/change...

1,577 views • 49 likes • 16 comments • August 07, 2025

No GPU Needed! BEST FREE Text to Speech for CPU! 😻 KittenTTS 😻

Kitten TTS is an open-source realistic text-to-speech model with just 15 million parameters, designed for lightweight deployment and high-quality voice synthesis. ✨ Features Ultra-lightweight: Mod...

5,309 views • 180 likes • 25 comments • August 06, 2025

1600 tokens/s - The FASTEST ACCESS to gpt-oss-120b!!!

Watch my video on GPT-OSS-20B and GPT-OSS-120B - https://www.youtube.com/watch?v=D4RFytIOks4 This video highlights the fastest access to gpt-oss-120b 🔗 Links 🔗 https://openai.com/index/introduc...

4,202 views • 102 likes • 22 comments • August 05, 2025

OpenAI's OPEN SOURCE "GPT-OSS" in 8 mins!

We’re releasing gpt-oss-120b and gpt-oss-20b—two state-of-the-art open-weight language models that deliver strong real-world performance at low cost. Available under the flexible Apache 2.0 license...

11,960 views • 218 likes • 39 comments • August 05, 2025

Jack Dorsey's Goose + SECRET MODEL is so WILD!

This video shows the powerful combination of GOose and Secret Model (Horizon Beta) Horizon Alpha video OpenAI's NEW Secret Model?! Free Usage Today! https://www.youtube.com/watch?v=piOVejhZ9aY ...

4,854 views • 176 likes • 39 comments • August 02, 2025

VEO 3 is the NEW High for AI videos!💥Veo 3 Image to Videos 💥

VEO 3's image to video is here right now on FAL And this video shows how to do the same ! veo 3 fast - image to video on FAL https://fal.ai/models/fal-ai/veo3/fast/image-to-video This is probabl...

2,401 views • 59 likes • 10 comments • August 01, 2025

OpenAI's NEW Secret Model?! Free Usage Today!

256,000 context openrouter/horizon-alpha 🔗 Links 🔗 https://openrouter.ai/openrouter/horizon-alpha ❤️ If you want to support the channel ❤️ Support here: Patreon - https://www.patreon.com/1littl...

5,590 views • 162 likes • 24 comments • July 31, 2025

Reacting to Mark Zuckerberg's AI Vision 💥 Meta AI Super Intelligence Labs 💥

Today Mark shared Meta’s vision for the future of personal superintelligence for everyone. 🔗 Links 🔗 https://www.meta.com/superintelligence/ Main Video - https://x.com/AIatMeta/status/1950543458...

4,159 views • 115 likes • 46 comments • July 30, 2025

This Chinese AI one-shots a Full CODING PROJECT! (GLM 4.5 coding test)

GLM-4.5 and GLM-4.5-Air — our latest flagship models. GLM-4.5 is built with 355 billion total parameters and 32 billion active parameters, and GLM-4.5-Air with 106 billion total parameters and 12 b...

9,697 views • 205 likes • 41 comments • July 29, 2025

China's FREE Agentic AI 🔥 GLM-4.5 JUST DROPPED!!!

GLM-4.5 is built with 355 billion total parameters and 32 billion active parameters, and GLM-4.5-Air with 106 billion total parameters and 12 billion active parameters. Both are designed to unify r...

6,911 views • 212 likes • 31 comments • July 28, 2025

ChatGPT Agent vs Manus AI - AI Agents King ?!

In this war of Agents, I tested the newly launched ChatGPT Agent with Manus AI. The results are very interesting. Watch the video :) ❤️ If you want to support the channel ❤️ Support here: Patre...

6,922 views • 121 likes • 25 comments • July 27, 2025

NEW EMOTIONAL Text-to-Speech AI - New Best Voice Cloning?

Higgs Audio v2, a powerful audio foundation model pretrained on over 10 million hours of audio data and a diverse set of text data. Despite having no post-training or fine-tuning, Higgs Audio v2 ex...

7,275 views • 200 likes • 45 comments • July 24, 2025

China quietly drops a Coding MONSTER!

🔗 Links 🔗 https://qwenlm.github.io/blog/qwen3-coder/ https://huggingface.co/spaces/Qwen/Qwen3-Coder-WebDev https://chat.qwen.ai/ ❤️ If you want to support the channel ❤️ Support here: Patreon -...

2,821 views • 124 likes • 27 comments • July 23, 2025

AI Almost Won! 🥇OpenAI vs Deepmind Controversy! 🥇

This video talks about IMO 2025 Gold medal level performance by Google and OpenAI! Google Deepmind IMO 2025 solutions https://storage.googleapis.com/deepmind-media/gemini/IMO_2025.pdf ❤️ If yo...

1,910 views • 79 likes • 11 comments • July 22, 2025

Can Kimi K2 Outsmart AI Detectors? [Real‑World Test]

I spent a few hours testing the best Flagship AI that can escape AI Detection! The video has got the test and results ! Let me know if you are surprised! ❤️ If you want to support the chan...

5,888 views • 175 likes • 31 comments • July 19, 2025

Kimi K2 - The BEST Open Source LLM, right now!

Want to know more about Kimi K2? https://moonshotai.github.io/Kimi-K2/ https://huggingface.co/blog/fdaudens/moonshot-ai-kimi-k2-explained How to access Kimi K2? https://huggingface.co/moonshot...

4,643 views • 158 likes • 28 comments • July 18, 2025

ChatGPT Agent* - Not GPT-5, Not AGI, But REAL workhorse!

Introducing ChatGPT agent: bridging research and action ChatGPT now thinks and acts, proactively choosing from a toolbox of agentic skills to complete tasks for you using its own computer. 🔗 Link...

5,466 views • 154 likes • 28 comments • July 17, 2025

Grok 4 - Crazy Pricing and Crazy Politics!

🔗 Links 🔗 https://grok.com/ Grok ARC AGI 2 benchmark - https://x.com/arcprize/status/1943168950763950555/photo/1 Grok 4 Jeremy Howard Test - https://x.com/jeremyphoward/status/194344454969691771...

2,327 views • 84 likes • 9 comments • July 11, 2025

This is a SUPER Fast TTS that's FREE!!! ⚡️ How to run Kyutai TTS ⚡️

Kyutai TTS - model for streaming text-to-speech (TTS). Unlike offline text-to-speech, where the model needs the entire text to produce the audio, our model starts to output audio as soon as the fir...

6,074 views • 209 likes • 30 comments • July 03, 2025

This MIGHT be OpenAI's Open Source LLM!

Cypher Alpha is being specualated as OpenAI's open source LLM It's still a speculation but you could try this for free! Chat with Cypher Alpha here - https://openrouter.ai/chat?models=openrouter...

4,383 views • 140 likes • 18 comments • July 02, 2025

Better than MoE- Grouped Experts!

PANGU PRO MOE: MIXTURE OF GROUPED EXPERTS FOR EFFICIENT SPARSITY 🔗 Links 🔗 Pangu Pro Paper https://arxiv.org/pdf/2505.21411 Pangu Pro model can be downloaded here - https://gitcode.com/ascend-tr...

1,676 views • 70 likes • 5 comments • July 02, 2025

Massive Release from China AGAIN! 💥Ernie 4.5 💥

Ernie 4.5 release doc https://yiyan.baidu.com/blog/posts/ernie4.5/ Ernie 4.5 models! https://huggingface.co/collections/baidu/ernie-45-6861cd4c9be84540645f35c9 ❤️ If you want to support the ch...

3,741 views • 130 likes • 12 comments • July 01, 2025

LEAKED Memo: Meta Super Intelligence Labs!!!

Here Is Everyone Mark Zuckerberg Has Hired So Far for Meta’s ‘Superintelligence’ Team After a poaching frenzy that’s brought in talent from rival firms like OpenAI, Anthropic, and Google, Zuckerber...

2,372 views • 64 likes • 10 comments • July 01, 2025

STOP USING Veo 3, This is the Cheapest AI Audio for AI Videos!!!

MMAudio generates synchronized audio given video and/or text inputs. 🔗 Links 🔗 On HF - https://huggingface.co/spaces/hkchengrex/MMAudio On FAL - https://fal.ai/models/fal-ai/mmaudio-v2 Credits...

2,167 views • 69 likes • 9 comments • June 23, 2025

Gemini Music? Real-Time AI music is just here 💥 Google Magenta RT 💥

Magenta RealTime (Magenta RT), an open-weights live music model that allows you to interactively create, control and perform music in the moment. 🔗 Links 🔗 https://magenta.withgoogle.com/magenta-...

3,920 views • 140 likes • 17 comments • June 21, 2025

HUGE Gemini 2.5 FLASH update you won't LIKE!

Gemini 2.5 Flash and Pro are now generally available, and Google is introducing 2.5 Flash-Lite, the most cost-efficient and fastest 2.5 model yet. https://blog.google/products/gemini/gemini-2-5-m...

5,040 views • 129 likes • 12 comments • June 18, 2025

This SMALL OCR AI is FREE! 💥Nanonets OCR-S Explained 💥

Nanonets-OCR-s, a state-of-the-art image-to-markdown OCR model that goes far beyond traditional text extraction. This powerful model transforms documents into structured markdown with intelligent c...

8,848 views • 376 likes • 62 comments • June 17, 2025

This Voice CLONING is Insanity! 💥Commercial-use TTS x ElevenLabs Alternative💥

Chatterbox, Resemble AI's first production-grade open source TTS model. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs, and is consistentl...

6,924 views • 314 likes • 47 comments • May 31, 2025

10 Wild Veo 3 examples in 2 mins!

Google Veo 3 just launched and it's already creating hollywood level videos! This compilation will blow your minds! The first Google model to make videos auto-synced with Audio #googleai #veo...

9,671 views • 86 likes • 23 comments • May 24, 2025

Real-Time VOICE Cloning 💥 The Best Low-latency AI Speech Engine 💥

UNMUTE by Kyutai - This is a cascaded system made by Kyutai: our speech-to-text transcribes what you say, an LLM (we use Gemma 3 12B) generates the text of the response, and we then use our text-to...

8,084 views • 369 likes • 88 comments • May 23, 2025