Is This The Fastest ASR?

Sam Witteveen · Advanced ·🧠 Large Language Models ·3h ago

Skills: LLM Engineering90%ML Maths Basics60%

In this video, I dive into IBM's newly released Granite Speech 4.1 models and explore what makes them interesting — particularly the three 2B variants they've dropped and how each one makes a different trade-off between accuracy, richness, and throughput that you'll actually care about for real applications. 🔗 Links: IBM Research Blog → https://research.ibm.com/blog/granite-4-1-ai-foundation-models Twitter: https://x.com/Sam_Witteveen 🕵️ Interested in building LLM Agents? Fill out the form below Building LLM Agents Form: https://drp.li/dIMes 👨‍💻Github: https://github.com/samwit/llm-tutorials ⏱️Time Stamps: 00:00 Intro 00:20 IBM Granite Collection 00:27 Granite Docling 00:46 Granite Speech 4.1 01:16 Granite 4.1 Blog 01:38 Granite Speech 4.1 2B 04:02 Granite Speech 4.1 2B Plus 06:15 Granite Speech 4.1 2B NAR 07:30 NLE: Non-autoregressive LLM-based ASR by Transcript Editing Paper 07:45 Architecture 09:45 Code Time 12:00 Granite Speech Model Github #DellProPrecision #DellProMax #Delltech #localai #NVIDIA

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: LLM Engineering

View skill →

Build an LLM and RAG-based Chat Application using AlloyDB and LangChain

FULLY LOCAL Mistral AI PDF Processing [Hands-on Tutorial]

FULLY LOCAL Mistral AI PDF Processing [Hands-on Tutorial]

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

How to Make an Asteroids Game Bot (LIVE)

How to Make an Asteroids Game Bot (LIVE)

Using Claude Code + Nano Banana Pro To Create a Dataset of Engineering Drawings

Using Claude Code + Nano Banana Pro To Create a Dataset of Engineering Drawings

Automata Learning Lab

Advanced AI and Machine Learning Techniques and Capstone

Advanced AI and Machine Learning Techniques and Capstone

Related AI Lessons

LLM Cost Calculator

Estimate monthly costs for LLM models like Claude, GPT, and Llama using a free cost calculator tool, and understand the importance of cost estimation in AI model selection

Dev.to · Codehelper

How to Run Claude Code Locally (100% Free & Fully Private)

Run Claude code locally for free and private AI development

Stop Blaming Claude Opus 4.7. Your Prompts Were Always Broken — 4.6 Was Just Carrying You.

Learn how to craft effective prompts for LLMs like Claude Opus 4.7 and avoid blaming the model for poor results

AI Isn’t “Inspired” by Human Writing. It Is Built on Unpaid Intellectual Labor

AI models are built on unpaid intellectual labor, erasing attribution and recombining human knowledge, highlighting ethical concerns in AI development

Chapters (12)

Intro

0:20 IBM Granite Collection

0:27 Granite Docling

0:46 Granite Speech 4.1

1:16 Granite 4.1 Blog

1:38 Granite Speech 4.1 2B

4:02 Granite Speech 4.1 2B Plus

6:15 Granite Speech 4.1 2B NAR

7:30 NLE: Non-autoregressive LLM-based ASR by Transcript Editing Paper

7:45 Architecture

9:45 Code Time

12:00 Granite Speech Model Github

5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems

Dave Ebbelaar (LLM Eng)