New Open Audio Models 🤗 | Recap with Jeff

Hugging Face · Intermediate ·📐 ML Fundamentals ·3w ago

Skills: LLM Engineering90%

This video covers the latest wave of open audio tooling, from Mistral's Voxtral 4B text-to-speech model to Cohere Transcribe for speech recognition and the Hugging Face infrastructure used to run large-scale transcription workflows. It walks through live demos, browser-based transcription with Transformers.js, and a practical UV-script pipeline built on storage buckets, HF Mount, and HF Jobs. If you're building speech apps or batch transcription systems, this is a fast overview of the current open stack. --- Demo Links 👉 Voxtral TTS: https://huggingface.co/spaces/mistralai/voxtral-tts-demo 👉 Cohere Transcribe: https://huggingface.co/spaces/CohereLabs/Cohere-Transcribe-WebGPU 👉 UV scripts for transcription: https://huggingface.co/datasets/uv-scripts/transcription --- 🤓 Topics Covered - Voxtral 4B text-to-speech - Cohere Transcribe speech-to-text - Hugging Face audio pipelines --- ⏱️ Timestamps 0:00 Open audio models and demos 2:44 What Hugging Face storage buckets are 3:39 How HF Mount works 4:02 HF Jobs and wrap-up

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: LLM Engineering

View skill →

Build an LLM and RAG-based Chat Application using AlloyDB and LangChain

FULLY LOCAL Mistral AI PDF Processing [Hands-on Tutorial]

FULLY LOCAL Mistral AI PDF Processing [Hands-on Tutorial]

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

How to Make an Asteroids Game Bot (LIVE)

How to Make an Asteroids Game Bot (LIVE)

Using Claude Code + Nano Banana Pro To Create a Dataset of Engineering Drawings

Using Claude Code + Nano Banana Pro To Create a Dataset of Engineering Drawings

Automata Learning Lab

Advanced AI and Machine Learning Techniques and Capstone

Advanced AI and Machine Learning Techniques and Capstone

Related AI Lessons

Radiomics in Medical Imaging: Unlocking Hidden Patterns for Early Disease Detection

Learn how radiomics in medical imaging can unlock hidden patterns for early disease detection using machine learning techniques

Medium · Machine Learning

Generative AI From First Principles — Article 5 (Recurrent Neural Networks)

Learn the fundamentals of Recurrent Neural Networks (RNNs) and how they overcome limitations of basic neural networks

Medium · Machine Learning

Generative AI From First Principles — Article 5 (Recurrent Neural Networks)

Learn the fundamentals of Recurrent Neural Networks (RNNs) and how they overcome limitations of basic neural networks

Medium · Deep Learning

Why Data Quality is Becoming More Important Than Model Size in Modern AI Systems

Data quality is becoming more crucial than model size in modern AI systems, and here's why it matters for building reliable AI models

Dev.to · Vishal Uttam Mane

Chapters (4)

Open audio models and demos

2:44 What Hugging Face storage buckets are

3:39 How HF Mount works

4:02 HF Jobs and wrap-up

Introducing Storage Buckets