Personalization in the Era of LLMs - Shivam Verma, Spotify

AI Engineer · Intermediate ·🧠 Large Language Models ·1d ago
Spotify represents Ariana Grande and Bruno Mars as sequences of six tokens. The first two are shared because both are pop artists. The remaining tokens diverge to capture what makes each distinct. That is a Semantic ID, and it is how Spotify teaches open-weight LLMs to reason over a catalog of 100 million tracks the same way they reason over words. Shivam Verma from Spotify's AI foundation team walks through the three components they assembled to personalize LLMs at scale without full fine-tuning. User embeddings trained on streaming history across 750 million users form the base. Semantic IDs compress catalog vectors into tokens the model can autoregressively generate, predicting the next song or episode as the next token in a sequence. A soft tokenization layer projects a user's embedding directly into the LLM's token space, giving the frozen model a user-specific token to attend over. Podcast next-episode recommendations are already running on this stack in production. Speaker info: - https://x.com/kaffeinated - https://www.linkedin.com/in/shivam13verma
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

The RAG tool that auto-generates Q&A pairs from your documents
Learn to auto-generate Q&A pairs from documents using RAG tool and improve your document management
Dev.to · retrovirusretro
How to Build Secure AI: Implementing Guardrails for Enterprise LLM
Learn to build secure AI by implementing guardrails for enterprise LLMs, going beyond prompt engineering safety for production-ready defense-in-depth architecture
Medium · LLM
5 Chinese AI tools with 100K+ stars that the West is ignoring
Discover 5 Chinese AI tools with 100K+ stars on GitHub that the Western world is overlooking, and learn how to explore and utilize them
Dev.to AI
OpenAI claims it solved an 80-year-old math problem — for real this time
OpenAI's reasoning model claims to have solved an 80-year-old math problem, with mathematicians verifying its solution
TechCrunch AI
Up next
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)
Watch →