Personalization in the Era of LLMs - Shivam Verma, Spotify

Name: Personalization in the Era of LLMs - Shivam Verma, Spotify
Uploaded: 2026-05-19T13:00:06Z
Channel: AI Engineer
Description: Spotify represents Ariana Grande and Bruno Mars as sequences of six tokens. The first two are shared because both are pop artists. The remaining tokens ...

AI Engineer · Intermediate ·🧠 Large Language Models ·1d ago

Skills: LLM Foundations80%Prompt Craft50%

Spotify represents Ariana Grande and Bruno Mars as sequences of six tokens. The first two are shared because both are pop artists. The remaining tokens diverge to capture what makes each distinct. That is a Semantic ID, and it is how Spotify teaches open-weight LLMs to reason over a catalog of 100 million tracks the same way they reason over words. Shivam Verma from Spotify's AI foundation team walks through the three components they assembled to personalize LLMs at scale without full fine-tuning. User embeddings trained on streaming history across 750 million users form the base. Semantic IDs compress catalog vectors into tokens the model can autoregressively generate, predicting the next song or episode as the next token in a sequence. A soft tokenization layer projects a user's embedding directly into the LLM's token space, giving the frozen model a user-specific token to attend over. Podcast next-episode recommendations are already running on this stack in production. Speaker info: - https://x.com/kaffeinated - https://www.linkedin.com/in/shivam13verma

Watch on YouTube ↗ (saves to browser)