10 articles

📰 Jay Alammar's Blog

Articles from Jay Alammar's Blog · 10 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (9111) ArXiv cs.AIDev.to · FORUM WEBForbes InnovationOpenAI NewsDev.to AIHugging Face Blog
Jay Alammar's Blog 4y ago
Applying massive language models in the real world with Cohere
A little less than a year ago, I joined the awesome Cohere team. The company trains massive language models (both GPT-like and BERT-like) and offers them as an
Jay Alammar's Blog 4y ago
The Illustrated Retrieval Transformer
Discussion: Discussion Thread for comments, corrections, or any feedback. Translations: Korean, Russian Summary: The latest batch of language models can be much
Jay Alammar's Blog 4y ago
Explainable AI Cheat Sheet
Introducing the Explainable AI Cheat Sheet, your high-level guide to the set of tools and methods that helps humans understand AI/ML models and their prediction
Jay Alammar's Blog 5y ago
Finding the Words to Say: Hidden State Visualizations for Language Models
By visualizing the hidden state between a model's layers, we can get some clues as to the model's "thought process". Figure: Finding the words to say After a la
Jay Alammar's Blog 5y ago
Interfaces for Explaining Transformer Language Models
Interfaces for exploring transformer language models by looking at input saliency and neuron activation. Explorable #1: Input saliency of a list of countries ge
Jay Alammar's Blog 5y ago
How GPT3 Works - Visualizations and Animations
Discussions: Hacker News (397 points, 97 comments), Reddit r/MachineLearning (247 points, 27 comments) Translations: German, Korean, Chinese (Simplified), Russi