▶ Videos →

📰 Jay Alammar's Blog

10 articles · Updated every 3 hours · View all reads

All Articles 109,412 Blog Posts 120,340 Tech Tutorials 27,831 Research Papers 22,433 News 16,509 ⚡ AI Lessons

Jay Alammar's Blog 🧠 Large Language Models ⚡ AI Lesson 1y ago

Moving To Substack

I’m freezing this blog and starting to post on my Substack instead. The authoring experience is much more convenient for me there. Please follow me there, and c

Jay Alammar's Blog ⚡ AI Lesson 3y ago

Generative AI and AI Product Moats

Here are eight observations I’ve shared recently on the Cohere blog and videos that go over them.: Article: What’s the big deal with Generative AI? Is it the fu

Jay Alammar's Blog 🎨 Image & Video AI ⚡ AI Lesson 3y ago

Remaking Old Computer Graphics With AI Image Generation

Can AI Image generation tools make re-imagined, higher-resolution versions of old video game graphics? Over the last few days, I used AI image generation to rep

Jay Alammar's Blog ⚡ AI Lesson 3y ago

The Illustrated Stable Diffusion

Translations: Chinese, Vietnamese. (V2 Nov 2022: Updated images for more precise description of forward diffusion. A few more images in this version) AI image g

Jay Alammar's Blog 4y ago

Applying massive language models in the real world with Cohere

A little less than a year ago, I joined the awesome Cohere team. The company trains massive language models (both GPT-like and BERT-like) and offers them as an

Jay Alammar's Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

The Illustrated Retrieval Transformer

Discussion: Discussion Thread for comments, corrections, or any feedback. Translations: Korean, Russian Summary: The latest batch of language models can be much

Jay Alammar's Blog 🛡️ AI Safety & Ethics ⚡ AI Lesson 5y ago

Explainable AI Cheat Sheet

Introducing the Explainable AI Cheat Sheet, your high-level guide to the set of tools and methods that helps humans understand AI/ML models and their prediction

Jay Alammar's Blog 5y ago

Finding the Words to Say: Hidden State Visualizations for Language Models

By visualizing the hidden state between a model's layers, we can get some clues as to the model's "thought process". Figure: Finding the words to say After a la

Jay Alammar's Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Interfaces for Explaining Transformer Language Models

Interfaces for exploring transformer language models by looking at input saliency and neuron activation. Explorable #1: Input saliency of a list of countries ge

Jay Alammar's Blog 5y ago

How GPT3 Works - Visualizations and Animations

Discussions: Hacker News (397 points, 97 comments), Reddit r/MachineLearning (247 points, 27 comments) Translations: German, Korean, Chinese (Simplified), Russi