Inspecting Neural Networks with CCA - A Gentle Intro (Explainable AI for Deep Learning)

Jay Alammar · Beginner ·🧠 Large Language Models ·4y ago

Skills: CV Basics53%Staying Current in AI53%

Canonical Correlation Analysis is one of the methods used to explore deep neural networks. Methods like CKA and SVCCA reveal to us insights into how a neural network processes its inputs. This is often done by using CKA and SVCCA as a similarity measure for different activation matrices. In this video, we look at a number of papers that compare different neural networks together. We also look at papers that compare the representations of the various layers of a neural network. Contents: Introduction (0:00) Correlation (0:54) How CCA is used to compare representations (2:50) SVCCA and Computer Vision models (4:40) Examining NLP language models with SVCCA: LSTM (9:01) PWCCA - Projection Weighted Canonical Correlation Analysis (10:22) How multilingual BERT represents different languages (10:43) CKA: Centered Kernel Alignment (15:25) BERT, GPT2, ELMo similarity analysis with CKA (16:07) Convnets, Resnets, deep nets and wide nets (17:35) Conclusion (18:59) Explainable AI Cheat Sheet: https://ex.pegg.io/ 1) Explainable AI Intro : https://www.youtube.com/watch?v=Yg3q5x7yDeM&t=0s 2) Neural Activations & Dataset Examples https://www.youtube.com/watch?v=y0-ISRhL4Ks 3) Probing Classifiers: A Gentle Intro (Explainable AI for Deep Learning) https://www.youtube.com/watch?v=HJn-OTNLnoE ----- Papers: SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning Dynamics and Interpretability https://arxiv.org/pdf/1706.05806.pdf Understanding Learning Dynamics Of Language Models with SVCCA https://arxiv.org/pdf/1811.00225.pdf Insights on representational similarity in neural networks with canonical correlation https://arxiv.org/pdf/1806.05759.pdf BERT is Not an Interlingua and the Bias of Tokenization https://www.aclweb.org/anthology/D19-6106.pdf Similarity of Neural Network Representations Revisited http://proceedings.mlr.press/v97/kornblith19a/kornblith19a.pdf Similarity Analysis of Contextual Word Representation Models https://arxiv.org/pdf/2005.01172.pdf Do Wi

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Jay Alammar · Jay Alammar · 11 of 38

← Previous Next →

Jay's Visual Intro to AI

Jay's Visual Intro to AI

Making Money from AI by Predicting Sales - Jay's Intro to AI Part 2

Making Money from AI by Predicting Sales - Jay's Intro to AI Part 2

How GPT3 Works - Easily Explained with Animations

How GPT3 Works - Easily Explained with Animations

The Narrated Transformer Language Model

The Narrated Transformer Language Model

My Visualization Tools (my Apple Keynote setup for visualizations and animations)

My Visualization Tools (my Apple Keynote setup for visualizations and animations)

Explainable AI Cheat Sheet - Five Key Categories

Explainable AI Cheat Sheet - Five Key Categories

The Unreasonable Effectiveness of RNNs (Article and Visualization Commentary) [2015 article]

The Unreasonable Effectiveness of RNNs (Article and Visualization Commentary) [2015 article]

Neural Activations & Dataset Examples

Neural Activations & Dataset Examples

Up and Down the Ladder of Abstraction [interactive article by Bret Victor, 2011]

Up and Down the Ladder of Abstraction [interactive article by Bret Victor, 2011]

Probing Classifiers: A Gentle Intro (Explainable AI for Deep Learning)

Probing Classifiers: A Gentle Intro (Explainable AI for Deep Learning)

Inspecting Neural Networks with CCA - A Gentle Intro (Explainable AI for Deep Learning)

Inspecting Neural Networks with CCA - A Gentle Intro (Explainable AI for Deep Learning)

Language Processing with BERT: The 3 Minute Intro (Deep learning for NLP)

Language Processing with BERT: The 3 Minute Intro (Deep learning for NLP)

Behavioral Testing of ML Models (Unit tests for machine learning)

Behavioral Testing of ML Models (Unit tests for machine learning)

Favorite AI/ML Books: Intro to ML with Python (Book Review)

Favorite AI/ML Books: Intro to ML with Python (Book Review)

Favorite Python Books: Effective Python

Favorite Python Books: Effective Python

Favorite Stats Books: Seven Pillars of Statistical Wisdom

Favorite Stats Books: Seven Pillars of Statistical Wisdom

Understanding Animal Languages - Seeing Voices 2

Understanding Animal Languages - Seeing Voices 2

How digital assistants like Siri work #shorts

How digital assistants like Siri work #shorts

Writing Code in Jupyter Notebooks #shorts

Writing Code in Jupyter Notebooks #shorts

Experience Grounds Language: Improving language models beyond the world of text

Experience Grounds Language: Improving language models beyond the world of text

pandas for data science in python #shorts

pandas for data science in python #shorts

The Illustrated Retrieval Transformer

The Illustrated Retrieval Transformer

AI Image Generation is MIND BLOWING! #shorts

AI Image Generation is MIND BLOWING! #shorts

A Generalist Agent (Gato) - DeepMind's single model learns 600 tasks

A Generalist Agent (Gato) - DeepMind's single model learns 600 tasks

The Illustrated Word2vec - A Gentle Intro to Word Embeddings in Machine Learning

The Illustrated Word2vec - A Gentle Intro to Word Embeddings in Machine Learning

AI Art Explained: How AI Generates Images (Stable Diffusion, Midjourney, and DALLE)

AI Art Explained: How AI Generates Images (Stable Diffusion, Midjourney, and DALLE)

What is Generative AI? 4 Important Things to Know (about ChatGPT, MidJourney, Cohere & future AIs)

What is Generative AI? 4 Important Things to Know (about ChatGPT, MidJourney, Cohere & future AIs)

AI is Eating The World - This is Where YOU Can Use it to Compete (AI Product Moats)

AI is Eating The World - This is Where YOU Can Use it to Compete (AI Product Moats)

What is LangChain? Where does it fit with LLMs like ChatGPT and Cohere? #shorts

What is LangChain? Where does it fit with LLMs like ChatGPT and Cohere? #shorts

Are language models with more parameters better? #shorts #chatgpt

Are language models with more parameters better? #shorts #chatgpt

How to manage LLM prompts with tools like LangChain #languagemodels #chatgpt

How to manage LLM prompts with tools like LangChain #languagemodels #chatgpt

What is Llama Index? how does it help in building LLM applications? #languagemodels #chatgpt

What is Llama Index? how does it help in building LLM applications? #languagemodels #chatgpt

prompt chains are important for building large language model applications

prompt chains are important for building large language model applications

ChatGPT has Never Seen a SINGLE Word (Despite Reading Most of The Internet). Meet LLM Tokenizers.

ChatGPT has Never Seen a SINGLE Word (Despite Reading Most of The Internet). Meet LLM Tokenizers.

What makes LLM tokenizers different from each other? GPT4 vs. FlanT5 Vs. Starcoder Vs. BERT and more

What makes LLM tokenizers different from each other? GPT4 vs. FlanT5 Vs. Starcoder Vs. BERT and more

Building LLM Agents with Tool Use

Building LLM Agents with Tool Use

SWE-Bench authors reflect on the state of LLM agents at Neurips 2024

SWE-Bench authors reflect on the state of LLM agents at Neurips 2024

Learn how ChatGPT and DeepSeek models work: How Transformer LLMs Work [Free Course]

Learn how ChatGPT and DeepSeek models work: How Transformer LLMs Work [Free Course]

More on: CV Basics

View skill →

Identify Horses or Humans with TensorFlow and Vertex AI

How to Build and Install OpenCV from Source | Using Visual Studio and CMake | Computer Vision

How to Build and Install OpenCV from Source | Using Visual Studio and CMake | Computer Vision

Building a Dog Breed Identifier App from scratch - DogNet

Building a Dog Breed Identifier App from scratch - DogNet

Aladdin Persson

Apply OpenGL Texturing and Camera Systems

Apply OpenGL Texturing and Camera Systems

Aerial Image Segmentation with PyTorch

Aerial Image Segmentation with PyTorch

How to Install Stable Diffusion - automatic1111

How to Install Stable Diffusion - automatic1111

Sebastian Kamph

Related AI Lessons

I Tried 10 ChatGPT Resume Prompts. Here's What Actually Got Me Interviews.

Learn how to use ChatGPT prompts to improve your resume and get more interview callbacks

How does indirect prompt injection work? #tech

Indirect prompt injection is a technique used in AI to manipulate model outputs by injecting prompts indirectly, and understanding how it works is crucial for developing secure AI systems.

A Unified View of AI Evolution: From Machine Learning to LLMs, RAG, and Fine-Tuning

Learn about the evolution of AI from machine learning to LLMs, RAG, and fine-tuning, and how to apply these concepts in practice

Dev.to · Naimul Karim

OpenAI Just Unleashed GPT-5.5 — And It Signals the Next Phase of AI

OpenAI's GPT-5.5 signals a shift towards practical AI applications in the real world

5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems

Dave Ebbelaar (LLM Eng)