Facebook AI's DINO | PyTorch Code Explained
๐จโ๐ฉโ๐งโ๐ฆ JOIN OUR DISCORD COMMUNITY:
Discord โบ https://discord.gg/peBrCpheKE
๐ข SUBSCRIBE TO MY MONTHLY AI NEWSLETTER:
Substack โบ https://aiepiphany.substack.com/
โค๏ธ Become The AI Epiphany Patreon โค๏ธ โบ https://www.patreon.com/theaiepiphany
With this video, I kick off a brand new series of coding videos in PyTorch where I'll be explaining some of the most impactful AI research through its code analysis!
In this video I cover DINO from the "Emerging Properties in Self-Supervised Vision Transformers" paper.
โฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌ
โ
Paper: https://arxiv.org/abs/2104.14294
โ
Code: https://github.com/facebookresearch/dino
โฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌ
โ๏ธ Timetable:
00:00 DINO paper - short overview
02:38 Code analysis starts - training arguments (argparse)
07:30 Training main function
08:25 DINO augmentations
11:30 Main function resumed
16:05 DINO head (MLP)
18:05 Main function resumed
20:40 DINO loss overview
21:45 Main function resumed
24:05 Augmentations visualized (matplotlib)
27:00 Main function resumed
28:00 DINO Core part!!! (in-depth shape analysis)
33:45 DINO loss in depth
38:00 Main function resumed
39:50 Visualizing attention script explained
48:15 Outro
โฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌ
๐ฐ BECOME A PATREON OF THE AI EPIPHANY โค๏ธ
If these videos, GitHub projects, and blogs help you,
consider helping me out by supporting me on Patreon!
The AI Epiphany โบ https://www.patreon.com/theaiepiphany
One-time donation:
https://www.paypal.com/paypalme/theaiepiphany
Much love! โค๏ธ
Huge thank you to these AI Epiphany patreons:
Eli Mahler
Petar Veliฤkoviฤ
Zvonimir Sabljic
โฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌ
๐ก The AI Epiphany is a channel dedicated to simplifying the field of AI using creative visualizations and in general, a stronger focus on geometrical and visual intuition, rather than the algebraic and numerical "intuition".
โฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌโฌ
๐ CONNECT WITH ME ON SOCIAL
LinkedIn โบ https://www.linkedin.com/in/aleksagordic/
Twitter โบ https://twitter.com/gordic_
Watch on YouTube โ
(saves to browser)
Sign in to unlock AI tutor explanation ยท โก30
Playlist
Uploads from Aleksa Gordiฤ - The AI Epiphany ยท Aleksa Gordiฤ - The AI Epiphany ยท 0 of 60
โ Previous
Next โ
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Intro | Neural Style Transfer #1
Aleksa Gordiฤ - The AI Epiphany
Basic Theory | Neural Style Transfer #2
Aleksa Gordiฤ - The AI Epiphany
Optimization method | Neural Style Transfer #3
Aleksa Gordiฤ - The AI Epiphany
Advanced Theory | Neural Style Transfer #4
Aleksa Gordiฤ - The AI Epiphany
Anyone can make deepfakes now!
Aleksa Gordiฤ - The AI Epiphany
What is Computer Vision? | The Art of Creating Seeing Machines
Aleksa Gordiฤ - The AI Epiphany
Feed-forward method | Neural Style Transfer #5
Aleksa Gordiฤ - The AI Epiphany
Alan Turing | Computing Machinery and Intelligence
Aleksa Gordiฤ - The AI Epiphany
Feed-forward method (training) | Neural Style Transfer #6
Aleksa Gordiฤ - The AI Epiphany
What is Google Deep Dream? (Basic Theory) | Deep Dream Series #1
Aleksa Gordiฤ - The AI Epiphany
Semantic Segmentation in PyTorch | Neural Style Transfer #7
Aleksa Gordiฤ - The AI Epiphany
How to get started with Machine Learning
Aleksa Gordiฤ - The AI Epiphany
How to learn PyTorch? (3 easy steps) | 2021
Aleksa Gordiฤ - The AI Epiphany
PyTorch or TensorFlow?
Aleksa Gordiฤ - The AI Epiphany
3 Machine Learning Projects For Beginners (Highly visual) | 2021
Aleksa Gordiฤ - The AI Epiphany
Machine Learning Projects (Intermediate level) | 2021
Aleksa Gordiฤ - The AI Epiphany
Cheapest (0$) Deep Learning Hardware Options | 2021
Aleksa Gordiฤ - The AI Epiphany
How to learn deep learning? (Transformers Example)
Aleksa Gordiฤ - The AI Epiphany
How do transformers work? (Attention is all you need)
Aleksa Gordiฤ - The AI Epiphany
Developing a deep learning project (case study on transformer)
Aleksa Gordiฤ - The AI Epiphany
Vision Transformer (ViT) - An image is worth 16x16 words | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
GPT-3 - Language Models are Few-Shot Learners | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
Google DeepMind's AlphaFold 2 explained! (Protein folding, AlphaFold 1, a glimpse into AlphaFold 2)
Aleksa Gordiฤ - The AI Epiphany
Attention Is All You Need (Transformer) | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
Graph Attention Networks (GAT) | GNN Paper Explained
Aleksa Gordiฤ - The AI Epiphany
Graph Convolutional Networks (GCN) | GNN Paper Explained
Aleksa Gordiฤ - The AI Epiphany
Graph SAGE - Inductive Representation Learning on Large Graphs | GNN Paper Explained
Aleksa Gordiฤ - The AI Epiphany
PinSage - Graph Convolutional Neural Networks for Web-Scale Recommender Systems | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
OpenAI CLIP - Connecting Text and Images | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
Temporal Graph Networks (TGN) | GNN Paper Explained
Aleksa Gordiฤ - The AI Epiphany
Graph Neural Network Project Update! (I'm coding GAT from scratch)
Aleksa Gordiฤ - The AI Epiphany
Graph Attention Network Project Walkthrough
Aleksa Gordiฤ - The AI Epiphany
How to get started with Graph ML? (Blog walkthrough)
Aleksa Gordiฤ - The AI Epiphany
DQN - Playing Atari with Deep Reinforcement Learning | RL Paper Explained
Aleksa Gordiฤ - The AI Epiphany
AlphaGo - Mastering the game of Go with deep neural networks and tree search | RL Paper Explained
Aleksa Gordiฤ - The AI Epiphany
DeepMind's AlphaGo Zero and AlphaZero | RL paper explained
Aleksa Gordiฤ - The AI Epiphany
OpenAI - Solving Rubik's Cube with a Robot Hand | RL paper explained
Aleksa Gordiฤ - The AI Epiphany
MuZero - Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model | RL Paper explained
Aleksa Gordiฤ - The AI Epiphany
EfficientNetV2 - Smaller Models and Faster Training | Paper explained
Aleksa Gordiฤ - The AI Epiphany
Implementing DeepMind's DQN from scratch! | Project Update
Aleksa Gordiฤ - The AI Epiphany
MLP-Mixer: An all-MLP Architecture for Vision | Paper explained
Aleksa Gordiฤ - The AI Epiphany
DeepMind's Android RL Environment - AndroidEnv
Aleksa Gordiฤ - The AI Epiphany
When Vision Transformers Outperform ResNets without Pretraining | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
Non-Parametric Transformers | Paper explained
Aleksa Gordiฤ - The AI Epiphany
Chip Placement with Deep Reinforcement Learning | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
Text Style Brush - Transfer of text aesthetics from a single example | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
Graphormer - Do Transformers Really Perform Bad for Graph Representation? | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
GANs N' Roses: Stable, Controllable, Diverse Image to Image Translation | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
VQ-VAEs: Neural Discrete Representation Learning | Paper + PyTorch Code Explained
Aleksa Gordiฤ - The AI Epiphany
VQ-GAN: Taming Transformers for High-Resolution Image Synthesis | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
Multimodal Few-Shot Learning with Frozen Language Models | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
Focal Transformer: Focal Self-attention for Local-Global Interactions in Vision Transformers
Aleksa Gordiฤ - The AI Epiphany
AudioCLIP: Extending CLIP to Image, Text and Audio | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
RMA: Rapid Motor Adaptation for Legged Robots | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
DALL-E: Zero-Shot Text-to-Image Generation | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
DETR: End-to-End Object Detection with Transformers | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
DINO: Emerging Properties in Self-Supervised Vision Transformers | Paper Explained!
Aleksa Gordiฤ - The AI Epiphany
DeepMind DetCon: Efficient Visual Pretraining with Contrastive Detection | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
Do Vision Transformers See Like Convolutional Neural Networks? | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
Fastformer: Additive Attention Can Be All You Need | Paper Explained
Aleksa Gordiฤ - The AI Epiphany
More on: LLM Engineering
View skill โRelated AI Lessons
โก
โก
โก
โก
5 Mistakes That Stop Beginners From Making Money With AI
Medium ยท AI
Search has changed more than we think
Medium ยท AI
Unraveling a Network-Based Attack: TryHackMe First Shift CTF โ Task 7: The Crown Jewel
Medium ยท Cybersecurity
Microsoft to Stop Sharing Revenue with Main AI Partner OpenAI
Hacker News (AI)
Chapters (16)
DINO paper - short overview
2:38
Code analysis starts - training arguments (argparse)
7:30
Training main function
8:25
DINO augmentations
11:30
Main function resumed
16:05
DINO head (MLP)
18:05
Main function resumed
20:40
DINO loss overview
21:45
Main function resumed
24:05
Augmentations visualized (matplotlib)
27:00
Main function resumed
28:00
DINO Core part!!! (in-depth shape analysis)
33:45
DINO loss in depth
38:00
Main function resumed
39:50
Visualizing attention script explained
48:15
Outro
๐
Tutor Explanation
DeepCamp AI