Google DeepMind's Deep Q-Learning & Superhuman Atari Gameplays | Two Minute Papers #27

Two Minute Papers · Beginner ·📐 ML Fundamentals ·10y ago
Google DeepMind implemented an artificial intelligence program using deep reinforcement learning that plays Atari games and improves itself to a superhuman level. The technique is called deep Q-learning, it uses a combination of deep neural networks and reinforcement learning, and it is capable of playing many Atari games as good or better than humans. After presenting their initial results with the algorithm, Google almost immediately acquired the company for several hundred million dollars, hence the name Google DeepMind. I am sure that this is one of the biggest triumphs of deep learning, especially given the fact that now the first few successful experiments for 3D games are out there! ________________________ The Nature paper "Human-level control through deep reinforcement learning" is available here: http://www.nature.com/nature/journal/v518/n7540/full/nature14236.html http://www.cs.swarthmore.edu/~meeden/cs63/s15/nature15b.pdf The code is available here: https://sites.google.com/a/deepmind.com/dqn/ Ilya Kuzovkin's fork with visualization: https://github.com/kuz/DeepMind-Atari-Deep-Q-Learner This configuration file will run Ilya Kuzovkin's version with less than 1GB of VRAM: http://cg.tuwien.ac.at/~zsolnai/wp/wp-content/uploads/2015/03/run_gpu Recommended for you: Artificial Neural Networks and Deep Learning - https://www.youtube.com/watch?v=rCWTOOgVXyE&list=PLujxSBD-JXgnqDD1n-V30pKtp6Q886x7e&index=13 Recurrent Neural Network Writes Sentences About Images - https://www.youtube.com/watch?v=e-WB4lfg30M&list=PLujxSBD-JXgnqDD1n-V30pKtp6Q886x7e&index=15 Deep Neural Network Learns Van Gogh's Art - https://www.youtube.com/watch?v=-R9bJGNHltQ&list=PLujxSBD-JXgnqDD1n-V30pKtp6Q886x7e&index=22 Terrain Traversal with Reinforcement Learning - https://www.youtube.com/watch?v=_yjHPu1aYCY&list=PLujxSBD-JXgnqDD1n-V30pKtp6Q886x7e&index=9 Subscribe if you would like to see more of these! - http://www.youtube.com/subscription_center?add_user=keeroyz The thumbnail was made
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Two Minute Papers · Two Minute Papers · 31 of 60

1 Fluid Simulations with Blender and Wavelet Turbulence | Two Minute Papers #1
Fluid Simulations with Blender and Wavelet Turbulence | Two Minute Papers #1
Two Minute Papers
2 Capturing Waves of Light With Femto-photography | Two Minute Papers #2
Capturing Waves of Light With Femto-photography | Two Minute Papers #2
Two Minute Papers
3 Artificial Neural Networks and Deep Learning | Two Minute Papers #3
Artificial Neural Networks and Deep Learning | Two Minute Papers #3
Two Minute Papers
4 Blender Rendering - Top 7 LuxRender Features
Blender Rendering - Top 7 LuxRender Features
Two Minute Papers
5 Simulating Breaking Glass | Two Minute Papers #4
Simulating Breaking Glass | Two Minute Papers #4
Two Minute Papers
6 Time Lapse Videos From Community Photos | Two Minute Papers #5
Time Lapse Videos From Community Photos | Two Minute Papers #5
Two Minute Papers
7 AI Learns Van Gogh's Art
AI Learns Van Gogh's Art
Two Minute Papers
8 Hydrographic Printing | Two Minute Papers #7
Hydrographic Printing | Two Minute Papers #7
Two Minute Papers
9 Announcing LuxRender 1.5
Announcing LuxRender 1.5
Two Minute Papers
10 Digital Creatures Learn To Walk | Two Minute Papers #8
Digital Creatures Learn To Walk | Two Minute Papers #8
Two Minute Papers
11 Manipulating Photorealistic Renderings | Two Minute Papers #9
Manipulating Photorealistic Renderings | Two Minute Papers #9
Two Minute Papers
12 Adaptive Fluid Simulations | Two Minute Papers #10
Adaptive Fluid Simulations | Two Minute Papers #10
Two Minute Papers
13 Building Bridges With Flying Machines | Two Minute Papers #11
Building Bridges With Flying Machines | Two Minute Papers #11
Two Minute Papers
14 Reconstructing Sound From Vibrations | Two Minute Papers #12
Reconstructing Sound From Vibrations | Two Minute Papers #12
Two Minute Papers
15 Creating Photographs Using Deep Learning | Two Minute Papers #13
Creating Photographs Using Deep Learning | Two Minute Papers #13
Two Minute Papers
16 Adaptive Cloth Simulations | Two Minute Papers #14
Adaptive Cloth Simulations | Two Minute Papers #14
Two Minute Papers
17 Synthesizing Sound From Collisions | Two Minute Papers #15
Synthesizing Sound From Collisions | Two Minute Papers #15
Two Minute Papers
18 Metropolis Light Transport | Two Minute Papers #16
Metropolis Light Transport | Two Minute Papers #16
Two Minute Papers
19 3D Printing a Glockenspiel | Two Minute Papers #17
3D Printing a Glockenspiel | Two Minute Papers #17
Two Minute Papers
20 Modeling Colliding and Merging Fluids | Two Minute Papers #18
Modeling Colliding and Merging Fluids | Two Minute Papers #18
Two Minute Papers
21 Recurrent Neural Network Writes Music and Shakespeare Novels | Two Minute Papers #19
Recurrent Neural Network Writes Music and Shakespeare Novels | Two Minute Papers #19
Two Minute Papers
22 Gradients, Poisson's Equation and Light Transport | Two Minute Papers #20
Gradients, Poisson's Equation and Light Transport | Two Minute Papers #20
Two Minute Papers
23 Real-Time Facial Expression Transfer | Two Minute Papers #21
Real-Time Facial Expression Transfer | Two Minute Papers #21
Two Minute Papers
24 Automatic Lecture Notes From Videos | Two Minute Papers #22
Automatic Lecture Notes From Videos | Two Minute Papers #22
Two Minute Papers
25 Be a Part of Two Minute Papers on Patreon!
Be a Part of Two Minute Papers on Patreon!
Two Minute Papers
26 Recurrent Neural Network Writes Sentences About Images | Two Minute Papers #23
Recurrent Neural Network Writes Sentences About Images | Two Minute Papers #23
Two Minute Papers
27 How Does Deep Learning Work? | Two Minute Papers #24
How Does Deep Learning Work? | Two Minute Papers #24
Two Minute Papers
28 Cryptography, Perfect Secrecy and One Time Pads | Two Minute Papers #25
Cryptography, Perfect Secrecy and One Time Pads | Two Minute Papers #25
Two Minute Papers
29 Terrain Traversal with Reinforcement Learning | Two Minute Papers #26
Terrain Traversal with Reinforcement Learning | Two Minute Papers #26
Two Minute Papers
30 Multiple-Scattering Microfacet BSDFs with the Smith Model
Multiple-Scattering Microfacet BSDFs with the Smith Model
Two Minute Papers
Google DeepMind's Deep Q-Learning & Superhuman Atari Gameplays | Two Minute Papers #27
Google DeepMind's Deep Q-Learning & Superhuman Atari Gameplays | Two Minute Papers #27
Two Minute Papers
32 Are We Living In a Computer Simulation? | Two Minute Papers #28
Are We Living In a Computer Simulation? | Two Minute Papers #28
Two Minute Papers
33 Artificial Superintelligence [Audio only] | Two Minute Papers #29
Artificial Superintelligence [Audio only] | Two Minute Papers #29
Two Minute Papers
34 Automatic Parameter Control for Metropolis Light Transport | Two Minute Papers #30
Automatic Parameter Control for Metropolis Light Transport | Two Minute Papers #30
Two Minute Papers
35 Randomness and Bell's Inequality [Audio only] | Two Minute Papers #31
Randomness and Bell's Inequality [Audio only] | Two Minute Papers #31
Two Minute Papers
36 OpenAI - Non-profit AI company by Elon Musk and Sam Altman
OpenAI - Non-profit AI company by Elon Musk and Sam Altman
Two Minute Papers
37 How Do Genetic Algorithms Work? | Two Minute Papers #32
How Do Genetic Algorithms Work? | Two Minute Papers #32
Two Minute Papers
38 Painting with Fluid Simulations | Two Minute Papers #33
Painting with Fluid Simulations | Two Minute Papers #33
Two Minute Papers
39 Peer Review #1 [Audio only] | Two Minute Papers
Peer Review #1 [Audio only] | Two Minute Papers
Two Minute Papers
40 Neural Programmer-Interpreters Learn To Write Programs | Two Minute Papers #34
Neural Programmer-Interpreters Learn To Write Programs | Two Minute Papers #34
Two Minute Papers
41 9 Cool Deep Learning Applications | Two Minute Papers #35
9 Cool Deep Learning Applications | Two Minute Papers #35
Two Minute Papers
42 Designing Cities and Furnitures With Machine Learning | Two Minute Papers #36
Designing Cities and Furnitures With Machine Learning | Two Minute Papers #36
Two Minute Papers
43 Designing 3D Printable Robotic Creatures | Two Minute Papers #37
Designing 3D Printable Robotic Creatures | Two Minute Papers #37
Two Minute Papers
44 3D Printing Objects With Caustics | Two Minute Papers #38
3D Printing Objects With Caustics | Two Minute Papers #38
Two Minute Papers
45 Interactive Editing of Subsurface Scattering | Two Minute Papers #39
Interactive Editing of Subsurface Scattering | Two Minute Papers #39
Two Minute Papers
46 Simulating Viscosity and Melting Fluids | Two Minute Papers #40
Simulating Viscosity and Melting Fluids | Two Minute Papers #40
Two Minute Papers
47 What Do Virtual Objects Sound Like? | Two Minute Papers #41
What Do Virtual Objects Sound Like? | Two Minute Papers #41
Two Minute Papers
48 How DeepMind Conquered Go With Deep Learning (AlphaGo) | Two Minute Papers #42
How DeepMind Conquered Go With Deep Learning (AlphaGo) | Two Minute Papers #42
Two Minute Papers
49 Breaking Deep Learning Systems With Adversarial Examples | Two Minute Papers #43
Breaking Deep Learning Systems With Adversarial Examples | Two Minute Papers #43
Two Minute Papers
50 Extrapolations and Crowdfunded Research (Experiment) | Two Minute Papers #44
Extrapolations and Crowdfunded Research (Experiment) | Two Minute Papers #44
Two Minute Papers
51 Biophysical Skin Aging Simulations | Two Minute Papers #45
Biophysical Skin Aging Simulations | Two Minute Papers #45
Two Minute Papers
52 What is Impostor Syndrome? | Two Minute Papers #46
What is Impostor Syndrome? | Two Minute Papers #46
Two Minute Papers
53 Should You Take the Stairs at Work? (For Weight Loss) | Two Minute Papers #47
Should You Take the Stairs at Work? (For Weight Loss) | Two Minute Papers #47
Two Minute Papers
54 Artistic Manipulation of Caustics | Two Minute Papers #48
Artistic Manipulation of Caustics | Two Minute Papers #48
Two Minute Papers
55 Deep Learning Program Learns to Paint | Two Minute Papers #49
Deep Learning Program Learns to Paint | Two Minute Papers #49
Two Minute Papers
56 Interactive Photo Recoloring | Two Minute Papers #50
Interactive Photo Recoloring | Two Minute Papers #50
Two Minute Papers
57 How To Get Started With Machine Learning? | Two Minute Papers #51
How To Get Started With Machine Learning? | Two Minute Papers #51
Two Minute Papers
58 Awesome Research For Everyone! - Two Minute Papers Channel Trailer
Awesome Research For Everyone! - Two Minute Papers Channel Trailer
Two Minute Papers
59 10 More Cool Deep Learning Applications | Two Minute Papers #52
10 More Cool Deep Learning Applications | Two Minute Papers #52
Two Minute Papers
60 How DeepMind's AlphaGo Defeated Lee Sedol | Two Minute Papers #53
How DeepMind's AlphaGo Defeated Lee Sedol | Two Minute Papers #53
Two Minute Papers

Related AI Lessons

🌸 Iris Classifier ML Pipeline — Complete Tutorial & Instructions Manual
Build a complete Iris Classifier ML pipeline using Python and scikit-learn, and learn how to train and deploy a machine learning model
Dev.to · Aniket Singh
I Spent a Full Day Debugging This Python Error — Here's What Fixed It in 30 Seconds
A Python beginner's embarrassing debugging story teaches a valuable lesson about paying attention to detail in code
Dev.to · Prashik besekar
Gradient Descent: How AI Learns
Learn how AI learns through gradient descent, a key concept in machine learning, and understand its application in optimizing functions
Dev.to · Akhilesh
Derivatives: Understanding Change
Learn how derivatives help adjust model weights to reduce loss in machine learning predictions
Dev.to · Akhilesh
Up next
Advanced Algorithms, Dynamic Programming & Graph Algorithms
Coursera
Watch →