George Hotz | Programming | can we fit a LLaMA inside a tinygrad? | $1499 comma.ai/shop/comma-three
Date of stream 08 Mar 2023.
from $1499 buy https://comma.ai/shop/comma-three
Live-stream chat added as Subtitles/CC - English (Twitch Chat) - three-dot menu icon - Show transcript
Source files:
- https://github.com/geohot/tinygrad/blob/llama/examples/llama.py
- https://github.com/geohot/tinygrad/tree/llama
- https://github.com/geohot/tinygrad
Follow for notifications:
- https://twitch.tv/georgehotz
Support George:
- https://twitch.tv/subs/georgehotz
Programming playlist:
- https://www.youtube.com/playlist?list=PLzFUMGbVxlQs5s-LNAyKgcq5SL28ZLLKC
Chapters:
00:00:00 intro
00:01:30 guitar tuning and playing
00:04:30 giving advice to elon, woke
00:07:25 llama, trip to india, bengaluru, bike to mumbai
00:08:40 when software was cool
00:11:20 llama, privilege
00:13:45 discord ban, pressure, power
00:16:05 prime, blue crown, money
00:18:00 tiny corp, making value, food
00:22:25 capitalism
00:23:50 facebook LLaMA, apply for access, torrent
00:25:10 ai safety, enjoy the decline
00:28:00 deepmind, google, rants
00:28:40 thinking for yourself, jesus, george is christian, god
00:30:30 a progression to christian
00:31:50 big text, copilot
00:35:10 running on gpu m1
00:39:30 using tinygrad tiny ram usage, float16
00:46:00 METAL=1
00:47:25 numpy size of datatype
00:49:12 interesting, np.frombuffer
00:52:10 problem raw buffers to have a datatype?
00:52:55 python read into buffer, not making copies = fast
00:59:40 loading from disk, making it lazy
01:00:50 tinygrad is the future, replacing programing with dsp paradigm
01:02:17 tinygrad has grown
01:03:50 metal buffer, float16 hard
01:05:20 interview question how big is a float
01:06:40 making metal buffer, supporting dtypes
01:09:40 tinygrad recognize llama
01:10:25 facebook llama arxiv, unbiased model
01:13:25 class Attention, FeedForward
01:21:40 washed heads
01:24:00 class RMSNorm
01:30:25 testing inference
01:35:30 error and slow loading because torch
01:39:00 wasting 0.6 seconds
01:40:45 how transformers work, Linear
01:47:2
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from george hotz archive · george hotz archive · 0 of 60
← Previous
Next →
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
comma ai Driving to self racing cars with openpilot
george hotz archive
comma ai Still driving
george hotz archive
comma ai was live
george hotz archive
comma ai Going home
george hotz archive
comma ai We go to the airport
george hotz archive
comma ai Reversing Prius with cabana + panda telethon!
george hotz archive
comma ai panda manufacturing!
george hotz archive
comma ai Self driving to Best Buy
george hotz archive
comma ai shilling for giraffe!
george hotz archive
comma ai Toyota Prius Driving!!!
george hotz archive
comma ai Late night civic driving
george hotz archive
comma ai Toyota giraffe shilling
george hotz archive
comma ai Live car hacking with panda this time or bust!
george hotz archive
comma ai Product launch question time
george hotz archive
comma ai Driving with the RAV4, launching Tuesday!
george hotz archive
comma ai giraffe ship o' clock
george hotz archive
comma ai openpilot 0.3.9
george hotz archive
comma ai EON assembly!
george hotz archive
comma ai Going through the GM investor deck
george hotz archive
comma ai I love my EON
george hotz archive
comma ai RAV4 driving
george hotz archive
comma ai Shilling at the holiday party
george hotz archive
comma ai EON shipping party
george hotz archive
comma ai EON unboxing!
george hotz archive
comma ai The very straight roads of Nevada
george hotz archive
comma ai Starting our trip with openpilot 0.4
george hotz archive
comma ai Little EON on the prairie
george hotz archive
comma ai The urban sprawl of Colorado
george hotz archive
comma ai Onward to Omaha
george hotz archive
comma ai nothing, nowhere
george hotz archive
comma ai shop.comma.ai Buy things!!!
george hotz archive
comma ai The youth are woke
george hotz archive
comma ai Photo shoot!
george hotz archive
comma ai Product announcements are LIT!
george hotz archive
comma ai Breaking down hype of CES
george hotz archive
comma ai Salt Lakes Everywhere!
george hotz archive
comma ai This is the last one
george hotz archive
comma ai Corolla port o’clock!
george hotz archive
comma ai Presentation where it’s like you are in Omaha with us
george hotz archive
comma ai Asking the scopies the banned question
george hotz archive
comma ai Driving in the Corolla!
george hotz archive
comma ai We got new products! shop.comma.ai
george hotz archive
comma ai Sunday w scopies!
george hotz archive
comma ai Our first Lexus, the Lexus RX!
george hotz archive
comma ai Scopie saturday!
george hotz archive
comma ai Panda!
george hotz archive
comma ai Scopie Sunday! *NOT CLICKBAIT*
george hotz archive
comma ai comma Tree!
george hotz archive
comma ai Scopie Saturday
george hotz archive
comma ai Ok scopie Friday
george hotz archive
comma ai comma pedal!
george hotz archive
comma ai okay this time comma pedal!
george hotz archive
comma ai Why aren’t car companies good
george hotz archive
comma ai How can driving be better
george hotz archive
comma ai Scopie Sunday
george hotz archive
comma ai comma got a new car!
george hotz archive
comma ai Mapping Sunday!
george hotz archive
comma ai Let’s go buy a car
george hotz archive
comma ai Ok I take back all the bad things I said about Ford
george hotz archive
comma ai comma smays are in stock!
george hotz archive
Related AI Lessons
⚡
⚡
⚡
⚡
When Feelings Need a Graph How SurrealDB Became the Heart of Our Mental Wellness #SurrealDB #MongoDB #MentalHealthAI #MultiModal
Dev.to · BAPANAPALLI PRANEETA
I Tried Harder With AI , But It Only Worked When I Simplified My Thinking
Medium · AI
Why OpenAI Privacy Filter feels like real AI infrastructure
Medium · LLM
GPT-5.5 vs Claude Opus vs Gemini — real benchmark breakdown
Dev.to AI
Chapters (38)
intro
1:30
guitar tuning and playing
4:30
giving advice to elon, woke
7:25
llama, trip to india, bengaluru, bike to mumbai
8:40
when software was cool
11:20
llama, privilege
13:45
discord ban, pressure, power
16:05
prime, blue crown, money
18:00
tiny corp, making value, food
22:25
capitalism
23:50
facebook LLaMA, apply for access, torrent
25:10
ai safety, enjoy the decline
28:00
deepmind, google, rants
28:40
thinking for yourself, jesus, george is christian, god
30:30
a progression to christian
31:50
big text, copilot
35:10
running on gpu m1
39:30
using tinygrad tiny ram usage, float16
46:00
METAL=1
47:25
numpy size of datatype
49:12
interesting, np.frombuffer
52:10
problem raw buffers to have a datatype?
52:55
python read into buffer, not making copies = fast
59:40
loading from disk, making it lazy
1:00:50
tinygrad is the future, replacing programing with dsp paradigm
1:02:17
tinygrad has grown
1:03:50
metal buffer, float16 hard
1:05:20
interview question how big is a float
1:06:40
making metal buffer, supporting dtypes
1:09:40
tinygrad recognize llama
1:10:25
facebook llama arxiv, unbiased model
1:13:25
class Attention, FeedForward
1:21:40
washed heads
1:24:00
class RMSNorm
1:30:25
testing inference
1:35:30
error and slow loading because torch
1:39:00
wasting 0.6 seconds
1:40:45
how transformers work, Linear
🎓
Tutor Explanation
DeepCamp AI