📰 Hacker News · gpjt
Articles from Hacker News · gpjt · 5 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (10147)
ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
Hacker News · gpjt
4mo ago
LLM from scratch, part 28 – training a base model from scratch on an RTX 3090
LLM from scratch, part 28 – training a base model from scratch on an RTX 3090. 121 comments, 540 points on Hacker News.
Hacker News · gpjt
5mo ago
Writing an LLM from scratch, part 22 – training our LLM
Writing an LLM from scratch, part 22 – training our LLM. 10 comments, 254 points on Hacker News.
Hacker News · gpjt
7mo ago
The maths you need to start understanding LLMs
The maths you need to start understanding LLMs. 120 comments, 616 points on Hacker News.
Hacker News · gpjt
11mo ago
Writing an LLM from scratch, part 13 – attention heads are dumb
Writing an LLM from scratch, part 13 – attention heads are dumb. 67 comments, 351 points on Hacker News.
DeepCamp AI