Writing an LLM from scratch, part 8 – trainable self-attention

📰 Hacker News · gpjt

Writing an LLM from scratch, part 8 – trainable self-attention. 31 comments, 380 points on Hacker News.

Published 5 Mar 2025
Read full article → ← Back to Reads