Writing an LLM from scratch, part 8 – trainable self-attention
📰 Hacker News · gpjt
Writing an LLM from scratch, part 8 – trainable self-attention. 31 comments, 380 points on Hacker News.
Writing an LLM from scratch, part 8 – trainable self-attention. 31 comments, 380 points on Hacker News.