Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs
📰 Hacker News · cpldcpu
Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs. 53 comments, 230 points on Hacker News.
Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs. 53 comments, 230 points on Hacker News.