Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs

📰 Hacker News · cpldcpu

Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs. 53 comments, 230 points on Hacker News.

Published 4 May 2025
Read full article → ← Back to Reads