» Front Page » Permalink » Source ↑ 1 ↓ Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs