β 0 β Lossless LLM compression for efficient GPU inference via dynamic-length float π View Source Article URL: https://arxiv.org/abs/2504.11651 Comments URL: https://news.ycombinator.com/item?id=43796935 Points: 282 # Comments: 94
π¬ Comments