ML Anthology
Authors
Search
About
Hariri, Mohsen
1 publications
NeurIPS
2025
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float (DFloat11)
Tianyi Zhang
,
Mohsen Hariri
,
Shaochen Zhong
,
Vipin Chaudhary
,
Yang Sui
,
Xia Hu
,
Anshumali Shrivastava