Yang, Lita

2 publications

NeurIPSW 2024 Enabling On-Device Large Language Models with 3D-Stacked Memory Lita Yang, Kavya Sreedhar, Huichu Liu, Edith Beigne
NeurIPS 2020 Pushing the Limits of Narrow Precision Inferencing at Cloud Scale with Microsoft Floating Point Bita Darvish Rouhani, Daniel Lo, Ritchie Zhao, Ming Liu, Jeremy Fowers, Kalin Ovtcharov, Anna Vinogradsky, Sarah Massengill, Lita Yang, Ray Bittner, Alessandro Forin, Haishan Zhu, Taesik Na, Prerak Patel, Shuai Che, Lok Chand Koppaka, Xia Song, Subhojit Som, Kaustav Das, Saurabh T, Steve Reinhardt, Sitaram Lanka, Eric Chung, Doug Burger