Yao, Zhewei
23 publications
ICLRW
2025
ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration
AAAI
2024
Exploring Post-Training Quantization in LLMs from Comprehensive Study to Low Rank Compensation
NeurIPS
2024
Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding
ICML
2022
DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale