Ahia, Orevaoghene

2 publications

NeurIPS 2025 Broken Tokens? Your Language Model Can Secretly Handle Non-Canonical Tokenizations Brian Siyuan Zheng, Alisa Liu, Orevaoghene Ahia, Jonathan Hayase, Yejin Choi, Noah A. Smith
NeurIPS 2024 MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization Orevaoghene Ahia, Sachin Kumar, Hila Gonen, Valentin Hofmann, Tomasz Limisiewicz, Yulia Tsvetkov, Noah A. Smith