Wang, Shiqiang
29 publications
ICLR
2026
GneissWeb: Preparing High Quality Data for LLMs at Scale
Hajar Emami Gohari, Swanand Ravindra Kadhe, Yousaf Shah, Constantin M Adam, Abdulhamid Adebayo, Praneet Adusumilli, Farhan Ahmed, Nathalie Baracaldo, Santosh Subhashrao Borse, Yuan-Chi Chang, Xuan-Hong Dang, Nirmit Desai, Revital Eres, Ran Iwamoto, Alexei A. Karve, Yan Koyfman, Wei-Han Lee, Changchang Liu, Boris Lublinsky, Takuya Ohko, Pablo Pesce, Maroun Touma, Shiqiang Wang, Shalisha Witherspooon, Herbert Woisetschläger, David Wood, Kun-Lung Wu, Issei Yoshida, Syed Zawad, Petros Zerfos, Yi Zhou, Bishwaranjan Bhattacharjee