Hsieh, Cheng-Ping

1 publications

ICLR 2025 nGPT: Normalized Transformer with Representation Learning on the Hypersphere Ilya Loshchilov, Cheng-Ping Hsieh, Simeng Sun, Boris Ginsburg