Gnaneshwar, Dwaraknath

6 publications

ICLR 2025 Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models Laura Ruis, Maximilian Mozes, Juhan Bae, Siddhartha Rao Kamalakara, Dwaraknath Gnaneshwar, Acyr Locatelli, Robert Kirk, Tim Rocktäschel, Edward Grefenstette, Max Bartolo
NeurIPS 2025 Rope to Nope and Back Again: A New Hybrid Attention Strategy Bowen Yang, Bharat Venkitesh, Dwaraknath Gnaneshwar, Hangyu Lin, David Cairuz, Phil Blunsom, Acyr Locatelli
NeurIPS 2024 BAM! Just like That: Simple and Efficient Parameter Upcycling for Mixture of Experts Qizhen Zhang, Nikolas Gritsch, Dwaraknath Gnaneshwar, Simon Guo, David Cairuz, Bharat Venkitesh, Jakob Foerster, Phil Blunsom, Sebastian Ruder, Ahmet Üstün, Acyr Locatelli
ICMLW 2024 BAM! Just like That: Simple and Efficient Parameter Upcycling for Mixture of Experts Qizhen Zhang, Nikolas Gritsch, Dwaraknath Gnaneshwar, Simon Guo, David Cairuz, Bharat Venkitesh, Jakob Nicolaus Foerster, Phil Blunsom, Sebastian Ruder, Ahmet Üstün, Acyr Locatelli
NeurIPSW 2020 Know Where to Drop Your Weights: Towards Faster Uncertainty Estimation Akshatha Kamath, Dwaraknath Gnaneshwar, Matias Valdenegro-Toro
AAAI 2020 Leveraging BERT with Mixup for Sentence Classification (Student Abstract) Amit Jindal, Dwaraknath Gnaneshwar, Ramit Sawhney, Rajiv Ratn Shah