ML Anthology
Authors
Search
About
Singh, Sidak Pal
22 publications
ICML
2025
Avoiding Spurious Sharpness Minimization Broadens Applicability of SAM
Sidak Pal Singh
,
Hossein Mobahi
,
Atish Agarwala
,
Yann Dauphin
NeurIPS
2025
Generalized Linear Mode Connectivity for Transformers
Alexander Theus
,
Alessandro Cabodi
,
Sotiris Anagnostidis
,
Antonio Orvieto
,
Sidak Pal Singh
,
Valentina Boeva
ICLR
2025
The Directionality of Optimization Trajectories in Neural Networks
Sidak Pal Singh
,
Bobby He
,
Thomas Hofmann
,
Bernhard Schölkopf
ICLR
2025
What Does It Mean to Be a Transformer? Insights from a Theoretical Hessian Analysis
Weronika Ormaniec
,
Felix Dangel
,
Sidak Pal Singh
ICMLW
2024
Closed Form of the Hessian Spectrum for Some Neural Networks
Sidak Pal Singh
,
Thomas Hofmann
ICMLW
2024
Hallmarks of Optimization Trajectories in Neural Networks and LLMs: Directional Exploration and Redundancy
Sidak Pal Singh
,
Bobby He
,
Thomas Hofmann
,
Bernhard Schölkopf
ICMLW
2024
Landscaping Linear Mode Connectivity
Sidak Pal Singh
,
Linara Adilova
,
Michael Kamp
,
Asja Fischer
,
Bernhard Schölkopf
,
Thomas Hofmann
CoLLAs
2024
Local vs Global Continual Learning
Giulia Lanzillotta
,
Sidak Pal Singh
,
Benjamin F Grewe
,
Thomas Hofmann
AAAI
2024
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers (Student Abstract)
Danilo Dordevic
,
Vukasin Bozic
,
Joseph Thommes
,
Daniele Coppola
,
Sidak Pal Singh
ICLR
2024
Some Fundamental Aspects About Lipschitz Continuity of Neural Networks
Grigory Khromov
,
Sidak Pal Singh
NeurIPS
2024
Theoretical Characterisation of the Gauss Newton Conditioning in Neural Networks
Jim Zhao
,
Sidak Pal Singh
,
Aurelien Lucchi
ICLR
2024
Towards Meta-Pruning via Optimal Transport
Alexander Theus
,
Olin Geimer
,
Friedrich Wicke
,
Thomas Hofmann
,
Sotiris Anagnostidis
,
Sidak Pal Singh
ICLR
2024
Transformer Fusion with Optimal Transport
Moritz Imfeld
,
Jacopo Graldi
,
Marco Giordano
,
Thomas Hofmann
,
Sotiris Anagnostidis
,
Sidak Pal Singh
NeurIPSW
2023
Escaping Random Teacher Initialization Enhances Signal Propagation and Representation
Felix Sarnthein
,
Sidak Pal Singh
,
Antonio Orvieto
,
Thomas Hofmann
ICML
2023
The Hessian Perspective into the Nature of Convolutional Neural Networks
Sidak Pal Singh
,
Thomas Hofmann
,
Bernhard Schölkopf
ICLR
2022
Phenomenology of Double Descent in Finite-Width Neural Networks
Sidak Pal Singh
,
Aurelien Lucchi
,
Thomas Hofmann
,
Bernhard Schölkopf
NeurIPS
2022
Signal Propagation in Transformers: Theoretical Perspectives and the Role of Rank Collapse
Lorenzo Noci
,
Sotiris Anagnostidis
,
Luca Biggio
,
Antonio Orvieto
,
Sidak Pal Singh
,
Aurelien Lucchi
NeurIPS
2021
Analytic Insights into Structure and Rank of Neural Network Hessian Maps
Sidak Pal Singh
,
Gregor Bachmann
,
Thomas Hofmann
AISTATS
2020
Context Mover’s Distance & Barycenters: Optimal Transport of Contexts for Building Representations
Sidak Pal Singh
,
Andreas Hug
,
Aymeric Dieuleveut
,
Martin Jaggi
NeurIPS
2020
Model Fusion via Optimal Transport
Sidak Pal Singh
,
Martin Jaggi
NeurIPS
2020
WoodFisher: Efficient Second-Order Approximation for Neural Network Compression
Sidak Pal Singh
,
Dan Alistarh
ICLRW
2019
Context Mover's Distance & Barycenters: Optimal Transport of Contexts for Building Representations
Sidak Pal Singh
,
Andreas Hug
,
Aymeric Dieuleveut
,
Martin Jaggi