International Conference on Machine Learning (ICML)
Enhancing LLM Training via Spectral Clipping
Conference on Learning Theory (COLT)
On the Stability of Nonlinear Dynamics in GD and SGD: Beyond Quadratic Potentials
International Conference on Artificial Intelligence and Statistics (AISTATS)
Accelerated Distributed Optimization with Compression and Error Feedback
International Conference on Artificial Intelligence and Statistics (AISTATS)
Accelerated Distributed Optimization with Compression and Error Feedback
International Conference on Learning Representations (ICLR)
Composite Optimization with Error Feedback: the Dual Averaging Approach