Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling

Published in NeurIPS 2024 (CCF-A), 2024

Shuaipeng Li, Penghao Zhao, Hailin Zhang, Samm Sun, Hao Wu, Dian Jiao, Weiyan Wang, Chengjun Liu, Zheng Fang, Jinbao Xue, Yangyu Tao, Bin Cui, Di Wang

NeurIPS 2024 (CCF-A)

Recommended citation: Shuaipeng Li, Penghao Zhao, Hailin Zhang, Samm Sun, Hao Wu, Dian Jiao, Weiyan Wang, Chengjun Liu, Zheng Fang, Jinbao Xue, Yangyu Tao, Bin Cui, Di Wang. "Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling." NeurIPS 2024.