TY - GEN
T1 - Swalp
T2 - 36th International Conference on Machine Learning, ICML 2019
AU - Yang, Guandao
AU - Zhang, Tianyi
AU - Kirichenko, Polina
AU - Bai, Junwen
AU - Wilson, Andrew Gordon
AU - de Sa, Christopher
N1 - Funding Information:
Polina Kirichenko and Andrew Gordon Wilson were supported by NSF IIS-1563887, an Amazon Research Award, and Facebook Research Award. We thank Google Cloud Platform Research Credits program for providing computational resources.
Publisher Copyright:
Copyright © 2019 ASME
PY - 2019
Y1 - 2019
N2 - Low precision operations can provide scalability, memory savings, portability, and energy efficiency. This paper proposes SWALP, an approach to low precision training that averages low-precision SGD iterates with a modified learning rate schedule. SWALP is easy to implement and can match the performance of full-precision SGD even with all numbers quantized down to 8 bits, including the gradient accumulators. Additionally, we show that SWALP converges arbitrarily close to the optimal solution for quadratic objectives, and to a noise ball asymptotically smaller than low precision SGD in strongly convex settings.
AB - Low precision operations can provide scalability, memory savings, portability, and energy efficiency. This paper proposes SWALP, an approach to low precision training that averages low-precision SGD iterates with a modified learning rate schedule. SWALP is easy to implement and can match the performance of full-precision SGD even with all numbers quantized down to 8 bits, including the gradient accumulators. Additionally, we show that SWALP converges arbitrarily close to the optimal solution for quadratic objectives, and to a noise ball asymptotically smaller than low precision SGD in strongly convex settings.
UR - http://www.scopus.com/inward/record.url?scp=85078283689&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85078283689&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85078283689
T3 - 36th International Conference on Machine Learning, ICML 2019
SP - 12125
EP - 12151
BT - 36th International Conference on Machine Learning, ICML 2019
PB - International Machine Learning Society (IMLS)
Y2 - 9 June 2019 through 15 June 2019
ER -