TY - JOUR
T1 - Rayleigh-Gauss-Newton optimization with enhanced sampling for variational Monte Carlo
AU - Webber, Robert J.
AU - Lindsey, Michael
N1 - Funding Information:
R.J.W. and M.L. would like to acknowledge helpful conversations with Timothy Berkelbach, Aaron Dinner, Sam Greene, Lin Lin, Verena Neufeld, James Smith, Jonathan Siegel, Erik Thiede, Jonathan Weare, and Huan Zhang. R.J.W. is supported by New York University's Dean's Dissertation Fellowship and by the National Science Foundation through Award No. DMS-1646339. M.L. is supported by the National Science Foundation under Award No. DMS-1903031. The authors acknowledge support from the Advanced Scientific Computing Research Program within the DOE Office of Science through Award No. DE-SC0020427. Computing resources were provided by New York University's High Performance Computing.
Publisher Copyright:
© 2022 authors. Published by the American Physical Society.
PY - 2022/7
Y1 - 2022/7
N2 - Variational Monte Carlo (VMC) is an approach for computing ground-state wave functions that has recently become more powerful due to the introduction of neural network-based wave-function parametrizations. However, efficiently training neural wave functions to converge to an energy minimum remains a difficult problem. In this work, we analyze optimization and sampling methods used in VMC and introduce alterations to improve their performance. First, based on theoretical convergence analysis in a noiseless setting, we motivate a new optimizer that we call the Rayleigh-Gauss-Newton method, which can improve upon gradient descent and natural gradient descent to achieve superlinear convergence at no more than twice the computational cost. Second, to realize this favorable comparison in the presence of stochastic noise, we analyze the effect of sampling error on VMC parameter updates and experimentally demonstrate that it can be reduced by the parallel tempering method. In particular, we demonstrate that RGN can be made robust to energy spikes that occur when the sampler moves between metastable regions of configuration space. Finally, putting theory into practice, we apply our enhanced optimization and sampling methods to the transverse-field Ising and XXZ models on large lattices, yielding ground-state energy estimates with remarkably high accuracy after just 200 parameter updates.
AB - Variational Monte Carlo (VMC) is an approach for computing ground-state wave functions that has recently become more powerful due to the introduction of neural network-based wave-function parametrizations. However, efficiently training neural wave functions to converge to an energy minimum remains a difficult problem. In this work, we analyze optimization and sampling methods used in VMC and introduce alterations to improve their performance. First, based on theoretical convergence analysis in a noiseless setting, we motivate a new optimizer that we call the Rayleigh-Gauss-Newton method, which can improve upon gradient descent and natural gradient descent to achieve superlinear convergence at no more than twice the computational cost. Second, to realize this favorable comparison in the presence of stochastic noise, we analyze the effect of sampling error on VMC parameter updates and experimentally demonstrate that it can be reduced by the parallel tempering method. In particular, we demonstrate that RGN can be made robust to energy spikes that occur when the sampler moves between metastable regions of configuration space. Finally, putting theory into practice, we apply our enhanced optimization and sampling methods to the transverse-field Ising and XXZ models on large lattices, yielding ground-state energy estimates with remarkably high accuracy after just 200 parameter updates.
UR - http://www.scopus.com/inward/record.url?scp=85135918270&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85135918270&partnerID=8YFLogxK
U2 - 10.1103/PhysRevResearch.4.033099
DO - 10.1103/PhysRevResearch.4.033099
M3 - Article
AN - SCOPUS:85135918270
SN - 2643-1564
VL - 4
JO - Physical Review Research
JF - Physical Review Research
IS - 3
M1 - 033099
ER -