PIRSA:26020043

Double descent: When do neural quantum states generalize?

APA

Moss, M.S. (2026). Double descent: When do neural quantum states generalize?. Perimeter Institute. https://pirsa.org/26020043

MLA

Moss, Megan Schuyler. Double descent: When do neural quantum states generalize?. Perimeter Institute, Feb. 02, 2026, https://pirsa.org/26020043

BibTex

          @misc{ pirsa_PIRSA:26020043,
            doi = {10.48660/26020043},
            url = {https://pirsa.org/26020043},
            author = {Moss, Megan Schuyler},
            keywords = {Other},
            language = {en},
            title = {Double descent: When do neural quantum states generalize?},
            publisher = {Perimeter Institute},
            year = {2026},
            month = {feb},
            note = {PIRSA:26020043 see, \url{https://pirsa.org}}
          }
          

Megan Schuyler Moss Perimeter Institute for Theoretical Physics

Talk numberPIRSA:26020043
Talk Type Scientific Series
Subject

Abstract

Neural quantum states (NQS) provide flexible and compact wavefunction parameterizations for numerical studies of quantum many-body physics. In particular, NQS aim to circumvent the exponential scaling of the Hilbert space by compressing quantum many-body wavefunctions with a tractable amount of parameters.
While inspired by deep learning, it remains unclear to what extent NQS share characteristics with neural networks used for standard machine learning tasks. We demonstrate that, in a simplified supervised setting, NQS exhibit the double descent phenomenon, a key feature of modern deep learning, where generalization worsens as network size increases before improving again in an overparameterized regime. Notably, we find the second descent to occur only for network sizes much larger than the Hilbert space dimension, i.e. network sizes that are out of reach for problems of practical interest. Within our setting, this observation places typical NQS in the underparameterized regime.
We also observe that the optimal network size in the underparameterized regime depends on the number of unique training samples. While the double descent phenomenon does indeed translate to the NQS setting, potential practical consequences of our findings point more towards the need for symmetry-aware, physics-informed architecture design, rather than directly adopting machine learning heuristics.