Accurate nuclear quantum statistics on machine-learned classical effective potentials
Abstract
The contribution of nuclear quantum effects (NQEs) to the properties of various hydrogen-bound systems, including biomolecules, is increasingly recognized. Despite the development of many acceleration techniques, the computational overhead of incorporating NQEs in complex systems is sizable, particularly at low temperatures. In this work, we leverage deep learning and multiscale coarse-graining techniques to mitigate the computational burden of path integral molecular dynamics (PIMD). Specifically, we employ a machine-learned potential to accurately represent corrections to classical potentials, thereby significantly reducing the computational cost of simulating NQEs. We validate our approach using four distinct systems: Morse potential, Zundel cation, single water molecule, and bulk water. Our framework allows us to accurately compute position-dependent static properties, as demonstrated by the excellent agreement obtained between the machine-learned potential and computationally intensive PIMD calculations, even in the presence of strong NQEs. This approach opens the way to the development of transferable machine-learned potentials capable of accurately reproducing NQEs in a wide range of molecular systems.
I Introduction
Molecular dynamics (MD) simulations provide insight into phenomena at atomic resolution and are an invaluable tool in chemistry, biophysics, and material science.
Recent advances in force field [1, 2, 3], algorithms [4] and hardware development [5] allow for increasingly higher accuracy and a broader scope of application of molecular dynamics simulations.
Traditional MD simulations rely on the classical evolution of nuclear positions, neglecting an explicit treatment of so-called quantum nuclear effects (NQEs) such as zero-point fluctuations, tunneling phenomena, and isotope effects.
This simplification has a historical justification as the parameters of classical empirical force field simulations are fit to reproduce (quantum) experimental data.
Hence, NQEs are effectively included in the classical potential energy surface, at least partially, albeit with limited transferability.
On the other hand, bottom-up ab initio MD methods or machine learning interatomic potentials (MLIPs), based on the Born-Oppenheimer approximation [6], only account for the quantization of electrons.
Hence, these simulations require an explicit treatment of quantum nuclear motion on the Born-Oppenheimer potential energy surface (PES) in theory.
Particularly when accurate electronic structure methods are used to estimate the Born-Oppenheimer PES, the neglect of quantum nuclear motion may be the largest source of error for obtaining quantitative agreement with experiments [7, 8, 9, 10].
Unfortunately, NQEs are typically neglected due to their significant computational overhead or because they are assumed to have small effects.
However, even at ambient temperatures, NQEs significantly affect properties of hydrogen-bounded systems, e.g. static and dynamic properties of water [11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23], free energies of hydrogen bonding and proton transfer [24, 25, 26, 27, 28, 29], stability of molecular crystals [30, 31], strength of intermolecular interactions [32, 9], and enzyme activity [33, 19].
The standard condensed-phase simulation technique for incorporating NQEs in equilibrium properties is Feynman’s imaginary time path integral (PI) formalism [34].
In this framework, a system’s quantum statistical mechanics is mapped onto the classical statistical mechanics of a ring polymer, which consists of multiple replicas of the system connected by harmonic interactions that depend on temperature and mass.
The effective classical PI partition function of the ring polymer system can be sampled using MD or Monte Carlo techniques, yielding the PIMD [35] and PI Monte Carlo [36] methods.
In theory, PI methods incorporate quantum statistics exactly with classical sampling, in the limit of the number of replicas going to infinity [37].
However, typically, the required number of replicas to converge structural properties, and thus the increased computational cost compared to classical MD, rises with increasing physical frequencies or lowering temperatures [38].
As a result, most molecular systems require replicas at room temperature, which increases proportionally with lowering temperature, such as over replicas to obtain converged energies for molecular systems at cryogenic temperatures [39].
Over the last decades, several systematically improvable reduced-cost PI approaches have been proposed.
These include high-order factorization of the Boltzmann operator [40, 41, 42, 43, 44, 45], , perturbative expansion of the Boltzmann operator [46, 47] , multiple time stepping [48], ring polymer contraction [49], and coloured-noise thermostats [50, 51, 52].
Despite their notable success towards mainstreaming the use of PI methods [38], these approaches do not yet present a “fix-all" solution.
Issues still exist, including, but not limited to, lack of universal applicability of multiple time stepping and ring polymer contraction, tedious implementation of high-order path integral techniques, and poor ergodicity [53] or zero-point energy leakage [50] due to the coupling between physical and fictitious PI collective modes.
Finally, these approaches do not eliminate the diverging cost of path integral simulations at low temperatures.
An appealing direction that avoids the computational overhead of PI methods is to avoid modeling the entire imaginary-time path.
For instance, the Wigner-Kirkwood effective potential [54, 55] expands the partition functions in powers of the reduced Plank’s constant and retains computationally tractable terms.
Similarly, an effective quantum potential was suggested by Feynman and Hibbs [34, 56] by integrating out all degrees of freedom except the centroid of the ring polymer, i.e., by coarse-graining the ring polymer to the centroid.
Ever since, the idea of ring polymer coarse-graining has been explored for over 20 years [57, 58, 59] and has recently gained increasingly more attention thanks to the advent of MLIPs [60, 61, 62].
A number of coarse-grained effective potentials can be developed by identifying (a) a suitable mapping from the fine-grained ring polymer to the coarse-grained classical system and (b) the functional form of the interactions in the coarse-grained system.
For instance, the so-called “CG-PI theory", developed by Voth et al., [63, 64, 65] proposes to map the imaginary time path to a two-replica coarse-grained system.
This mapping gives access to both diagonal and nondiagonal elements of the thermal density matrix.
However, the corresponding formalism is challenging to implement for realistic molecular systems in the regime of strong NQEs [65].
Another common mapping involves coarse-graining of the ring polymer to the centroid [61, 62, 60].
Amongst others, we have recently shown that [60], the resulting effective potential can be used in the same manner as a classical force field, providing an approximation to the dynamical properties of the system within the centroid molecular dynamics approach [66].
However, centroid-based ring-polymer coarse-graining does not give rigorous access to exact static properties [67].
For instance, for a harmonic oscillator (which is a good approximation for many chemical systems with stiff bonds), the centroid approximation is equivalent to a classical description.
On the other hand, a prominent feature of our work is the path-integral coarse-grained simulations (PIGS) method that uses many-body universal approximators [68] to fit the coarse-grained potential of mean force.
The PIGS approach utilizes machine-learned potentials, capable of approximating arbitrary complex functions, to learn the full many-body coarse-grained potentials as demonstrated recently on biomolecular simulations [69, 70, 71, 72].
Here, instead of dynamical properties, we focus on characterizing static position-dependent properties, including NQEs, and we develop an effective machine-learned potential to compute them accurately and at the cost of classical MD. Our approach builds upon the PIGS approach that uses multiscale coarse-graining [68] and our work on the use of deep learning architectures for classical coarse-grained force-fields [73] but maps the ring polymer onto a single replica instead of the centroid. This coarse-grained mapping is distinct from earlier choices and allows quantum thermodynamic expectation values of position-dependent observables to be estimated rigorously as simple time averages akin to classical MD. We demonstrate that this approach yields accurate quantum nuclear statistics of bulk and isolated systems. The remaining sections of this paper are organized as follows: in Section II, we briefly summarize the imaginary-time PI theory and the application of thermodynamic coarse-graining to the problem. Section III discusses the numeric implementation of the approach for four model systems: one-dimensional Morse potential, single H2O molecule, Zundel cation, and bulk water. The results are presented and discussed in Section IV. Section V provides concluding remarks.
II Theory and Methods
II.1 The path-integral approach
We consider a Hamiltonian of distinguishable nuclei, living in three-dimensional Cartesian space, with masses , and position vector , interacting with the potential . The equivalent PI partition function comprising replicas can be expressed as,
(1) |
where
(2) |
Here, represents the position vector of the -th replica of the system in the ring polymer with cyclic boundary conditions . In addition, is a shorthand for the set of positions of the replicas of the ring polymer, and is a shorthand notation for the total potential experienced by the ring polymer.
In the limit , the ring polymer statistics is an unbiased estimator of quantum statistics of the target system.
The quantum thermodynamic average of a position-dependent operator is typically estimated as the ensemble average of the position-dependent function averaged over all the replicas,
(3) |
where corresponds to the ensemble average defined in Eq. 2. Estimating Eq. 3 is computationally demanding due to the need for simulating replicas of the system. Typically, should be larger than with being the largest physical frequency of the system [74]. Notably, for a given system (with fixed ), the number of replicas scales inversely with , implying a sleepy rising computational cost as the temperature goes to zero.
II.2 Path-integral coarse-graining for quantum statistics
Our goal is to reduce the computational cost of simulating ring polymer statistics by integrating additional degrees of freedom, such as (linear combinations of) the replicas of the system, in a way that retains the accuracy of Eq. 2.
In the most extreme and computationally beneficial scenario, we can integrate the replicas of the physical system to an effective potential felt by a single replica, reducing the dimensionality of the ring polymer to the physical system.
In doing so, we reduce the cost of estimating thermodynamic properties to that of classical MD.
The problem can be reformulated as follows, using the bottom-up coarse-graining technique widely used in biomolecular simulations. We wish to coarse-grain the ring polymer system in the high dimensional configurational space described by a potential ) into an effective thermodynamically consistent potential with ,
(4) |
where is a Dirac delta function, and is a linear operator that maps coordinates from . The effective potential of mean force in Eq. 4 can be obtained using the force-matching variational approach [68]. We first represent the effective potential as a function, , parametrized by , and then minimize the force-matching loss [69] with respect to :
(5) |
Here, is the force associated with the (high-dimensional) ring polymer systems, is a linear operator that maps the forces from , i.e., the high-dimensional ring polymer space into the low-dimensional space associated with the physical system.
II.3 Single replica coarse-graining
While several choices of the mapping operators and are possible [73], not all choices are ideal for estimating generic position-dependent operators with the full accuracy of Eq. 3. To make a physically meaningful and computationally favorable choice, we note that all replicas of the ring polymer are equivalent thanks to the invariance of the trace in Eq. 2 to cyclic shifts. Hence, a thermodynamic expectation value can be estimated by averaging the position-dependent function estimated on any replica
(6) |
Eq. 6 suggests that the statistics of a single replica are sufficient to estimate equilibrium averages of position-dependent observables. From the perspective of developing an effective potential, it is sufficient to integrate all but one (arbitrary) replica. Hence, in this work, we coarse-grain the ring polymer to a single replica using as a configurational map the following transformation:
(7) |
Given this configurational map, a thermodynamically consistent force map can be constructed with the matrix elements given by following relationship [73]:
(8) |
where are arbitrary. For a ring polymer, setting leads to the following primitive estimator of force projected onto coarse-grained space,
(9) |
However, the numerical efficiency of the force-matching minimization can be further improved by optimizing to minimize the variance of the mapped force [73]. With the configurational mapping operator given by Eq. 7, expectation values of a position-dependent operator can be estimated as a simple ensemble average
(10) |
akin to standard MD, albeit in the classical ensemble of the effective potential .
The choice of the mapping operator in Eq. 7 is distinct from earlier choices made in the context of coarse-graining of imaginary-time PIs.
The most popular coarse-graining approach maps the ring polymer positions to its centroid.
While this approach is useful for simulating dynamical properties, it doesn’t give correct equilibrium averages of generic position-dependent operators when estimated as an ensemble average similar to Eq. 10.
Similarly, the PICG theory introduced in Ref. 63, 64, 65 coarse-grains the ring polymer to two replicas, which are related to the positions at which the off-diagonal elements of the Boltzmann operator are estimated.
This approach gives access to generic position- and momentum-dependent operators; however, it requires derivations of bespoke (complicated) operators.
On the other hand, our mapping reduces the ring polymer to the position at which the trace of the Boltzmann operator is estimated, with , modulo an additive constant, giving access to position-dependent operators as simple ensemble averages.
II.4 Workflow using the PIGS method
![Refer to caption](https://cdn.statically.io/img/arxiv.org/extracted/5709116/figures/picg_statistics_workflow.png)
To develop the effective potentials that incorporate NQEs, we utilize the PIGS approach, outlined in Fig. 1.
First, we generate training data via PIMD simulation. The resulting ring polymer trajectories are then projected onto a coarse-grained space using a configurational and force map discussed above. Notice that the remaining replica in Eq. 7 is selected arbitrarily, and for a ring polymer with P replicas Eq. 7 defines P equivalent projections. Thus, each ring polymer configuration yields P data points for training.
The effective potential is represented in a form
(11) |
where is a learnable correction. The hyperparameter determines the contribution of the physical potential to the total energy. A common approach is to set so the ML network learns an effective NQEs correction to the classical potential [62, 60]. This choice is effective at high temperatures when NQEs are small. However, as temperature decreases, prior classical distribution quickly becomes more localized, and the difference between the classical and target quantum potential increases, making the correction harder to learn. However, in the harmonic limit, it is possible to make classical distribution more "quantum-like" by changing the simulation temperature. For this reason, to simplify the learning, enhance the data efficiency of the training, and the model stability, we consider to be temperature dependent: , where denotes the target temperature and is a hyperparameter. If not specified otherwise, we set K. The learnable model parameters are then optimized with standard ML approaches. The resulting effective potential can then be used for calculations as a standard MD force field.
II.5 Applicability of the models to different thermodynamic conditions
The thermodynamically consistent CG model approximates the potential of mean force, which depends on the thermodynamic state, e.g., system temperature. Consequently, the model is strictly applicable only at the same conditions it was trained, and targeting the coarse-grained model to different conditions requires a different set of training data. However, in the case of ring polymer coarse-graining, we notice two cases where the applicability of a trained model can be extended, or training data generated for one condition can be reused for a different one. In the first scenario, the molecule predominantly populates the ground vibrational state, and the population of the excited states is negligible. In this case, the general equation for the quantum PMF (see Eq. 18) simplifies to:
(12) |
where denotes the ground-state nuclear wave function.
Consequently, the effective CG potentials at different temperatures in the ground vibrational state are related to each other as:
(13) |
This relationship implies that if a model is trained at a temperature that is low enough to predominately populate the ground state, the same model can be used
to run simulations and compute the quantum statistics at any lower temperature by rescaling and the corresponding forces according to Eq. 13.
This simple relation breaks down when the population of the excited vibrational states is non-negligible. However, a dataset generated with expensive PIMD simulation at one temperature can be used to generate synthetic datasets corresponding to different temperatures and isotope compositions. This data recycling process involves two steps. First, we reweight the configuration to the target temperature and isotope composition, efficiently using the coordinate rescaling approaches from Refs. [75, 76]. Within the coordinate scaling approach, given the initial coordinates , inverse temperature and mass , to model a system at inverse temperature and mass , we rescale the coordinates as
(14) |
where represents the centroid position. Then, the probability of a configuration in the ensemble characterized by and is given by (see SI for the details):
(15) |
We used Eq. 15 to reweight the original dataset.
Second, we recompute the forces originating from the harmonic couplings between adjacent replicas, , for the target temperature and isotope compositions as:
(16) |
The resulting dataset is then used for model training, as described above.
III Computational details
III.1 1-D Morse potential
III.1.1 Model and training procedure
We first demonstrate the approach on a systems for which the exact effective potential can be estimated numerically. We study a particle with mass experiencing a one-dimensional Morse potential,
(17) |
where is the particle position. The parameters, , , and , that determine the equilibrium position, shape, and depth of the potential, are chosen to reproduce the potential of the O–H bond, and is set to the reduced mass of the O–H bond (see SI for the numeric values).
The effective potential corresponding to the mapping in Eq. 7 can be estimated numerically by solving the Schrödinger equation for the bound states of a Morse potential [77] as
(18) |
where and are the the energy eigenvalues and wave functions associated with the -th eigenstate, respectively. To estimate the effective potential using the PIGS approach, we optimize a coarse-grained potential in the following Eq. 11 with :
(19) |
where is a linear combination of radial basis functions (see SI for details) [78], and is the total number of basis functions (we set ). Hence, instead of learning the full effective potential, we fit a correction to the classical potential that captures the quantum effects learned from the full-ring polymer simulations. The adjustable parameters are obtained by minimizing Eq. 5 using ridge regression, with the regularization parameter set to 0.1 and 5-fold cross-validation.
III.1.2 Training data generation
The training data was generated from PIMD simulations using the i-PI code [79] and a custom Python-based force provider implementing the one-dimensional Morse potential. Six simulations were performed in an NVT ensemble for 100 - 600 K in steps of 100 K, using 64 replicas and a PILE-L thermostat with a time constant of fs [80] The integration was performed with a timestep of fs and positions and forces acting on each bead were sampled every steps. The first ps of the simulation were discarded, and the positions and forces sampled in the subsequent ps were used to train the model.
III.2 Molecular systems: water molecule, Zundel cation and liquid water
III.2.1 Model and training procedure
We use Eq. 11 to model the effective potential of the single water molecule, the Zundel cation, and bulk water.
We represent with the MACE graph neural network architecture [81].
One of the hallmarks of the MACE architecture is the use of many-body features to represent atomic local environments, improving the model accuracy and data efficiency.
The models were trained with the AdamW optimizer, a learning rate of , and a weight decay of . The MACE neural network cutoff was set to Å, with radial basis functions, polynomial, and radial channels. The correlation order for Zundel cation and bulk water was set to , which corresponds to -body features. For a single water molecule, -body features were used. In all cases, % of the data available were used for training, and the remaining % were used for validation. The training was performed for epochs for the H2O molecule and the Zundel cation and for epochs for liquid H2O. For each system, ten different models were trained, each using different parameter initialization and train-validation splitting.
III.2.2 Training data generation
The training data was generated from PIMD simulations using the i-PI code using the Partridge and Schwenke potential for a single water molecule [82], the HBB potential [83] for the Zundel cation and the MB-pol for bulk water [84]. The simulation parameters are summarized in Table S1. We use a force map optimized to minimize the average magnitude of the mapped forces, subject to the constraints of the valid force map. This force optimization was performed according to the procedure introduced in [73], with an regularization parameter set to .
III.2.3 Model evaluation
For each of the trained models, we performed molecular dynamics simulations, using as a force field a combination of classical potential and a trained correction as defined in Eq. 11. The simulations were performed in the NVT ensemble, with a Langevin integrator and a friction thermostat with a time constant set to ps-1. The time step and periodic boundary conditions were the same as in the training simulations. All the frames were used for further analysis.
For each of the systems, we performed additional simulations with only the classical potential as a control to compare the classical and quantum statistics.
IV Results and Discussion
IV.1 1-D Morse potential
![Refer to caption](https://cdn.statically.io/img/arxiv.org/extracted/5709116/figures/Fig1_Morse_potential.png)
First, we study a particle in a one-dimensional Morse potential, a model for a single O–H bond. The Schrödinger equation for this system can be solved analytically [77], thus making Morse potential an ideal test system. It also exhibits significant nuclear quantum effects, as shown in Fig. 2A where the classical Morse potential (Eq. 17) is compared with the corresponding quantum PMFs (Eq. 18) at different temperatures. The classical approximation leads to the over-localization of the particle, and, as expected, the quantum effects are more prominent at low temperatures. However, for this system, even at a temperature of K, there are significant deviations between the classical and quantum pictures.
The developed approach allows us to get a quantitative agreement with the reference, especially in the high-probability regions that are well-sampled in the training dataset (Fig. 2B-D). Further away from the minimum energy, the model’s prediction slightly deviates from the ground truth. This is to be expected, as these regions of the phase space are not as well represented in the training data as the high-probability regions.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/extracted/5709116/figures/Fig2_H2O_mol_with_molecule.png)
We test the reweighting scheme proposed in Section II.5 by generating datasets corresponding to different temperatures ( K) and mass of the particle that is either Protium (corresponding to an O-H bond) or increased to Tritium (corresponding to an O-T bond) from the K O-H dataset previously used. The resulting reweighted datasets were used to train new models. As shown in Fig. 2E, the models trained with the reweighted dataset are in excellent agreement with the analytical results for the corresponding temperature and isotope composition. Such data recycling allows for a further decrease in the computational cost of generating training data, though it becomes less effective for more complex systems. Nevertheless, we expect that this approach may be particularly useful for generating large datasets for training transferable models.
IV.2 Water molecule
The next test case we considered is a single water molecule in a vacuum. The quantum statistics of this 3-body system can be exhaustively described in terms of the length of two OH bonds and the HOH bond angle. The comparison of the probability distributions for the OH distances and HOH bond angles is shown in Fig. 3, with additional details provided in Fig. S1. As was the case for the Morse potential, the classical distribution is significantly more localized than the quantum counterpart. The ML model consistently samples the correct quantum distribution: the results obtained from 10 independently trained models produce similar results with low variance.
![Refer to caption](https://cdn.statically.io/img/arxiv.org/extracted/5709116/figures/Zundel_paper_figure.png)
For all trained models, we run the MD simulation for one ns, which is four times longer than the PIMD simulation used to generate the training data. This allows us to obtain slowly converging properties by performing short PIMD simulations and using those to train a model that can be run for a much longer time.
Even at K, the water molecule predominantly populates the ground vibrational state, with a negligible population of the excited states. Thus, as discussed in Section II.5, the model trained at K can be used to simulate the system at any lower temperatures. To demonstrate this point further, we report in Fig. S2 the results obtained with a model trained on PIMD data generated at K and then simulated at lower temperatures, with forces rescaled according to Eq. 13.
IV.3 Zundel cation
A more challenging case is the Zundel cation, which consists of two water molecules sharing a hydrogen-bonded proton H∗. This system has been extensively studied both theoretically [85, 86] and experimentally [87, 88, 89, 90, 91]. The Zundel cation exhibits large conformational fluctuations, including anharmonic and floppy motion associated with the transfer of the hydrogen-bounded proton. Following Ref. [86], we characterize these fluctuations by means of the OO distance , the proton transfer coordinate , the dihedral angle that measures the rotation of the H3O+ fragments around the OHO axis, and the that measures the nonplanarity of the H3O+ fragment. The dihedral angle is defined by points XOOX2, where X1 and X2 represent a midpoint between the hydrogen atoms that are not involved in the hydrogen bond formation and bound to O1 and O2, correspondingly. The non-planarity of H3O+ is characterized by a distance between an O atom and the plane, defined by the three closest H atoms (see the lower panel of Fig. 4).
![Refer to caption](https://cdn.statically.io/img/arxiv.org/extracted/5709116/figures/bulk_H2O.png)
At K, the proton transfer coordinate and the rotational motion are almost classical, and only a small correction is needed to account for NQEs. The OO distance and the Oplane distance require larger corrections, yet the NQEs are relatively small. At K, however, significant quantum effects appear in all the collective variables, including changes in the equilibrium geometry. Nevertheless, our methods recover the accurate quantum distribution both at K and at K.
Despite the diversity of conformational fluctuations, the model not only reproduces the target probability distributions but leads to remarkably stable simulations: all of the models were used to perform ns simulation and did not exhibit any instabilities, often plaguing ML force fields [92, 93].
IV.4 Liquid water
Lastly, we applied our approach to model liquid water to reproduce the NQEs manifesting in both bonded and nonbonded interactions. Since NQEs are highly local, the former are more affected than the latter. The quality of the machine-learned model is evaluated in Fig. 5 by computing radial distribution functions (RDFs). At K, the first peak in the OH (Fig. 5B) and HH (Fig. 5C) RDFs predicted by classical MD (P=1) are too narrow and significantly deviate from the correct quantum PIMD results. The deviations between the classical and quantum RDFs rapidly decrease with the distance.
Our neural-network-based potential is able to accurately predict the correct quantum RDFs, including the contributions that arise from both bonded and nonbonded interactions (Fig. 5), in contrast to another similar study [62]. There is also a remarkable agreement between independently trained models. The high accuracy and low uncertainty of our ML model prediction show that it reliably captures the subtle differences between classical and quantum RDFs.
Compared to the simulations for the ML models of a single water molecule and of the Zundel cation, the simulations for bulk water exhibit a decreased stability. We note that some simulations of our ML models, trained on ps of PIMD simulations, exhibit numerical instabilities. However, the ML models allow, on average, at least ps of simulation time before the onset of any instability. Simulations can be reinitialized and restarted to improve statistics. The numerical instability of the model can be potentially cured by increasing the size of the training dataset. Alternatively, one can modify the learning problem by simplifying the ML correction, for example, by introducing physics-informed energy terms, as it is done in the learning of classical ML coarse-grained force-fields [69, 70, 71, 72]. Given that the stability of the simulation strongly depends on the parameter (see Eq. 11), this approach may be especially promising.
V Conclusion
We developed an approach to coarse-grain the ring-polymer dynamics describing NQEs and we obtain results thermodynamically consistent with PIMD simulations. This approach allows us to accurately compute position-dependent NQEs at the cost of classical MD. We have demonstrated this method on four distinct systems where NQEs are prevalent — the Morse potential, a single water molecule, the Zundel cation, and bulk water - and reported results in excellent agreement with (much more expensive) reference PIMD simulations.
The proposed method builds an effective NQEs correction potential by combining the rigorous framework of multiscale coarse-graining [68] with the MACE architecture [81], an expressive state-of-the-art graph neural network potential. We have shown that these effective potentials accurately reproduce the quantum statistics of nuclei, both near the classical limit and in the regime of strong NQE. Moreover, our method eliminates the need to simulate multiple replicas. Once parametrized, the model seamlessly integrates into existing classical force fields and doesn’t require specialized algorithms other than ones already used for classical MD simulations. However, this approach is limited to reproducing quantum thermodynamics, so dynamical and momentum-dependent properties should be approximated using a previously proposed method [60].
For the results presented here, different sets of training data were used to train models for different molecular species at different temperatures. However, if the temperature associated with the training data is low enough to significantly populate only the quantum ground state, the trained model can be used to also simulate any lower temperature, as we have demonstrated in the case of the H2O molecule. The proposed method is designed such that the forces originating from the spring coupling between the adjacent replicas have a non-zero contribution to the force projected onto the coarse-grained space. As the dependence of these forces on the atomic mass and temperature is known (Eq. 9), the training dataset obtained for one isotope composition and temperature can be, in principle, reweighted to construct a dataset for different isotope compositions and temperatures. Thus, a single PIMD simulation can provide data for training several models, as we have shown in the case of the Morse potential.
The rapid proliferation of machine-learned force fields parametrized using accurate quantum chemical calculations to account for the electronic effects means the missing NQEs are now becoming a dominant source of error. At the same time, NQEs are crucial to describe biomolecular processes accurately, for instance, in the case of proton transfer reactions. While the results presented here are model-specific, building upon the recent development of machine-learned force-fields (either coarse-grained [69, 70, 71, 72], or atomistic [94, 78, 95, 96, 97, 81]), we believe this study represents a significant step forward towards the design of transferable potentials that explicitly incorporate also NQEs but retain the computational cost of classical MD.
Supplementary material
Section I of the supplementary material details the Morse potential (I.A) and ML model used for a particle in Morse potential (I.B). Section II contains the simulation parameters used to generate the training data. Section III provides the derivation of coordinate rescaling. Section IV presents extended results for a water molecule.
Acknowledgements
We thank members of the Clementi’s group at FU for insightful discussions and comments on the manuscript. We gratefully acknowledge funding from the Deutsche Forschungsgemeinschaft DFG (SFB/TRR 186, Project A12; SFB 1114, Projects B03, B08, and A04; SFB 1078, Project C7), the National Science Foundation (PHY-2019745), and the Einstein Foundation Berlin (project 0420815101), and the computing time provided on the supercomputer Lise at NHR@ZIB as part of the NHR infrastructure (project beb00040), the HPC Service of FUB-IT, Freie Universität Berlin, the HPC Service of the Physics department, Freie Universität Berlin, and the Swiss National Supercomputing Centre (under projects s1209 and s1288). V.K. acknowledges support from the Ernest Oppenheimer Early Career Fellowship and the Sydney Harvey Junior Research Fellowship, Churchill College, University of Cambridge.
Author declarations
Conflict of Interest
The authors have no conflicts to disclose.
Data availability
All the training data, code, and scripts required to reproduce the results are made available in the online repository: cg_nuclear_quantum_statistics.
References
- Chmiela et al. [2023] S. Chmiela, V. Vassilev-Galindo, O. T. Unke, A. Kabylda, H. E. Sauceda, A. Tkatchenko, and K.-R. Müller, Sci. Adv. 9, eadf0873 (2023).
- Batatia et al. [2023a] I. Batatia, P. Benner, Y. Chiang, A. M. Elena, D. P. Kovács, J. Riebesell, X. R. Advincula, M. Asta, W. J. Baldwin, N. Bernstein, A. Bhowmik, S. M. Blau, V. Cărare, J. P. Darby, S. De, F. D. Pia, V. L. Deringer, R. Elijošius, Z. El-Machachi, E. Fako, A. C. Ferrari, A. Genreith-Schriever, J. George, R. E. A. Goodall, C. P. Grey, S. Han, W. Handley, H. H. Heenen, K. Hermansson, C. Holm, J. Jaafar, S. Hofmann, K. S. Jakob, H. Jung, V. Kapil, A. D. Kaplan, N. Karimitari, N. Kroupa, J. Kullgren, M. C. Kuner, D. Kuryla, G. Liepuoniute, J. T. Margraf, I.-B. Magdău, A. Michaelides, J. H. Moore, A. A. Naik, S. P. Niblett, S. W. Norwood, N. O’Neill, C. Ortner, K. A. Persson, K. Reuter, A. S. Rosen, L. L. Schaaf, C. Schran, E. Sivonxay, T. K. Stenczel, V. Svahn, C. Sutton, C. van der Oord, E. Varga-Umbrich, T. Vegge, M. Vondrák, Y. Wang, W. C. Witt, F. Zills, and G. Csányi, arXiv preprint arXiv:2401.00096 (2023a), arXiv:2401.00096 [physics.chem-ph] .
- van der Spoel [2021] D. van der Spoel, Curr. Opin. Struct. Biol. 67, 18 (2021).
- Jung et al. [2021] J. Jung, C. Kobayashi, K. Kasahara, C. Tan, A. Kuroda, K. Minami, S. Ishiduki, T. Nishiki, H. Inoue, Y. Ishikawa, M. Feig, and Y. Sugita, J. Comput. Chem. 42, 231 (2021).
- Shaw et al. [2021] D. E. Shaw, P. J. Adams, A. Azaria, J. A. Bank, B. Batson, A. Bell, M. Bergdorf, J. Bhatt, J. A. Butts, T. Correia, R. M. Dirks, R. O. Dror, M. P. Eastwood, B. Edwards, A. Even, P. Feldmann, M. Fenn, C. H. Fenton, A. Forte, J. Gagliardo, G. Gill, M. Gorlatova, B. Greskamp, J. Grossman, J. Gullingsrud, A. Harper, W. Hasenplaugh, M. Heily, B. C. Heshmat, J. Hunt, D. J. Ierardi, L. Iserovich, B. L. Jackson, N. P. Johnson, M. M. Kirk, J. L. Klepeis, J. S. Kuskin, K. M. Mackenzie, R. J. Mader, R. McGowen, A. McLaughlin, M. A. Moraes, M. H. Nasr, L. J. Nociolo, L. O’Donnell, A. Parker, J. L. Peticolas, G. Pocina, C. Predescu, T. Quan, J. K. Salmon, C. Schwink, K. S. Shim, N. Siddique, J. Spengler, T. Szalay, R. Tabladillo, R. Tartler, A. G. Taube, M. Theobald, B. Towles, W. Vick, S. C. Wang, M. Wazlowski, M. J. Weingarten, J. M. Williams, and K. A. Yuh, in Proc. - Int. Conf. High Perform. Comput. Netw. Storage Anal., SC ’21 (Association for Computing Machinery, New York, NY, USA, 2021).
- Born and Oppenheimer [1927] M. Born and R. Oppenheimer, Ann. Phys. (Leipzig) 389, 457 (1927).
- Medders and Paesani [2016] G. R. Medders and F. Paesani, J. Am. Chem. Soc. 138, 3912 (2016).
- Marsalek and Markland [2017] O. Marsalek and T. E. Markland, J. Phys. Chem. Lett. 8, 1545 (2017).
- Pereyaslavets et al. [2018] L. Pereyaslavets, I. Kurnikov, G. Kamath, O. Butin, A. Illarionov, I. Leontyev, M. Olevanov, M. Levitt, R. D. Kornberg, and B. Fain, Proc. Natl. Acad. Sci. U.S.A. 115, 8878 (2018).
- Daru et al. [2022] J. Daru, H. Forbert, J. Behler, and D. Marx, Phys. Rev. Lett. 129, 226001 (2022).
- Stern and Berne [2001] H. A. Stern and B. J. Berne, J. Chem. Phys. 115, 7622 (2001).
- Shiga and Shinoda [2005] M. Shiga and W. Shinoda, J. Chem. Phys. 123, 134502 (2005).
- Paesani et al. [2006] F. Paesani, W. Zhang, D. A. Case, I. Cheatham, Thomas E., and G. A. Voth, J. Chem. Phys. 125, 184507 (2006).
- Morrone and Car [2008] J. A. Morrone and R. Car, Phys. Rev. Lett. 101, 017801 (2008).
- Habershon et al. [2009] S. Habershon, T. E. Markland, and D. E. Manolopoulos, J. Chem. Phys. 131, 024501 (2009).
- Vega et al. [2010] C. Vega, M. M. Conde, C. McBride, J. L. F. Abascal, E. G. Noya, R. Ramirez, and L. M. Sesé, J. Chem. Phys. 132, 046101 (2010).
- Pamuk et al. [2012] B. Pamuk, J. M. Soler, R. Ramírez, C. P. Herrero, P. W. Stephens, P. B. Allen, and M.-V. Fernández-Serra, Phys. Rev. Lett. 108, 193003 (2012).
- Fritsch et al. [2014] S. Fritsch, R. Potestio, D. Donadio, and K. Kremer, J. Chem. Theory Comput. 10, 816 (2014).
- Wang et al. [2014] L. Wang, M. Ceriotti, and T. E. Markland, J. Chem. Phys. 141, 104502 (2014).
- Litman et al. [2017] Y. Litman, D. Donadio, M. Ceriotti, and M. Rossi, J. Chem. Phys. 148, 102320 (2017).
- Shepherd et al. [2021] S. Shepherd, J. Lan, D. M. Wilkins, and V. Kapil, J. Phys. Chem. Lett. , 9108 (2021).
- Eltareb et al. [2023] A. Eltareb, G. E. Lopez, and N. Giovambattista, J. Phys. Chem. B 127, 4633 (2023).
- Kapil et al. [2023] V. Kapil, D. P. Kovacs, G. Csányi, and A. Michaelides, Faraday Discuss. (2023), 10.1039/D3FD00113J.
- Tuckerman et al. [1997] M. E. Tuckerman, D. Marx, M. L. Klein, and M. Parrinello, Science 275, 817 (1997).
- Walker and Michaelides [2010] B. Walker and A. Michaelides, J. Chem. Phys. 133, 174306 (2010).
- Ceriotti et al. [2013] M. Ceriotti, J. Cuny, M. Parrinello, and D. E. Manolopoulos, Proc. Natl. Acad. Sci. U.S.A. 110, 15591 (2013).
- Fang et al. [2016] W. Fang, J. Chen, M. Rossi, Y. Feng, X.-Z. Li, and A. Michaelides, J. Phys. Chem. Lett. 7, 2125 (2016).
- Kapil et al. [2019a] V. Kapil, E. Engel, M. Rossi, and M. Ceriotti, J. Chem. Theory Comput. 15, 5845 (2019a).
- Kapil and Engel [2022] V. Kapil and E. A. Engel, Proc. Natl. Acad. Sci. U.S.A. 119 (2022), 10.1073/pnas.2111769119.
- Buxton et al. [2019] S. J. Buxton, D. Quigley, and S. Habershon, J. Chem. Phys. 151, 144503 (2019).
- Štoček et al. [2022] J. R. Štoček, O. Socha, I. Cásařová, T. Slanina, and M. Dračínský, J. Am. Chem. Soc. 144, 7111 (2022).
- Sauceda et al. [2021] H. E. Sauceda, V. Vassilev-Galindo, S. Chmiela, K.-R. Müller, and A. Tkatchenko, Nat. Commun. 12, 442 (2021).
- Major et al. [2009] D. T. Major, A. Heroux, A. M. Orville, M. P. Valley, P. F. Fitzpatrick, and J. Gao, Proc. Natl. Acad. Sci. U.S.A. 106, 20734 (2009).
- Feynman and Hibbs [1965] R. Feynman and A. Hibbs, Quantum Mechanics and Path Integrals, International series in pure and applied physics (McGraw-Hill, 1965).
- Chandler and Wolynes [1981] D. Chandler and P. G. Wolynes, J. Chem. Phys. 74, 4078 (1981).
- Berne and Thirumalai [1986] B. J. Berne and D. Thirumalai, Annu. Rev. Phys. Chem. 37, 401 (1986).
- Tuckerman [2023] M. E. Tuckerman, Statistical Mechanics: Theory and Molecular Simulation (Oxford University Press, 2023).
- Markland and Ceriotti [2018] T. E. Markland and M. Ceriotti, Nat. Rev. Chem. 2, 1 (2018).
- Uhl et al. [2016] F. Uhl, D. Marx, and M. Ceriotti, J.Chem. Phys. 145, 054101 (2016).
- Takahashi and Imada [1984] M. Takahashi and M. Imada, J. Phys. Soc. Jpn. 53, 3765 (1984).
- Chin [1997] S. A. Chin, Phys. Lett. A 226, 344 (1997).
- Pérez and Tuckerman [2011] A. Pérez and M. E. Tuckerman, J. Chem. Phys. 135, 064104 (2011).
- Kapil et al. [2016a] V. Kapil, J. Behler, and M. Ceriotti, J. Chem. Phys. 145, 234103 (2016a).
- Kapil et al. [2019b] V. Kapil, J. Wieme, S. Vandenbrande, A. Lamaire, V. Van Speybroeck, and M. Ceriotti, J. Chem. Theory Comput. 15, 3237 (2019b).
- Kamibayashi and Miura [2016] Y. Kamibayashi and S. Miura, J. Chem. Phys. 145, 074114 (2016).
- Poltavsky et al. [2020] I. Poltavsky, V. Kapil, M. Ceriotti, K. S. Kim, and A. Tkatchenko, J. Chem. Theory Comput. (2020), 10.1021/acs.jctc.9b00881.
- Poltavsky and Tkatchenko [2016] I. Poltavsky and A. Tkatchenko, Chem. Sci. 7, 1368 (2016).
- Kapil et al. [2016b] V. Kapil, J. VandeVondele, and M. Ceriotti, J. Chem. Phys. 144, 054111 (2016b).
- Markland and Manolopoulos [2008a] T. E. Markland and D. E. Manolopoulos, Chemical Physics Letters 464, 256 (2008a).
- Ceriotti et al. [2009] M. Ceriotti, G. Bussi, and M. Parrinello, Phys. Rev. Lett. 103, 030603 (2009).
- Dammak et al. [2009] H. Dammak, Y. Chalopin, M. Laroche, M. Hayoun, and J.-J. Greffet, Phys. Rev. Lett. 103, 190601 (2009).
- Ceriotti and Manolopoulos [2012] M. Ceriotti and D. E. Manolopoulos, Phys. Rev. Lett. 109, 100604 (2012).
- Korol et al. [2019] R. Korol, N. Bou-Rabee, and T. F. Miller, III, J. Chem. Phys. 151, 124103 (2019).
- Wigner [1932] E. Wigner, Phys. Rev. 40, 749 (1932).
- Kirkwood [1933] J. G. Kirkwood, Phys. Rev. 44, 31 (1933).
- Feynman [1972] R. P. Feynman, Statistical Mechanics (Benjamin, New York, 1972).
- Blinov et al. [2001] N. V. Blinov, P.-N. Roy, and G. A. Voth, J. Chem. Phys. 115, 4484 (2001).
- Blinov and Roy [2001] N. Blinov and P.-N. Roy, J. Chem. Phys. 115, 7822 (2001).
- Roy and Blinov [2002] P.-N. Roy and N. Blinov, Isr. J. Chem. 42, 183 (2002).
- Musil et al. [2022] F. Musil, I. Zaporozhets, F. Noé, C. Clementi, and V. Kapil, J. Chem. Phys. 157, 181102 (2022).
- Loose et al. [2022] T. D. Loose, P. G. Sahrmann, and G. A. Voth, J. Chem. Theory Comput. 18, 5856 (2022).
- Kurnikov et al. [2024] I. V. Kurnikov, L. Pereyaslavets, G. Kamath, S. N. Sakipov, E. Voronina, O. Butin, A. Illarionov, I. Leontyev, G. Nawrocki, M. Darkhovskiy, M. Olevanov, I. Ivahnenko, Y. Chen, C. B. Lock, M. Levitt, R. D. Kornberg, and B. Fain, J. Chem. Theory Comput. 20, 1347 (2024).
- Ryu et al. [2019] W. H. Ryu, Y. Han, and G. A. Voth, J. Chem. Phys. 150, 244103 (2019).
- Sinitskiy and Voth [2015] A. V. Sinitskiy and G. A. Voth, J. Chem. Phys. 143, 094104 (2015).
- Ryu and Voth [2022] W. H. Ryu and G. A. Voth, J. Phys. Chem. A 126, 6004 (2022).
- Cao and Voth [1994a] J. Cao and G. A. Voth, J. Chem. Phys. 100, 5106 (1994a).
- Cao and Voth [1994b] J. Cao and G. A. Voth, J. Chem. Phys. 100, 5093 (1994b).
- Noid et al. [2008] W. G. Noid, J.-W. Chu, G. S. Ayton, V. Krishna, S. Izvekov, G. A. Voth, A. Das, and H. C. Andersen, J. Chem. Phys. 128 (2008), 10.1063/1.2938860.
- Wang et al. [2019] J. Wang, S. Olsson, C. Wehmeyer, A. Pérez, N. E. Charron, G. de Fabritiis, F. Noé, and C. Clementi, ACS Cent. Sci. 5, 755 (2019).
- Husic et al. [2020] B. E. Husic, N. E. Charron, D. Lemm, J. Wang, A. Pérez, M. Majewski, A. Krämer, Y. Chen, S. Olsson, G. de Fabritiis, F. Noé, and C. Clementi, J. Chem. Phys. 153 (2020), 10.1063/5.0026133.
- Majewski et al. [2023] M. Majewski, A. Pérez, P. Thölke, S. Doerr, N. E. Charron, T. Giorgino, B. E. Husic, C. Clementi, F. Noé, and G. De Fabritiis, Nat. Commun. 14 (2023), 10.1038/s41467-023-41343-1.
- Charron et al. [2023] N. E. Charron, F. Musil, A. Guljas, Y. Chen, K. Bonneau, A. S. Pasos-Trejo, J. Venturin, D. Gusew, I. Zaporozhets, A. Krämer, C. Templeton, A. Kelkar, A. E. P. Durumeric, S. Olsson, A. Pérez, M. Majewski, B. E. Husic, A. Patel, G. De Fabritiis, F. Noé, and C. Clementi, arXiv:2310.18278 (2023), 10.48550/ARXIV.2310.18278.
- Krämer et al. [2023] A. Krämer, A. E. P. Durumeric, N. E. Charron, Y. Chen, C. Clementi, and F. Noé, J. Phys. Chem. Lett. 14, 3970 (2023).
- Markland and Manolopoulos [2008b] T. E. Markland and D. E. Manolopoulos, Chem. Phys. Lett. 464, 256 (2008b).
- Yamamoto [2005] T. M. Yamamoto, J. Chem. Phys. 123, 104101 (2005).
- Ceriotti and Markland [2013] M. Ceriotti and T. E. Markland, J. Chem. Phys. 138, 014112 (2013).
- Dahl and Springborg [1988] J. P. Dahl and M. Springborg, J. Chem. Phys. 88, 4535 (1988).
- Unke and Meuwly [2019] O. T. Unke and M. Meuwly, J. Chem. Theory Comput. 15, 3678 (2019).
- Kapil et al. [2019c] V. Kapil, M. Rossi, O. Marsalek, R. Petraglia, Y. Litman, T. Spura, B. Cheng, A. Cuzzocrea, R. H. Meißner, D. M. Wilkins, B. A. Helfrecht, P. Juda, S. P. Bienvenue, W. Fang, J. Kessler, I. Poltavsky, S. Vandenbrande, J. Wieme, C. Corminboeuf, T. D. K uhne, D. E. Manolopoulos, T. E. Markland, J. O. Richardson, A. Tkatchenko, G. A. Tribello, V. Van Speybroeck, and M. Ceriotti, Comput. Phys. Commun. 236, 214 (2019c).
- Ceriotti et al. [2010] M. Ceriotti, M. Parrinello, T. E. Markland, and D. E. Manolopoulos, J. Chem. Phys. 133, 124104 (2010).
- Batatia et al. [2023b] I. Batatia, D. P. Kovács, G. N. C. Simm, C. Ortner, and G. Csányi, arXiv preprint (Machine Learning), arXiv:2206.07697, v2 (2023b), arXiv:2206.07697 [stat.ML] .
- Partridge and Schwenke [1997] H. Partridge and D. W. Schwenke, J. Chem. Phys. 106, 4618 (1997).
- Huang et al. [2005] X. Huang, B. J. Braams, and J. M. Bowman, J. Chem. Phys. 122, 044308 (2005).
- Babin et al. [2013] V. Babin, C. Leforestier, and F. Paesani, J. Chem. Theory Comput. 9, 5395 (2013).
- Schröder et al. [2022] M. Schröder, F. Gatti, D. Lauvergnat, H.-D. Meyer, and O. Vendrell, Nat. Commun. 13, 6170 (2022).
- Suzuki et al. [2013] K. Suzuki, M. Tachikawa, and M. Shiga, J. Chem. Phys. 138, 184307 (2013).
- Headrick et al. [2004] J. M. Headrick, J. C. Bopp, and M. A. Johnson, J. Chem. Phys. 121, 11523 (2004).
- Diken et al. [2005] E. G. Diken, J. M. Headrick, J. R. Roscioli, J. C. Bopp, M. A. Johnson, and A. B. McCoy, J. Phys. Chem. A 109, 1487 (2005).
- Hammer et al. [2005] N. I. Hammer, E. G. Diken, J. R. Roscioli, M. A. Johnson, E. M. Myshakin, K. D. Jordan, A. B. McCoy, X. Huang, J. M. Bowman, and S. Carter, J. Chem. Phys. 122, 244301 (2005).
- McCunn et al. [2008] L. R. McCunn, J. R. Roscioli, M. A. Johnson, and A. B. McCoy, J. Phys. Chem. B 112, 321 (2008).
- Vendrell et al. [2008] O. Vendrell, F. Gatti, and H. Meyer, Angew. Chem. 121, 358–361 (2008).
- Fu et al. [2023] X. Fu, Z. Wu, W. Wang, T. Xie, S. Keten, R. Gomez-Bombarelli, and T. Jaakkola, arXiv preprint arXiv:2210.07237 (2023), arXiv:2210.07237 [physics.comp-ph] .
- Stocker et al. [2022] S. Stocker, J. Gasteiger, F. Becker, S. Günnemann, and J. T. Margraf, Mach. Learn.: Sci. Technol. 3, 045010 (2022).
- Schütt et al. [2018] K. T. Schütt, H. E. Sauceda, P.-J. Kindermans, A. Tkatchenko, and K.-R. Müller, J. Chem. Phys. 148, 241722 (2018).
- Unke et al. [2021a] O. T. Unke, S. Chmiela, M. Gastegger, K. T. Schütt, H. E. Sauceda, and K.-R. Müller, Nat. Commun. 12, 7273 (2021a).
- Unke et al. [2021b] O. T. Unke, S. Chmiela, H. E. Sauceda, M. Gastegger, I. Poltavsky, K. T. Schütt, A. Tkatchenko, and K.-R. Müller, Chem. Rev. 121, 10142 (2021b).
- Batzner et al. [2022] S. Batzner, A. Musaelian, L. Sun, M. Geiger, J. P. Mailoa, M. Kornbluth, N. Molinari, T. E. Smidt, and B. Kozinsky, Nat. Commun. 13, 2453 (2022).