-
Augmenting Human Expertise in Weighted Ensemble Simulations through Deep Learning based Information Bottleneck
Authors:
Dedi Wang,
Pratyush Tiwary
Abstract:
The weighted ensemble (WE) method stands out as a widely used segment-based sampling technique renowned for its rigorous treatment of kinetics. The WE framework typically involves initially mapping the configuration space onto a low-dimensional collective variable (CV) space and then partitioning it into bins. The efficacy of WE simulations heavily depends on the selection of CVs and binning schem…
▽ More
The weighted ensemble (WE) method stands out as a widely used segment-based sampling technique renowned for its rigorous treatment of kinetics. The WE framework typically involves initially mapping the configuration space onto a low-dimensional collective variable (CV) space and then partitioning it into bins. The efficacy of WE simulations heavily depends on the selection of CVs and binning schemes. The recently proposed State Predictive Information Bottleneck (SPIB) method has emerged as a promising tool for automatically constructing CVs from data and guiding enhanced sampling through an iterative manner. In this work, we advance this data-driven pipeline by incorporating prior expert knowledge. Our hybrid approach combines SPIB-learned CVs to enhance sampling in explored regions with expert-based CVs to guide exploration in regions of interest, synergizing the strengths of both methods. Through benchmarking on alanine dipeptide and chignoin systems, we demonstrate that our hybrid approach effectively guides WE simulations to sample states of interest, and reduces run-to-run variances. Moreover, our integration of the SPIB model also enhances the analysis and interpretation of WE simulation data by effectively identifying metastable states and pathways, and offering direct visualization of dynamics.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Empowering AlphaFold2 for protein conformation selective drug discovery with AlphaFold2-RAVE
Authors:
Xinyu Gu,
Akashnathan Aranganathan,
Pratyush Tiwary
Abstract:
Small molecule drug design hinges on obtaining co-crystallized ligand-protein structures. Despite AlphaFold2's strides in protein native structure prediction, its focus on apo structures overlooks ligands and associated holo structures. Moreover, designing selective drugs often benefits from the targeting of diverse metastable conformations. Therefore, direct application of AlphaFold2 models in vi…
▽ More
Small molecule drug design hinges on obtaining co-crystallized ligand-protein structures. Despite AlphaFold2's strides in protein native structure prediction, its focus on apo structures overlooks ligands and associated holo structures. Moreover, designing selective drugs often benefits from the targeting of diverse metastable conformations. Therefore, direct application of AlphaFold2 models in virtual screening and drug discovery remains tentative. Here, we demonstrate an AlphaFold2 based framework combined with all-atom enhanced sampling molecular dynamics and induced fit docking, named AF2RAVE-Glide, to conduct computational model based small molecule binding of metastable protein kinase conformations, initiated from protein sequences. We demonstrate the AF2RAVE-Glide workflow on three different protein kinases and their type I and II inhibitors, with special emphasis on binding of known type II kinase inhibitors which target the metastable classical DFG-out state. These states are not easy to sample from AlphaFold2. Here we demonstrate how with AF2RAVE these metastable conformations can be sampled for different kinases with high enough accuracy to enable subsequent docking of known type II kinase inhibitors with more than 50% success rates across docking calculations. We believe the protocol should be deployable for other kinases and more proteins generally.
△ Less
Submitted 4 July, 2024; v1 submitted 10 April, 2024;
originally announced April 2024.
-
An Information Bottleneck Approach for Markov Model Construction
Authors:
Dedi Wang,
Yunrui Qiu,
Eric Beyerle,
Xuhui Huang,
Pratyush Tiwary
Abstract:
Markov state models (MSMs) are valuable for studying dynamics of protein conformational changes via statistical analysis of molecular dynamics (MD) simulations. In MSMs, the complex configuration space is coarse-grained into conformational states, with the dynamics modeled by a series of Markovian transitions among these states at discrete lag times. Constructing the Markovian model at a specific…
▽ More
Markov state models (MSMs) are valuable for studying dynamics of protein conformational changes via statistical analysis of molecular dynamics (MD) simulations. In MSMs, the complex configuration space is coarse-grained into conformational states, with the dynamics modeled by a series of Markovian transitions among these states at discrete lag times. Constructing the Markovian model at a specific lag time requires state defined without significant internal energy barriers, enabling internal dynamics relaxation within the lag time. This process coarse grains time and space, integrating out rapid motions within metastable states. This work introduces a continuous embedding approach for molecular conformations using the state predictive information bottleneck (SPIB), which unifies dimensionality reduction and state space partitioning via a continuous, machine learned basis set. Without explicit optimization of VAMP-based scores, SPIB demonstrates state-of-the-art performance in identifying slow dynamical processes and constructing predictive multi-resolution Markovian models. When applied to mini-proteins trajectories, SPIB showcases unique advantages compared to competing methods. It automatically adjusts the number of metastable states based on a specified minimal time resolution, eliminating the need for manual tuning. While maintaining efficacy in dynamical properties, SPIB excels in accurately distinguishing metastable states and capturing numerous well-populated macrostates. Furthermore, SPIB's ability to learn a low-dimensional continuous embedding of the underlying MSMs enhances the interpretation of dynamic pathways. Accordingly, we propose SPIB as an easy-to-implement methodology for end-to-end MSM construction.
△ Less
Submitted 10 June, 2024; v1 submitted 3 April, 2024;
originally announced April 2024.
-
Atomic scale insights into NaCl nucleation in nanoconfined environments
Authors:
Ruiyu Wang,
Pratyush Tiwary
Abstract:
In this work we examine the nucleation from NaCl aqueous solutions within nano-confined environments, employing enhanced sampling molecular dynamics simulations integrated with machine learning-derived reaction coordinates. Through our simulations, we successfully induce phase transitions between solid, liquid, and a hydrated phase, typically observed at lower temperatures in bulk environments. In…
▽ More
In this work we examine the nucleation from NaCl aqueous solutions within nano-confined environments, employing enhanced sampling molecular dynamics simulations integrated with machine learning-derived reaction coordinates. Through our simulations, we successfully induce phase transitions between solid, liquid, and a hydrated phase, typically observed at lower temperatures in bulk environments. Interestingly, nano-confinement serves to stabilize the solid phase and elevate melting points. Our simulations explain these findings by underscoring the significant role of water, alongside ion aggregation and subtle, anistropic dielectric behavior, in driving nucleation within nano-constrained environments. This letter thus provides a framework for sampling, analyzing and understanding nucleation processes under nano-confinement.
△ Less
Submitted 13 July, 2024; v1 submitted 1 March, 2024;
originally announced March 2024.
-
Thermodynamically Optimized Machine-learned Reaction Coordinates for Hydrophobic Ligand Dissociation
Authors:
Eric Beyerle,
Pratyush Tiwary
Abstract:
Ligand unbinding is mediated by the free energy change, which has intertwined contributions from both energy and entropy. It is important but not easy to quantify their individual contributions. We model hydrophobic ligand unbinding for two systems, a methane particle and a C60 fullerene, both unbinding from hydrophobic pockets in all-atom water. By using a modified deep learning framework, we lea…
▽ More
Ligand unbinding is mediated by the free energy change, which has intertwined contributions from both energy and entropy. It is important but not easy to quantify their individual contributions. We model hydrophobic ligand unbinding for two systems, a methane particle and a C60 fullerene, both unbinding from hydrophobic pockets in all-atom water. By using a modified deep learning framework, we learn a thermodynamically optimized reaction coordinate to describe hydrophobic ligand dissociation for both systems. Interpretation of these reaction coordinates reveals the roles of entropic and enthalpic forces as ligand and pocket sizes change. Irrespective of the contrasting roles of energy and entropy, we also find that for both the systems the transition from the bound to unbound states is driven primarily by solvation of the pocket and ligand, independent of ligand size. Our framework thus gives useful thermodynamic insight into hydrophobic ligand dissociation problems that are otherwise difficult to glean.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Is the Local Ion Density Sufficient to Drive NaCl Nucleation from the Melt and Aqueous Solution?
Authors:
Ruiyu Wang,
Shams Mehdi,
Ziyue Zou,
Pratyush Tiwary
Abstract:
Even though nucleation is ubiquitous in different science and engineering problems, investigating nucleation is extremely difficult due to the complicated ranges of time and length scales involved. In this work, we simulate NaCl nucleation in both molten and aqueous environments using enhanced sampling all-atom molecular dynamics with deep learning-based estimation of reaction coordinates. By inco…
▽ More
Even though nucleation is ubiquitous in different science and engineering problems, investigating nucleation is extremely difficult due to the complicated ranges of time and length scales involved. In this work, we simulate NaCl nucleation in both molten and aqueous environments using enhanced sampling all-atom molecular dynamics with deep learning-based estimation of reaction coordinates. By incorporating various structural order parameters and learning the reaction coordinate as a function thereof, we achieve significantly improved sampling relative to traditional ad hoc descriptions of what drives nucleation, particularly in the aqueous medium. Our results reveal a one-step nucleation mechanism in both environments, with reaction coordinate analysis highlighting the importance of local ion density in distinguishing solid and liquid states. However, while fluctuations in the local ion density are necessary to drive nucleation, they are not sufficient. Our analysis shows that near the transition states, descriptors such as enthalpy and local structure become crucial. Our protocol proposed here enables robust nucleation analysis and phase sampling, and could offer insights into nucleation mechanisms for generic small molecules in different environments.
△ Less
Submitted 27 December, 2023; v1 submitted 17 September, 2023;
originally announced September 2023.
-
Exploring kinase DFG loop conformational stability with AlphaFold2-RAVE
Authors:
Bodhi P. Vani,
Akashnathan Aranganathan,
Pratyush Tiwary
Abstract:
Kinases compose one of the largest fractions of the human proteome, and their misfunction is implicated in many diseases, in particular cancers. The ubiquitousness and structural similarities of kinases makes specific and effective drug design difficult. In particular, conformational variability due to the evolutionarily conserved DFG motif adopting in and out conformations and the relative stabil…
▽ More
Kinases compose one of the largest fractions of the human proteome, and their misfunction is implicated in many diseases, in particular cancers. The ubiquitousness and structural similarities of kinases makes specific and effective drug design difficult. In particular, conformational variability due to the evolutionarily conserved DFG motif adopting in and out conformations and the relative stabilities thereof are key in structure-based drug design for ATP competitive drugs. These relative conformational stabilities are extremely sensitive to small changes in sequence, and provide an important problem for sampling method development. Since the invention of AlphaFold2, the world of structure-based drug design has noticably changed. In spite of it being limited to crystal-like structure prediction, several methods have also leveraged its underlying architecture to improve dynamics and enhanced sampling of conformational ensembles, including AlphaFold2-RAVE. Here, we extend AlphaFold2-RAVE and apply it to a set of kinases: the wild type DDR1 sequence and three mutants with single point mutations that are known to behave drastically differently. We show that AlphaFold2-RAVE is able to efficiently recover the changes in relative stability using transferable learnt order parameters and potentials, thereby supplementing AlphaFold2 as a tool for exploration of Boltzmann-weighted protein conformations.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
Inferring phase transitions and critical exponents from limited observations with Thermodynamic Maps
Authors:
Lukas Herron,
Kinjal Mondal,
John S. Schneekloth,
Pratyush Tiwary
Abstract:
Phase transitions are ubiquitous across life, yet hard to quantify and describe accurately. In this work, we develop an approach for characterizing generic attributes of phase transitions from very limited observations made deep within different phases' domains of stability. Our approach is called Thermodynamic Maps, which combines statistical mechanics and molecular simulations with score-based g…
▽ More
Phase transitions are ubiquitous across life, yet hard to quantify and describe accurately. In this work, we develop an approach for characterizing generic attributes of phase transitions from very limited observations made deep within different phases' domains of stability. Our approach is called Thermodynamic Maps, which combines statistical mechanics and molecular simulations with score-based generative models. Thermodynamic Maps enable learning the temperature dependence of arbitrary thermodynamic observables across a wide range of temperatures. We show its usefulness by calculating phase transition attributes such as melting temperature, temperature-dependent heat capacities, and critical exponents. For instance, we demonstrate the ability of thermodynamic maps to infer the ferromagnetic phase transition of the Ising model, including temperature-dependent heat capacity and critical exponents, despite never having seen samples from the transition region. In addition, we efficiently characterize the temperature-dependent conformational ensemble and compute melting curves of the two RNA systems GCAA tetraloop and HIV-TAR, which are notoriously hard to sample due to glassy-like landscapes.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Enhanced Sampling with Machine Learning: A Review
Authors:
Shams Mehdi,
Zachary Smith,
Lukas Herron,
Ziyue Zou,
Pratyush Tiwary
Abstract:
Molecular dynamics (MD) enables the study of physical systems with excellent spatiotemporal resolution but suffers from severe time-scale limitations. To address this, enhanced sampling methods have been developed to improve exploration of configurational space. However, implementing these is challenging and requires domain expertise. In recent years, integration of machine learning (ML) technique…
▽ More
Molecular dynamics (MD) enables the study of physical systems with excellent spatiotemporal resolution but suffers from severe time-scale limitations. To address this, enhanced sampling methods have been developed to improve exploration of configurational space. However, implementing these is challenging and requires domain expertise. In recent years, integration of machine learning (ML) techniques in different domains has shown promise, prompting their adoption in enhanced sampling as well. Although ML is often employed in various fields primarily due to its data-driven nature, its integration with enhanced sampling is more natural with many common underlying synergies. This review explores the merging of ML and enhanced MD by presenting different shared viewpoints. It offers a comprehensive overview of this rapidly evolving field, which can be difficult to stay updated on. We highlight successful strategies like dimensionality reduction, reinforcement learning, and flow-based methods. Finally, we discuss open problems at the exciting ML-enhanced MD interface.
△ Less
Submitted 16 June, 2023; v1 submitted 15 June, 2023;
originally announced June 2023.
-
From latent dynamics to meaningful representations
Authors:
Dedi Wang,
Yihang Wang,
Luke Evans,
Pratyush Tiwary
Abstract:
While representation learning has been central to the rise of machine learning and artificial intelligence, a key problem remains in making the learned representations meaningful. For this, the typical approach is to regularize the learned representation through prior probability distributions. However, such priors are usually unavailable or are ad hoc. To deal with this, recent efforts have shift…
▽ More
While representation learning has been central to the rise of machine learning and artificial intelligence, a key problem remains in making the learned representations meaningful. For this, the typical approach is to regularize the learned representation through prior probability distributions. However, such priors are usually unavailable or are ad hoc. To deal with this, recent efforts have shifted towards leveraging the insights from physical principles to guide the learning process. In this spirit, we propose a purely dynamics-constrained representation learning framework. Instead of relying on predefined probabilities, we restrict the latent representation to follow overdamped Langevin dynamics with a learnable transition density - a prior driven by statistical mechanics. We show this is a more natural constraint for representation learning in stochastic dynamical systems, with the crucial ability to uniquely identify the ground truth representation. We validate our framework for different systems including a real-world fluorescent DNA movie dataset. We show that our algorithm can uniquely identify orthogonal, isometric and meaningful latent representations.
△ Less
Submitted 9 April, 2024; v1 submitted 2 September, 2022;
originally announced September 2022.
-
Computing committors via Mahalanobis diffusion maps with enhanced sampling data
Authors:
Luke Evans,
Maria K. Cameron,
Pratyush Tiwary
Abstract:
The study of phenomena such as protein folding and conformational changes in molecules is a central theme in chemical physics. Molecular dynamics (MD) simulation is the primary tool for the study of transition processes in biomolecules, but it is hampered by a huge timescale gap between the processes of interest and atomic vibrations which dictate the time step size. Therefore, it is imperative to…
▽ More
The study of phenomena such as protein folding and conformational changes in molecules is a central theme in chemical physics. Molecular dynamics (MD) simulation is the primary tool for the study of transition processes in biomolecules, but it is hampered by a huge timescale gap between the processes of interest and atomic vibrations which dictate the time step size. Therefore, it is imperative to combine MD simulations with other techniques in order to quantify the transition processes taking place on large timescales. In this work, the diffusion map with Mahalanobis kernel, a meshless approach for approximating the Backward Kolmogorov Operator (BKO) in collective variables, is upgraded to incorporate standard enhanced sampling techniques such as metadynamics. The resulting algorithm, which we call the "target measure Mahalanobis diffusion map" (tm-mmap), is suitable for a moderate number of collective variables in which one can approximate the diffusion tensor and free energy. Imposing appropriate boundary conditions allows use of the approximated BKO to solve for the committor function and utilization of transition path theory to find the reactive current delineating the transition channels and the transition rate. The proposed algorithm, tm-mmap, is tested on the two-dimensional Moro-Cardin two-well system with position-dependent diffusion coefficient and on alanine dipeptide in two collective variables where the committor, the reactive current, and the transition rate are compared to those computed by the finite element method (FEM). Finally, tm-mmap is applied to alanine dipeptide in four collective variables where the use of finite elements is infeasible.
△ Less
Submitted 2 November, 2022; v1 submitted 26 August, 2022;
originally announced August 2022.
-
Thermodynamics-inspired Explanations of Artificial Intelligence
Authors:
Shams Mehdi,
Pratyush Tiwary
Abstract:
In recent years, predictive machine learning methods have gained prominence in various scientific domains. However, due to their black-box nature, it is essential to establish trust in these models before accepting them as accurate. One promising strategy for assigning trust involves employing explanation techniques that elucidate the rationale behind a black-box model's predictions in a manner th…
▽ More
In recent years, predictive machine learning methods have gained prominence in various scientific domains. However, due to their black-box nature, it is essential to establish trust in these models before accepting them as accurate. One promising strategy for assigning trust involves employing explanation techniques that elucidate the rationale behind a black-box model's predictions in a manner that humans can understand. However, assessing the degree of human interpretability of the rationale generated by such methods is a nontrivial challenge. In this work, we introduce interpretation entropy as a universal solution for assessing the degree of human interpretability associated with any linear model. Using this concept and drawing inspiration from classical thermodynamics, we present Thermodynamics-inspired Explainable Representations of AI and other black-box Paradigms (TERP), a method for generating accurate, and human-interpretable explanations for black-box predictions in a model-agnostic manner. To demonstrate the wide-ranging applicability of TERP, we successfully employ it to explain various black-box model architectures, including deep learning Autoencoders, Recurrent Neural Networks, and Convolutional Neural Networks, across diverse domains such as molecular simulations, text, and image classification.
△ Less
Submitted 8 April, 2024; v1 submitted 27 June, 2022;
originally announced June 2022.
-
Quantifying Energetic and Entropic Pathways in Molecular Systems
Authors:
E. R. Beyerle,
Shams Mehdi,
Pratyush Tiwary
Abstract:
When examining dynamics occurring at non-zero temperatures, both energy and entropy must be taken into account while describing activated barrier crossing events. Furthermore, good reaction coordinates need to be constructed to describe different metastable states and the transition mechanisms between them. Here we use a physics-based machine learning method called the State Predictive Information…
▽ More
When examining dynamics occurring at non-zero temperatures, both energy and entropy must be taken into account while describing activated barrier crossing events. Furthermore, good reaction coordinates need to be constructed to describe different metastable states and the transition mechanisms between them. Here we use a physics-based machine learning method called the State Predictive Information Bottleneck (SPIB) to find non-linear reaction coordinates for three systems of varying complexity. The SPIB is able to predict correctly an entropic bottleneck for an analytical flat-energy double-well system and identify the entropy- and energy-dominated pathways for an analytical four-well system. Finally, for a simulation of benzoic acid permeation through a lipid bilayer, SPIB is able to discover the entropic and energetic barriers to the permeation process. Given these results, we thus establish that SPIB is a reasonable and robust method for finding the important entropy and energy/enthalpy barriers in physical systems, which can then be used for enhanced understanding and sampling of different activated mechanisms.
△ Less
Submitted 14 March, 2022;
originally announced March 2022.
-
Path sampling of recurrent neural networks by incorporating known physics
Authors:
Sun-Ting Tsai,
Eric Fields,
Yijia Xu,
En-Jui Kuo,
Pratyush Tiwary
Abstract:
Recurrent neural networks have seen widespread use in modeling dynamical systems in varied domains such as weather prediction, text prediction and several others. Often one wishes to supplement the experimentally observed dynamics with prior knowledge or intuition about the system. While the recurrent nature of these networks allows them to model arbitrarily long memories in the time series used i…
▽ More
Recurrent neural networks have seen widespread use in modeling dynamical systems in varied domains such as weather prediction, text prediction and several others. Often one wishes to supplement the experimentally observed dynamics with prior knowledge or intuition about the system. While the recurrent nature of these networks allows them to model arbitrarily long memories in the time series used in training, it makes it harder to impose prior knowledge or intuition through generic constraints. In this work, we present a path sampling approach based on principle of Maximum Caliber that allows us to include generic thermodynamic or kinetic constraints into recurrent neural networks. We show the method here for a widely used type of recurrent neural network known as long short-term memory network in the context of supplementing time series collected from different application domains. These include classical Molecular Dynamics of a protein and Monte Carlo simulations of an open quantum system continuously losing photons to the environment and displaying Rabi oscillations. Our method can be easily generalized to other generative artificial intelligence models and to generic time series in different areas of physical and social sciences, where one wishes to supplement limited data with intuition or theory based corrections.
△ Less
Submitted 20 April, 2022; v1 submitted 1 March, 2022;
originally announced March 2022.
-
Accelerating all-atom simulations and gaining mechanistic understanding of biophysical systems through State Predictive Information Bottleneck
Authors:
Shams Mehdi,
Dedi Wang,
Shashank Pant,
Pratyush Tiwary
Abstract:
An effective implementation of enhanced sampling algorithms for molecular dynamics simulations requires a priori knowledge of the approximate reaction coordinate describing the relevant mechanisms in the system. Here we demonstrate how the artificial intelligence based recent State Predictive Information Bottleneck (SPIB) approach can learn such a reaction coordinate as a deep neural network even…
▽ More
An effective implementation of enhanced sampling algorithms for molecular dynamics simulations requires a priori knowledge of the approximate reaction coordinate describing the relevant mechanisms in the system. Here we demonstrate how the artificial intelligence based recent State Predictive Information Bottleneck (SPIB) approach can learn such a reaction coordinate as a deep neural network even from under-sampled trajectories. We demonstrate its usefulness by achieving more than 40 magnitudes of acceleration in simulating two test-piece biophysical systems through well-tempered metadynamics performed by biasing along the SPIB learned reaction coordinate. These include left- to right- handed chirality transitions in a synthetic protein (Aib)_9, and permeation of a small, asymmetric molecule benzoic acid through a synthetic, symmetric phospholipid bilayer. In addition to significantly accelerating the dynamics and achieving back-and-forth movement between different metastable states, the SPIB based reaction coordinate gives mechanistic insight into the processes driving these two important problems.
△ Less
Submitted 21 December, 2021;
originally announced December 2021.
-
Influence of long range forces on the transition states and dynamics of NaCl ion-pair dissociation in water
Authors:
Dedi Wang,
Renjie Zhao,
John D. Weeks,
Pratyush Tiwary
Abstract:
We study NaCl ion-pair dissociation in a dilute aqueous solution using computer simulations both for the full system with long range Coulomb interactions and for a well chosen reference system with short range intermolecular interactions. Analyzing results using concepts from Local Molecular Field (LMF) theory and the recently proposed AI-based analysis tool "State predictive information bottlenec…
▽ More
We study NaCl ion-pair dissociation in a dilute aqueous solution using computer simulations both for the full system with long range Coulomb interactions and for a well chosen reference system with short range intermolecular interactions. Analyzing results using concepts from Local Molecular Field (LMF) theory and the recently proposed AI-based analysis tool "State predictive information bottleneck" (SPIB) we show that the system with short range interactions can accurately reproduce the transition rate for the dissociation process, the dynamics for moving between the underlying metastable states, and the transition state ensemble. Contributions from long range interactions can be largely neglected for these processes because long range forces from the direct interionic Coulomb interactions are almost completely canceled ($>90\%$) by those from solvent interactions over the length scale where the transition takes place. Thus for this important monovalent ion-pair system, short range forces alone are able to capture detailed consequences of the collective solvent motion, allowing the use of physically suggestive and computationally efficient short range models for the disassociation event. We believe that the framework here should be applicable to disentangling mechanisms for more complex processes such as multivalent ion disassociation, where previous work has suggested that long range contributions may be more important.
△ Less
Submitted 11 October, 2021;
originally announced October 2021.
-
Towards automated sampling of polymorph nucleation and free energies with SGOOP and metadynamics
Authors:
Ziyue Zou,
Sun-Ting Tsai,
Pratyush Tiwary
Abstract:
Understanding the driving forces behind the nucleation of different polymorphs is of great importance for material sciences and the pharmaceutical industry. This includes understanding the reaction coordinate that governs the nucleation process as well as correctly calculating the relative free energies of different polymorphs. Here we demonstrate, for the prototypical case of urea nucleation from…
▽ More
Understanding the driving forces behind the nucleation of different polymorphs is of great importance for material sciences and the pharmaceutical industry. This includes understanding the reaction coordinate that governs the nucleation process as well as correctly calculating the relative free energies of different polymorphs. Here we demonstrate, for the prototypical case of urea nucleation from melt, how one can learn such a 1-dimensional reaction coordinate as a function of pre-specified order parameters, and use it to perform efficient biased all-atom molecular dynamics simulations. The reaction coordinate is learnt as a function of generic thermodynamic and structural order parameters using the "Spectral Gap Optimization of Order Parameters (SGOOP)" approach [P. Tiwary and B. J. Berne, Proc. Natl. Acad. Sci. (2016)], and is biased using well-tempered metadynamics simulations. The reaction coordinate gives insight into the role played by different structural and thermodynamics order parameters, and the biased simulations obtain accurate relative free energies for different polymorphs. This includes accurate prediction of the approximate pressure at which urea undergoes a phase transition and one of the metastable polymorphs becomes the most stable conformation. We believe the ideas demonstrated in thus work will facilitate efficient sampling of nucleation in complex, generic systems.
△ Less
Submitted 23 August, 2021;
originally announced August 2021.
-
Computing committors in collective variables via Mahalanobis diffusion maps
Authors:
Luke Evans,
Maria K. Cameron,
Pratyush Tiwary
Abstract:
The study of rare events in molecular and atomic systems such as conformal changes and cluster rearrangements has been one of the most important research themes in chemical physics. Key challenges are associated with long waiting times rendering molecular simulations inefficient, high dimensionality impeding the use of PDE-based approaches, and the complexity or breadth of transition processes lim…
▽ More
The study of rare events in molecular and atomic systems such as conformal changes and cluster rearrangements has been one of the most important research themes in chemical physics. Key challenges are associated with long waiting times rendering molecular simulations inefficient, high dimensionality impeding the use of PDE-based approaches, and the complexity or breadth of transition processes limiting the predictive power of asymptotic methods. Diffusion maps are promising algorithms to avoid or mitigate all these issues. We adapt the diffusion map with Mahalanobis kernel proposed by Singer and Coifman (2008) for the SDE describing molecular dynamics in collective variables in which the diffusion matrix is position-dependent and, unlike the case considered by Singer and Coifman, is not associated with a diffeomorphism. We offer an elementary proof showing that one can approximate the generator for this SDE discretized to a point cloud via the Mahalanobis diffusion map. We use it to calculate the committor functions in collective variables for two benchmark systems: alanine dipeptide, and Lennard-Jones-7 in 2D. For validating our committor results, we compare our committor functions to the finite-difference solution or by conducting a "committor analysis" as used by molecular dynamics practitioners. We contrast the outputs of the Mahalanobis diffusion map with those of the standard diffusion map with isotropic kernel and show that the former gives significantly more accurate estimates for the committors than the latter.
△ Less
Submitted 2 October, 2022; v1 submitted 19 August, 2021;
originally announced August 2021.
-
From data to noise to data: mixing physics across temperatures with generative artificial intelligence
Authors:
Yihang Wang,
Lukas Herron,
Pratyush Tiwary
Abstract:
Using simulations or experiments performed at some set of temperatures to learn about the physics or chemistry at some other arbitrary temperature is a problem of immense practical and theoretical relevance. Here we develop a framework based on statistical mechanics and generative Artificial Intelligence that allows solving this problem. Specifically, we work with denoising diffusion probabilistic…
▽ More
Using simulations or experiments performed at some set of temperatures to learn about the physics or chemistry at some other arbitrary temperature is a problem of immense practical and theoretical relevance. Here we develop a framework based on statistical mechanics and generative Artificial Intelligence that allows solving this problem. Specifically, we work with denoising diffusion probabilistic models, and show how these models in combination with replica exchange molecular dynamics achieve superior sampling of the biomolecular energy landscape at temperatures that were never even simulated without assuming any particular slow degrees of freedom. The key idea is to treat the temperature as a fluctuating random variable and not a control parameter as is usually done. This allows us to directly sample from the joint probability distribution in configuration and temperature space. The results here are demonstrated for a chirally symmetric peptide and single-strand ribonucleic acid undergoing conformational transitions in all-atom water. We demonstrate how we can discover transition states and metastable states that were previously unseen at the temperature of interest, and even bypass the need to perform further simulations for wide range of temperatures. At the same time, any unphysical states are easily identifiable through very low Boltzmann weights. The procedure while shown here for a class of molecular simulations should be more generally applicable to mixing information across simulations and experiments with varying control parameters.
△ Less
Submitted 2 March, 2022; v1 submitted 15 July, 2021;
originally announced July 2021.
-
SGOOP-d: Estimating kinetic distances and reaction coordinate dimensionality for rare event systems from biased/unbiased simulations
Authors:
Sun-Ting Tsai,
Zachary Smith,
Pratyush Tiwary
Abstract:
Understanding kinetics including reaction pathways and associated transition rates is an important yet difficult problem in numerous chemical and biological systems especially in situations with multiple competing pathways. When these high-dimensional systems are projected on low-dimensional coordinates, which are often needed for enhanced sampling or for interpretation of simulations and experime…
▽ More
Understanding kinetics including reaction pathways and associated transition rates is an important yet difficult problem in numerous chemical and biological systems especially in situations with multiple competing pathways. When these high-dimensional systems are projected on low-dimensional coordinates, which are often needed for enhanced sampling or for interpretation of simulations and experiments, one can end up losing the kinetic connectivity of the underlying high-dimensional landscape. Thus in the low-dimensional projection metastable states might appear closer or further than they actually are. To deal with this issue, in this work we develop a formalism that learns a multi-dimensional yet minimally complex reaction coordinate (RC) for generic high-dimensional systems. When projected along this RC, all possible kinetically relevant pathways can be demarcated and the true high-dimensional connectivity is maintained. One of the defining attributes of our method lies in that it can work on long unbiased simulations as well as biased simulations often needed for rare event systems. We demonstrate the utility of the method by studying a range of model systems including conformational transitions in a small peptide Ace-Ala$_3$-Nme, where we show how two-dimensional and three-dimensional reaction coordinate found by our previously published spectral gap optimization method "SGOOP" [P. Tiwary and B. J. Berne, Proc. Natl. Acad. Sci. 113, 2839 (2016)] can capture the kinetics for 23 and all 28 out of the 28 dominant state-to-state transitions respectively.
△ Less
Submitted 21 September, 2021; v1 submitted 27 April, 2021;
originally announced April 2021.
-
State Predictive Information Bottleneck
Authors:
Dedi Wang,
Pratyush Tiwary
Abstract:
The ability to make sense of the massive amounts of high-dimensional data generated from molecular dynamics (MD) simulations is heavily dependent on the knowledge of a low dimensional manifold (parameterized by a reaction coordinate or RC) that typically distinguishes between relevant metastable states and which captures the relevant slow dynamics of interest. Methods based on machine learning and…
▽ More
The ability to make sense of the massive amounts of high-dimensional data generated from molecular dynamics (MD) simulations is heavily dependent on the knowledge of a low dimensional manifold (parameterized by a reaction coordinate or RC) that typically distinguishes between relevant metastable states and which captures the relevant slow dynamics of interest. Methods based on machine learning and artificial intelligence have been proposed over the years to deal with learning such low-dimensional manifolds, but they are often criticized for a disconnect from more traditional and physically interpretable approaches. To deal with such concerns, in this work, we propose a deep learning based State Predictive Information Bottleneck (SPIB) approach to learn the RC from high dimensional molecular simulation trajectories. We demonstrate analytically and numerically how the RC learnt in this approach is deeply connected to the committor in chemical physics, and can be used to accurately identify transition states. A crucial hyperparameter in this approach is the time-delay, or how far into the future the algorithm should make predictions about. Through careful comparisons for benchmark systems, we demonstrate that this hyperparameter choice gives useful control over how coarse-grained we want the metastable state classification of the system to be. We thus believe that this work represents a step forward in systematic application of deep learning based ideas to molecular simulations in a way that bridges the gap between artificial intelligence and traditional chemical physics.
△ Less
Submitted 12 February, 2021; v1 submitted 19 November, 2020;
originally announced November 2020.
-
Learning Molecular Dynamics with Simple Language Model built upon Long Short-Term Memory Neural Network
Authors:
Sun-Ting Tsai,
En-Jui Kuo,
Pratyush Tiwary
Abstract:
Recurrent neural networks (RNNs) have led to breakthroughs in natural language processing and speech recognition, wherein hundreds of millions of people use such tools on a daily basis through smartphones, email servers and other avenues. In this work, we show such RNNs, specifically Long Short-Term Memory (LSTM) neural networks can also be applied to capturing the temporal evolution of typical tr…
▽ More
Recurrent neural networks (RNNs) have led to breakthroughs in natural language processing and speech recognition, wherein hundreds of millions of people use such tools on a daily basis through smartphones, email servers and other avenues. In this work, we show such RNNs, specifically Long Short-Term Memory (LSTM) neural networks can also be applied to capturing the temporal evolution of typical trajectories arising in chemical and biological physics. Specifically, we use a character-level language model based on LSTM. This learns a probabilistic model from 1-dimensional stochastic trajectories generated from molecular dynamics simulations of a higher dimensional system. We show that the model can not only capture the Boltzmann statistics of the system but it also reproduce kinetics at a large spectrum of timescales. We demonstrate how the embedding layer, introduced originally for representing the contextual meaning of words or characters, exhibits here a nontrivial connectivity between different metastable states in the underlying physical system. We demonstrate the reliability of our model and interpretations through different benchmark systems and a single molecule force spectroscopy trajectory for multi-state riboswitch. We anticipate that our work represents a stepping stone in the understanding and use of RNNs for modeling and predicting dynamics of complex stochastic molecular systems.
△ Less
Submitted 4 August, 2020; v1 submitted 26 April, 2020;
originally announced April 2020.
-
Understanding the role of predictive time delay and biased propagator in RAVE
Authors:
Yihang Wang,
Pratyush Tiwary
Abstract:
In this work, we revisit our recent iterative machine learning (ML) -- molecular dynamics (MD) technique "Reweighted autoencoded variational Bayes for enhanced sampling (RAVE)" (Ribeiro, Bravo, Wang, Tiwary, J. Chem. Phys. 149 072301 (2018) and Wang, Ribeiro, Tiwary, Nature Commun. 10 3573 (2019)) and analyze as well as formalize some of its approximations. These including: (a) the choice of a pre…
▽ More
In this work, we revisit our recent iterative machine learning (ML) -- molecular dynamics (MD) technique "Reweighted autoencoded variational Bayes for enhanced sampling (RAVE)" (Ribeiro, Bravo, Wang, Tiwary, J. Chem. Phys. 149 072301 (2018) and Wang, Ribeiro, Tiwary, Nature Commun. 10 3573 (2019)) and analyze as well as formalize some of its approximations. These including: (a) the choice of a predictive time-delay, or how far into the future should the ML try to predict the state of a given system output from MD, and (b) for short time-delays, how much of an error is made in approximating the biased propagator for the dynamics as the unbiased propagator. We demonstrate through a master equation framework as to why the exact choice of time-delay is irrelevant as long as a small non-zero value is adopted. We also derive a correction to reweight the biased propagator, and somewhat to our dissatisfaction but also to our reassurance, find that it barely makes a difference to the intuitive picture we had previously derived and used.
△ Less
Submitted 14 February, 2020;
originally announced February 2020.
-
Machine learning approaches for analyzing and enhancing molecular dynamics simulations
Authors:
Yihang Wang,
Joao Marcelo Lamim Ribeiro,
Pratyush Tiwary
Abstract:
Molecular dynamics (MD) has become a powerful tool for studying biophysical systems, due to increasing computational power and availability of software. Although MD has made many contributions to better understanding these complex biophysical systems, there remain methodological difficulties to be surmounted. First, how to make the deluge of data generated in running even a microsecond long MD sim…
▽ More
Molecular dynamics (MD) has become a powerful tool for studying biophysical systems, due to increasing computational power and availability of software. Although MD has made many contributions to better understanding these complex biophysical systems, there remain methodological difficulties to be surmounted. First, how to make the deluge of data generated in running even a microsecond long MD simulation human comprehensible. Second, how to efficiently sample the underlying free energy surface and kinetics. In this short perspective, we summarize machine learning based ideas that are solving both of these limitations, with a focus on their key theoretical underpinnings and remaining challenges.
△ Less
Submitted 25 September, 2019;
originally announced September 2019.
-
Reaction coordinates and rate constants for liquid droplet nucleation: quantifying the interplay between driving force and memory
Authors:
Sun-Ting Tsai,
Zachary Smith,
Pratyush Tiwary
Abstract:
In this work we revisit the classic problem of homogeneous nucleation of a liquid droplet in a supersaturated vapor phase. We consider this at different extents of the driving force, which here is the extent of supersaturation, and calculate a reaction coordinate (RC) for nucleation as the driving force is varied. The RC is constructed as a linear combination of three order parameters, where one a…
▽ More
In this work we revisit the classic problem of homogeneous nucleation of a liquid droplet in a supersaturated vapor phase. We consider this at different extents of the driving force, which here is the extent of supersaturation, and calculate a reaction coordinate (RC) for nucleation as the driving force is varied. The RC is constructed as a linear combination of three order parameters, where one accounts for the number of liquid-like atoms, and the other two for local density fluctuations. The RC is calculated from all-atom biased and unbiased molecular dynamics (MD) simulations using the spectral gap optimization approach "SGOOP" [P. Tiwary and B. J. Berne, Proc. Natl. Acad. Sci. U. S. A. 113, 2839 (2016)]. Our key finding is that as the supersaturation decreases, the RC ceases to simply be the number of liquid-like atoms, and instead it becomes important to explicitly consider local density fluctuations that correlate with shape and density variations in the nucleus. All three order parameters are found to have similar barriers in their respective potentials of mean force, however, as the supersaturation decreases the density fluctuations decorrelate slower and thus carry longer memory. Thus at lower supersaturations density fluctuations are non-Markovian and can not be simply ignored from the RC by virtue of being noise. Finally, we use this optimized RC to calculate nucleation rates in the infrequent metadynamics framework, and show it leads to more accurate estimate of the nucleation rate with four orders of magnitude acceleration relative to unbiased MD.
△ Less
Submitted 13 August, 2019;
originally announced August 2019.
-
Ligand dissociation mechanisms from all-atom simulations: Are we there yet?
Authors:
Joao Marcelo Lamim Ribeiro,
Sun-Ting Tsai,
Debabrata Pramanik,
Yihang Wang,
Pratyush Tiwary
Abstract:
Large parallel gains in the development of both computational resources as well as sampling methods have now made it possible to simulate dissociation events in ligand-protein complexes with all--atom resolution. Such encouraging progress, together with the inherent spatiotemporal resolution associated with molecular simulations, has left their use for investigating dissociation processes brimming…
▽ More
Large parallel gains in the development of both computational resources as well as sampling methods have now made it possible to simulate dissociation events in ligand-protein complexes with all--atom resolution. Such encouraging progress, together with the inherent spatiotemporal resolution associated with molecular simulations, has left their use for investigating dissociation processes brimming with potential, both in rational drug design, where it can be an invaluable tool for determining the mechanistic driving forces behind dissociation rate constants, as well as in force-field development, where it can provide a catalog of transient molecular structures on which to refine force-fields. Although much progress has been made in making force-fields more accurate, reducing their error for transient structures along a transition path could yet prove to be a critical development helping to make kinetic predictions much more accurate. In what follows we will provide a state-of-the-art compilation of the molecular dynamics (MD) methods used to investigate the kinetics and mechanisms of ligand-protein dissociation processes. Due to the timescales of such processes being slower than what is accessible using straightforward MD simulations, several ingenious schemes are being devised at a rapid rate to overcome this obstacle. Here we provide an up-to-date compendium of such methods and their achievements/shortcomings in extracting mechanistic insight into ligand-protein dissociation. We conclude with a critical and provocative appraisal attempting to answer the title of this review.
△ Less
Submitted 12 September, 2018;
originally announced September 2018.
-
Frequency adaptive metadynamics for the calculation of rare-event kinetics
Authors:
Yong Wang,
Omar Valsson,
Pratyush Tiwary,
Michele Parrinello,
Kresten Lindorff-Larsen
Abstract:
The ability to predict accurate thermodynamic and kinetic properties in biomolecular systems is of both scientific and practical utility. While both remain very difficult, predictions of kinetics are particularly difficult because rates, in contrast to free energies, depend on the route taken and are thus not amenable to all enhanced sampling methods. It has recently been demonstrated that it is p…
▽ More
The ability to predict accurate thermodynamic and kinetic properties in biomolecular systems is of both scientific and practical utility. While both remain very difficult, predictions of kinetics are particularly difficult because rates, in contrast to free energies, depend on the route taken and are thus not amenable to all enhanced sampling methods. It has recently been demonstrated that it is possible to recover kinetics through so called `infrequent metadynamics' simulations, where the simulations are biased in a way that minimally corrupts the dynamics of moving between metastable states. This method, however, requires the bias to be added slowly, thus hampering applications to processes with only modest separations of timescales. Here we present a frequency-adaptive strategy which bridges normal and infrequent metadynamics. We show that this strategy can improve the precision and accuracy of rate calculations at fixed computational cost, and should be able to extend rate calculations for much slower kinetic processes.
△ Less
Submitted 12 February, 2018;
originally announced February 2018.
-
Reweighted Autoencoded Variational Bayes for Enhanced Sampling (RAVE)
Authors:
Joao Marcelo Lamim Ribeiro,
Pablo Bravo Collado,
Yihang Wang,
Pratyush Tiwary
Abstract:
Here we propose the Reweighted Autoencoded Variational Bayes for Enhanced Sampling (RAVE) method, a new iterative scheme that uses the deep learning framework of variational autoencoders to enhance sampling in molecular simulations. RAVE involves iterations between molecular simulations and deep learning in order to produce an increasingly accurate probability distribution along a low-dimensional…
▽ More
Here we propose the Reweighted Autoencoded Variational Bayes for Enhanced Sampling (RAVE) method, a new iterative scheme that uses the deep learning framework of variational autoencoders to enhance sampling in molecular simulations. RAVE involves iterations between molecular simulations and deep learning in order to produce an increasingly accurate probability distribution along a low-dimensional latent space that captures the key features of the molecular simulation trajectory. Using the Kullback-Leibler divergence between this latent space distribution and the distribution of various trial reaction coordinates sampled from the molecular simulation, RAVE determines an optimum, yet nonetheless physically interpretable, reaction coordinate and optimum probability distribution. Both then directly serve as the biasing protocol for a new biased simulation, which is once again fed into the deep learning module with appropriate weights accounting for the bias, the procedure continuing until estimates of desirable thermodynamic observables are converged. Unlike recent methods using deep learning for enhanced sampling purposes, RAVE stands out in that (a) it naturally produces a physically interpretable reaction coordinate, (b) is independent of existing enhanced sampling protocols to enhance the fluctuations along the latent space identified via deep learning, and (c) it provides the ability to easily filter out spurious solutions learned by the deep learning procedure. The usefulness and reliability of RAVE is demonstrated by applying it to model potentials of increasing complexity, including computation of the binding free energy profile for a hydrophobic ligand-substrate system in explicit water with dissociation time of more than three minutes, in computer time at least twenty times less than that needed for umbrella sampling or metadynamics.
△ Less
Submitted 9 February, 2018;
originally announced February 2018.
-
Predicting reaction coordinates in energy landscapes with diffusion anisotropy
Authors:
Pratyush Tiwary,
B. J. Berne
Abstract:
We consider a range of model potentials with metastable states undergoing molecular dynamics coupled to a thermal bath in the high friction regime, and consider how the optimal reaction coordinate depends on the diffusion anisotropy. For this we use our recently proposed method 'Spectral gap optimization of order parameters (SGOOP)' (Tiwary and Berne, Proc. Natl. Acad. Sci. 113 2839 2016). We show…
▽ More
We consider a range of model potentials with metastable states undergoing molecular dynamics coupled to a thermal bath in the high friction regime, and consider how the optimal reaction coordinate depends on the diffusion anisotropy. For this we use our recently proposed method 'Spectral gap optimization of order parameters (SGOOP)' (Tiwary and Berne, Proc. Natl. Acad. Sci. 113 2839 2016). We show how available information about dynamical observables in addition to static information can be incorporated into SGOOP, which can then be used to accurately determine the 'best' reaction coordinate for arbitrary anisotropies. We compare our results with transmission coefficient calculations and published benchmarks where applicable or available respectively.
△ Less
Submitted 1 May, 2017; v1 submitted 12 April, 2017;
originally announced April 2017.
-
How wet should be the reaction coordinate for ligand unbinding?
Authors:
Pratyush Tiwary,
B. J. Berne
Abstract:
We use a recently proposed method called Spectral Gap Optimization of Order Parameters (SGOOP) (Tiwary and Berne, Proc. Natl. Acad. Sci 2016, 113, 2839 (2016)), to determine an optimal 1-dimensional reaction coordinate (RC) for the unbinding of a bucky-ball from a pocket in explicit water. This RC is estimated as a linear combination of the multiple available order parameters that collectively can…
▽ More
We use a recently proposed method called Spectral Gap Optimization of Order Parameters (SGOOP) (Tiwary and Berne, Proc. Natl. Acad. Sci 2016, 113, 2839 (2016)), to determine an optimal 1-dimensional reaction coordinate (RC) for the unbinding of a bucky-ball from a pocket in explicit water. This RC is estimated as a linear combination of the multiple available order parameters that collectively can be used to distinguish the various stable states relevant for unbinding. We pay special attention to determining and quantifying the degree to which water molecules should be included in the RC. Using SGOOP with under-sampled biased simulations, we predict that water plays a distinct role in the reaction coordinate for unbinding in the case when the ligand is sterically constrained to move along an axis of symmetry. This prediction is validated through extensive calculations of the unbinding times through metadynamics, and by comparison through detailed balance with unbiased molecular dynamics estimate of the binding time. However when the steric constraint is removed, we find that the role of water in the reaction coordinate diminishes. Here instead SGOOP identifies a good one-dimensional RC involving various motional degrees of freedom.
△ Less
Submitted 23 May, 2016;
originally announced May 2016.
-
Kramers turnover: from energy diffusion to spatial diffusion using metadynamics
Authors:
Pratyush Tiwary,
B. J. Berne
Abstract:
We consider the rate of transition for a particle between two metastable states coupled to a thermal environment for various magnitudes of the coupling strength, using the recently proposed infrequent metadynamics approach (Tiwary and Parrinello, Phys. Rev. Lett. 111, 230602 (2013)). We are interested in understanding how this approach for obtaining rate constants performs as the dynamics regime c…
▽ More
We consider the rate of transition for a particle between two metastable states coupled to a thermal environment for various magnitudes of the coupling strength, using the recently proposed infrequent metadynamics approach (Tiwary and Parrinello, Phys. Rev. Lett. 111, 230602 (2013)). We are interested in understanding how this approach for obtaining rate constants performs as the dynamics regime changes from energy diffusion to spatial diffusion. Reassuringly, we find that the approach works remarkably well for various coupling strengths in the strong coupling regime, and to some extent even in the weak coupling regime.
△ Less
Submitted 21 February, 2016;
originally announced February 2016.
-
A perturbative solution to metadynamics ordinary differential equation
Authors:
Pratyush Tiwary,
James F. Dama,
Michele Parrinello
Abstract:
Metadynamics is a popular enhanced sampling scheme wherein by periodic application of a repulsive bias, one can surmount high free energy barriers and explore complex landscapes. Recently metadynamics was shown to be mathematically well founded, in the sense that the biasing procedure is guaranteed to converge to the true free energy surface in the long time limit irrespective of the precise choic…
▽ More
Metadynamics is a popular enhanced sampling scheme wherein by periodic application of a repulsive bias, one can surmount high free energy barriers and explore complex landscapes. Recently metadynamics was shown to be mathematically well founded, in the sense that the biasing procedure is guaranteed to converge to the true free energy surface in the long time limit irrespective of the precise choice of biasing parameters. A differential equation governing the post-transient convergence behavior of metadynamics was also derived. In this short communication, we revisit this differential equation, expressing it in a convenient and elegant Riccati-like form. A perturbative solution scheme is then developed for solving this differential equation, which is valid for any generic biasing kernel. The solution clearly demonstrates the robustness of metadynamics to choice of biasing parameters and gives further confidence in the widely used method.
△ Less
Submitted 6 October, 2015;
originally announced October 2015.
-
Caliber based spectral gap optimization of order parameters (SGOOP) for sampling complex molecular systems
Authors:
Pratyush Tiwary,
B. J. Berne
Abstract:
In modern day simulations of many-body systems much of the computational complexity is shifted to the identification of slowly changing molecular order parameters called collective variables (CV) or reaction coordinates. A vast array of enhanced sampling methods are based on the identification and biasing of these low-dimensional order parameters, whose fluctuations are important in driving rare e…
▽ More
In modern day simulations of many-body systems much of the computational complexity is shifted to the identification of slowly changing molecular order parameters called collective variables (CV) or reaction coordinates. A vast array of enhanced sampling methods are based on the identification and biasing of these low-dimensional order parameters, whose fluctuations are important in driving rare events of interest. Here describe a new algorithm for finding optimal low-dimensional collective variables for use in enhanced sampling biasing methods like umbrella sampling, metadynamics and related methods, when limited prior static and dynamic information is known about the system, and a much larger set of candidate CVs is specified. The algorithm involves estimating the best combination of these candidate CVs, as quantified by a maximum path entropy estimate of the spectral gap for dynamics viewed as a function of that CV. Through multiple practical examples, we show how this post-processing procedure can lead to optimization of CV and several orders of magnitude improvement in the convergence of the free energy calculated through metadynamics, essentially giving the ability to extract useful information even from unsuccessful metadynamics runs.
△ Less
Submitted 8 November, 2015; v1 submitted 21 September, 2015;
originally announced September 2015.
-
The role of water and steric constraints in the kinetics of cavity-ligand unbinding
Authors:
Pratyush Tiwary,
Jagannath Mondal,
Joseph A. Morrone,
B. J. Berne
Abstract:
A key factor influencing a drug's efficacy is its residence time in the binding pocket of the host protein. Using atomistic computer simulation to predict this residence time and the associated dissociation process is a desirable but extremely difficult task due to the long timescales involved. This gets further complicated by the presence of biophysical factors such as steric and solvation effect…
▽ More
A key factor influencing a drug's efficacy is its residence time in the binding pocket of the host protein. Using atomistic computer simulation to predict this residence time and the associated dissociation process is a desirable but extremely difficult task due to the long timescales involved. This gets further complicated by the presence of biophysical factors such as steric and solvation effects. In this work, we perform molecular dynamics (MD) simulations of the unbinding of a popular prototypical hydrophobic cavity-ligand system using a metadynamics based approach that allows direct assessment of kinetic pathways and parameters. When constrained to move in an axial manner, we find the unbinding time to be on the order of 4000 sec. In accordance with previous studies, we find that the ligand must pass through a region of sharp dewetting transition manifested by sudden and high fluctuations in solvent density in the cavity. When we remove the steric constraints on ligand, the unbinding happens predominantly by an alternate pathway, where the unbinding becomes 20 times faster, and the sharp dewetting transition instead becomes continuous. We validate the unbinding timescales from metadynamics through a Poisson analysis, and by comparison through detailed balance to binding timescale estimates from unbiased MD. This work demonstrates that enhanced sampling can be used to perform explicit solvent molecular dynamics studies at timescales previously unattainable, obtaining direct and reliable pictures of the underlying physio-chemical factors including free energies and rate constants.
△ Less
Submitted 10 July, 2015;
originally announced July 2015.
-
From Metadynamics to Dynamics
Authors:
Pratyush Tiwary,
Michele Parrinello
Abstract:
Metadynamics is a commonly used and successful enhanced sampling method. By the introduction of a history dependent bias which depends on a restricted number of collective variables(CVs) it can explore complex free energy surfaces characterized by several metastable states separated by large free energy barriers. Here we extend its scope by introducing a simple yet powerful method for calculating…
▽ More
Metadynamics is a commonly used and successful enhanced sampling method. By the introduction of a history dependent bias which depends on a restricted number of collective variables(CVs) it can explore complex free energy surfaces characterized by several metastable states separated by large free energy barriers. Here we extend its scope by introducing a simple yet powerful method for calculating the rates of transition between different metastable states. The method does not rely on a previous knowledge of the transition states or reaction co-ordinates, as long as CVs are known that can distinguish between the various stable minima in free energy space. We demonstrate that our method recovers the correct escape rates out of these stable states and also preserves the correct sequence of state-to-state transitions, with minimal extra computational effort needed over ordinary metadynamics. We apply the formalism to three different problems and in each case find excellent agreement with the results of long unbiased molecular dynamics runs.
△ Less
Submitted 5 December, 2013; v1 submitted 20 September, 2013;
originally announced September 2013.
-
Ab initio calculation of anisotropic interfacial excess free energies
Authors:
Axel van de Walle,
Chirranjeevi Balaji Gopal,
Steve Demers,
Qijun Hong,
Adam Kowalski,
Ljubomir Miljacic,
Gregory Pomrehn,
Pratyush Tiwary
Abstract:
We describe a simple method to determine, from ab initio calculations, the complete orientation-dependence of interfacial free energies in solid-state crystalline systems. We illustrate the method with an application to precipitates in the Al-Ti alloy system. The method combines the cluster expansion formalism in its most general form (to model the system's energetics) with the inversion of the we…
▽ More
We describe a simple method to determine, from ab initio calculations, the complete orientation-dependence of interfacial free energies in solid-state crystalline systems. We illustrate the method with an application to precipitates in the Al-Ti alloy system. The method combines the cluster expansion formalism in its most general form (to model the system's energetics) with the inversion of the well-known Wulff construction (to recover interfacial energies from equilibrium precipitate shapes). Although the inverse Wulff construction only provides the relative magnitude of the various interfacial free energies, absolute free energies can be recovered from a calculation of a single, conveniently chosen, planar interface. The method is able to account for essentially all sources of entropy (arising from phonons, bulk point defects, as well as interface roughness) and is thus able to transparently handle both atomically smooth and rough interfaces. The approach expresses the resulting orientation-dependence of the interfacial properties using symmetry-adapted bases for general orientation-dependent quantities. As a by-product, this paper thus provides a simple and general method to generate such basis functions, which prove useful in a variety of other applications, for instance to represent the anisotropy of the so-called constituent strain elastic energy.
△ Less
Submitted 22 April, 2014; v1 submitted 1 January, 2013;
originally announced January 2013.
-
Accelerated Molecular Dynamics through stochastic iterations to strengthen yield of path hopping over upper states (SISYPHUS)
Authors:
Pratyush Tiwary,
Axel van de Walle
Abstract:
We present a new method, called SISYPHUS (Stochastic Iterations to Strengthen Yield of Path Hopping over Upper States), for extending accessible time-scales in atomistic simulations. The method proceeds by separating phase space into basins, and transition regions between the basins based on a general collective variable (CV) criterion. The transition regions are treated via traditional molecular…
▽ More
We present a new method, called SISYPHUS (Stochastic Iterations to Strengthen Yield of Path Hopping over Upper States), for extending accessible time-scales in atomistic simulations. The method proceeds by separating phase space into basins, and transition regions between the basins based on a general collective variable (CV) criterion. The transition regions are treated via traditional molecular dynamics (MD) while Monte Carlo (MC) methods are used to (i) estimate the expected time spent in each basin and (ii) thermalize the system between two MD episodes. In particular, an efficient adiabatic switching based scheme is used to estimate the time spent inside the basins. The method offers various advantages over existing approaches in terms of (i) providing an accurate real time scale, (ii) avoiding reliance on harmonic transition state theory and (iii) avoiding the need to enumerate all possible transition events. Applications of SISYPHUS to low temperature vacancy diffusion in BCC Ta and adatom island ripening in FCC Al are presented. A new CV appropriate for such condensed phases, especially for transitions involving collective motions of several atoms, is also introduced.
△ Less
Submitted 2 January, 2013; v1 submitted 29 December, 2012;
originally announced December 2012.
-
Realistic time-scale fully atomistic simulations of surface nucleation of dislocations in pristine nanopillars
Authors:
Pratyush Tiwary,
Axel van de Walle
Abstract:
We use our recently proposed accelerated dynamics algorithm (Tiwary & van de Walle, 2011) to calculate temperature and stress dependence of activation free energy for surface nucleation of dislocations in pristine Gold nanopillars under realistic loads. While maintaining fully atomistic resolution, we achieve the fraction of a second time-scale regime. We find that the activation free energy depen…
▽ More
We use our recently proposed accelerated dynamics algorithm (Tiwary & van de Walle, 2011) to calculate temperature and stress dependence of activation free energy for surface nucleation of dislocations in pristine Gold nanopillars under realistic loads. While maintaining fully atomistic resolution, we achieve the fraction of a second time-scale regime. We find that the activation free energy depends significantly on the driving force (stress or strain) and temperature, leading to very high activation entropies. We also perform compression tests on Gold nanopillars for strain rates varying between 7 orders of magnitudes, reaching as low as 10^3/s. Our calculations show the quantitative effects on the yield point of unrealistic strain-rate Molecular Dynamics calculations: we find that while the failure mechanism for <001> compression of Gold nanopillars remains the same across the entire strain-rate range, the elastic limit (defined as stress for nucleation of the first dislocation) depends significantly on the strain-rate. We also propose a new methodology that overcomes some of the limits in our original accelerated dynamics scheme (and accelerated dynamics methods in general). We lay out our methods in sufficient details so as to be used for understanding and predicting deformation mechanism under realistic driving forces for various problems.
△ Less
Submitted 11 May, 2013; v1 submitted 21 February, 2012;
originally announced February 2012.
-
Hybrid deterministic and stochastic approach for efficient atomistic simulations at long time scales
Authors:
Pratyush Tiwary,
Axel van de Walle
Abstract:
We propose a hybrid deterministic and stochastic approach to achieve extended time scales in atomistic simulations that combines the strengths of molecular dynamics (MD) and Monte Carlo (MC) simulations in an easy-to-implement way. The method exploits the rare event nature of the dynamics similar to most current accelerated MD approaches but goes beyond them by providing, without any further compu…
▽ More
We propose a hybrid deterministic and stochastic approach to achieve extended time scales in atomistic simulations that combines the strengths of molecular dynamics (MD) and Monte Carlo (MC) simulations in an easy-to-implement way. The method exploits the rare event nature of the dynamics similar to most current accelerated MD approaches but goes beyond them by providing, without any further computational overhead, (a) rapid thermalization between infrequent events, thereby minimizing spurious correlations, and (b) control over accuracy of time-scale correction, while still providing similar or higher boosts in computational efficiency. We present two applications of the method: (a) Vacancy-mediated diffusion in Fe yields correct diffusivities over a wide range of temperatures and (b) source-controlled plasticity and deformation behavior in Au nanopillars at realistic strain rates (10^4/s and lower), with excellent agreement with previous theoretical predictions and in situ high-resolution transmission electron microscopy observations. The method gives several orders-of-magnitude improvements in computational efficiency relative to standard MD and good scalability with the size of the system.
△ Less
Submitted 17 October, 2011; v1 submitted 30 May, 2011;
originally announced May 2011.