You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: paper/basic_training.tex
+13-7Lines changed: 13 additions & 7 deletions
Original file line number
Diff line number
Diff line change
@@ -301,6 +301,7 @@ \subsubsection{Key concepts}
301
301
What is the connection between MD simulation and equilibrium? The most precise statement we can make is that an MD trajectory is a single sample of a process that is relaxing to equilibrium from the starting configuration~\cite{Zuckerman:2015:StatisticalBiophysicsBlog, Zuckerman:2010:}.
302
302
\emph{If} the trajectory is long enough, it should sample the equilibrium distribution -- where each configuration occurs with frequency proportional to its Boltzmann factor.
303
303
In such a very long trajectory (only), a time average thus will give the same result as a Boltzmann-factor-weighted, or ensemble, average.
304
+
We refer to such a system, where the time and ensemble averages are equivialent, as ``ergodic.''
304
305
Note that the Boltzmann-factor distribution implies that every configuration has some probability, and so it is unlikely that a single conformation or even a single basin dominates an ensembles.
305
306
Beware that in a typical MD trajectory it is likely that only a small subset of basins will be sampled well -- those most quickly accessible to the initial configuration.
306
307
It is sometimes suggested that multiple MD trajectories starting structures can aid sampling, but unless the equilibrium distribution is known in advance, the bias from the set of starting structures is simply unknown and harder to diagnose.
@@ -575,14 +576,14 @@ \subsection{Main steps of a molecular dynamics simulation}
575
576
576
577
\subsubsection{System preparation}
577
578
578
-
System preparation focuses on preparing the starting state of the desired system for simulation with the desired simulation package, including building a starting structure, solvating (if necessary), applying a force field etc.
579
-
Because this step differs so much depending on the composition of the system and what information is available about the starting structure, it is a step which varies a great deal depending on the type of system and each category may require unique tools.
579
+
System preparation focuses on preparing the starting state of the desired system for input to an appropriate simulation package, including building a starting structure, solvating (if necessary), applying a force field, etc.
580
+
Because this step differs so much depending on the composition of the system and what information is available about the starting structure, it is a step which varies a great deal depending on the nature of the system at hand and as a result may require unique tools.
580
581
581
582
Given the variable nature of system preparation, it is highly recommended that best practices documents specific to this issue and to the type of system of interest be consulted.
582
583
If such documents do not exist, considerable care should be exercised to determine best practices from the literature.
583
584
584
585
Loosely speaking, system preparation can be thought of as consisting of two \emph{logical} components which are not necessarily consecutive or separate.
585
-
One comprises building the configuration of the system in the desired chemical state, and the other, applying force field parameters.
586
+
One comprises building the configuration of the system in the desired chemical state and the other applying force field parameters.
586
587
587
588
For building systems, freely available tools for constructing systems are available and can be a reasonable option (though their mention here should not be taken as an endorsement that they necessarily encapsulate best practices).
588
589
Examples include tools for constructing specific crystal structures, proteins, and lipid membranes, such as Moltemplate, Packmol, and Atomsk.
System preparation is arguably the most critical stage of a simulation and in many cases receives the least attention.
595
596
Specifically, if your system preparation is flawed, such flaws may prove fatal.
596
597
Potentially the worst possible outcome is if the prepared system is not what you intended (e.g. it contains incorrect molecules or protonation states) but is chemically valid and well described by your force field and thus proceeds without error through the remaining steps --- and in fact this is a frequent outcome of problems in system preparation.
597
-
It should not be assumed that if a system can proceed in a well-behaved manner through the other steps, it was necessarily prepared correctly; considerable care should be taken here.
598
+
It should not be assumed that a system has been prepared correctly if it is well-behaved in subsequent equilibration steps; considerable care should be taken here.
598
599
599
600
Assignment or development of force field parameters is also critical, but is outside the scope of this work.
600
-
For our purposes here, we will assume you have already obtained or developed force field parameters suitable for your system of interest.
601
+
For our purposes, we will assume you have already obtained or developed force field parameters suitable for your system of interest.
601
602
602
603
\subsubsection{Minimization}
603
604
@@ -617,7 +618,9 @@ \subsubsection{Assignment of velocities}
617
618
\subsubsection{Equilibration}
618
619
619
620
Ultimately, we usually seek to run a simulation in a particular thermodynamic ensemble (e.g. the NVE or NVT ensemble) at a particular state point (e.g. target energy, temperature, and pressure) and collect data for analysis which is appropriate for those conditions and not biased depending on our starting conditions/configuration.
620
-
This means that usually we need to invest simulation time in bringing the system to the appropriate state point and allowing it to essentially forget about its history and reach equilibrium (or pseudo-equilibrium -- for some systems, such as biomolecular systems, reaching true equilibrium may be impractical) before we begin retaining data for analysis.
621
+
This means that usually we need to invest simulation time in bringing the system to the appropriate state point as well as relaxing away from any artificially induced metastable starting states.
622
+
In other words, we are usually interested in sampling the most relevant (or most probable) configurations in the equilibrium ensemble of interest.
623
+
However, if we start in a less-stable configuration a large part of our equilibration may be the relaxation time (this may be very long for biomolecules or systems at phase equilibrium) necessary to reach the more relevant configuration space.
621
624
622
625
The most straightforward portion of equilibrium is bringing the system to the target state point.
623
626
Usually, even though velocities are assigned according to the correct distribution, a thermostat will still need to add or remove heat from the system as it approaches the correct partitioning of kinetic and potential energies.
Once equilibration is complete, we may begin collecting data for analysis, and typically this phase is called ``production''.
664
+
Once equilibration is complete, we may begin collecting data for analysis.
665
+
Typically this phase is called ``production''.
662
666
The main difference between equilibration and production is simply that in the production simulation, we plan to retain and analyze the collected data.
663
667
Production must always be preceded by equilibration appropriate for the target production ensemble, and production data should never be collected immediately after a change in conditions (such as rescaling a box size, energy minimizing, or suddenly changing the temperature or pressure) except in very specific applications where this is the goal.
664
668
665
669
For bookkeeping purposes, sometimes practitioners choose to discard some initial production data as additional equilibration; usually this is simply to allow additional equilibration time after a change in protocol (such as a switch from NVT to NPT), and the usual considerations for equilibration apply in such cases (see \citet{ShellNotes}, lecture on Computing Properties).
666
670
667
671
Analysis of production is largely outside the scope of this work, but requires considerable care in computing observables and assessing the uncertainty in any computed properties.
668
672
Usually, analysis involves computing expectation values of particular observables, and a key consideration is to obtain \emph{converged} estimates of these properties --- that is, estimates that are based on adequate simulation data so that they no longer depend substantially on the length of the simulation which was run or on its initial conditions.
673
+
This is closely related to the above discussion of equilibration.
674
+
Depending on the relaxation timescales involved, one may only realize after analysis of a ``production'' trajectory that the system was still equilibrating in some sense.
669
675
670
676
A separate Best Practices document addresses these critical issues of convergence and error analysis (\url{https://github.com/dmzuckerman/Sampling-Uncertainty}).
671
677
For more specific details on procedures and parameters used in production simulations, see the appropriate best practices document for the system of interest.
0 commit comments