Skip to content

Commit 7753c90

Browse files
authored
Merge pull request #57 from MobleyLab/editsJIM
Suggested edits to "steps of an md simulation" section
2 parents 9f00e58 + c5302ed commit 7753c90

File tree

2 files changed

+13
-7
lines changed

2 files changed

+13
-7
lines changed

paper/basic_training.pdf

2.58 KB
Binary file not shown.

paper/basic_training.tex

Lines changed: 13 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -301,6 +301,7 @@ \subsubsection{Key concepts}
301301
What is the connection between MD simulation and equilibrium? The most precise statement we can make is that an MD trajectory is a single sample of a process that is relaxing to equilibrium from the starting configuration~\cite{Zuckerman:2015:StatisticalBiophysicsBlog, Zuckerman:2010:}.
302302
\emph{If} the trajectory is long enough, it should sample the equilibrium distribution -- where each configuration occurs with frequency proportional to its Boltzmann factor.
303303
In such a very long trajectory (only), a time average thus will give the same result as a Boltzmann-factor-weighted, or ensemble, average.
304+
We refer to such a system, where the time and ensemble averages are equivialent, as ``ergodic.''
304305
Note that the Boltzmann-factor distribution implies that every configuration has some probability, and so it is unlikely that a single conformation or even a single basin dominates an ensembles.
305306
Beware that in a typical MD trajectory it is likely that only a small subset of basins will be sampled well -- those most quickly accessible to the initial configuration.
306307
It is sometimes suggested that multiple MD trajectories starting structures can aid sampling, but unless the equilibrium distribution is known in advance, the bias from the set of starting structures is simply unknown and harder to diagnose.
@@ -575,14 +576,14 @@ \subsection{Main steps of a molecular dynamics simulation}
575576

576577
\subsubsection{System preparation}
577578

578-
System preparation focuses on preparing the starting state of the desired system for simulation with the desired simulation package, including building a starting structure, solvating (if necessary), applying a force field etc.
579-
Because this step differs so much depending on the composition of the system and what information is available about the starting structure, it is a step which varies a great deal depending on the type of system and each category may require unique tools.
579+
System preparation focuses on preparing the starting state of the desired system for input to an appropriate simulation package, including building a starting structure, solvating (if necessary), applying a force field, etc.
580+
Because this step differs so much depending on the composition of the system and what information is available about the starting structure, it is a step which varies a great deal depending on the nature of the system at hand and as a result may require unique tools.
580581

581582
Given the variable nature of system preparation, it is highly recommended that best practices documents specific to this issue and to the type of system of interest be consulted.
582583
If such documents do not exist, considerable care should be exercised to determine best practices from the literature.
583584

584585
Loosely speaking, system preparation can be thought of as consisting of two \emph{logical} components which are not necessarily consecutive or separate.
585-
One comprises building the configuration of the system in the desired chemical state, and the other, applying force field parameters.
586+
One comprises building the configuration of the system in the desired chemical state and the other applying force field parameters.
586587

587588
For building systems, freely available tools for constructing systems are available and can be a reasonable option (though their mention here should not be taken as an endorsement that they necessarily encapsulate best practices).
588589
Examples include tools for constructing specific crystal structures, proteins, and lipid membranes, such as Moltemplate, Packmol, and Atomsk.
@@ -594,10 +595,10 @@ \subsubsection{System preparation}
594595
System preparation is arguably the most critical stage of a simulation and in many cases receives the least attention.
595596
Specifically, if your system preparation is flawed, such flaws may prove fatal.
596597
Potentially the worst possible outcome is if the prepared system is not what you intended (e.g. it contains incorrect molecules or protonation states) but is chemically valid and well described by your force field and thus proceeds without error through the remaining steps --- and in fact this is a frequent outcome of problems in system preparation.
597-
It should not be assumed that if a system can proceed in a well-behaved manner through the other steps, it was necessarily prepared correctly; considerable care should be taken here.
598+
It should not be assumed that a system has been prepared correctly if it is well-behaved in subsequent equilibration steps; considerable care should be taken here.
598599

599600
Assignment or development of force field parameters is also critical, but is outside the scope of this work.
600-
For our purposes here, we will assume you have already obtained or developed force field parameters suitable for your system of interest.
601+
For our purposes, we will assume you have already obtained or developed force field parameters suitable for your system of interest.
601602

602603
\subsubsection{Minimization}
603604

@@ -617,7 +618,9 @@ \subsubsection{Assignment of velocities}
617618
\subsubsection{Equilibration}
618619

619620
Ultimately, we usually seek to run a simulation in a particular thermodynamic ensemble (e.g. the NVE or NVT ensemble) at a particular state point (e.g. target energy, temperature, and pressure) and collect data for analysis which is appropriate for those conditions and not biased depending on our starting conditions/configuration.
620-
This means that usually we need to invest simulation time in bringing the system to the appropriate state point and allowing it to essentially forget about its history and reach equilibrium (or pseudo-equilibrium -- for some systems, such as biomolecular systems, reaching true equilibrium may be impractical) before we begin retaining data for analysis.
621+
This means that usually we need to invest simulation time in bringing the system to the appropriate state point as well as relaxing away from any artificially induced metastable starting states.
622+
In other words, we are usually interested in sampling the most relevant (or most probable) configurations in the equilibrium ensemble of interest.
623+
However, if we start in a less-stable configuration a large part of our equilibration may be the relaxation time (this may be very long for biomolecules or systems at phase equilibrium) necessary to reach the more relevant configuration space.
621624

622625
The most straightforward portion of equilibrium is bringing the system to the target state point.
623626
Usually, even though velocities are assigned according to the correct distribution, a thermostat will still need to add or remove heat from the system as it approaches the correct partitioning of kinetic and potential energies.
@@ -658,14 +661,17 @@ \subsubsection{Equilibration}
658661

659662
\subsubsection{Production}
660663

661-
Once equilibration is complete, we may begin collecting data for analysis, and typically this phase is called ``production''.
664+
Once equilibration is complete, we may begin collecting data for analysis.
665+
Typically this phase is called ``production''.
662666
The main difference between equilibration and production is simply that in the production simulation, we plan to retain and analyze the collected data.
663667
Production must always be preceded by equilibration appropriate for the target production ensemble, and production data should never be collected immediately after a change in conditions (such as rescaling a box size, energy minimizing, or suddenly changing the temperature or pressure) except in very specific applications where this is the goal.
664668

665669
For bookkeeping purposes, sometimes practitioners choose to discard some initial production data as additional equilibration; usually this is simply to allow additional equilibration time after a change in protocol (such as a switch from NVT to NPT), and the usual considerations for equilibration apply in such cases (see \citet{ShellNotes}, lecture on Computing Properties).
666670

667671
Analysis of production is largely outside the scope of this work, but requires considerable care in computing observables and assessing the uncertainty in any computed properties.
668672
Usually, analysis involves computing expectation values of particular observables, and a key consideration is to obtain \emph{converged} estimates of these properties --- that is, estimates that are based on adequate simulation data so that they no longer depend substantially on the length of the simulation which was run or on its initial conditions.
673+
This is closely related to the above discussion of equilibration.
674+
Depending on the relaxation timescales involved, one may only realize after analysis of a ``production'' trajectory that the system was still equilibrating in some sense.
669675

670676
A separate Best Practices document addresses these critical issues of convergence and error analysis (\url{https://github.com/dmzuckerman/Sampling-Uncertainty}).
671677
For more specific details on procedures and parameters used in production simulations, see the appropriate best practices document for the system of interest.

0 commit comments

Comments
 (0)