You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: manuscript/manuscript.tex
+12-8Lines changed: 12 additions & 8 deletions
Original file line number
Diff line number
Diff line change
@@ -86,7 +86,7 @@ \section{Introduction}
86
86
87
87
\subsection{Scope}
88
88
89
-
In this tutorial, we assume that the reader is familiar with MD simulation and standard analysis of MD simulations of peptides and proteins, such as computation of torsion angles and distances. (see~\cite{dror2012biomolecular} for a review).
89
+
In this tutorial, we assume that the reader is familiar with MD simulation and standard analysis of MD simulations of peptides and proteins, such as computation of torsion angles and distances (see~\cite{dror2012biomolecular} for a review).
90
90
91
91
We further assume that the reader is familiar with the basic ideas and theory underlying Markov modeling and will only give a brief reminder of the basic concepts in Section 2.
92
92
@@ -256,7 +256,7 @@ \subsection{Hidden Markov state models}
256
256
The estimation of an MSM requires the dynamics between microstates to be Markovian.
257
257
However, in case of a poor dimension reduction and/or discretization or short trajectories,
258
258
we cannot anticipate this to be the case.
259
-
We illustrate this point in Notebook~07.
259
+
We illustrate this point in notebook~07.
260
260
261
261
An alternative, which is much less sensitive to poor discretization,
262
262
is to estimate a hidden Markov model (HMM)~\cite{hmm-baum-welch-alg,hmm-tutorial,jhp-spectral-rate-theory,noe-proj-hid-msm,bhmm-preprint}.
@@ -279,7 +279,7 @@ \subsection{Hidden Markov state models}
279
279
280
280
An HMM estimation always yields a model with a small number of (hidden) states
281
281
where each state is considered to be metastable and,
282
-
thus, the number of hidden states is a new hyper-parameter which needs to be chosen carefully (see Notebook~07).
282
+
thus, the number of hidden states is a new hyper-parameter which needs to be chosen carefully (see notebook~07).
283
283
As the HMMs---like MSMs---approximate the full phase-space dynamics,
284
284
we can similarly compute the metastable kinetics, apply TPT, visualize the network, and obtain physical observables.
Throughout this tutorial, we utilize the VAMP-2 score, which maximizes the kinetic variance contained in the features~\cite{kinetic-maps}.
377
377
We should always evaluate the score in a cross-validated manner to ensure that we neither include too few features (under-fitting) or too many features (over-fitting)~\cite{gmrq,vamp-preprint}.
378
378
To choose among three different molecular features reflecting protein structure,
379
-
we compute the (cross-validated) VAMP-2 score (Notebook 00).
379
+
we compute the (cross-validated) VAMP-2 score (notebook 00).
380
380
Although we cannot MSM optimize lag times with a variational score\cite{husic2017note}, such as VAMP-2,
381
381
it is important to ensure that properties that we optimize are robust as a function of lag time.
382
-
Consequently, we compute the VAMP-2 score at several lag times (Notebook 00).
382
+
Consequently, we compute the VAMP-2 score at several lag times (notebook 00).
383
383
We find that the relative rankings of the different molecular features are highly robust as a function of lag time.
384
384
We show one example of this ranking and the absolute VAMP-2 scores for lag time~$0.5$~ns in Fig.~\ref{fig:io-to-tica}b.
385
385
We find that backbone torsions contain more kinetic variance than the backbone heavy atom positions or the distances between them (Fig.~\ref{fig:io-to-tica}b).
@@ -594,7 +594,7 @@ \subsection{Modeling large systems}
594
594
This problem may be mitigated by choosing a more specific set of features.
595
595
596
596
Additional technical challenges for large systems include high demands on memory and computation time;
597
-
we explain how to deal with those in the tutorials (Notebook 00 and 02).
597
+
we explain how to deal with those in the tutorials (notebook~01).
598
598
599
599
More details on how to model complex systems with the techniques presented here are described, e.g., by~\cite{plattner_protein_2015,plattner_complete_2017}.
600
600
We further examine some symptoms that may indicate problematic or difficult datasets, and demonstrate how to deal with them in Notebook (08).
@@ -603,7 +603,8 @@ \subsection{Advanced Methods}
603
603
604
604
The present tutorial presents the basics of modern Markov state modeling with PyEMMA.
605
605
However, recent years have seen many extensions of the methodology---many of which are available within PyEMMA.
606
-
We encourage interested readers to look into these methods in the software documentation and to make use of the specific Jupyter notebooks distributed with PyEMMA.
606
+
We encourage interested readers to look into these methods in the software documentation
607
+
and to make use of the specific Jupyter notebooks distributed with PyEMMA (\url{http://emma-project.org}).
607
608
608
609
Conventional Markov state modeling often relies on large simulation datasets to ensure proper convergence of thermodynamic and kinetic properties.
609
610
In one extension, Multi-ensemble Markov models (MEMMs)~\cite{dtram,tram},
0 commit comments