Skip to content

Commit eb240ff

Browse files
committed
[manuscript] unify notation for notebook numbers
1 parent f3503ce commit eb240ff

File tree

1 file changed

+17
-17
lines changed

1 file changed

+17
-17
lines changed

manuscript/manuscript.tex

Lines changed: 17 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -318,11 +318,11 @@ \subsection{Software and installation}
318318
\section{PyEMMA tutorials}
319319

320320
This tutorial consists of nine Jupyter notebooks which introduce the basic features of PyEMMA.
321-
The first notebook (00), which we will summarize in the following, showcases the entire estimation,
321+
The first notebook~00, which we will summarize in the following, showcases the entire estimation,
322322
validation, and analysis workflow for a small example system.
323-
The goal of this introductory notebook (00) is to provide the user with the typical steps required to obtain a validated MSM analysis of protein or peptide simulation data.
324-
The seven subsequent notebooks (01--07) provide in-depth lessons on specific topics,
325-
and the last notebook (08) contains guidelines on how to deal with common problems during MSM estimation.
323+
The goal of this introductory notebook~00 is to provide the user with the typical steps required to obtain a validated MSM analysis of protein or peptide simulation data.
324+
The seven subsequent notebooks~01--07 provide in-depth lessons on specific topics,
325+
and the last notebook~08 contains guidelines on how to deal with common problems during MSM estimation.
326326

327327
\subsection{The PyEMMA workflow}
328328

@@ -338,12 +338,12 @@ \subsection{The PyEMMA workflow}
338338
\item coarse-graining the MSM using a hidden Markov model approach (07).
339339
\end{itemize}
340340

341-
For the remainder of this manuscript we will walk through the first notebook (00).
341+
For the remainder of this manuscript we will walk through the first notebook~00.
342342
In notebook~00 we analyze a dataset of the Trp-Leu-Ala-Leu-Leu pentapeptide (Fig.~\ref{fig:io-to-tica}a),
343343
consisting of~$25$ independent MD trajectories conducted in implicit solvent with frames saved at an interval of~$0.1$~ns.
344344
We present the results obtained in this notebook,
345345
thereby providing an example of how results generated using PyEMMA can be integrated into research publications.
346-
The figures that will be displayed in the following are created in the showcase notebook (00) and can be easily reproduced.
346+
The figures that will be displayed in the following are created in the showcase notebook~00 and can be easily reproduced.
347347

348348
Note that the modeler has to select hyper-parameters at most stages throughout the workflow.
349349
This selection must be done carefully as poor choices make it hard, or even impossible, to build a good MSM.
@@ -371,15 +371,15 @@ \subsection{Feature selection}
371371
and the more general variational approach for Markov processes (VAMP)~\cite{vamp-preprint}
372372
provide a systematic means to quantitatively compare multiple representations of the simulation data.
373373
In particular, we can use a scalar score obtained using VAMP to directly compare the ability of certain features to capture slow dynamical modes in a particular molecular system.
374-
In Notebook (01), we present in detail how to extract features from MD datasets and how to systematically compare them.
374+
In notebook~01, we present in detail how to extract features from MD datasets and how to systematically compare them.
375375

376376
Throughout this tutorial, we utilize the VAMP-2 score, which maximizes the kinetic variance contained in the features~\cite{kinetic-maps}.
377377
We should always evaluate the score in a cross-validated manner to ensure that we neither include too few features (under-fitting) or too many features (over-fitting)~\cite{gmrq,vamp-preprint}.
378378
To choose among three different molecular features reflecting protein structure,
379-
we compute the (cross-validated) VAMP-2 score (notebook 00).
379+
we compute the (cross-validated) VAMP-2 score (notebook~00).
380380
Although we cannot MSM optimize lag times with a variational score\cite{husic2017note}, such as VAMP-2,
381381
it is important to ensure that properties that we optimize are robust as a function of lag time.
382-
Consequently, we compute the VAMP-2 score at several lag times (notebook 00).
382+
Consequently, we compute the VAMP-2 score at several lag times (notebook~00).
383383
We find that the relative rankings of the different molecular features are highly robust as a function of lag time.
384384
We show one example of this ranking and the absolute VAMP-2 scores for lag time~$0.5$~ns in Fig.~\ref{fig:io-to-tica}b.
385385
We find that backbone torsions contain more kinetic variance than the backbone heavy atom positions or the distances between them (Fig.~\ref{fig:io-to-tica}b).
@@ -399,7 +399,7 @@ \subsection{Dimensionality reduction}
399399
Discrete jumps between the minima can be observed by visualizing the transformation of the first trajectory into these ICs (Fig.~\ref{fig:io-to-tica}d).
400400
We thus assume that our TICA-transformed backbone torsion features describe one or more metastable processes.
401401

402-
We demonstrate how to apply TICA, suggest how to interpret the projected coordinates, and compare the results to other dimension reduction techniques in Notebook (02).
402+
We demonstrate how to apply TICA, suggest how to interpret the projected coordinates, and compare the results to other dimension reduction techniques in notebook~02.
403403

404404
\begin{figure}
405405
\includegraphics{figure_3}
@@ -417,8 +417,8 @@ \subsection{Discretization}
417417
which can greatly facilitate the decomposition of our system into the discrete Markovian states necessary for MSM estimation.
418418
Here, we use the $k$-means algorithm to segment the four dimensional TICA space into $k=75$ cluster centers.
419419
The number of cluster centers has been chosen to optimize the VAMP-2 score in a manner identical to how the feature selection was carried out above,
420-
which is shown in the showcase Notebook (00).
421-
A detailed comparison between different clustering techniques is provided in Notebook (02).
420+
which is shown in the showcase notebook~00.
421+
A detailed comparison between different clustering techniques is provided in notebook~02.
422422

423423
\subsection{MSM estimation and validation}
424424

@@ -451,7 +451,7 @@ \subsection{MSM estimation and validation}
451451
and shows that the MSM we have estimated at lag time $\tau=0.5$~ns indeed predicts the
452452
long-timescale behavior of our system within error (blue/shaded area).
453453

454-
In Notebook (03), we demonstrate in detail how to estimate and validate MSMs with PyEMMA.
454+
In notebook~03, we demonstrate in detail how to estimate and validate MSMs with PyEMMA.
455455

456456
\subsection{Analyzing the MSM}
457457

@@ -532,7 +532,7 @@ \subsection{Analyzing the MSM}
532532
The transition network can be additionally visualized by plotting representative structures of the five metastable states $\mathcal{S}_{(1-5)}$ according to their committor probability (Fig.~\ref{fig:tpt-network}).
533533
It is easy to see from this depiction that the dominant pathway from $\mathcal{S}_2$ to $\mathcal{S}_4$ proceeds through $\mathcal{S}_5$.
534534

535-
More details about (spectral) properties of MSMs and how to analyze them with PyEMMA are discussed in Notebook (04) and Notebook (05).
535+
More details about (spectral) properties of MSMs and how to analyze them with PyEMMA are discussed in notebook~04 and notebook~05.
536536

537537
\subsection{Connecting the MSM with experimental data}
538538

@@ -568,12 +568,12 @@ \subsection{Connecting the MSM with experimental data}
568568
We see that the predicted relaxation signal has a much larger amplitude for the nonequilibrium initialization,
569569
making it more likely to be experimentally measurable.
570570

571-
In addition to a detailed demonstration of the above, Notebook (06) demonstrates how to compute J-couplings and dynamic fingerprints from MSMs.
571+
In addition to a detailed demonstration of the above, notebook~06 demonstrates how to compute J-couplings and dynamic fingerprints from MSMs.
572572

573573
\subsection{Summary}
574574

575575
In this section, we have summarized how to conduct an MSM-based analysis of biomolecular dynamics data using PyEMMA.
576-
For the full analysis, please refer to the first notebook (00).
576+
For the full analysis, please refer to the first notebook~00.
577577
All notebooks as well as detailed installation instructions are available on \githubrepository{}.
578578

579579
\subsection{Modeling large systems}
@@ -597,7 +597,7 @@ \subsection{Modeling large systems}
597597
we explain how to deal with those in the tutorials (notebook~01).
598598

599599
More details on how to model complex systems with the techniques presented here are described, e.g., by~\cite{plattner_protein_2015,plattner_complete_2017}.
600-
We further examine some symptoms that may indicate problematic or difficult datasets, and demonstrate how to deal with them in Notebook (08).
600+
We further examine some symptoms that may indicate problematic or difficult datasets, and demonstrate how to deal with them in notebook~08.
601601

602602
\subsection{Advanced Methods}
603603

0 commit comments

Comments
 (0)