You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
and the more general variational approach for Markov processes (VAMP)~\cite{vamp-preprint}
372
372
provide a systematic means to quantitatively compare multiple representations of the simulation data.
373
373
In particular, we can use a scalar score obtained using VAMP to directly compare the ability of certain features to capture slow dynamical modes in a particular molecular system.
374
+
In Notebook (01), we present in detail how to extract features from MD datasets and how to systematically compare them.
374
375
375
-
Here, we utilize the VAMP-2 score, which maximizes the kinetic variance contained in the features~\cite{kinetic-maps}.
376
+
Throughout this tutorial, we utilize the VAMP-2 score, which maximizes the kinetic variance contained in the features~\cite{kinetic-maps}.
376
377
We should always evaluate the score in a cross-validated manner to ensure that we neither include too few features (under-fitting) or too many features (over-fitting)~\cite{gmrq,vamp-preprint}.
377
378
To choose among three different molecular features reflecting protein structure,
378
379
we compute the (cross-validated) VAMP-2 score (Notebook 00).
Discrete jumps between the minima can be observed by visualizing the transformation of the first trajectory into these ICs (Fig.~\ref{fig:io-to-tica}d).
399
400
We thus assume that our TICA-transformed backbone torsion features describe one or more metastable processes.
400
401
402
+
We demonstrate how to apply TICA, suggest how to interpret the projected coordinates and compare the results to other dimension reduction techniques in Notebook (02).
403
+
401
404
\begin{figure}
402
405
\includegraphics{figure_3}
403
406
\caption{Example analysis of the conformational dynamics of a pentapeptide backbone:
@@ -414,7 +417,8 @@ \subsection{Discretization}
414
417
which can greatly facilitate the decomposition of our system into the discrete Markovian states necessary for MSM estimation.
415
418
Here, we use the $k$-means algorithm to segment the four dimensional TICA space into $k=75$ cluster centers.
416
419
The number of cluster centers has been chosen to optimize the VAMP-2 score in a manner identical to how the feature selection was carried out above,
417
-
which is shown in the showcase notebook (00).
420
+
which is shown in the showcase Notebook (00).
421
+
A detailed comparison between different clustering techniques is provided in Notebook (02).
418
422
419
423
\subsection{MSM estimation and validation}
420
424
@@ -447,6 +451,8 @@ \subsection{MSM estimation and validation}
447
451
and shows that the MSM we have estimated at lag time $\tau=0.5$~ns indeed predicts the
448
452
long-timescale behavior of our system within error (blue/shaded area).
449
453
454
+
In Notebook (03), we demonstrate in detail how to estimate and validate MSMs with PyEMMA.
455
+
450
456
\subsection{Analyzing the MSM}
451
457
452
458
\begin{figure}
@@ -526,6 +532,8 @@ \subsection{Analyzing the MSM}
526
532
The transition network can be additionally visualized by plotting representative structures of the five metastable states $\mathcal{S}_{(1-5)}$ according to their committor probability (Fig.~\ref{fig:tpt-network}).
527
533
It is easy to see from this depiction that the dominant pathway from $\mathcal{S}_2$ to $\mathcal{S}_4$ proceeds through $\mathcal{S}_5$.
528
534
535
+
More details about (spectral) properties of MSMs and how to analyze them with PyEMMA are discussed in Notebook (04) and Notebook (05).
536
+
529
537
\subsection{Connecting the MSM with experimental data}
530
538
531
539
\begin{figure}
@@ -560,6 +568,8 @@ \subsection{Connecting the MSM with experimental data}
560
568
We see that the predicted relaxation signal has a much larger amplitude for the nonequilibrium initialization,
561
569
making it more likely to be experimentally measurable.
562
570
571
+
Besides a detailed demonstration of the above, Notebook (06) demonstrates how to compute J-couplings and dynamic fingerprints from MSMs.
572
+
563
573
\subsection{Summary}
564
574
565
575
In this section, we have summarized how to conduct an MSM-based analysis of biomolecular dynamics data using PyEMMA.
@@ -587,6 +597,7 @@ \subsection{Modeling large systems}
587
597
we explain how to deal with those in the tutorials.
588
598
589
599
More details on how to model complex systems with the techniques presented here are described, e.g., by~\cite{plattner_protein_2015,plattner_complete_2017}.
600
+
We further demonstrate the symptoms of difficult data situations and how to deal with them in Notebook (08).
0 commit comments