You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: manuscript/manuscript.tex
+32-7Lines changed: 32 additions & 7 deletions
Original file line number
Diff line number
Diff line change
@@ -112,7 +112,7 @@ \subsection{Markov state models}
112
112
113
113
Markov state modeling is a mathematical framework for the analysis of time-series data, often but not limited to high-dimensional MD simulation datasets.
114
114
In its standard formulation, the creation of a Markov state model involves decomposing the phase or configuration space occupied by a system into a set of disjoint, discrete states,
115
-
and a transition matrix $P(\tau) = [p_{ij}(\tau)]$ denoting the conditional probability of finding the system in state $j$ at time $t+\tau$ given that it was in state $i$ at time $t$.
115
+
and a transition matrix $\mathbf{P}(\tau) = [p_{ij}(\tau)]$ denoting the conditional probability of finding the system in state $j$ at time $t+\tau$ given that it was in state $i$ at time $t$.
116
116
Let us make two remarks to avoid common misconceptions:
117
117
\begin{enumerate}
118
118
\item Equilibrium:
@@ -129,8 +129,8 @@ \subsection{Markov state models}
129
129
130
130
In order to create a Markov state model for a dynamical system, each data point in the time series is assigned to a state.
131
131
Given an appropriate lag time, every pairwise transition at that lag time is counted and stored in a count matrix.
132
-
Then, the count matrix is converted to a row-stochastic transition probability matrix $P$, which is defined for the specified lag time.
133
-
For MD simulations in equilibrium, $P$ should obey detailed balance which is enforced by constraining the estimation of $P$ to the following equations:
132
+
Then, the count matrix is converted to a row-stochastic transition probability matrix $\mathbf{P}$, which is defined for the specified lag time.
133
+
For MD simulations in equilibrium, $\mathbf{P}$ should obey detailed balance which is enforced by constraining the estimation of $\mathbf{P}$ to the following equations:
134
134
\begin{equation}
135
135
\label{eq:balance}
136
136
\pi_i p_{ij} = \pi_j p_{ji},
@@ -149,10 +149,10 @@ \subsection{Markov state models}
149
149
\end{equation}
150
150
When the ITS become approximately constant with the lag time, we say that our timescales have converged and choose the smallest lag time with the converged timescales in order to maximize the model's temporal resolution.
151
151
152
-
Once we have used the ITS to choose the lag time, we can check whether a given transition probability matrix $T(\tau)$ is approximately Markovian using the Chapman-Kolmogorov (CK) test~\cite{noe-folding-pathways}.
152
+
Once we have used the ITS to choose the lag time, we can check whether a given transition probability matrix $\mathbf{P}(\tau)$ is approximately Markovian using the Chapman-Kolmogorov (CK) test~\cite{noe-folding-pathways}.
153
153
The CK property for a Markovian matrix is,
154
154
\begin{equation}
155
-
P(k \tau) = P^k(\tau),
155
+
\mathbf{P}(k \tau) = \mathbf{P}^k(\tau),
156
156
\end{equation}
157
157
where the left-hand side of the equation corresponds to an MSM estimated at lag time $k\tau$, where $k$ is an integer larger than~$1$, whereas the right-hand side of the equation is our estimated MSM transition probability matrix to the $k^\textrm{th}$ power.
158
158
By assessing how well the approximated transition probability matrix adheres to the CK property, we can validate the appropriateness of the Markovian assumption for the model.
@@ -161,13 +161,13 @@ \subsection{Markov state models}
161
161
The highest eigenvalue, $\lambda_1(\tau)$, is unique and equal to $1$.
162
162
Its corresponding left eigenvector is the stationary distribution, $\bm{\pi}$:
163
163
\begin{equation}
164
-
\bm{\pi}^\topP(\tau) = \bm{\pi}^\top.
164
+
\bm{\pi}^\top\mathbf{P}(\tau) = \bm{\pi}^\top.
165
165
\end{equation}
166
166
167
167
The subsequent eigenvalues $\lambda_{i>1}(\tau)$ are real with absolute values less than~$1$ and are related to the \emph{characteristic} or \emph{implied} timescales of dynamical processes within the system (eq.~\ref{eq:its}).
168
168
The dynamical process themself (for $i>1$) are encoded by the right eigenvectors $\bm{\psi}_i$,
0 commit comments