Skip to content

Commit adbd4d0

Browse files
committed
typo and wording corrections within lecture03
1 parent 71e7a51 commit adbd4d0

File tree

3 files changed

+4
-3
lines changed

3 files changed

+4
-3
lines changed

lecture_slides/main.tex

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -177,7 +177,7 @@
177177

178178
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
179179
%%%Lecture Include Onlys%%%
180-
%\includeonly{tex/Lecture02}
180+
%\includeonly{tex/Lecture03}
181181
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
182182

183183
\begin{document}

lecture_slides/tex/Lecture02.tex

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -598,6 +598,7 @@ \section{Finite Markov Decision Processes}
598598
\frame{\frametitle{Bellman Expectation Equation (3)}
599599
Inserting \eqref{eq:q_MDP_finite} into \eqref{eq:v_MDP_finite} directly results in:
600600
\begin{equation}
601+
\label{eq:Bellman_MDP_linear_non_matrix}
601602
v_\pi(x_k) = \sum_{u_k\in\mathcal{U}}\pi(u_k|x_k)\left(\mathcal{R}^u_x + \gamma\sum_{x_{k+1}\in\mathcal{X}}p_{xx'}^u v_\pi(x_{k+1})\right) \, .
602603
\end{equation}
603604
\pause

lecture_slides/tex/Lecture03.tex

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -221,11 +221,11 @@ \section{Policy Evaluation}
221221
%% Iterative Policy Evaluation by Richardson Iteration (1)%%
222222
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
223223
\frame{\frametitle{Iterative Policy Evaluation by Richardson Iteration (1)}
224-
General form for any $x_k\in\mathcal{X}$ at iteration $i$ is given as:
224+
Applying the Richardson iteration \eqref{eq:richardson_general} to the Bellman equation \eqref{eq:Bellman_MDP_linear_non_matrix} for any $x_k\in\mathcal{X}$ at iteration $i$ results in:
225225
\begin{equation}
226226
v_{i+1}(x_k) = \sum_{u_k\in\mathcal{U}}\bm{\pi}(u_k|x_k)\left(\mathcal{R}^u_x + \gamma\sum_{x_{k+1}\in\mathcal{X}}p_{xx'}^u v_{i}(x_{k+1})\right)\, .
227227
\end{equation}\pause
228-
Matrix form then is:
228+
Matrix form based on \eqref{eq:Bellman_MDP_linear} then is:
229229
\begin{equation}
230230
\label{eq:iterative_policy_eval_matrix}
231231
\bm{v}_{\mathcal{X},i+1}^{\pi} =\bm{r}_{\mathcal{X}}^{\pi}+\gamma\bm{\mathcal{P}}_{xx'}^{\pi}\bm{v}_{\mathcal{X},i}^{\pi}\, .

0 commit comments

Comments
 (0)