Skip to content

Commit a053e75

Browse files
Merge pull request #24 from LearningToOptimize/mk/ytou
Class 1 typo: y -> u
2 parents 5714e66 + 627af29 commit a053e75

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

class01/class01_intro.jl

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -86,7 +86,7 @@ In this course, we are interested in problems with the following structure:
8686
\phantom{\substack{(\mathbf u_1,\mathbf x_1)\\\mathrm{s.t.}}}%
8787
\!\!\!\!\!\!\!\!\!\!(\mathbf u_1,\mathbf x_1)\in\mathcal X_1(\mathbf x_0)%
8888
}{%
89-
\!\!\!\!c(\mathbf x_1,\mathbf y_1)%
89+
\!\!\!\!c(\mathbf x_1,\mathbf u_1)%
9090
}
9191
+\mathbb{E}_1\Bigl[
9292
\quad \cdots
@@ -123,7 +123,7 @@ constraints can be generally posed as:
123123
&\mathcal{X}_t(\mathbf{x}_{t-1}, w_t)=
124124
\begin{cases}
125125
f(\mathbf{x}_{t-1}, w_t, \mathbf{u}_t) = \mathbf{x}_t \\
126-
h(\mathbf{x}_t, \mathbf{y}_t) \geq 0
126+
h(\mathbf{x}_t, \mathbf{u}_t) \geq 0
127127
\end{cases}
128128
\end{align}
129129
```
@@ -135,7 +135,7 @@ where the outgoing state of the system $\mathbf{x}_t$ is a
135135
transformation based on the incoming state, the realized uncertainty,
136136
and the control variables. In the Markov Decision Process (MDP) framework, we refer to $f$ as the "transition kernel" of the system. State and
137137
control variables are restricted further by additional constraints
138-
captured by $h(\mathbf{x}_t, \mathbf{y}_t) \geq 0$. We
138+
captured by $h(\mathbf{x}_t, \mathbf{u}_t) \geq 0$. We
139139
consider policies that map the past information into decisions: $\pi_t : (\mathbf{x}_{t-1}, w_t) \rightarrow \mathbf{x}_t$. In
140140
period $t$, an optimal policy is given by the solution of the dynamic
141141
equations:

0 commit comments

Comments
 (0)