File tree Expand file tree Collapse file tree 1 file changed +3
-3
lines changed
lectures/dynamic_programming Expand file tree Collapse file tree 1 file changed +3
-3
lines changed Original file line number Diff line number Diff line change @@ -283,7 +283,7 @@ By the definition of greedy policies given above, this means that
283283$$
284284\sigma^*(s) \in \operatorname*{arg\,max}_{a \in A(s)}
285285 \left\{
286- r(s, a) + \beta \sum_{s' \in S} v^*(s') Q(s, \sigma(s) , s')
286+ r(s, a) + \beta \sum_{s' \in S} v^*(s') Q(s, a , s')
287287 \right\}
288288\qquad (s \in S)
289289$$
907907
908908Let's try different solution methods. The results below show that policy
909909function iteration and modified policy function iteration are much
910- faster that value function iteration.
910+ faster than value function iteration.
911911
912912``` {code-cell} julia
913913@benchmark results = solve(ddp, PFI)
998998```
999999
10001000We next plot the consumption policies along the value iteration. First
1001- we write a function to generate the and record the policies at given
1001+ we write a function to generate and record the policies at given
10021002stages of iteration.
10031003
10041004``` {code-cell} julia
You can’t perform that action at this time.
0 commit comments