Chap5 Lack Math Proof #23
anyinlover
started this conversation in
General
Replies: 1 comment
-
These two algos are variants of MC Basic, whose convergence can be inferred from the policy iteration algo. The introduction of many tricks makes the convergence analysis difficult, which is also the reason why I introduced MC Basic first. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Thanks for great book, but it seems there are no math proof for MC Exploring Starts and MC-Greedy algorithms?
Beta Was this translation helpful? Give feedback.
All reactions