内容修改建议
#28
Replies: 1 comment
-
谢谢建议,不过这个是一个惯例,广泛使用,习惯就好 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
P21处公式建议修改,由
$$E[R_{t+1}|S_t=s]=\sum_{\alpha\in A} \pi(a|s)E[R_{t+1}|S_t=s,A_t=a]=\sum_{\alpha\in A} \pi(a|s) \sum_{r\in R}p(r|s,a)r$$ 改为
$$E[R_{t+1}|S_t=s]=\sum_{\alpha\in A} \pi(a|s)E[R_{t+1}|S_t=s,A_t=a]=\sum_{\alpha\in A} (\pi(a|s) \sum_{r\in R}p(r|s,a)r)$$ 以避免歧义

Beta Was this translation helpful? Give feedback.
All reactions