Skip to content

Commit c575e39

Browse files
committed
fix headings and subheadings to match style guide
1 parent c8814f3 commit c575e39

File tree

1 file changed

+5
-5
lines changed

1 file changed

+5
-5
lines changed

lectures/mccall_q.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -85,7 +85,7 @@ key = jax.random.PRNGKey(123)
8585
jax.config.update('jax_platform_name', 'cpu')
8686
```
8787

88-
## Review of McCall Model
88+
## Review of McCall model
8989

9090
We begin by reviewing the McCall model described in {doc}`this quantecon lecture <mccall_model>`.
9191

@@ -239,7 +239,7 @@ We'll use this value function as a benchmark later after we have done some Q-lea
239239
print(valfunc_VFI)
240240
```
241241
242-
## Implied Quality Function $Q$
242+
## Implied quality function $Q$
243243
244244
245245
A **quality function** $Q$ map state-action pairs into optimal values.
@@ -313,7 +313,7 @@ $$
313313

314314
+++
315315

316-
## From Probabilities to Samples
316+
## From probabilities to samples
317317

318318
We noted above that the optimal Q function for our McCall worker satisfies the Bellman equations
319319

@@ -735,7 +735,7 @@ The above graphs indicates that
735735
736736
* the quality of approximation to the "true" value function computed by value function iteration improves for longer epochs
737737
738-
## Employed Worker Can't Quit
738+
## Employed worker can't quit
739739
740740
741741
The preceding version of temporal difference Q-learning described in equation system {eq}`eq:old4` lets an employed worker quit, i.e., reject her wage as an incumbent and instead receive unemployment compensation this period
@@ -775,7 +775,7 @@ We illustrate these possibilities with the following code and graph.
775775
plot_epochs(epochs_to_plot=[100, 1000, 10000, 100000, 200000], quit_allowed=0)
776776
```
777777
778-
## Possible Extensions
778+
## Possible extensions
779779
780780
To extend the algorthm to handle problems with continuous state spaces,
781781
a typical approach is to restrict Q-functions and policy functions to take particular

0 commit comments

Comments
 (0)