Update README.md (#164)

erikcs · web-flow · commit 453d44ce8e52 · 2023-03-18T09:16:57.000-07:00
diff --git a/README.md b/README.md
@@ -74,7 +74,7 @@ head(predict(opt.tree, X[-train, ]))
 `policy_tree()`: fits a depth _k_ tree by exhaustive search (_Nxp_ features on _Nxd_ actions). The optimal tree maximizes the sum of rewards: let $\Gamma_i \in \mathbb R^d$ be a vector of unit-specific rewards for each action 1 to $d$ and $\pi(X_i) \in \\{1, ..., d\\}$ a mapping from covariates $X_i$ to action. `policy_tree` solves the following:
 
 $$
-\pi^* = argmax_{\pi \in \Pi} \left[ \sum_{i=1}^{n} \Gamma_i(\pi(X_i)) \right],
+\pi^* = argmax_{\pi \in \Pi} \left[\frac{1}{n} \sum_{i=1}^{n} \Gamma_i(\pi(X_i)) \right],
 $$
 
 where $\Pi$ is the class of depth-_k_ decision trees. (`hybrid_policy_tree()` employs a mix between a optimal/greedy approach and can be used to fit deeper trees).