adding video resource and notebook to how-to

NathanielF · NathanielF · commit 6aea139adcdf · 2025-07-11T23:41:38.000+01:00
Signed-off-by: Nathaniel &lt;NathanielF@users.noreply.github.com&gt;
diff --git a/docs/source/knowledgebase/causal_video_resources.md b/docs/source/knowledgebase/causal_video_resources.md
@@ -44,3 +44,10 @@
 <div class="video-container">
     <iframe width="560" height="315" src="https://www.youtube.com/embed/QAzAFess1AA?si=zD6PrljOFUyvjm1I" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
 </div>
+
+
+## Uncertainty and Causal Inference in Python with CausalPy
+
+<div class="video-container">
+    <iframe width="560" height="315" src="https://www.youtube.com/embed/-C4p4b2cUp8?si=klS3Ze8PjOpajqaQ" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
+</div>
diff --git a/docs/source/notebooks/index.md b/docs/source/notebooks/index.md
@@ -71,4 +71,5 @@ iv_weak_instruments.ipynb
 :maxdepth: 1
 
 inv_prop_pymc.ipynb
+inv_prop_latent.ipynb
 :::
diff --git a/docs/source/notebooks/inv_prop_latent.ipynb b/docs/source/notebooks/inv_prop_latent.ipynb
@@ -33,7 +33,7 @@
     "\n",
     "In this notebook we'll show why we should be careful attempting to model the joint-distribution of the propensity score and the outcome variable, but still make good use of the propensity score. \n",
     "\n",
-    "### Brief Digression on the Mathematics\n",
+    "#### Brief Digression on the Mathematics\n",
     "\n",
     "Consider that we have the following three variables:\n",
     "\n",
@@ -3785,7 +3785,7 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## Nets Example\n",
+    "### Nets Example\n",
     "\n",
     "Next we'll asses a data set used by Andrew Heiss to demonstrate propensity score methods with `brms`. "
    ]
@@ -5439,7 +5439,7 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## LaLonde Example\n",
+    "### LaLonde Example\n",
     "\n",
     "The Lalonde Data set is famous because it highlights a problem with naive causal contrasts. It is discussed by Angrist and Pischke in their _Mostly Harmless Econometrics_ as an example of how regression controls can tolerably address selection effects in a way similar to propensity score weighting. So we should hope the a well specified outcome model can identify the treatment effects plausibly here too. "
    ]
@@ -6133,7 +6133,7 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## NHEFS \n",
+    "### NHEFS \n",
     "\n",
     "Finally we turn to the NHEFS data. This data is known to be have a complex covariate profile for measuring aspects smokers health. We might suspect that there is some unmeasured confounding in this data set that would be hard to pick up on with simple regression controls. "
    ]
@@ -6733,7 +6733,9 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## Two-Stage Outcome Modelling with CausalPy\n"
+    "### Two-Stage Outcome Modelling with CausalPy\n",
+    "\n",
+    "Next we show how to achieve these steps with the simpler CausalPy experiment API. "
    ]
   },
   {
@@ -6796,7 +6798,9 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "### Comparing Inverse Propensity Score Weighting and Covariate Adjustment"
+    "### Comparing Inverse Propensity Score Weighting and Covariate Adjustment\n",
+    "\n",
+    "The two step procedure doesn't jusst apply for regression adjustment methods as we've seen here, but can be used to apply inverse weighting techniques too. "
    ]
   },
   {
@@ -6829,6 +6833,13 @@
     "result.plot_ate(result.idata);"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "which can be compared against the two-step regression adjustment here. "
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 34,
@@ -6859,6 +6870,13 @@
     ")"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Yielding similar, but not identical results. "
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 35,
@@ -7144,16 +7162,27 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## Conclusion: It' the Model Stupid!\n",
+    "### Conclusion: Modularity as Causal Discipline\n",
+    "\n",
+    "When attempting to estimate treatment effects using Bayesian inference, a natural but risky strategy is to fit a joint model for both the treatment assignment and the outcome. That is, to specify a full model and infer the parameters of both components simultaneously.\n",
     "\n",
+    "However, this joint approach introduces a feedback loop: the outcome $Y$ can influence the estimation of the treatment mechanism $P(T | X)$. This violates the original logic of design-based inference, where treatment assignment should be modeled independently of the observed outcomes. This phenomenon is often subtle but can lead to biased treatment effect estimates.\n",
     "\n",
+    "Across several examples, we have shown that fitting a full joint model distorts the treatment effect estimate relative to a two-step (modular) approach.\n",
+    "In other cases, joint and modular approaches yield nearly identical estimates — usually when the treatment mechanism is well-identified from covariates alone. With these observations in scope, we recommend that practioners generally follow a two-step or modular approach. Either two-stage inverse propensity score weighting or regression adjustment with the propensity score as an additional covariate. Both methods are available now in `CausalPy`. \n",
     "\n",
+    "Framed this way we can see that joint model violates the temporal precedence of the treatment assignment and outcome process. The 2-stage Bayesian procedures ensure that the causal ordering encoded in the actual data generating process is respected in the estimation process. The confounding adjustment achieved with propensity score must occur without access to information about the outcome. A well-specified propensity score model can substantially improve causal estimates (as we've seen), especially when the outcome model is weak or mis-specified. Propensity scores do not only serve to reduce dimensionality; they formalize the treatment mechanism and encode information that the outcome model might fail to recover. This explains their continued prominence in modern causal inference.\n",
     "\n",
-    "## References\n",
+    "### References\n",
     ":::{bibliography}\n",
     ":filter: docname in docnames\n",
     ":::"
    ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": []
   }
  ],
  "metadata": {