Skip to content

Commit aa39226

Browse files
mjwillsonWeatherBenchX authors
authored andcommitted
Add documentation and citations to README.md for methods implemented under statistical_inference/.
This includes some discussion of methods to consider implementing in future. PiperOrigin-RevId: 823564609
1 parent 8ff97c8 commit aa39226

File tree

1 file changed

+70
-0
lines changed

1 file changed

+70
-0
lines changed

weatherbenchX/statistical_inference/README.md

Lines changed: 70 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -42,3 +42,73 @@ with further caveats and trade-offs.
4242
Spatial correlation can be an issue too, although we typically avoid the need
4343
to account for this by treating spatial locations (on a grid, for example) as
4444
fixed, and averaging over them before performing any statistical inference.
45+
46+
## Methods implemented
47+
48+
* The standard t-test.
49+
* The t-test with AR(2) autocorrelation correction from [^1]
50+
* Both the above with delta-method confidence intervals for metrics which are
51+
nonlinear functions of the mean statistics.
52+
* The IID bootstrap.
53+
* A cluster bootstrap [^3] [^4] which assumes independence only between
54+
clusters.
55+
* The stationary bootstrap of [^5], with optimal block length selection from
56+
[^6] [^7] but generalized to support non-linear functions of means of
57+
multivariate statistics via a delta-method trick.
58+
59+
## Methods to consider implementing in future
60+
61+
* Diebold-Mariano test [^8] or another of the family of tests based on HAC
62+
(Heteroskedasticity and Autocorrelation Consistent) variance estimators, as a
63+
a better-studied and more standard alternative to [^1]. These methods also
64+
have problems at small sample sizes and/or high degrees of autocorrelation
65+
however, and there are a number of choices e.g. of kernel and window length
66+
selection method with different trade-offs. [^10] offers a modern review and
67+
some practical recommendations.
68+
* One of the second-order-correct block bootstrap CI methods of [^9] for smooth
69+
functions of vector means, either the studentized or the BCa-style intervals.
70+
Perhaps adapted to the circular block bootstrap to avoid the need to correct
71+
for endpoint bias. Unlike a naive application of studentized (bootstrap-t) or
72+
BCa intervals to the block bootstrap, these methods are more principled and
73+
retain the good asymptotics of studentized and BCa intervals from the IID
74+
case. It remains to be seen how effective they are at practical sample sizes
75+
though despite the improved asymptotic order, and they also add some
76+
complexity and further choices.
77+
78+
[^1]: A. J. Geer, Significance of changes in medium-range forecast scores.
79+
Tellus A Dyn. Meterol. Oceanogr. 68, 30229 (2016).
80+
81+
[^2]: Efron, B. Better bootstrap confidence intervals. J.A.S.A. 82, 171-185
82+
(1987)
83+
84+
[^3]: Davison, A. C. & Hinkley, D. V. Bootstrap Methods and their Application
85+
(Cambridge University Press, 1997), pp.100-101.
86+
87+
[^4]: Sherman, M. & le Cessie, Saskia, A comparison between bootstrap methods
88+
and generalized estimating equations for correlated outcomes in generalized
89+
linear models, Communications in Statistics - Simulation and Computation,
90+
26:3, 901-925 (1997).
91+
92+
[^5]: Politis, D. N. & Romano, J. P. The stationary bootstrap. J.A.S.A. 89,
93+
1303-1313 (1994).
94+
95+
[^6]: Politis, D. N. & White, H. Automatic Block-Length Selection for the
96+
Dependent Bootstrap, Econometric Reviews, 23:1, 53-70 (2004).
97+
98+
[^7]: Patton, A., Politis, D. N. & White, H. Correction to "Automatic
99+
Block-Length Selection for the Dependent Bootstrap" by D. Politis and
100+
H. White, Econometric Reviews, 28:4, 372-375 (2009).
101+
102+
[^8]: Diebold, F. X. & Mariano, R. S. Comparing predictive accuracy. J. Bus. Econ.
103+
Stat. 20, 134–144 (2002).
104+
105+
[^9]: Götze, F. & Künsch, H. R. Second-order correctness of the blockwise
106+
bootstrap for stationary observations. Ann. Stat. 24, 1914-1933 (1996).
107+
108+
[^10]: Lazarus, E., Lewis, D. J., Stock, J. H. & Watson, M. W. HAR inference:
109+
Recommendations for practice. J. Bus. Econ. Stat. 36, 541–559 (2018).
110+
111+
112+
113+
114+

0 commit comments

Comments
 (0)