@@ -42,3 +42,73 @@ with further caveats and trade-offs.
4242Spatial correlation can be an issue too, although we typically avoid the need
4343to account for this by treating spatial locations (on a grid, for example) as
4444fixed, and averaging over them before performing any statistical inference.
45+
46+ ## Methods implemented
47+
48+ * The standard t-test.
49+ * The t-test with AR(2) autocorrelation correction from [ ^ 1 ]
50+ * Both the above with delta-method confidence intervals for metrics which are
51+ nonlinear functions of the mean statistics.
52+ * The IID bootstrap.
53+ * A cluster bootstrap [ ^ 3 ] [ ^ 4 ] which assumes independence only between
54+ clusters.
55+ * The stationary bootstrap of [ ^ 5 ] , with optimal block length selection from
56+ [ ^ 6 ] [ ^ 7 ] but generalized to support non-linear functions of means of
57+ multivariate statistics via a delta-method trick.
58+
59+ ## Methods to consider implementing in future
60+
61+ * Diebold-Mariano test [ ^ 8 ] or another of the family of tests based on HAC
62+ (Heteroskedasticity and Autocorrelation Consistent) variance estimators, as a
63+ a better-studied and more standard alternative to [ ^ 1 ] . These methods also
64+ have problems at small sample sizes and/or high degrees of autocorrelation
65+ however, and there are a number of choices e.g. of kernel and window length
66+ selection method with different trade-offs. [ ^ 10 ] offers a modern review and
67+ some practical recommendations.
68+ * One of the second-order-correct block bootstrap CI methods of [ ^ 9 ] for smooth
69+ functions of vector means, either the studentized or the BCa-style intervals.
70+ Perhaps adapted to the circular block bootstrap to avoid the need to correct
71+ for endpoint bias. Unlike a naive application of studentized (bootstrap-t) or
72+ BCa intervals to the block bootstrap, these methods are more principled and
73+ retain the good asymptotics of studentized and BCa intervals from the IID
74+ case. It remains to be seen how effective they are at practical sample sizes
75+ though despite the improved asymptotic order, and they also add some
76+ complexity and further choices.
77+
78+ [ ^ 1 ] : A. J. Geer, Significance of changes in medium-range forecast scores.
79+ Tellus A Dyn. Meterol. Oceanogr. 68, 30229 (2016).
80+
81+ [ ^ 2 ] : Efron, B. Better bootstrap confidence intervals. J.A.S.A. 82, 171-185
82+ (1987)
83+
84+ [ ^ 3 ] : Davison, A. C. & Hinkley, D. V. Bootstrap Methods and their Application
85+ (Cambridge University Press, 1997), pp.100-101.
86+
87+ [ ^ 4 ] : Sherman, M. & le Cessie, Saskia, A comparison between bootstrap methods
88+ and generalized estimating equations for correlated outcomes in generalized
89+ linear models, Communications in Statistics - Simulation and Computation,
90+ 26:3, 901-925 (1997).
91+
92+ [ ^ 5 ] : Politis, D. N. & Romano, J. P. The stationary bootstrap. J.A.S.A. 89,
93+ 1303-1313 (1994).
94+
95+ [ ^ 6 ] : Politis, D. N. & White, H. Automatic Block-Length Selection for the
96+ Dependent Bootstrap, Econometric Reviews, 23:1, 53-70 (2004).
97+
98+ [ ^ 7 ] : Patton, A., Politis, D. N. & White, H. Correction to "Automatic
99+ Block-Length Selection for the Dependent Bootstrap" by D. Politis and
100+ H. White, Econometric Reviews, 28:4, 372-375 (2009).
101+
102+ [ ^ 8 ] : Diebold, F. X. & Mariano, R. S. Comparing predictive accuracy. J. Bus. Econ.
103+ Stat. 20, 134–144 (2002).
104+
105+ [ ^ 9 ] : Götze, F. & Künsch, H. R. Second-order correctness of the blockwise
106+ bootstrap for stationary observations. Ann. Stat. 24, 1914-1933 (1996).
107+
108+ [ ^ 10 ] : Lazarus, E., Lewis, D. J., Stock, J. H. & Watson, M. W. HAR inference:
109+ Recommendations for practice. J. Bus. Econ. Stat. 36, 541–559 (2018).
110+
111+
112+
113+
114+
0 commit comments