GitHub - jiachzou/selective_multiple_testing: Accompanying code for Pelger, M. and Zou, J., 2022. Selective Multiple Testing: Inference for Large Panels with Many Covariates. Available at SSRN 4315891.

Selective Multiple Testing

Authors: Markus Pelger(mpelger@stanford.edu), Jiacheng Zou(jiachengzou@alumni.stanford.edu)

Context

We provide a one-stop collection of resources on covariate selective inference with Family-Wise Error Rate (FWER) control on large panel asset pricing data and model, as described in Selective Multiple Testing: Inference for Large Panels with Many Covariates. Specifically, we enable users to perform the rolling-window estimations described in $\S$ 7.3 of the paper. Any constant window model or alternative window size can be produced by changing the window parameter or estimation in the code.

data:

Response variables: $Y=R^{660 \times 243}$ test portfolios excess returns downloaded and processed from Kenneth French's website. The cross-section size is 243 as test portfolios and there are 660 monthly observations. We regress out the market factor from each of the invidiual factors.
Covariates: $X=R^{660 \times 114}$ asset pricing high-minus-low factors that are downloaded and processed from Hou, K., Xue, C. and Zhang, L., 2020. Replicating anomalies. The Review of financial studies, 33(5), pp.2019-2133. The covariates' dimension is 114 and there are 660 monthly observations. We regress out the market factor from each of the individual factors.

R code:

empirics:

step1: multiple_fixed_window.R performs a rolling window regression to generate a $P=R^{114 \times 243}$ matrix for each rolling window. Specifically, posi_lognorm_pval_enforce_dimension admits a $(X,Y)$ tuple and optional priors, and returns the post-LASSO selection inference valid p-values from Jason D. Lee. Dennis L. Sun. Yuekai Sun. Jonathan E. Taylor. "Exact post-selection inference, with application to the lasso." Ann. Statist. 44 (3) 907 - 927, June 2016;
step2: select.R performs the Selective Multiple Testing selection described in the paper;
step3: eval_performance_multiple_fixed_window.R performs evaluations described in the paper.

simulations:

simulation.R is a self-contained R script that

python code on selection:

python: funs.py provides minimal stand-alone function that only requires pandas and numpy to perform our Selective Multiple Testing selection method given a matrix of p-values, controlling for Family-Wise Error Rate (FWER).

python demo

When there are $N$ units and $J$ features, the evidence of unit-level regressions can be stored in a matrix:

a $P$ matrix $J \times N$ of log p-values;
whenever $P_{jn}$ is missing, the $j$th feature is not in the support set of $n$th unit-level model.

To run the code, we can select features subject to FWER target of $\alpha$:

import numpy as np
import pandas as pd
J, N = log_pval_matrix.shape
alpha_vec = [0.00001,0.01,0.05] # the FWER thresholds you want to try
pmt_rejection_table =panel_unordered(log_pval_matrix)
rho=pmt_rejection_table['rho'].unique()[0] # the panel cohesiveness coefficient
for alpha in alpha_vec:
	selected_panel_multiple_testing =np.sort(pmt_rejection_table.index[pmt_rejection_table['rho_inv.N.p_1']<=alpha]).tolist()
	selected_Bonferroni_multiple_testing =np.sort(pmt_rejection_table.index[pmt_rejection_table['p_1']<=alpha/(J*N)]).tolist()

Additional resources

For a method-focused code base, we provide a python version of the Selective Multiple Testing in the Github repository for our accompanying paper Large Dimensional Change Point Detection with FWER Control as Automatic Stopping.

Usage

To cite this code, in addition to the data sources, please use the following citation:

@article{pelger2022selective,
  title={Selective Multiple Testing: Inference for Large Panels with Many Covariates},
  author={Pelger, Markus and Zou, Jiacheng},
  journal={Available at SSRN 4315891},
  year={2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data		data
empirics		empirics
python		python
simulations		simulations
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Selective Multiple Testing

Context

data:

R code:

empirics:

simulations:

python code on selection:

python demo

Additional resources

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Selective Multiple Testing

Context

data:

R code:

empirics:

simulations:

python code on selection:

python demo

Additional resources

Usage

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages