Conditional Analysis with LD Matrix
Get the same results as GCTA COJO, but with LD matrix.
pip install cojopyor with uv:
uv pip install cojopyThe summary statistics file should have the following columns (same as the cojo file of GCTA):
- SNP: SNP ID
- A1: Effect allele
- A2: Other allele
- b: Effect size
- se: Standard error
- p: P-value
- freq: Minor allele frequency
- N: Sample size
- A tab-delimited LD matrix, same as the output of
plink --r square.
Note
The allele order of the summary statistics file should be the same as the allele order of the LD matrix file.
cojo slct \
--sumstats ./exampledata/sim_gwas.ma \
--ld-matrix ./exampledata/sim_r.ld \
--output ./exampledata/slct.txtFit all the included SNPs to estimate their joint effects without model selection (same as gcta --cojo-joint)
cojo joint \
--sumstats ./exampledata/sim_gwas.ma \
--ld-matrix ./exampledata/sim_r.ld \
--extract-snps ./exampledata/extract_snps.txt \
--output ./exampledata/joint.txtPerform association analysis of the included SNPs conditional on the given list of SNPs (same as gcta --cojo-cond)
cojo cond \
--sumstats ./exampledata/sim_gwas.ma \
--ld-matrix ./exampledata/sim_r.ld \
--cond-snps ./exampledata/cond_snps.txt \
--output ./exampledata/cond.txtcond-snps: Forcondcommand, path to the file containing the SNPs to condition on, one SNP per line.extract-snps: Forjointandcondcommands, path to the file containing the SNPs to extract, one SNP per line.ld-freq: Path to the LD frequency file, optional, use freq in sumstats if not provided.p-cutoff: P-value cutoff, default is 5e-8.collinear-cutoff: Collinearity cutoff, default is 0.9.maf-cutoff: Minor allele frequency cutoff, default is 0.01.diff-freq-cutoff: Difference in minor allele frequency cutoff, default is 0.2, won't take effect unlessld-freqis provided.