This is the set of scripts which comprise a pipeline from raw sequence data to gene expression analyses for an experiment on Callosobruchus maculatus adaptation to temperature extremes.
- Initial stats via FastQC
- Read trimming and filtering with Trimmomatic
- HiSat2 reference genome index generation and subsequence read alignment to reference
- SAMtools processing to sort the mapped reads by co-ordinates, filter unmapped reads, and produce indexes for each library
- HTSeq count to count the number of reads at each genomic feature
- Gene whitelist generation via EdgeR
- General gene expression analysis via EdgeR
- Gene Ontology analysis
- Statistical analaysis approaches using Fisher's Exact tests etc.
- WGCNA & Circlize visualisation
- Upset plots, Venn diagrams, and other visualisation tools