Phylogenetic analysis of Varicella-Zoster Virus (VZV) using Nextstrain tools (Augur, Auspice, Nextclade, Snakemake)
This repository contains three main workflows:
ingest/– Download and curate VZV data from GenBank and generate normalized metadataphylogenetic/– Filter and align sequences, build the phylogenetic tree, assign clades and mutations, and export results for visualization.nextclade/– Create a custom Nextclade dataset compatible with Nextstrain clade definitions and tree visualizationcheck_sequences/– Utilities and scripts to extract, compare and align duplicated genes (e.g. ORF62/71, ORF63/70, ORF64/69) and check the insertion for each sequence.
Each workflow directory (duplicated_genes_check/, phylogenetic/, nextclade/) contains its own README.md file with detailed documentation, including the purpose of the workflow, input/output files, configuration settings, and usage instructions.
Follow the standard installation instructions for Nextstrain's suite of software tools.
Run the phylogenetic workflow:
cd phylogenetic/
nextstrain build .
nextstrain view .Or run the full ingest workflow:
cd ingest/
snakemake