Skip to content

Commit 65dca28

Browse files
committed
moving the sarscov2 preset from using the refseq Wuhan-1 accession to the more commonly referenced genbank Wuhan-1 accession
1 parent f6ee0f5 commit 65dca28

File tree

1 file changed

+8
-5
lines changed

1 file changed

+8
-5
lines changed

conf/presets/sarscov2.config

Lines changed: 8 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@
33
*
44
* This preset extends the virus preset with SARS-CoV-2 specific settings:
55
* - Inherits: Sylph IMG/VR 4.1 database, Panhuman-1 decontamination
6-
* - Adds: Wuhan-Hu-1 reference sequence (NCBI RefSeq NC_045512.2)
6+
* - Adds: Wuhan-Hu-1 reference sequence (GenBank MN908947.3)
77
* - Adds: Nextclade SARS-CoV-2 dataset for clade assignment and QC
88
*
99
* The reference sequence is automatically fetched from NCBI if not provided
@@ -18,10 +18,13 @@
1818
includeConfig 'virus.config'
1919

2020
params {
21-
// SARS-CoV-2 Wuhan-Hu-1 reference sequence (NCBI RefSeq)
22-
// Automatically fetched from NCBI Entrez if not a local file
23-
refseq = "NC_045512.2"
24-
ref_gbk = "NC_045512.2"
21+
// SARS-CoV-2 Wuhan-Hu-1 reference sequence (GenBank)
22+
// Using GenBank accession rather than RefSeq (NC_045512.2) because:
23+
// - Most primer schemes (ARTIC, QIAseq, etc.) use MN908947.3 coordinates
24+
// - Avoids chromosome name mismatches with common primer BED files
25+
// - Immutable versioned record ensures reproducibility
26+
refseq = "MN908947.3"
27+
ref_gbk = "MN908947.3"
2528

2629
// Nextclade dataset for SARS-CoV-2 clade assignment and phylogenetic QC
2730
// See available datasets: nextclade dataset list --only-names

0 commit comments

Comments
 (0)