Skip to content

Commit 1784c34

Browse files
chore(ci): disable output gff validation for hav dataset
Our GFF files are not strictly compliant currently: in presence of input sequences with duplicated seqids, the output GFF files will also have duplicated #sequence-region pragmas, which is not allowed according to the GFF3 spec, and fails validation.
1 parent df664cf commit 1784c34

File tree

1 file changed

+13
-0
lines changed

1 file changed

+13
-0
lines changed

scripts/validate-gff

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,19 @@ IGNORED_PATH_PREFIXES=(
3737
# (these 2 run modes ignore pathogen.json). This results in empty annotation files.
3838
"tmp/smoke-tests/result/nextstrain/yellow-fever/prM-E/with_ref_and_annotation/nextclade.gff"
3939
"tmp/smoke-tests/result/nextstrain/yellow-fever/prM-E/with_ref_and_annotation_and_tree/nextclade.gff"
40+
41+
# error: the sequence region "PQ496830.1" has already been defined
42+
# If multiple sequences have the same fasta seqid (if they have different fasta description they are not
43+
# considered duplicates), we emit multiple sequence-region pragmas with the same seqid (spaces are not allowed there).
44+
# This makes it invalid GFF3.
45+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/whole-genome/with_name/nextclade.gff"
46+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/whole-genome/with_ref_and_annotation/nextclade.gff"
47+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/whole-genome/with_ref_and_annotation_and_tree/nextclade.gff"
48+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/whole-genome/with_dataset/nextclade.gff"
49+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/vp1-2b-junction/with_name/nextclade.gff"
50+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/vp1-2b-junction/with_ref_and_annotation/nextclade.gff"
51+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/vp1-2b-junction/with_ref_and_annotation_and_tree/nextclade.gff"
52+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/vp1-2b-junction/with_dataset/nextclade.gff"
4053
)
4154

4255
IGNORED_SEQUENCE_REGEXES=(

0 commit comments

Comments
 (0)