Skip to content

Commit 835ce49

Browse files
chore(ci): disable output gff validation for hav dataset
Our GFF files are not strictly compliant currently: in presence of input sequences with duplicated seqids, the output GFF files will also have duplicated #sequence-region pragmas, which is not allowed according to the GFF3 spec, and fails validation.
1 parent df664cf commit 835ce49

File tree

1 file changed

+12
-0
lines changed

1 file changed

+12
-0
lines changed

scripts/validate-gff

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,18 @@ IGNORED_PATH_PREFIXES=(
3737
# (these 2 run modes ignore pathogen.json). This results in empty annotation files.
3838
"tmp/smoke-tests/result/nextstrain/yellow-fever/prM-E/with_ref_and_annotation/nextclade.gff"
3939
"tmp/smoke-tests/result/nextstrain/yellow-fever/prM-E/with_ref_and_annotation_and_tree/nextclade.gff"
40+
41+
# error: the sequence region "PQ496830.1" has already been defined
42+
# If multiple sequences have the same fasta seqid (if they have different fasta description),
43+
# we emit multiple sequence-region pragmas with the same seqid. This is not strictly valid GFF3.
44+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/whole-genome/with_name/nextclade.gff"
45+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/whole-genome/with_ref_and_annotation/nextclade.gff"
46+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/whole-genome/with_ref_and_annotation_and_tree/nextclade.gff"
47+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/whole-genome/with_dataset/nextclade.gff"
48+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/vp1-2b-junction/with_name/nextclade.gff"
49+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/vp1-2b-junction/with_ref_and_annotation/nextclade.gff"
50+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/vp1-2b-junction/with_ref_and_annotation_and_tree/nextclade.gff"
51+
"tmp/smoke-tests/result/community/masphl-bioinformatics/hav/vp1-2b-junction/with_dataset/nextclade.gff"
4052
)
4153

4254
IGNORED_SEQUENCE_REGEXES=(

0 commit comments

Comments
 (0)