You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+25-30Lines changed: 25 additions & 30 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,34 +15,29 @@ The germline variant annotator (*gvanno*) is a software package intended for ana
15
15
*gvanno* accepts query files encoded in the VCF format, and can analyze both SNVs and short InDels. The workflow relies heavily upon [Ensembl’s Variant Effect Predictor (VEP)](http://www.ensembl.org/info/docs/tools/vep/index.html), and [vcfanno](https://github.com/brentp/vcfanno). It produces an annotated VCF file and a file of tab-separated values (.tsv), the latter listing all annotations pr. variant record. Note that if your input VCF contains data (genotypes) from multiple samples (i.e. a multisample VCF), the output TSV file will contain one line/record __per sample variant__.
16
16
17
17
### News
18
+
* September 24th 2022 - **1.5.0 release**
19
+
* Data updates: ClinVar, GENCODE GWAS catalog, CancerMine, Open Targets Platform
20
+
* Software updates: VEP 107
21
+
* Excluded UniProt KB from annotation tracks
18
22
* December 21st 2021 - **1.4.4 release**
19
23
* Data updates: ClinVar, GWAS catalog, CancerMine, UniProt KB, Open Targets Platform
20
24
* Software updates: VEP (v105)
21
25
* August 25th 2021 - **1.4.3 release**
22
26
* Data updates: ClinVar, GWAS catalog, CancerMine, UniProt, Open Targets Platform
23
-
* May 24th 2021 - **1.4.2 release**
24
-
* Software update (VEP 104)
25
-
* Data updates: ClinVar, GWAS catalog, CancerMine, Pfam, dbNSFP, UniProt
26
-
* Two new options added:
27
-
* `--vep_regulatory` - annotates variants for overlap with regulatory regions (details below)
28
-
* `--docker-uid` - set Docker user id
29
-
* New variant annotations for enhanced non-coding interpretation:
30
-
* _REGULATORY_ANNOTATION_ : A comma-separated list of regulatory annotations from VEP's `--regulatory` option, i.e. __TF_binding_site__, overlap with __enhancer/promoter/open_chromatin__, __CTCF_binding_site__ etc. Included when the `--vep_regulatory` option is turned on in gvanno.
31
-
* _NCER_PERCENTILE_: A genome-wide percentile rank score from the ncER algorithm (**n**on-**c**oding **E**ssential **R**egulation), [Wells et al., Nat Comm. (2019)](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6868241/).
32
-
33
-
### Annotation resources (v1.4.4)
34
-
35
-
*[VEP](http://www.ensembl.org/info/docs/tools/vep/index.html) - Variant Effect Predictor v105 (GENCODE v39/v19 as the gene reference dataset)
27
+
28
+
### Annotation resources (v1.5.0)
29
+
30
+
*[VEP](http://www.ensembl.org/info/docs/tools/vep/index.html) - Variant Effect Predictor v107 (GENCODE v41/v19 as the gene reference dataset)
36
31
*[dBNSFP](https://sites.google.com/site/jpopgen/dbNSFP) - Database of non-synonymous functional predictions (v4.2, March 2021)
37
32
*[gnomAD](http://gnomad.broadinstitute.org/) - Germline variant frequencies exome-wide (release 2.1, October 2018) - from VEP
38
33
*[dbSNP](http://www.ncbi.nlm.nih.gov/SNP/) - Database of short genetic variants (build 154) - from VEP
*[ClinVar](http://www.ncbi.nlm.nih.gov/clinvar/) - Database of variants related to human health/disease phenotypes (December 2021)
41
-
*[CancerMine](http://bionlp.bcgsc.ca/cancermine/) - literature-mined database of drivers, oncogenes and tumor suppressors in cancer (version 41, December 2021)
*[UniProt/SwissProt KnowledgeBase](http://www.uniprot.org) - Resource on protein sequence and functional information (2021_04, November 2021)
35
+
*[ClinVar](http://www.ncbi.nlm.nih.gov/clinvar/) - Database of variants related to human health/disease phenotypes (September 2022)
36
+
*[CancerMine](http://bionlp.bcgsc.ca/cancermine/) - literature-mined database of drivers, oncogenes and tumor suppressors in cancer (version 47, July 2022)
37
+
*[Open Targets Platform](https://targetvalidation.org) - Target-disease and target-drug associations (2022_06, June 2022)
44
38
*[Pfam](http://pfam.xfam.org) - Database of protein families and domains (v35.0, November 2021)
45
-
*[NHGRI-EBI GWAS Catalog](https://www.ebi.ac.uk/gwas/home) - Catalog of published genome-wide association studies (December 7th 2021)
39
+
*[Mutation hotspots](cancerhotspots.org) - Database of mutation hotspots in cancer
40
+
*[NHGRI-EBI GWAS Catalog](https://www.ebi.ac.uk/gwas/home) - Catalog of published genome-wide association studies (August 26th 2022)
46
41
47
42
48
43
### Getting started
@@ -76,17 +71,17 @@ An installation of Python (version >=_3.6_) is required to run *gvanno*. Check t
76
71
77
72
#### STEP 2: Download *gvanno* and data bundle
78
73
79
-
1.[Download the latest version](https://github.com/sigven/gvanno/releases/tag/v1.4.4) (gvanno run script, v1.4.4)
74
+
1.[Download the latest version](https://github.com/sigven/gvanno/releases/tag/v1.5.0) (gvanno run script, v1.5.0)
80
75
2. Download (preferably using `wget`) and unpack the latest assembly-specific data bundle in the gvanno directory
81
-
*[grch37 data bundle](http://insilico.hpc.uio.no/pcgr/gvanno/gvanno.databundle.grch37.20211221.tgz) (approx 18Gb)
82
-
*[grch38 data bundle](http://insilico.hpc.uio.no/pcgr/gvanno/gvanno.databundle.grch38.20211221.tgz) (approx 20Gb)
76
+
*[grch37 data bundle](http://insilico.hpc.uio.no/pcgr/gvanno/gvanno.databundle.grch37.20220921.tgz) (approx 20Gb)
77
+
*[grch38 data bundle](http://insilico.hpc.uio.no/pcgr/gvanno/gvanno.databundle.grch38.20220921.tgz) (approx 28Gb)
0 commit comments