|
1 | 1 | About |
2 | | -===== |
| 2 | +----- |
3 | 3 |
|
4 | | -Arriba is a command-line tool for the detection of gene fusions from RNA-Seq data. It was developed for the use in a clinical research setting. Therefore, short runtimes and high sensitivity were important design criteria. It is based on the ultrafast [STAR aligner](https://github.com/alexdobin/STAR) and the post-alignment runtime is typically just ~2 minutes. In contrast to many other fusion detection tools which build on STAR, Arriba does not require to reduce the `alignIntronMax` parameter of STAR to detect fusions arising from focal deletions. |
| 4 | +Arriba is a command-line tool for the detection of gene fusions from RNA-Seq data. It was developed for the use in a clinical research setting. Therefore, short runtimes and high sensitivity were important design criteria. It is based on the ultrafast [STAR aligner](https://github.com/alexdobin/STAR), and the post-alignment runtime is typically just ~2 minutes. Arriba's workflow produces fully reusable alignments, which can serve as input to other common analyses, such as quantification of gene expression. In contrast to many other fusion detection tools which build on STAR, Arriba does not require to reduce the STAR parameter `--alignIntronMax` to detect fusions arising from focal deletions. Reducing this parameter impairs mapping of reads to genes with long introns and may affect expression quantification, hence. |
5 | 5 |
|
6 | | -Apart from gene fusions, Arriba can detect other structural rearrangements with potential clinical relevance, such as viral integration sites, internal tandem duplications, whole exon duplications, truncations of genes (i.e., breakpoints in introns and intergenic regions). |
| 6 | +Apart from gene fusions, Arriba can detect other structural rearrangements with potential clinical relevance, including viral integration sites, internal tandem duplications, whole exon duplications, intragenic inversions, enhancer hijacking events involving immunoglobulin/T-cell receptor loci, translocations affecting genes with many paralogs such as DUX4, and truncations of genes (i.e., breakpoints in introns or intergenic regions). |
7 | 7 |
|
8 | | -Arriba is the winner of the [DREAM SMC-RNA Challenge](https://www.synapse.org/SMC_RNA), an international competition organized by ICGC, TCGA, IBM, and Sage Bionetworks to determine the current gold standard for the detection of gene fusions from RNA-Seq data. The final results of the challenge are posted on the [Round 5 Leaderboard](https://www.synapse.org/#!Synapse:syn2813589/wiki/588511). |
| 8 | +Arriba is the winner of the [DREAM SMC-RNA Challenge](https://www.synapse.org/SMC_RNA), an international competition organized by ICGC, TCGA, IBM, and Sage Bionetworks to determine the current gold standard for the detection of gene fusions from RNA-Seq data. The final results of the challenge are posted on the [Round 5 Leaderboard](https://www.synapse.org/#!Synapse:syn2813589/wiki/588511) and discussed in the accompanying [publication](https://doi.org/10.1016/j.cels.2021.05.021). |
9 | 9 |
|
10 | 10 | Get help |
11 | | -======== |
| 11 | +-------- |
12 | 12 |
|
13 | 13 | Use the [GitHub issue tracker](https://github.com/suhrig/arriba/issues) to get help or to report bugs. |
14 | 14 |
|
15 | 15 | Citation |
16 | | -======== |
| 16 | +-------- |
17 | 17 |
|
18 | | -Sebastian Uhrig, Julia Ellermann, Tatjana Walther, Pauline Burkhardt, Martina Fröhlich, Barbara Hutter, Umut H. Toprak, Olaf Neumann, Albrecht Stenzinger, Claudia Scholl, Stefan Fröhling and Benedikt Brors: *Accurate and efficient detection of gene fusions from RNA sequencing data.* Genome Research. Published in Advance January 13, 2021. doi: [10.1101/gr.257246.119](https://doi.org/10.1101/gr.257246.119) |
| 18 | +Sebastian Uhrig, Julia Ellermann, Tatjana Walther, Pauline Burkhardt, Martina Fröhlich, Barbara Hutter, Umut H. Toprak, Olaf Neumann, Albrecht Stenzinger, Claudia Scholl, Stefan Fröhling and Benedikt Brors: *Accurate and efficient detection of gene fusions from RNA sequencing data.* Genome Research. March 2021 31: 448-460; Published in Advance January 13, 2021. doi: [10.1101/gr.257246.119](https://doi.org/10.1101/gr.257246.119) |
19 | 19 |
|
20 | 20 | License |
21 | 21 | ------- |
22 | 22 |
|
23 | 23 | The code, software and database files of Arriba are distributed under the MIT/Expat License, with the exception of the script `draw_fusions.R`, which is distributed under the GNU GPL v3 due to dependencies on GPL-licensed R packages. The terms and conditions of both licenses can be found in the [LICENSE file](https://raw.githubusercontent.com/suhrig/arriba/master/LICENSE). |
24 | 24 |
|
25 | 25 | User manual |
26 | | -=========== |
| 26 | +----------- |
27 | 27 |
|
28 | 28 | Please refer to the [user manual](http://arriba.readthedocs.io/en/latest/) for installation instructions and information about usage. **Note: You should not use `git clone` to download Arriba, because the git repository does not include the blacklist and other database files!** |
29 | 29 |
|
@@ -75,12 +75,24 @@ Please refer to the [user manual](http://arriba.readthedocs.io/en/latest/) for i |
75 | 75 | - [Multiple transcript variants](https://arriba.readthedocs.io/en/latest/interpretation-of-results/#multiple-transcript-variants) |
76 | 76 | - [Cohort analysis](https://arriba.readthedocs.io/en/latest/interpretation-of-results/#cohort-analysis) |
77 | 77 |
|
78 | | -8. [Current limitations](https://arriba.readthedocs.io/en/latest/current-limitations/) |
| 78 | +8. [Utility scripts](https://arriba.readthedocs.io/en/latest/utility-scripts/) |
| 79 | + |
| 80 | + - [Extract fusion-supporting alignments](https://arriba.readthedocs.io/en/latest/utility-scripts/#extract-fusion-supporting-alignments) |
| 81 | + - [Convert fusions.tsv to VCF format](https://arriba.readthedocs.io/en/latest/utility-scripts/#convert-fusionstsv-to-vcf-format) |
| 82 | + - [Run Arriba on prealigned BAM file](https://arriba.readthedocs.io/en/latest/utility-scripts/#run-arriba-on-prealigned-bam-file) |
| 83 | + - [Quantify virus expression](https://arriba.readthedocs.io/en/latest/utility-scripts/#quantify-virus-expression) |
| 84 | + |
| 85 | +9. [Current limitations](https://arriba.readthedocs.io/en/latest/current-limitations/) |
79 | 86 |
|
80 | 87 | - [Intragenic deletions](https://arriba.readthedocs.io/en/latest/current-limitations/#intragenic-deletions) |
| 88 | + - [RefSeq annotation](https://arriba.readthedocs.io/en/latest/current-limitations/#refseq-annotation) |
81 | 89 | - [Memory consumption](https://arriba.readthedocs.io/en/latest/current-limitations/#memory-consumption) |
| 90 | + - [Adapter trimming](https://arriba.readthedocs.io/en/latest/current-limitations/#adapter-trimming) |
| 91 | + - [Small insert size](https://arriba.readthedocs.io/en/latest/current-limitations/#small-insert-size) |
| 92 | + - [Viral detection](https://arriba.readthedocs.io/en/latest/current-limitations/#viral-detection) |
| 93 | + - [Targeted sequencing](https://arriba.readthedocs.io/en/latest/current-limitations/#targeted-sequencing) |
82 | 94 |
|
83 | | -9. [Internal algorithm](https://arriba.readthedocs.io/en/latest/internal-algorithm/) |
| 95 | +10. [Internal algorithm](https://arriba.readthedocs.io/en/latest/internal-algorithm/) |
84 | 96 |
|
85 | 97 | - [Read-level filters](https://arriba.readthedocs.io/en/latest/internal-algorithm/#read-level-filters) |
86 | 98 | - [Event-level filters](https://arriba.readthedocs.io/en/latest/internal-algorithm/#event-level-filters) |
|
0 commit comments