You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+10-1Lines changed: 10 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,7 +16,7 @@ in a regulatory and splicing context.
16
16
17
17
## Installation
18
18
19
-
Clone and install regtools by running:
19
+
Clone and install regtools by running the following:
20
20
```
21
21
git clone https://github.com/griffithlab/regtools
22
22
cd regtools/
@@ -26,6 +26,8 @@ Clone and install regtools by running:
26
26
make
27
27
```
28
28
29
+
For convienience we also maintain a docker image available at [https://hub.docker.com/r/griffithlab/regtools/](https://hub.docker.com/r/griffithlab/regtools/)
30
+
29
31
## Usage:
30
32
31
33
```
@@ -55,6 +57,7 @@ If you would like to build the documentation locally, please install
55
57
work on most machines. Then run `mkdocs serve` from within the `regtools`
56
58
base directory.
57
59
60
+
58
61
## Acknowledgements
59
62
60
63
Regtools uses several open-source libraries. We would like to thank the
@@ -64,3 +67,9 @@ useful comments and code.
64
67
## License
65
68
66
69
The project is licensed under the [MIT license](https://opensource.org/licenses/MIT).
Copy file name to clipboardExpand all lines: docs/commands/cis-ase-identify.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,7 +10,7 @@ The `cis-ase identify` command is used to identify allele-specific expression ev
10
10
| somatic-variants.vcf | Somatic variant calls in VCF format. The tool looks for allele specific expression at polymorphic loci near the somatic variants|
11
11
| polymorphisms.vcf | List of polymorphic loci in the VCF format. RNA expression is checked at these sites to identify evidence of allele speciific expression|
12
12
| dna-alignments.bam | Aligned DNA reads in the BAM format that has been indexed for example with `samtools index`. We have tested this command with alignments from BWA.|
13
-
|dna-alignments.bam | Aligned RNAseq BAM produced with a splice aware aligner, that has been indexed for example with `samtools index`. We have tested this command with alignments from TopHat.|
13
+
|rna-alignments.bam | Aligned RNAseq BAM produced with a splice aware aligner, that has been indexed for example with `samtools index`. We have tested this command with alignments from TopHat.|
14
14
| ref.fa | The reference FASTA file. The donor and acceptor sequences used in the "splice-site" column of the annotated junctions are extracted from the FASTA file. |
15
15
| annotations.gtf | The GTF file specifies the transcriptome that is used to annotate the junctions and variants. For examples, the Ensembl GTFs for release78 are [here](ftp://ftp.ensembl.org/pub/release-78/gtf/).|
The `cis-splice-effects associate` command is used to identify splicing misregulation events. This command is similar to `cis-splice-effects identify`, but takes the BED output of `junctions extract` in lieu of a BAM file with RNA alignments. The tool then proceeds to associate non-canonical splicing junctions near the variant sites.
| variants.vcf | Variant call in VCF format from which to look for cis-splice-effects.|
13
+
| junctions.bed | BED file of junctions to look through for evidence of splice events. The file is expected to be in the [BED12 format](junctions-extract.md#output) of the `junctions extract` output. |
14
+
| ref.fa | The reference FASTA file. The donor and acceptor sequences used in the "splice-site" column of the annotated junctions are extracted from the FASTA file. |
15
+
| annotations.gtf | The GTF file specifies the transcriptome that is used to annotate the junctions and variants. For examples, the Ensembl GTFs for release78 are [here](ftp://ftp.ensembl.org/pub/release-78/gtf/).|
16
+
17
+
**Note** - Please make sure that the version of the annotation GTF that you use corresponds with the version of the assembly build (ref.fa) and that the co-ordinates in the VCF file are also from the same build.
18
+
19
+
###Options
20
+
| Option | Description |
21
+
| ------ | ----------- |
22
+
| -o STR | Output file containing the aberrant splice junctions with annotations. [STDOUT]|
| -j STR | Output file containing the aberrant junctions in BED12 format. |
25
+
| -w INT | Window size in b.p to associate splicing events in. The tool identifies events in variant.start +/- w basepairs. Default behaviour is to look at the window between previous and next exons. |
26
+
| -e INT | Maximum distance from the start/end of an exon to annotate a variant as relevant to splicing, the variant is in exonic space, i.e a coding variant. [3]|
27
+
| -i INT | Maximum distance from the start/end of an exon to annotate a variant as relevant to splicing, the variant is in intronic space. [2]|
28
+
| -I | Annotate variants in intronic space within a transcript(not to be used with -i). |
29
+
| -E | Annotate variants in exonic space within a transcript(not to be used with -e). |
30
+
| -S | Don't skip single exon transcripts. |
31
+
32
+
###Output
33
+
For an explanation of the annotated junctions that are identified by this command please refer to the output of the `junctions annotate` command [here](junctions-annotate.md#output)
34
+
For an explanation of the annotated variants that are identified by this command when using the -v option, please refer to the output of the `variants annotate` command [here](variants-annotate.md#output)
| -w | Window around the variant file (in basepairs) to identify splicing events in. If specified the tool looks at +/- n b.p around the variant start position. For example -w 500 will look at a 1kb window around the variant. If this option is not specified, the default option is to look at a window that ranges from the start co-ordinate of the previous exon and ends at the end co-ordinate of the next exon i.e by treating the current exon as a cassette exon. |
25
-
| -j | Optional file containing the aberrant junctions in BED12 format. |
26
-
| -e | Maximum distance from the start/end of an exon to annotate a variant as relevant to splicing, the variant is in exonic space, i.e a coding variant. [default = 3]|
27
-
| -i | Maximum distance from the start/end of an exon to annotate a variant as relevant to splicing, the variant is in intronic space. [default = 2]|
28
-
| -I | Annotate variants in intronic space within a transcript (not to be used with -i).
29
-
| -E | Annotate variants in exonic space within a transcript (not to be used with -e).
30
-
| -S | Dont skip single exon transcripts. The default is to skip the single exon transcripts. |
22
+
| -o STR | Output file containing the aberrant splice junctions with annotations. [STDOUT]|
| -j STR | Output file containing the aberrant junctions in BED12 format. |
25
+
| -s INT | Strand specificity of RNA library preparation, where 0 = unstranded/XS, 1 = first-strand/RF, 2 = second-strand/FR. This option is required. If your alignments contain XS tags, these will be used in the "unstranded" mode. |
26
+
| -w INT | Window size in b.p to identify splicing events in. The tool identifies events in variant.start +/- w basepairs. Default behaviour is to look at the window between previous and next exons. |
27
+
| -e INT | Maximum distance from the start/end of an exon to annotate a variant as relevant to splicing, the variant is in exonic space, i.e a coding variant. [3]|
28
+
| -i INT | Maximum distance from the start/end of an exon to annotate a variant as relevant to splicing, the variant is in intronic space. [2]|
29
+
| -I | Annotate variants in intronic space within a transcript(not to be used with -i). |
30
+
| -E | Annotate variants in exonic space within a transcript(not to be used with -e). |
31
+
| -S | Don't skip single exon transcripts. |
31
32
32
33
###Output
33
34
For an explanation of the annotated junctions that are identified by this command please refer to the output of the `junctions annotate` command [here](junctions-annotate.md#output)
Copy file name to clipboardExpand all lines: docs/commands/commands.md
+1Lines changed: 1 addition & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -13,6 +13,7 @@ This set of tools helps identify and work with aberrant splicing events near var
13
13
Below are links to detailed explanations of the `cis-splice-effects` sub-commands:
14
14
15
15
-[identify](cis-splice-effects-identify.md)
16
+
-[associate](cis-splice-effects-associate.md)
16
17
17
18
##cis-ase
18
19
This set of tools helps identify and work with allele-specific-expression near variants, these could be somatic variants or germline polymorphisms/mutations. These variants are hypothesized to act in cis and affect how the gene is transcribed.
0 commit comments