Skip to content

Commit 2c556d5

Browse files
authored
Merge branch 'dev' into 107-docs-add-contributing-guidelines
2 parents 5603959 + 2152b6f commit 2c556d5

24 files changed

+2007
-476
lines changed

pipeline_documentation.md

Lines changed: 112 additions & 48 deletions
Original file line numberDiff line numberDiff line change
@@ -1190,7 +1190,7 @@ Intersect the aligned reads with the extended pri-miR annotations with
11901190
- pri-miR extended annotations (`.gff3`); from
11911191
[**extend_mirs_annotations**](#extend_mirs_annotations)
11921192
- **Output**
1193-
- pri-miR intersections file (`.bed`); used in
1193+
- pri-miR intersections file (`.intersect`); used in
11941194
[**filter_sam_by_intersecting_primir**](#filter_sam_by_intersecting_primir)
11951195
and [**quantify_primir**](#quantify_primir)
11961196

@@ -1206,7 +1206,7 @@ Remove alignments that do not intersect with any pri-miR with
12061206
- **Input**
12071207
- Alignments file, filtered (`.sam`); from
12081208
[**filter_by_indels**](#filter_by_indels)
1209-
- pri-miR intersections file (`.bed`); from
1209+
- pri-miR intersections file (`.intersect`); from
12101210
[**intersect_extended_primir**](#intersect_extended_primir)
12111211
- **Output**
12121212
- (**Workflow output**) Alignments file, filtered (`.sam`); used in
@@ -1275,7 +1275,7 @@ Intersect the aligned reads with the extended miRNA annotations with
12751275
- Mature miRNA extended annotations (`.gff3`); from
12761276
[**extend_mirs_annotations**](#extend_mirs_annotations)
12771277
- **Output**
1278-
- Mature miRNA intersections file (`.bed`); used in
1278+
- Mature miRNA intersections file (`.intersect`); used in
12791279
[**filter_sam_by_intersecting_mirna**](#filter_sam_by_intersecting_mirna)
12801280
and [**add_intersecting_mirna_tag**](#add_intersecting_mirna_tag)
12811281

@@ -1290,7 +1290,7 @@ Remove alignments that do not intersect with any miRNA with
12901290
- **Input**
12911291
- Alignments file, filtered (`.sam`); from
12921292
[**filter_sam_by_intersecting_primir**](#filter_sam_by_intersecting_primir)
1293-
- Mature miRNA intersections file (`.bed`); from
1293+
- Mature miRNA intersections file (`.intersect`); from
12941294
[**intersect_extended_mirna**](#intersect_extended_mirna)
12951295
- **Output**
12961296
- (**Workflow output**) Alignments file, filtered (`.sam`); used in
@@ -1313,7 +1313,7 @@ with a [**custom script**][custom-script-iso-tag].
13131313
- **Input**
13141314
- Alignments file, filtered (`.sam`); from
13151315
[**filter_sam_by_intersecting_mirna**](#filter_sam_by_intersecting_mirna)
1316-
- Mature miRNA intersections file (`.bed`); from
1316+
- Mature miRNA intersections file (`.intersect`); from
13171317
[**intersect_extended_mirna**](#intersect_extended_mirna)
13181318
- **Parameters**
13191319
- **config_template.yaml**
@@ -1590,64 +1590,128 @@ Tabulate alignments according to its intersecting pri-miR with a
15901590
> name format set in [**mirna_extension**](#mirna_extension).
15911591
15921592
- **Input**
1593-
- pri-miR intersections file (`.bed`); from
1593+
- pri-miR intersections file (`.intersect`); from
15941594
[**intersect_extended_primir**](#intersect_extended_primir)
15951595
- **Output**
15961596
- pri-miR counts tab-delimited file; used in
15971597
[**merge_tables**](#merge_tables)
15981598
- **Examples**
15991599

16001600
```console
1601-
Example 1 | One single pri-miR with different alignments
1601+
Example 1 | Contribution when using '--collapsed'
1602+
use case:
1603+
A single feature with several intersecting reads.
1604+
The flag '--collapsed' is used, so contribution equals the # of reads
1605+
per alignment.
16021606

1603-
IN BED records:
1604-
19 . miRNA_primary_transcript 27766 27788 . + . ID=MI0003150;Alias=MI0003150;Name=hsa-mir-526b_-0_+0 19 27765 27788 68-2_1 255 +
1605-
19 . miRNA_primary_transcript 27766 27787 . + . ID=MI0003150;Alias=MI0003150;Name=hsa-mir-526b_-0_+0 19 27765 27787 316-1_7 1 +
1606-
19 . miRNA_primary_transcript 27804 27823 . + . ID=MI0003150;Alias=MI0003150;Name=hsa-mir-526b_-0_+0 19 27803 27823 599-1_3 255 +
1607-
19 . miRNA_primary_transcript 27805 27822 . + . ID=MI0003150;Alias=MI0003150;Name=hsa-mir-526b_-0_+0 19 27804 27822 226-1_4 1 +
1607+
IN INTERSECT records:
1608+
19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 8-2 255 + 21
1609+
19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 24-1 255 + 21
16081610

1609-
Alignments:
1610-
Read ID: 68
1611-
Number of collapsed reads: 2
1612-
Number of mapped genomic loci: 1
1613-
Contribution: 2/1 = 2
1614-
1615-
Read ID: 316
1616-
Number of collapsed reads: 1
1617-
Number of mapped genomic loci: 7
1618-
Contribution: 1/7 = 0.143
1619-
1620-
Read ID: 599
1621-
Number of collapsed reads: 1
1622-
Number of mapped genomic loci: 3
1623-
Contribution: 1/3 = 0.33
1624-
1625-
Read ID: 226
1626-
Number of collapsed reads: 1
1627-
Number of mapped genomic loci: 4
1628-
Contribution: 1/4 = 0.25
1611+
alignments:
1612+
Read ID: 8-2
1613+
Number of collapsed reads: 2
1614+
Contribution: 2
16291615

1630-
OUT table:
1631-
ID lib_name
1632-
hsa-mir-526b_-0_+0 2.723
1616+
Read ID: 24-1
1617+
Number of collapsed reads: 1
1618+
Contribution: 1
16331619

1620+
OUT table:
1621+
hsa-mir-524_-0_+0 3
16341622

1635-
Example 2 | Different pri-miRs for a single read
16361623

1637-
IN BED records:
1638-
19 . miRNA_primary_transcript 40866 40886 . + . ID=MI0003158;Alias=MI0003158;Name=hsa-mir-520c_-0_+0 19 40865 40886 10-4_2 255 +
1639-
19 . miRNA_primary_transcript 34627 34647 . + . ID=MI0003155;Alias=MI0003155;Name=hsa-mir-520b_-5_+6 19 34626 34647 10-4_2 255 +
1624+
Example 2 | Contribution when using '--nh'
1625+
use case:
1626+
A single feature with several intersecting reads.
1627+
The flag '--nh' is used, so contribution equals 1/NH.
16401628

1641-
Alignment:
1642-
Read ID: 10
1643-
Number of collapsed reads: 4
1644-
Number of mapped genomic loci: 2
1645-
Contribution: 4/2 = 2
1629+
IN INTERSECT records:
1630+
19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 8_1 255 + 21
1631+
19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 24_1 255 + 21
16461632

1647-
OUT table:
1648-
ID lib_name
1649-
hsa-mir-520c_-0_+0 2
1650-
hsa-mir-520b_-5_+6 2
1633+
alignments:
1634+
Read ID: 8_1
1635+
Number of mapped genomic loci: 1
1636+
Contribution: 1
1637+
1638+
Read ID: 24_1
1639+
Number of mapped genomic loci: 1
1640+
Contribution: 1
1641+
1642+
OUT table:
1643+
hsa-mir-524_-0_+0 2
1644+
1645+
1646+
Example 3 | Contribution when using '--collapsed' and '--nh'
1647+
use case:
1648+
A single feature with several intersecting reads.
1649+
The flags '--nh' and '--contribution' are used, so contribution equals
1650+
# of reads/NH.
1651+
1652+
IN INTERSECT records:
1653+
19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 8-2_1 255 + 21
1654+
19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 24-1_1 255 + 21
1655+
1656+
alignments:
1657+
Read ID: 8-2_1
1658+
Number of collapsed reads: 2
1659+
Number of mapped genomic loci: 1
1660+
Contribution: 2/1 = 2
1661+
1662+
Read ID: 23-1_1
1663+
Number of collapsed reads: 1
1664+
Number of mapped genomic loci: 1
1665+
Contribution: 1/1 = 1
1666+
1667+
OUT table:
1668+
hsa-mir-524_-0_+0 3
1669+
1670+
Example 4 | Column with intersecting reads; using '--read-ids'
1671+
use case:
1672+
A single feature with several intersecting reads.
1673+
Each read contributes with 1.
1674+
Read IDs intersecting the feature are appended as the last column.
1675+
1676+
IN INTERSECT records:
1677+
19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 8-2_1 255 + 21
1678+
19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 24-1_1 255 + 21
1679+
1680+
alignments:
1681+
Read ID: 8-2_1
1682+
Contribution: 1
1683+
1684+
Read ID: 24-1_1
1685+
Contribution: 1
1686+
1687+
OUT table:
1688+
hsa-mir-524_-0_+0 2 8-2_1;24-1_1
1689+
1690+
Example 5 | Columns with feature shifts; using '--feat-extension'
1691+
use case:
1692+
A single feature with several intersecting reads.
1693+
Each read contributes with 1.
1694+
Feature start and end coordinates shift are appended as the third and
1695+
fourth column respectively.
1696+
1697+
IN INTERSECT records:
1698+
19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-1_+3 19 44413 44434 8-2_1 255 + 21
1699+
19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-1_+3 19 44413 44434 24-1_1 255 + 21
1700+
1701+
alignments:
1702+
Read ID: 8-2_1
1703+
Contribution: 1
1704+
1705+
Read ID: 24-1_1
1706+
Contribution: 1
1707+
1708+
feature:
1709+
Feature name: hsa-mir-524_-1_+3
1710+
5' shift: -1
1711+
3' shift: +3
1712+
1713+
OUT table:
1714+
hsa-mir-524_-1_+3 2 -1 +3
16511715
```
16521716

16531717

0 commit comments

Comments
 (0)