@@ -1190,7 +1190,7 @@ Intersect the aligned reads with the extended pri-miR annotations with
11901190 - pri-miR extended annotations (` .gff3 ` ); from
11911191 [ ** extend_mirs_annotations** ] ( #extend_mirs_annotations )
11921192- ** Output**
1193- - pri-miR intersections file (` .bed ` ); used in
1193+ - pri-miR intersections file (` .intersect ` ); used in
11941194 [ ** filter_sam_by_intersecting_primir** ] ( #filter_sam_by_intersecting_primir )
11951195 and [ ** quantify_primir** ] ( #quantify_primir )
11961196
@@ -1206,7 +1206,7 @@ Remove alignments that do not intersect with any pri-miR with
12061206- ** Input**
12071207 - Alignments file, filtered (` .sam ` ); from
12081208 [ ** filter_by_indels** ] ( #filter_by_indels )
1209- - pri-miR intersections file (` .bed ` ); from
1209+ - pri-miR intersections file (` .intersect ` ); from
12101210 [ ** intersect_extended_primir** ] ( #intersect_extended_primir )
12111211- ** Output**
12121212 - (** Workflow output** ) Alignments file, filtered (` .sam ` ); used in
@@ -1275,7 +1275,7 @@ Intersect the aligned reads with the extended miRNA annotations with
12751275 - Mature miRNA extended annotations (` .gff3 ` ); from
12761276 [ ** extend_mirs_annotations** ] ( #extend_mirs_annotations )
12771277- ** Output**
1278- - Mature miRNA intersections file (` .bed ` ); used in
1278+ - Mature miRNA intersections file (` .intersect ` ); used in
12791279 [ ** filter_sam_by_intersecting_mirna** ] ( #filter_sam_by_intersecting_mirna )
12801280 and [ ** add_intersecting_mirna_tag** ] ( #add_intersecting_mirna_tag )
12811281
@@ -1290,7 +1290,7 @@ Remove alignments that do not intersect with any miRNA with
12901290- ** Input**
12911291 - Alignments file, filtered (` .sam ` ); from
12921292 [ ** filter_sam_by_intersecting_primir** ] ( #filter_sam_by_intersecting_primir )
1293- - Mature miRNA intersections file (` .bed ` ); from
1293+ - Mature miRNA intersections file (` .intersect ` ); from
12941294 [ ** intersect_extended_mirna** ] ( #intersect_extended_mirna )
12951295- ** Output**
12961296 - (** Workflow output** ) Alignments file, filtered (` .sam ` ); used in
@@ -1313,7 +1313,7 @@ with a [**custom script**][custom-script-iso-tag].
13131313- ** Input**
13141314 - Alignments file, filtered (` .sam ` ); from
13151315 [ ** filter_sam_by_intersecting_mirna** ] ( #filter_sam_by_intersecting_mirna )
1316- - Mature miRNA intersections file (` .bed ` ); from
1316+ - Mature miRNA intersections file (` .intersect ` ); from
13171317 [ ** intersect_extended_mirna** ] ( #intersect_extended_mirna )
13181318- ** Parameters**
13191319 - ** config_template.yaml**
@@ -1590,64 +1590,128 @@ Tabulate alignments according to its intersecting pri-miR with a
15901590> name format set in [ ** mirna_extension** ] ( #mirna_extension ) .
15911591
15921592- ** Input**
1593- - pri-miR intersections file (` .bed ` ); from
1593+ - pri-miR intersections file (` .intersect ` ); from
15941594 [ ** intersect_extended_primir** ] ( #intersect_extended_primir )
15951595- ** Output**
15961596 - pri-miR counts tab-delimited file; used in
15971597 [ ** merge_tables** ] ( #merge_tables )
15981598- ** Examples**
15991599
16001600``` console
1601- Example 1 | One single pri-miR with different alignments
1601+ Example 1 | Contribution when using '--collapsed'
1602+ use case:
1603+ A single feature with several intersecting reads.
1604+ The flag '--collapsed' is used, so contribution equals the # of reads
1605+ per alignment.
16021606
1603- IN BED records:
1604- 19 . miRNA_primary_transcript 27766 27788 . + . ID=MI0003150;Alias=MI0003150;Name=hsa-mir-526b_-0_+0 19 27765 27788 68-2_1 255 +
1605- 19 . miRNA_primary_transcript 27766 27787 . + . ID=MI0003150;Alias=MI0003150;Name=hsa-mir-526b_-0_+0 19 27765 27787 316-1_7 1 +
1606- 19 . miRNA_primary_transcript 27804 27823 . + . ID=MI0003150;Alias=MI0003150;Name=hsa-mir-526b_-0_+0 19 27803 27823 599-1_3 255 +
1607- 19 . miRNA_primary_transcript 27805 27822 . + . ID=MI0003150;Alias=MI0003150;Name=hsa-mir-526b_-0_+0 19 27804 27822 226-1_4 1 +
1607+ IN INTERSECT records:
1608+ 19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 8-2 255 + 21
1609+ 19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 24-1 255 + 21
16081610
1609- Alignments:
1610- Read ID: 68
1611- Number of collapsed reads: 2
1612- Number of mapped genomic loci: 1
1613- Contribution: 2/1 = 2
1614-
1615- Read ID: 316
1616- Number of collapsed reads: 1
1617- Number of mapped genomic loci: 7
1618- Contribution: 1/7 = 0.143
1619-
1620- Read ID: 599
1621- Number of collapsed reads: 1
1622- Number of mapped genomic loci: 3
1623- Contribution: 1/3 = 0.33
1624-
1625- Read ID: 226
1626- Number of collapsed reads: 1
1627- Number of mapped genomic loci: 4
1628- Contribution: 1/4 = 0.25
1611+ alignments:
1612+ Read ID: 8-2
1613+ Number of collapsed reads: 2
1614+ Contribution: 2
16291615
1630- OUT table:
1631- ID lib_name
1632- hsa-mir-526b_-0_+0 2.723
1616+ Read ID: 24-1
1617+ Number of collapsed reads: 1
1618+ Contribution: 1
16331619
1620+ OUT table:
1621+ hsa-mir-524_-0_+0 3
16341622
1635- Example 2 | Different pri-miRs for a single read
16361623
1637- IN BED records:
1638- 19 . miRNA_primary_transcript 40866 40886 . + . ID=MI0003158;Alias=MI0003158;Name=hsa-mir-520c_-0_+0 19 40865 40886 10-4_2 255 +
1639- 19 . miRNA_primary_transcript 34627 34647 . + . ID=MI0003155;Alias=MI0003155;Name=hsa-mir-520b_-5_+6 19 34626 34647 10-4_2 255 +
1624+ Example 2 | Contribution when using '--nh'
1625+ use case:
1626+ A single feature with several intersecting reads.
1627+ The flag '--nh' is used, so contribution equals 1/NH.
16401628
1641- Alignment:
1642- Read ID: 10
1643- Number of collapsed reads: 4
1644- Number of mapped genomic loci: 2
1645- Contribution: 4/2 = 2
1629+ IN INTERSECT records:
1630+ 19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 8_1 255 + 21
1631+ 19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 24_1 255 + 21
16461632
1647- OUT table:
1648- ID lib_name
1649- hsa-mir-520c_-0_+0 2
1650- hsa-mir-520b_-5_+6 2
1633+ alignments:
1634+ Read ID: 8_1
1635+ Number of mapped genomic loci: 1
1636+ Contribution: 1
1637+
1638+ Read ID: 24_1
1639+ Number of mapped genomic loci: 1
1640+ Contribution: 1
1641+
1642+ OUT table:
1643+ hsa-mir-524_-0_+0 2
1644+
1645+
1646+ Example 3 | Contribution when using '--collapsed' and '--nh'
1647+ use case:
1648+ A single feature with several intersecting reads.
1649+ The flags '--nh' and '--contribution' are used, so contribution equals
1650+ # of reads/NH.
1651+
1652+ IN INTERSECT records:
1653+ 19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 8-2_1 255 + 21
1654+ 19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 24-1_1 255 + 21
1655+
1656+ alignments:
1657+ Read ID: 8-2_1
1658+ Number of collapsed reads: 2
1659+ Number of mapped genomic loci: 1
1660+ Contribution: 2/1 = 2
1661+
1662+ Read ID: 23-1_1
1663+ Number of collapsed reads: 1
1664+ Number of mapped genomic loci: 1
1665+ Contribution: 1/1 = 1
1666+
1667+ OUT table:
1668+ hsa-mir-524_-0_+0 3
1669+
1670+ Example 4 | Column with intersecting reads; using '--read-ids'
1671+ use case:
1672+ A single feature with several intersecting reads.
1673+ Each read contributes with 1.
1674+ Read IDs intersecting the feature are appended as the last column.
1675+
1676+ IN INTERSECT records:
1677+ 19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 8-2_1 255 + 21
1678+ 19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-0_+0 19 44413 44434 24-1_1 255 + 21
1679+
1680+ alignments:
1681+ Read ID: 8-2_1
1682+ Contribution: 1
1683+
1684+ Read ID: 24-1_1
1685+ Contribution: 1
1686+
1687+ OUT table:
1688+ hsa-mir-524_-0_+0 2 8-2_1;24-1_1
1689+
1690+ Example 5 | Columns with feature shifts; using '--feat-extension'
1691+ use case:
1692+ A single feature with several intersecting reads.
1693+ Each read contributes with 1.
1694+ Feature start and end coordinates shift are appended as the third and
1695+ fourth column respectively.
1696+
1697+ IN INTERSECT records:
1698+ 19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-1_+3 19 44413 44434 8-2_1 255 + 21
1699+ 19 . miRNA_primary_transcript 44362 44448 . + . ID=MI0003160;Alias=MI0003160;Name=hsa-mir-524_-1_+3 19 44413 44434 24-1_1 255 + 21
1700+
1701+ alignments:
1702+ Read ID: 8-2_1
1703+ Contribution: 1
1704+
1705+ Read ID: 24-1_1
1706+ Contribution: 1
1707+
1708+ feature:
1709+ Feature name: hsa-mir-524_-1_+3
1710+ 5' shift: -1
1711+ 3' shift: +3
1712+
1713+ OUT table:
1714+ hsa-mir-524_-1_+3 2 -1 +3
16511715```
16521716
16531717
0 commit comments