Skip to content

stephschustermann/piRNA-detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

piRNA-detection via multidimensional small RNA clustering

Piwi-interacting RNAs (piRNA) are a novel class of small RNA molecules thought to mediate retrotransposon silecing. Since transposable elements (TEs) can serve as mutagenic factors that contribute to genomic instability, piRNA function is extremely important for genome defense.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites - File structures

You will need the following files to use all the features in this project:

  1. Transposable Elements file

The program expects an eland based file with the format example shown below:

trans_id        trans_coord     target_seq      probe_id        probe_seq       num_mismatch    strand
TAR1    Repeat sequence Homo sapiens    170     GGGAGGAGTAGGCTG HWI-ST1391:140326:C441WACXX:7:2114:14096:41705 1:N:0: CAACTAA   GGGAGGAGTAGGATG 1       -
TAR1    Repeat sequence Homo sapiens    789     GTCCCCGGCGGCGCAGAGACGAGTGGAACCTG        HWI-ST1391:140326:C441WACXX:7:1110:14668:100150 1:N:0: CACCTAA  GTACCCGGCGGCGCAGAGACGAGTGGAACCTG  1       -
MamGypLTR2c_LTR Gypsy   Mammalia        527     TCTTGGGGGGCTCAGG        HWI-ST1391:140326:C441WACXX:7:1114:17283:59379 1:N:0: CAACAAT   TCTTGGGGGGCTCAGG        0-
MLT1H   MaLR family     Homo sapiens    188     CTTGAGGCCTAGGCT HWI-ST1391:140326:C441WACXX:7:1113:15313:71495 1:N:0: CAACTAA   CTTGAGGACTAGGCT 1       -
LTR29   LTR     Homo sapiens    43      ATGCTGAACTGAAGAAGCCTCAAGGT      HWI-ST1391:140326:C441WACXX:7:2310:17684:30008 1:N:0: CAACTAA   ATGCTGAACTGAAGAAGCCTCAAGGT0       +
MLT2F   LTR     Homo sapiens    582     TCTGATTGAATCCTGACTGATACA        HWI-ST1391:140326:C441WACXX:7:1213:21078:77562 1:N:0: CAACTAA   TCTGGTTGAATCCTGACTGATACA 1+
MER68A  LTR Retrotransposon     Eutheria        267     CCTGGGCACTGAGTC HWI-ST1391:140326:C441WACXX:7:2216:14800:78111 1:N:0: CAACTAA   CCGGGGCACTGAGTC 1       +
Kanga1  Mariner/Tc1     Eutheria        266     TTTCATTTCAATGCC HWI-ST1391:140326:C441WACXX:7:2304:3686:56287 1:N:0: CAACTAA    TATCATTTCAATGCC 1       -
HERVK3I endogenous retrovirus   Homo sapiens    332     GTTAATTCTCAGACA HWI-ST1391:140326:C441WACXX:7:2201:19850:22283 1:N:0: CAACTAA   GTTAATTCTCAGACC 1       +

  1. Intragenic file

This BED file was created using the function intersect from bedtools (see more - https://bedtools.readthedocs.io/en/latest/content/tools/intersect.html).

Note: This file needs to contain the info of the sequences which map to the known genes and not the genes' id.

TODO: Update this format with the utr data and gene type.

The program expects a bed based file with the format example shown below:

chr1	14406	14445	HWI-ST1391:140326:C441WACXX:7:2314:11417:70781 1:N:0: CAACTAA	-
chr1	14406	14445	HWI-ST1391:140326:C441WACXX:7:2314:11417:70781 1:N:0: CAACTAA	-
chr1	14574	14608	HWI-ST1391:140326:C441WACXX:7:2301:12782:18038 1:N:0: CAACTAA	-
chr1	14630	14670	HWI-ST1391:140326:C441WACXX:7:1315:3844:94402 1:N:0: CAACTAA	-
chr1	14643	14682	HWI-ST1391:140326:C441WACXX:7:1212:19864:22541 1:N:0: CAACTAA	-
chr1	14646	14688	HWI-ST1391:140326:C441WACXX:7:1108:3792:24233 1:N:0: CACCTAA	-
chr1	14680	14710	HWI-ST1391:140326:C441WACXX:7:1309:16224:44217 1:N:0: CAACTAA	-

Installing

A step by step series of examples that tell you how to get a development env running

Say what the step will be

Give the example

And repeat

until finished

End with an example of getting some data out of the system or using it for a little demo

Running the program

Explain how to run the automated tests for this system

Files

Explain what these tests test and why

Give an example

Authors

See also the list of contributors who participated in this project.

Acknowledgments

  • Hat tip to anyone whose code was used
  • Inspiration
  • etc

About

piRNA detection via multidimensional small RNA clustering

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages