Piwi-interacting RNAs (piRNA) are a novel class of small RNA molecules thought to mediate retrotransposon silecing. Since transposable elements (TEs) can serve as mutagenic factors that contribute to genomic instability, piRNA function is extremely important for genome defense.
These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.
You will need the following files to use all the features in this project:
- Transposable Elements file
The program expects an eland based file with the format example shown below:
trans_id trans_coord target_seq probe_id probe_seq num_mismatch strand
TAR1 Repeat sequence Homo sapiens 170 GGGAGGAGTAGGCTG HWI-ST1391:140326:C441WACXX:7:2114:14096:41705 1:N:0: CAACTAA GGGAGGAGTAGGATG 1 -
TAR1 Repeat sequence Homo sapiens 789 GTCCCCGGCGGCGCAGAGACGAGTGGAACCTG HWI-ST1391:140326:C441WACXX:7:1110:14668:100150 1:N:0: CACCTAA GTACCCGGCGGCGCAGAGACGAGTGGAACCTG 1 -
MamGypLTR2c_LTR Gypsy Mammalia 527 TCTTGGGGGGCTCAGG HWI-ST1391:140326:C441WACXX:7:1114:17283:59379 1:N:0: CAACAAT TCTTGGGGGGCTCAGG 0-
MLT1H MaLR family Homo sapiens 188 CTTGAGGCCTAGGCT HWI-ST1391:140326:C441WACXX:7:1113:15313:71495 1:N:0: CAACTAA CTTGAGGACTAGGCT 1 -
LTR29 LTR Homo sapiens 43 ATGCTGAACTGAAGAAGCCTCAAGGT HWI-ST1391:140326:C441WACXX:7:2310:17684:30008 1:N:0: CAACTAA ATGCTGAACTGAAGAAGCCTCAAGGT0 +
MLT2F LTR Homo sapiens 582 TCTGATTGAATCCTGACTGATACA HWI-ST1391:140326:C441WACXX:7:1213:21078:77562 1:N:0: CAACTAA TCTGGTTGAATCCTGACTGATACA 1+
MER68A LTR Retrotransposon Eutheria 267 CCTGGGCACTGAGTC HWI-ST1391:140326:C441WACXX:7:2216:14800:78111 1:N:0: CAACTAA CCGGGGCACTGAGTC 1 +
Kanga1 Mariner/Tc1 Eutheria 266 TTTCATTTCAATGCC HWI-ST1391:140326:C441WACXX:7:2304:3686:56287 1:N:0: CAACTAA TATCATTTCAATGCC 1 -
HERVK3I endogenous retrovirus Homo sapiens 332 GTTAATTCTCAGACA HWI-ST1391:140326:C441WACXX:7:2201:19850:22283 1:N:0: CAACTAA GTTAATTCTCAGACC 1 +
- Intragenic file
This BED file was created using the function intersect from bedtools (see more - https://bedtools.readthedocs.io/en/latest/content/tools/intersect.html).
Note: This file needs to contain the info of the sequences which map to the known genes and not the genes' id.
TODO: Update this format with the utr data and gene type.
The program expects a bed based file with the format example shown below:
chr1 14406 14445 HWI-ST1391:140326:C441WACXX:7:2314:11417:70781 1:N:0: CAACTAA -
chr1 14406 14445 HWI-ST1391:140326:C441WACXX:7:2314:11417:70781 1:N:0: CAACTAA -
chr1 14574 14608 HWI-ST1391:140326:C441WACXX:7:2301:12782:18038 1:N:0: CAACTAA -
chr1 14630 14670 HWI-ST1391:140326:C441WACXX:7:1315:3844:94402 1:N:0: CAACTAA -
chr1 14643 14682 HWI-ST1391:140326:C441WACXX:7:1212:19864:22541 1:N:0: CAACTAA -
chr1 14646 14688 HWI-ST1391:140326:C441WACXX:7:1108:3792:24233 1:N:0: CACCTAA -
chr1 14680 14710 HWI-ST1391:140326:C441WACXX:7:1309:16224:44217 1:N:0: CAACTAA -
A step by step series of examples that tell you how to get a development env running
Say what the step will be
Give the example
And repeat
until finished
End with an example of getting some data out of the system or using it for a little demo
Explain how to run the automated tests for this system
Explain what these tests test and why
Give an example
- Stephanie Schustermann - Initial work - StephSchustermann
See also the list of contributors who participated in this project.
- Hat tip to anyone whose code was used
- Inspiration
- etc