-
Notifications
You must be signed in to change notification settings - Fork 9
Description
I was wondering why Ribotricer does not have an ORF category that is called "internal". I was trying to compare results for ribotricer and ribocode and I identify an internal ORF in RiboCode that does not seem to be in the ribotricer index at all. They are based on the same human reference GRCh38 from Ensembl version 104.
I looked into the ribotricer code, and I can see that there is an ORF type called "internal", but you do not append it (prepare_orfs.py line 258 and 340). Can you help me explain the reasoning for this. Perhaps I am just misunderstanding the code.
I have an example of an internal ORF in transcript ENST00000675536 that is part of Ribocodes indexing, but not Ribotricer (in relation to that transcript). The AA sequence that it translates to is found in other transcripts, so it is not that the ORF can actually not be identified as translating, but the annotation is missing.
I don't know if it is a bug or intended, but I would really like to understand the reasoning. It seems that many ORFs are found in that transcript, so why not that one?
In ribotricer it has the coordinates 89646069_89649410_396 (ENSG00000131165).
Thank you in advance!
I look forward to your answer.
Kind regards,
Anne