Skip to content

wrong motif in STRchiv?#327

Draft
strchive-bot wants to merge 2 commits intomainfrom
SCA17_TBP
Draft

wrong motif in STRchiv?#327
strchive-bot wants to merge 2 commits intomainfrom
SCA17_TBP

Conversation

@strchive-bot
Copy link
Copy Markdown
Contributor

Name
Manuel Hofer
Username
-
Email
manuel.hofer@meduniwien.ac.at

Description
hi,
i looked up the sca17 (TBP) in your SRTchives and says GCA as motif. But in all publications the motif is CAG interrupted sometimes with CAA. All together it is an poly glutamin (Q) problem because of CAG=Q and CAA=Q.

is this a error in your database?

greetings an thanks,
Manuel

@netlify
Copy link
Copy Markdown

netlify bot commented Mar 4, 2026

Deploy Preview for strchive ready!

Name Link
🔨 Latest commit 39c2f3e
🔍 Latest deploy log https://app.netlify.com/projects/strchive/deploys/69a85a92c1b0b70008387057
😎 Deploy Preview https://deploy-preview-327--strchive.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@hdashnow
Copy link
Copy Markdown
Member

hdashnow commented Mar 5, 2026

Thanks for the feedback. The perfect repeat in hg38 does start "GCA". It's a circular permutation of CAG, so technically accurate. But I agree that it is a little confusing. Since this is a protein-coding repeat, I agree that CAG is clearer. I'll update this in the next version.

@mhmuw
Copy link
Copy Markdown

mhmuw commented Mar 5, 2026

Thanks for the feedback. The perfect repeat in hg38 does start "GCA". It's a circular permutation of CAG, so technically accurate. But I agree that it is a little confusing. Since this is a protein-coding repeat, I agree that CAG is clearer. I'll update this in the next version.

Thank you hdashnow for the quick reply,
Can you also add the CAA because it is known and published that CAG is interrupted with CAA which also codes for Glutamine which is the phato mechanism (to many Qs in a row) of this repeat.

For Example: I do have a sample Sequenced (PacBio pure target repeat expansion kit). GCA count = 31/31
But in reality there are 31CAGs with 5 CAAs between = 36/36 Glutamine in a row. The TRGT analysis based on STRchive (31/31) is not correct. Even not technically.

Greetings and thanks,

@Macayla-weiner Macayla-weiner mentioned this pull request Mar 27, 2026
5 tasks
@Macayla-weiner
Copy link
Copy Markdown
Contributor

Hello! This will be updated in the most recent literature update (#334). We are currently working on making another field to list interruption data, but for now that CAA interruption has been added at the end of the 'locus details' field.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants