-
Notifications
You must be signed in to change notification settings - Fork 4
Open
Labels
Description
Some datasets may be dominated by noise and have little difference between synonymous and stop variants. A warning should be added for these cases, as the visualization cannot be helped to be bad.
A specific example is urn:mavedb:00000003-b-1: BRCA1 RING domain - Enrich2 nucleotide variant scores for BRCA1 Y2H
> summary(sm.data$score[which(sm.mut$type == "synonymous")])
Min. 1st Qu. Median Mean 3rd Qu. Max.
-2.34200 -0.35260 -0.01884 -0.13260 0.24790 1.08200
> summary(sm.data$score[which(sm.mut$variant=="*")])
Min. 1st Qu. Median Mean 3rd Qu. Max.
-5.5870 -0.5913 0.4375 -0.1150 1.1710 2.0520
> head(sm.data[which(sm.mut$variant == "*"),1:4])
urn hgvs_nt hgvs_pro score
127 urn:mavedb:00000003-b-1#20851 c.877A>T p.Lys293Ter 1.1817745
557 urn:mavedb:00000003-b-1#21281 c.388A>T p.Arg130Ter -0.4453293
813 urn:mavedb:00000003-b-1#21537 c.625C>T p.Gln209Ter 0.3754994
1184 urn:mavedb:00000003-b-1#21908 c.675T>A p.Cys225Ter 2.0521364
1312 urn:mavedb:00000003-b-1#22036 c.904G>T p.Glu302Ter -0.4542513
1368 urn:mavedb:00000003-b-1#22092 c.772G>T p.Glu258Ter 0.2100059
> head(sm.data[which(sm.mut$type == "synonymous"),1:4])
urn hgvs_nt hgvs_pro score
32 urn:mavedb:00000003-b-1#20756 c.372C>T p.= 0.40005867
155 urn:mavedb:00000003-b-1#20879 c.786T>C p.= -1.78391714
391 urn:mavedb:00000003-b-1#21115 c.624T>C p.= 0.04629578
405 urn:mavedb:00000003-b-1#21129 c.192G>A p.= 0.02823471
425 urn:mavedb:00000003-b-1#21149 c.669T>C p.= -1.97649415
541 urn:mavedb:00000003-b-1#21265 c.453C>G p.= -0.35483822