Skip to content

[All] Warning message for broken datasets #42

@jweile

Description

@jweile

Some datasets may be dominated by noise and have little difference between synonymous and stop variants. A warning should be added for these cases, as the visualization cannot be helped to be bad.
A specific example is urn:mavedb:00000003-b-1: BRCA1 RING domain - Enrich2 nucleotide variant scores for BRCA1 Y2H

> summary(sm.data$score[which(sm.mut$type == "synonymous")])
    Min.  1st Qu.   Median     Mean  3rd Qu.     Max. 
-2.34200 -0.35260 -0.01884 -0.13260  0.24790  1.08200 
> summary(sm.data$score[which(sm.mut$variant=="*")])
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
-5.5870 -0.5913  0.4375 -0.1150  1.1710  2.0520 
> head(sm.data[which(sm.mut$variant == "*"),1:4])
                               urn  hgvs_nt    hgvs_pro      score
127  urn:mavedb:00000003-b-1#20851 c.877A>T p.Lys293Ter  1.1817745
557  urn:mavedb:00000003-b-1#21281 c.388A>T p.Arg130Ter -0.4453293
813  urn:mavedb:00000003-b-1#21537 c.625C>T p.Gln209Ter  0.3754994
1184 urn:mavedb:00000003-b-1#21908 c.675T>A p.Cys225Ter  2.0521364
1312 urn:mavedb:00000003-b-1#22036 c.904G>T p.Glu302Ter -0.4542513
1368 urn:mavedb:00000003-b-1#22092 c.772G>T p.Glu258Ter  0.2100059
> head(sm.data[which(sm.mut$type == "synonymous"),1:4])
                              urn  hgvs_nt hgvs_pro       score
32  urn:mavedb:00000003-b-1#20756 c.372C>T      p.=  0.40005867
155 urn:mavedb:00000003-b-1#20879 c.786T>C      p.= -1.78391714
391 urn:mavedb:00000003-b-1#21115 c.624T>C      p.=  0.04629578
405 urn:mavedb:00000003-b-1#21129 c.192G>A      p.=  0.02823471
425 urn:mavedb:00000003-b-1#21149 c.669T>C      p.= -1.97649415
541 urn:mavedb:00000003-b-1#21265 c.453C>G      p.= -0.35483822

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions