Skip to content

QA of GO-CAM ingest #339

@mbrush

Description

@mbrush

QA Resources:


Issues to consider / address for next release:

Generally, the initial release of this was pretty fast and loose, and there are plans to come back and improve things. Lots of notes to this effect in the RIG 'future considerations'. Noting some additional observations below:

  • uses predicates that violate Translator conventions (e.g. include direction of effect)
  • very few edges actually created
  • those that are lack detail e.g. qualifiers, edge properties that could be taken)
  • contains relatively large numbers of nonsensical edges (e.g. gene causes gene (878), gene has input gene (1400)

Also, the RIG is not in sync with what the code is actually doing to produce data, and seems to reference predicates that are not in Biolink (e.g. positively_regulates), and edge properties that are not in biolink (apparently as placeholders to capture qualifying info that will be captured using proper SPOQ patterns in future iterations of the ingest)


Issues to consider longer term:

  • Some Biolink predicates include directionality (e.g. acts upstream of negative effect) - which we capture in qualifiers for Translator use cases. We should not use these in our ingest, and define qualifier based patterns to represent directionality of effects.

Questions:

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions