-
Notifications
You must be signed in to change notification settings - Fork 56
Description
Background
Systems Biology Graphical Notation (SBGN) is a standard language for representing biological pathways with SBGNML being the exchange and storage format for SBGN. SBGN diagrams are created using the developed SBGN editors (Newt, CellDesigner, SBGN-ED etc.), and they can be saved as the XML-based SBGNML files. The specification that defines the rules governing SBGN can be difficult for many users. Validation tools can the ability of users to create clear SBGN diagrams that are unambiguous. Programmatic validation of SBGN is encoded in the Schematron language that can validate XML documents through an XSLT transformation; XSLT transformation is available in a variety of programming languages.
Goal
The goal of the project improve the validation of SBGN documents (and extend it if needed, by re-examining the SBGN specifications). Furthermore, an objective is also to update validation in common SBGN tools like Newt (written in Javascript) with any improvements. Possible improvements include
- Clarification to the diagnostic information provided for individual rules that help user understand rule failures,
- Addition of missing rules (needs re-examination of SBGN specs)
Starting point
Make sure you can run the Schematron validation in either Python or Javascript for example diagrams.
Difficulty Level: Medium
Difficulty is based on the having and understanding basics of SBGN
Size and Length of Project
- Medium: 175 hours
Skills
Essential skills: Javascript or Python, XML
Public Repository
- https://github.com/sbgn/libsbgn/tree/master/validation
- https://github.com/sciluna/sbgn_dataset_generator
- https://sbgn.github.io/ (specification exist here)
Potential Mentors
- Augustin Luna (augustin AT nih.gov)