Skip to content

Ingest UniProt database registry and prefixes #1838

@ialarmedalien

Description

@ialarmedalien

UniProt

Description

UniProt provides a list of external databases that it links to, along with information on how to create the appropriate URLs for entries.

Prefixes

185 freshly-baked prefixes and attendant database information

I'm interested in getting the prefixes used by UniProt integrated into the Bioregistry. As mentioned above, UniProt provides a list of external databases that it links to, with info including

  • prefix
  • db name, description
  • URL
  • format of link for an individual entity

Some prefixes are already aligned, either with the Bioregistry preferred prefix or with a synonym, and some represent a subclass or superclass (e.g. UniProt uses merops to refer to merops.entry). Other entries are missing from Bioregistry -- e.g. PATRIC.

It would be great to be able to incorporate this prefix data into Bioregistry programmatically. A coworker and I have already done some work on mapping the UniProt and Bioregistry prefixes where there is not an exact match.

What would be the best way to proceed?

Metadata

Metadata

Assignees

No one assigned

    Labels

    NewUsed in combination with prefix, metaprefix, or collection for new entries

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions