-
Notifications
You must be signed in to change notification settings - Fork 4
Reconsider what components get inforeses and need to be referenced in retrieval provenance chains. #345
Description
This issue will need to be discussed more broadly with other working groups - but wanted to get ingest folks input here first.
At present, it seems like retrieval chains captured in RetreivalSource objects in the TRAPI data (and displayed on the ARAX UI) references components like dogpark tier 1, dogpark tier 0, retriever, arax-shepherd.
For example, this is the provenance chain for a simple "Oxiglutation ‘treats’ Breast Cancer” edge provided by an external source (TTD), and retrieved through a Tier 0 lookup, in the ARAX UI here.
ttd (primary) -> dogpark tier 0 (aggregator) -> retriever (aggregator) -> arax (aggregator) -> shepherd-arax (aggregator)
Another example (where a tier 1 resources is used, not tier 0) :
faers (primary) -> multiomics-dakp (aggregator) -> dogpark tier 1 (aggregator) -> retriever (aggregator) -> arax (aggregator) -> shepherd-arax (aggregator)
Questions:
- Do we want/need to capture and distinguish all of these components (
retrieveranddogpark,araxandshepherd arax) in these chains? - Should we create inforeses for each source KG Tier 1 endpoint (e.g.
infores:ttd-translator)? (analogous to what was done in Phase 2, where we created separate inforeses for things likettdandautomat-ttd)