-
Notifications
You must be signed in to change notification settings - Fork 12
Thoughts on 'third generation' community taxonomy editing system
Jonathan A Rees edited this page Apr 2, 2014
·
5 revisions
While the open tree reference taxonomy assembly tool now has two generations of taxonomy editing features, it has become increasingly clear that neither is adequate, and we can learn from and improve on both. Here are some thoughts on what the third generation system ought to be like.
Goals:
- Edit directives should in general be possible without the help or intervention of Open Tree personnel ("community editing").
- Provenance should be captured for all directives (typically in the form of a URI and a curator's name) and all uses of taxon names.
- Directives should be reusable. If a set of directives applies to one of Open Tree's input taxonomies such as IRMNG, it should be possible to apply them independently to IRMNG, independent of the Open Tree reference taxonomy. That is, the syntax and semantics of the directives should not be tied to the Open Tree project.
As before the directives should be representable as human-readable and human-writable snippets.
Features:
- Declarative semantics. Each directive should be interpretable as a scientific claim. The current systems don't do this; directives are edits to the tree, and the outcome of executing directives is very sensitive to the order in which they are processed. Order dependence is not completely avoidable but it should be minimized as much as possible.
- Idempotence. The appearance of an edit in an edit set twice, should have the same effect as it occurring once. In other words, if a taxonomy already reflects a claim, then applying the claim to the taxonomy should have no effect. This is actually a consequence of declarative semantics but is important enough to call out.
- Completeness. Anything that can be done through taxonomy ingest should be doable (if not as efficiently) using the edit system. This includes synonyms and attaching "flags" to taxa. Similarly operations performed by smasher's alignment procedure, such as identity and non-identity of taxa in different taxonomies, should be expressible.
- Graphical front end. Some users will find a graphical interface very attractive, and it may be possible to create one with a modest amount of effort. What I have in mind is that a part of a taxonomic tree would be displayed, and one would be allowed to do a few simple operations like adding a new node or changing the topology by moving a node from one location to another.
I envision that the front end would not itself do edits directly, but that the output of the graphical front end would be a set of directives (this was agreed in discussion with Cody back in August 2013).
Although this project is very different from Nico Franz's work (e.g.), and much less rigorous, I take his method and philosophy to be inspirational.