For Aim I, @sergeitarasov and @uyedaj need:
An entity with good data coverage in phenoscape (e.g. dorsal fin) and all of the raw phenotypes and taxa.
We then need the raw phenotypes made into a semantic similarity matrix (Jaccard's) by @balhoff that gives us all pairwise similarities for all the phenotypes that we have.