-
Notifications
You must be signed in to change notification settings - Fork 44
Description
At a Hackathon session of the recent CF Workshop 2024, I began updating the code I created in 2020 to produce some visualisations of the standard names: a plot of the total number of standard names over time and per version, plus word clouds which show sets of names added between specified versions (see #110).
This week in-between other work I managed to finalise it to the point I am happy, in terms of the outputs being good enough and the code maintainable and flexible enough. I now have a branch here (standard-names-vis-cfworkshop24) on my fork with the final code script to generate the plot and wordclouds, with generated outputs. If you follow the link, you can see the code and the generated plots, for reference.
The next steps are:
- to get this code added to the repository, so it can be run when suitable e.g. after a new version of the table to update the totals plot and overall wordcloud, and add a new wordcloud to cover only the names added for the new version;
- to get the visualisation images included on the site.
I am opening this separate Issue because it is clear there is consensus from #110, and from conversations in person at the workshops etc., that people would like these included on the website somewhere (then in turn also available for people to take and use elsewhere e.g. to promote the CF Conventions overall). But now we need to agree where the best place to put them is, hence this Issue.
Proposal
I think it helps to start off with a proposal, so here is my suggestion of what to do to get my branch, and some or all of the generated images (the word cloud code is quite flexible and can show the names added from any version to any older one), incorporated into the site:
- Add the overall word cloud (see https://github.com/sadielbartholomew/cf-convention.github.io/blob/0ebe966ff92065b59c94e0f0f7b02fd032ef4b11/generated_vis_images/sn_wordcloud_versions1_to_current.png) showing all the names in the latest table to the 'Vocabularies' page, just above the 'Documents' heading. Perhaps we could also add it to the site homepage to add a little colour and media to what is otherwise a purely text-based page?
- Add the totals plot (see https://github.com/sadielbartholomew/cf-convention.github.io/blob/0ebe966ff92065b59c94e0f0f7b02fd032ef4b11/generated_vis_images/sn_totals_plot.png) to the same page, under a new title of 'History' before or after the 'Discussion' sub-heading, with a few sentences to introduce it summarising that the table has bene under development for a few decades now, or something along those lines.
- Create a new page, linked under that 'History' heading with a summarising sentence and after the totals plot, which shows a word cloud covering the new names added to a version for every version from 1 to the current, under a heading of the version name to cover every version in reverse order (so top heading 'Version 86', then 'Version 85' and so on).
And as for the code, I would keep it in its own self-contained directory vis, so that it can be separate from other parts of the repo. (I don't mind what we call it, but chose vis which is descriptive but short and notably avoids British vs. America English spelling decision for 'visualisation').