Skip to content
lrutter edited this page Feb 20, 2015 · 21 revisions

Make on-the-fly visualizations of genealogical data

Summary: Re-factor the phyViz package into a new package called ggenealogy, which provides additional methods to visualize the ancestors and descendants of a given line or variety.

Description: :

  • Interactively draw genealogy trees from database of known lineage.
  • Select generation number of ancestors and descendants to show.
  • Show shortest path between two varieties, and superimpose over full lineage structure.
  • Organize nodes horizontally by time of emergence, and vertically by a given additional variable.
  • Obtain graph theory measures of the full lineage structure.
  • Produce color matrix plots of variables between a subset of varieties.
> library(devtools)
> install_github("phyViz", "dicook")

Related work:

Potential tasks:

  • Order nodes/varieties by time of appearance, as well as by other given variables
  • Incorporate Shiny to allow users to examine genealogy visualization tools in a more interactive way
  • Test the package on multiple toy datasets
    • Soybean parent-child relationships (Pedigrees of soybean cultivars released in the United States and Canada." Theodore Hyivitz, C.A. Newell, S.G. Carmer. College of Agriculture, University of Illinois at Urbana-Champaign. 1977)
    • Barley parent-child relationships
    • Mathematics Genealogy Project (http://genealogy.math.ndsu.nodak.edu/)
  • Add more flexibility for data input types than the currently-required data frame format

Skills required:

Knowledge of genealogical data structures, igraph, ggplot2, shiny

Test: Fork the package on GH and create a pull-request implementing a method for tables::tabular. Quick and dirty solution:

> 

Mentor: Di Cook ([@](mailto:visnut {at} gmail {dot} com)) and Susan Vanderplas ([@](mailto:srvanderplas {at} gmail {dot} com)) as backup mentor

Disclaimer:

Clone this wiki locally