Code and metadata for research on biographies drawn from HathiTrust. Work by Ted Underwood, Natalie DeClerck, Ryan Dubnicek, and Hoi-Yan Wendy Wong.
Data was gathered and exploratory data analysis was done in the summer of 2017. Other projects became pressing; we returned to finish this one in winter 2018-19. The main outstanding task at that point was to use topic modeling to define a character space. Before building that model we preregistered 92 assumptions about character difference and similarity to guide our modeling choices: see the preregistration.