Skip to content

Latest commit

 

History

History
10 lines (4 loc) · 651 Bytes

File metadata and controls

10 lines (4 loc) · 651 Bytes

biographies

Code and metadata for research on biographies drawn from HathiTrust. Work by Ted Underwood, Natalie DeClerck, Ryan Dubnicek, and Hoi-Yan Wendy Wong.

Data was gathered and exploratory data analysis was done in the summer of 2017. Other projects became pressing; we returned to finish this one in winter 2018-19. The main outstanding task at that point was to use topic modeling to define a character space. Before building that model we preregistered 92 assumptions about character difference and similarity to guide our modeling choices: see the preregistration.