Skip to content
craig8196 edited this page Jan 30, 2015 · 9 revisions

The Topical Guide is a web application that facilitates the discovery of topical patterns and trends in large document collections. The Topical Guide relies on probabilistic topic models, such as LDA, to reveal the semantic content in such large corpora. Many individualized visualizations of topic models have been reported in the literature, showing the potential of topic models to give valuable insight into a corpus. However, good, general, interactive tools for browsing the entire output of a topic model along with the analyzed corpus have been lacking. The Topical Guide is an interactive tool that incorporates both prior work in displaying topic models as well as some novel ideas that greatly enhance the visualization of data analyzed by these models for the sake of discovery of trends.

Demo

Explore the State of the Union Addresses using the Topical Guide.

Download

The code is in a git repository hosted on Github. Use the following command to clone the repository:

git clone https://github.com/BYU-NLP-Lab/topicalguide.git

Installation and Setup

See the README for installation and setup instructions.

For more detailed information about the Topical Guide -- including how to import a data set, and navigate a data set using your browser -- visit our Documentation page.

Papers

Our "system" paper explaining the abilities of our Topical Guide:

Matthew J. Gardner, Joshua Lutes, Jeff Lund, Josh Hansen, Dan Walker, Eric Ringger, Kevin Seppi. "The Topic Browser: An Interactive Tool for Browsing Topic Models". Proceedings of the Workshop on Challenges of Data Visualization, held in conjunction with the 24th Annual Conference on Neural Information Processing Systems (NIPS 2010). December 11, 2010. Whistler, BC, Canada.

License

The code for the Topical Guide is released under the terms of the AGPLv3 or any later version of that license. If for any reason you wish to use the code under other terms, please contact the Copyright Licensing Office, Brigham Young University, 3760 HBLL, Provo, UT 84602, (801) 422-9339 or 422-3821, Email: copyright AT byu DOT edu.

We also ask that if you use this code for academic purposes, any papers that result from the use of this code should cite the Gardner et al. paper referenced above.

Contributions to the code are welcome. Currently the best way to contribute is to email a patch to the textmining AT cs DOT byu DOT edu. Because of licensing issues we ask that you assign the copyright of any patch that you contribute to BYU.

Credits

Project Leaders: Eric Ringger and Kevin Seppi

Project Members: Jeff Lund, Chris Tensmeyer, Joey Cozza, Craig Jacobson, Schuyler Goodman

Alumni: Jared Forsyth, Matt Gardner, Josh Hansen, Tobias Kin Hou Lei, Joshua Lutes, Dan Walker,

Original Author: Joshua Lutes

Third-party software: See the list of third-party software used by the Topical Guide.

Contents

User Documentation

Programmer Documentation

Clone this wiki locally