GitHub

In this project, I dip my toes into NLP by exploring the basic functionality of the nltk toolkit and IBM Tone Analyzer. Since I do not have much experience in NLP, I used this project to learn about tokenizing texts, stripping them of their stopwords, plotting the frequency of words in a text, and analyzing text via the IBM Tone Analyzer.

The program can be run via the command "python nlp_practice.py" in the terminal. The program will read two separate text files containing different texts, tokenize the texts, remove all English stopwords according to nltk.corpus, print the 20 most frequently used words (excluding stopwords), and analyze the two texts via the IBM Tone Analyzer. To run the program, please replace the api_key and url with your IBM credentials.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.DS_Store		.DS_Store
README.md		README.md
a_divine_and_supernatural_light.txt		a_divine_and_supernatural_light.txt
nlp_practice.py		nlp_practice.py
sinners_in_the_hands_of_an_angry_god.txt		sinners_in_the_hands_of_an_angry_god.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

derek8bai/cs98_hack_a_thing_1

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages