NLP-Toolkit-App

Overview

The NLP Toolkit is a Streamlit-based web application for performing basic text processing and analysis using Python. It leverages NLTK, SpaCy, and other NLP libraries to provide features such as tokenization, stemming, lemmatization, POS tagging, n-grams, Bag-of-Words, TF-IDF, parsing, Named Entity Recognition (NER), sentiment analysis, and visualization.

Features

Tokenization (words and sentences)
Stopwords removal
Stemming (Porter, Lancaster, Snowball)
Lemmatization
POS tagging
N-grams generation
Bag-of-Words representation
TF-IDF representation
Dependency parsing
Named Entity Recognition (NER)
Sentiment analysis
Word frequency plots and WordClouds
Download original and processed tokens

Installation

Clone the repository:

git clone <repository_url>
cd <repository_folder>

Create a virtual environment (optional but recommended):

python -m venv venv
source venv/bin/activate   # Linux/Mac
venv\Scripts\activate      # Windows

Install dependencies:

pip install -r requirements.txt

Download the SpaCy English model:

python -m spacy download en_core_web_sm

Usage

Run the Streamlit app:

streamlit run app.py

In the sidebar, choose your input source:
- Paste text
- Upload .txt or .csv file (text should be in the first column)
Select the desired NLP options and parameters:
- Tokenization
- Stopwords removal
- Stemming / Lemmatization
- POS tagging
- N-grams
- Bag-of-Words / TF-IDF
- Parsing / NER
- Sentiment analysis
Click Run NLP to process the text.
Visualizations and downloadable token files are available after processing.

Requirements

See requirements.txt for all dependencies.

NLTK Data

The app automatically downloads required NLTK datasets if not present:

punkt
averaged_perceptron_tagger
wordnet
omw-1.4
stopwords

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
fonts		fonts
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
sample.txt		sample.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NLP-Toolkit-App

Overview

Features

Installation

Usage

Requirements

NLTK Data

License

About

Uh oh!

Releases

Packages

Languages

HARSHAVINJAMURI/NLP-Toolkit-App

Folders and files

Latest commit

History

Repository files navigation

NLP-Toolkit-App

Overview

Features

Installation

Usage

Requirements

NLTK Data

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages