I'm keeping my documents/source codes related to this Manning liveProject in my GitHub account here.
In this liveProject, you will gain an overall impression of the job of a Natural Language Processing (NLP) Specialist working on the Growth Hacking Team of a freshly launched startup introducing a new video game to the market.
One of the key targets of a growth hacking team is to drive rapid growth for early startups. This often involves strategies to acquire as many customers as possible at the lowest cost. As part of the strategy, your boss wants to map the video game market to understand how customers evaluate competitors’ products — namely, what they like and dislike in a video game.
Knowing what makes a video game attractive to gamers helps the marketing team craft a more effective product message. To achieve this, you will dive into customer reviews to extract insights using various NLP methods.
Your main responsibilities in this project include:
- Download the dataset of Amazon reviews.
- Create a filtered dataset for video game reviews.
- Assign a sentiment score between -1 and 1 to each review.
- Compare sentiment scores with star ratings to evaluate accuracy.
- Experiment with multiple sentiment analysis methods.
- Classify reviews as Positive, Negative, or Neutral.
- Summarize findings to highlight what gamers like or dislike about video games.
To deeply understand gamers’ opinions, the project uses:
- Imbalanced Dataset Sampling – with the
imbalanced-learn
package. - Dictionary-based Sentiment Analysis – using NLTK sentiment tools.
- Model Evaluation – via scikit-learn metrics.
- Neural Network-based Analysis – using the DistilBERT model with PyTorch, transformers, and simpletransformers.
- Data Visualization – via Altair to show key “liked” and “disliked” terms.
- Interactive Web Application – powered by Streamlit for real-time sentiment analysis.
- Creating Your Dataset
- Dictionary-based Sentiment Analyzer
- Evaluator for Dictionary-based Analyzer
- Neural Network-based Sentiment Analyzers
- Reporting and Visualization
- Interactive Streamlit Demonstration
The Amazon review dataset can be downloaded from here.
Use the Video Games 5-core JSON file from Small subsets for experimentation.
An interactive Streamlit application was added to make the project more engaging and accessible for both technical and non-technical audiences.
Live Demo Features:
- Real-time sentiment analysis of text.
- Interactive gauge charts showing sentiment scores.
- Sample reviews for quick testing.
- Rule-based sentiment classification.
- Mobile-friendly interface.
- Install Python (3.8+ recommended).
- Clone the repository:
git clone https://github.com/longlivewama/sentiment_analysis_project.git cd sentiment_analysis_project