Sentiment Analysis API

A robust sentiment analysis API powered by state-of-the-art transformer models. This API can analyze text sentiment with high accuracy, handling both short and long texts through intelligent chunking.

Features

🚀 Fast and Accurate: Powered by pre-trained transformer models
📝 Long Text Support: Automatically chunks and analyzes texts longer than model capacity
🎯 Detailed Analysis: Returns confidence scores, sentiment distribution, and chunk-level analysis
🔧 RESTful API: Easy-to-use endpoints with comprehensive documentation
📊 Token Counting: Check text length before analysis
🐳 Docker Support: Ready-to-deploy Docker configuration
✅ Well-Tested: Comprehensive test suite with unit and integration tests

Project Structure

sentiment-analysis/
├── sentiment_analysis/         # Main package
│   ├── __init__.py
│   ├── app.py                 # FastAPI application setup
│   │   ├── __init__.py
│   │   └── routes.py          # Route definitions
│   ├── core/                  # Core business logic
│   │   ├── __init__.py
│   │   └── analyzer.py        # Sentiment analysis logic
│   ├── models/                # Data models
│   │   ├── __init__.py
│   │   └── schemas.py         # Pydantic schemas
│   └── tests/                 # Test suite
│       ├── __init__.py
│       ├── test_analyzer.py   # Unit tests
│       └── test_integration.py # Integration tests
├── .github/                   # GitHub Actions workflows
│   └── workflows/
│       └── ci.yml
├── main.py                    # Application entry point
├── requirements.txt           # Production dependencies
├── requirements-dev.txt       # Development dependencies
├── setup.py                   # Package setup
├── Dockerfile                 # Docker configuration
├── LICENSE                    # MIT License
├── CONTRIBUTING.md           # Contribution guidelines
└── README.md                 # This file

Installation

Using pip

# Clone the repository
git clone https://github.com/yourusername/sentiment-analysis.git
cd sentiment-analysis

# Create virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install the package
pip install -e .

# For development
pip install -e ".[dev]"

Using Docker

# Build the image
docker build -t sentiment-analysis .

# Run the container
docker run -p 8000:8000 sentiment-analysis

Quick Start

Start the API server:
```
python main.py
```
Access the API documentation:
- Swagger UI: http://localhost:8000/docs
- ReDoc: http://localhost:8000/redoc

Make your first request:

curl -X POST "http://localhost:8000/api/v1/analyze" \
     -H "Content-Type: application/json" \
     -d '{"text": "I love this API! It works great."}'

API Endpoints

Sentiment Analysis

Endpoint: POST /api/v1/analyze

Analyzes the sentiment of provided text.

Request:

{
  "text": "Your text to analyze"
}

Response:

{
  "label": "POSITIVE",
  "score": 0.9876,
  "raw_scores": [0.001, 0.002, 0.009, 0.9876, 0.0004],
  "numerical_sentiment": 1.0,
  "sentiment_distribution": {"POSITIVE": 100.0},
  "confidence_level": "high",
  "num_chunks": 1,
  "chunk_votes": {"POSITIVE": 1}
}

Token Counting

Endpoint: POST /api/v1/count-tokens

Counts tokens in the provided text.

Request:

{
  "text": "Your text to count tokens"
}

Response:

{
  "token_count": 8,
  "max_tokens": 512,
  "percentage": 1.56,
  "truncated": false
}

Model Information

Endpoint: GET /api/v1/model-info/capacity

Returns the maximum token capacity of the model.

Response:

{
  "max_tokens": 512
}

Health Check

Endpoint: GET /api/v1/health

Check service health status.

Response:

{
  "status": "healthy",
  "service": "sentiment-analysis-api"
}

Long Text Analysis

The API automatically handles long texts that exceed the model's token limit (512 tokens) by:

Chunking: Dividing text into overlapping segments
Individual Analysis: Analyzing each chunk separately
Aggregation: Combining results using:
- Voting: Most common sentiment wins
- Weighted Scoring: Confidence-weighted averaging
- Distribution Analysis: Percentage breakdown by sentiment
- Confidence Assessment: Based on agreement between chunks

Development

Running Tests

# Run all tests
pytest

# Run with coverage
pytest --cov=sentiment_analysis --cov-report=html

# Run specific test file
pytest sentiment_analysis/tests/test_analyzer.py -v

Code Quality

# Format code
black sentiment_analysis/

# Lint code
flake8 sentiment_analysis/

# Type checking
mypy sentiment_analysis/

Pre-commit Hooks

# Install pre-commit
pip install pre-commit

# Set up hooks
pre-commit install

Configuration

The API uses the following model by default:

Model: tabularisai/robust-sentiment-analysis
Max tokens: 512
Classes: 5 (Very Negative, Negative, Neutral, Positive, Very Positive)

Contributing

We welcome contributions! Please see our Contributing Guidelines for details.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Built with FastAPI
Powered by Hugging Face Transformers
Model: tabularisai/robust-sentiment-analysis

Support

📫 Email: support@example.com
🐛 Issues: GitHub Issues
💬 Discussions: GitHub Discussions

Made with ❤️ by the Sentiment Analysis API Team

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis API

Features

Project Structure

Installation

Using pip

Using Docker

Quick Start

API Endpoints

Sentiment Analysis

Token Counting

Model Information

Health Check

Long Text Analysis

Development

Running Tests

Code Quality

Pre-commit Hooks

Configuration

Contributing

License

Acknowledgments

Support

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows		.github/workflows
sentiment_analysis		sentiment_analysis
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
setup.py		setup.py
test_api.py		test_api.py

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis API

Features

Project Structure

Installation

Using pip

Using Docker

Quick Start

API Endpoints

Sentiment Analysis

Token Counting

Model Information

Health Check

Long Text Analysis

Development

Running Tests

Code Quality

Pre-commit Hooks

Configuration

Contributing

License

Acknowledgments

Support

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages