This project visualizes the most discussed research topics about carbon emissions across districts (latitude). Using scraped keyword data, we employ LDA (Latent Dirichlet Allocation) modeling and Typhoon API for topic prediction, providing insightful visualizations for carbon emission trends.
- Keyword Analysis: Extracts key topics from research data.
- LDA Modeling: Clusters research topics for deeper insights.
- Interactive Visualization: Displays topic distribution across districts by latitude.
- Programming Language: Python
- Libraries: Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, Gensim
- API: Typhoon API for topic name predictions
- Data Format: CSV
- Clone the repository:
git clone https://github.com/VizKey01/Datasci_project.git
cd Datasci_project - Install required libraries:
pip install -r requirements.txt Place your scraped research keyword data in the data/ folder as a CSV file.
Run the script for topic modeling and visualization:
python main.py - A graphical representation of carbon emission research topics by district latitude.
- CSV and JSON files of processed topic distributions.
Feel free to fork the repository, create a branch, and submit a pull request. Contributions are always welcome!