- Introduction and Motivation
- Project Description
- Datasets
- Project Structure
- Instructions
- Data Pipeline
- Analysis
- Contact Information
- License
Climate change is a pressing global issue with significant implications for environmental, economic, and social systems. In Europe, rising temperatures have raised concerns about their potential impact on energy consumption patterns. This project aims to investigate whether there has been a noticeable increase in temperatures in European countries since the 2000s and how energy consumption has changed over this period. Understanding these trends is essential to address the main question: Does a potential temperature increase correlate with energy consumption in Europe?
This project investigates the correlation between temperature increase and energy consumption in European countries. It follows a pipeline that includes data extraction, cleaning, transformation, and integration. A detailed description of the data origin preprocessing and preprocessing can be found in project/data-report.pdf. The final data is saved into data/final_data.csv. An analysis is then performed on the final data to answer the research question in project/analysis-report.pdf. The results of the analysis are saved into plots/.
In this project, unit tests have been implemented to verify the correctness of the data pipeline. These tests ensure that the pipeline produces the expected output. Additionally, GitHub Actions have been set up to automatically trigger these unit tests whenever changes are pushed to the repository.
The project combines two datasets to analyze the trends and patterns:
-
Climate Change Indicators: Measures temperature change relative to a baseline climatology corresponding to the period from 1951 to 1980. It is a CSV file and is licensed under CC0. Source
-
Primary Energy Consumption EU: Provides data about the primary energy consumption in the EU. The dataset is distributed by Eurostat, the statistical office of the European Union. The data format is CSV and it is licensed under the Open Data License. Source
For more detailed information, see project/data-report.pdf.
├───.github/
│ └───workflows/
├───data/
├───exercises/
├───plots/
├───project/
│ ├───analysis-report.pdf
│ ├───analysis.py
│ ├───data-report.pdf
│ ├───downloader.py
│ ├───pipeline.py
│ ├───pipeline.sh
│ ├───preprocessing.py
│ ├───project_plan.md
│ ├───requirements.txt
│ ├───tests.sh
│ └───unit_tests.sh
└───sample_data/
├───eurostat_sample.csv
└───kaggle_sample.csv
Important files:
project/pipeline.shorproject/pipeline.py: Executes the pipeline and saves the data intodata/final_data.csv. Make sure that the dependencies contained inproject/requirements.txtare installed and that a Kaggle API key is installed on the system.projects/analysis.py: Analyses the data and saves the according plots intoplots/.project/data-report.pdf: Provides a detailed overview of the original data and what changes are made to obtain the final data.project/analysis-report.pdf: Analysis of the research question.
- Clone the repository to your local machine with
git clone https://github.com/dennismstfc/FAU-MADE-project. - Install the dependencies from
project/requirements.txtusing pip:pip install -r project/requirements.txt. - Make sure you have a Kaggle API key installed on your system. See the documentation for more.
- Run the pipeline script:
python project/pipeline.pyorsh project/pipeline.sh. - Run the analysis script:
python projects/analysis.py.
The analysis is divided into three main parts:
- Surface Temperature Trends
- Energy Consumption Patterns
- Correlation Analysis
For a more comprehensive understanding of these trends and their implications, please refer to the detailed report available at project/analysis-report.pdf.
For any additional questions or comments, please contact me at dennis.mustafic@fau.de
This project is licensed under the terms of the CC-BY-4.0 license.



