Improving Tropical Cyclone Track Forecast

Overview

Over the period from 2008 to 2022, the United States experienced a range of tropical cyclones, including hurricanes and tropical storms. The ability to predict the track of tropical cyclones is important for public safety and disaster preparedness, reducing the risk to human life and property. The Global Ensemble Forecast System (GEFS) is a sophisticated physics model created by the National Oceanic and Atmospheric Administration (NOAA) to make weather predictions spanning from days to weeks into the future. However, the GEFS Mean can have relatively large biases that can lead to large predicted track error. In order to correct the GEFS Mean 5-day forecasts for tropical cyclones in the Atlantic Basin, this research will develop a CNN model developed by taking into account data from tropical cyclone 5-day forecast sequences across the United States in a 15 year timespan.

Data Sources

The 21 GEFS ensemble members were obtained every 6 hours out to day 5 forecast predictions from 2008-2022. The tropical cyclone Best Tracks and GEFS tropical cyclone storm forecast tracks were retrieved from the National Oceanic and Atmospheric Administration’s (NOAA) National Hurricane Center. Python version 3.9.13 on Jupyter Notebooks, a Python development environment, was used to develop Python code for this project.

Methodology

Processing:

When gathering the GEFS data, some tropical cyclone ensemble member forecasts may be missing. Thus, when preliminarily screening the data, the missing tropical cyclone data was deleted. In addition, the short tropical cyclone sequences with a lifespan of less than 5 days were deleted to avoid affecting the accuracy.

Modeling:

The Python torch library was used to create the CNN model. The input to the modeling process was the data frame resulting from the data processing of the multiple tropical cyclone storms. To ensure randomness of tropical cyclone sequences inputted to the CNN Model, the order of tropical cyclone sequences data within the data frame was shuffled using a constant seed. K-fold cross-validation was used to determine a more reliable accuracy score using a k-value of 5.

Evaluation:

To assess the accuracy of the CNN Model in comparison to the GEFS Mean, a crucial step involved employing the haversine function for distance calculations. The haversine function was used to calculate the distances in kilometers between the predicted latitude and longitude coordinates of both the CNN Model and GEFS Mean, and the corresponding Best Track latitude and longitude coordinates for each forecast lead time.

The total track error (TTE) is the deviation of the forecast from the Best Track. The along-track error (ATE) represents how far ahead or behind the forecast is from where it should be along the Best Track . The cross-track error (CTE) represents how much the forecast has deviated sideways from the Best Track. ATE and CTE are calculated from TTE and the angle between the forecast and Best Track.

The Relative Improvement Percentage is a valuable metric for assessing the enhancement achieved by the CNN Model over the GEFS Mean. By calculating the percentage improvement, one can quantitatively measure the extent to which the CNN Model has refined its predictive capabilities. A positive Relative Improvement Percentage signifies a reduction in the total track error, indicating that the CNN Model has produced predictions closer to the Best Track compared to the GEFS Mean.

Based on graphing the median total track error as a function of forecast lead times for the GEFS Mean and CNN Model with 95% confidence intervals, the CNN Model generally outperforms the GEFS Mean, especially between 0 and 96 hours in advance. Based on graphing the overall percentage performance per forecast lead time, the CNN Model outperformed the GEFS Mean between 0 to 108 hours.

Additionally, test on new 2023 tropical cyclone data that the CNN Model was unaware of.

Steps:

Software: Python version 3.9.13 on Jupyter Notebooks.

Data:

The consolidated GEFS Mean tropical cyclone track prediction and Best Track data is stored in the following: df_master_X.pkl

Modeling:

Taking the input data files, develop the CNN Model (or use the previously developed model: mod_five_0-1_500__3.joblib) using the following: Modelling_2008_2022.ipynb. The tropical cyclones from 2008 to 2022 and their corresponding forecast sequences for 5-days were split into 90% training & validation (2,218 sequences) and 10% holdout evaluation datasets (177 evaluation sequences).

Evaluation:

Run analysis on the GEFS Mean and CNN Model tropical cyclone track prediction such as by comparing the Total Track Error (TTE), Along-Track Error (ATE), and Cross-Track Error (CTE) within the following: Modelling_2008_2022.ipynb. Use the TTE to compute Relative Improvement Percentage - percentage increase or decrease from the GEFS Mean TTE to the CNN Model TTE. Additionally, graph the Median Total Track Error (TTE) per forecast lead time with 95% confidence interval and Overall Percentage Performance per forecast lead time. Furthermore, plot example GEFS Mean and CNN Model tropical cyclone track predictions compared with the Best Track on a map.

Test on new 2023 tropical cyclone data that the CNN Model was unaware of using the following: Modelling_2023.ipynb.

Results

CNN Model achieved statistically significant lower total track error than the GEFS Mean developed by NOAA. Improvement from existing AI models developed for tropical cyclone track forecasting. The GEFS Mean has a high ATE which leads to a high TTE. This was detected by the developed CNN Model which accelerated forecasts, indicated by the Relative Improvement Percentage. When evaluated on previously unknown tropical cyclone data from 2023, the CNN Model outperformed the GEFS Mean.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
ATE and CTE Error Plots.ipynb		ATE and CTE Error Plots.ipynb
Compare.py		Compare.py
Dataloader.py		Dataloader.py
Lmodel.py		Lmodel.py
Modelling_2008_2022.ipynb		Modelling_2008_2022.ipynb
Modelling_2023.ipynb		Modelling_2023.ipynb
ReAnalysisData.py		ReAnalysisData.py
Readme.md		Readme.md
StormData.py		StormData.py
df_master_X.pkl		df_master_X.pkl
ensemble_pn.py		ensemble_pn.py
mod_five_0-1_500__3.joblib		mod_five_0-1_500__3.joblib

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Improving Tropical Cyclone Track Forecast

Overview

Data Sources

Methodology

Processing:

Modeling:

Evaluation:

Steps:

Data:

Modeling:

Evaluation:

Results

About

Uh oh!

Releases

Packages

Languages

nagrawa6/TropicalCyclone

Folders and files

Latest commit

History

Repository files navigation

Improving Tropical Cyclone Track Forecast

Overview

Data Sources

Methodology

Processing:

Modeling:

Evaluation:

Steps:

Data:

Modeling:

Evaluation:

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages