|
1 | 1 | # sevir_challenges |
2 | | -A collection of tasks and baseline models for the SEVIR weather dataset |
3 | | - |
| 2 | +A collection of challenges and baseline models for the SEVIR weather dataset. |
4 | 3 |
|
5 | 4 | ## Obtaining SEVIR data |
6 | 5 |
|
7 | | -To obtian the dataset used in each of the challenges, run the following command |
| 6 | +The challenges in this repo are based on the [SEVIR weather dataset](https://proceedings.neurips.cc//paper/2020/hash/fa78a16157fed00d7a80515818432169-Abstract.html). This dataset is made of up sequences of weather imagery sampled and aligned across radar and satellite. It was constucted as a benchmark dataset to support algorithm development in meterology. For a detailed tutorial on this dataset, see [the SEVIR tutorial.](https://nbviewer.jupyter.org/github/MIT-AI-Accelerator/eie-sevir/blob/master/examples/SEVIR_Tutorial.ipynb) |
| 7 | + |
| 8 | +SEVIR is currently available for download from the [AWS Open Data registry](https://registry.opendata.aws/sevir/). In total, the dataset is approximately 1TB in size, however smaller samples of the full dataset are provided for selected challenges (see `s3://sevir/data/processed/`). To construct larger datasets, you can download SEVIR using one of the following methods: |
| 9 | + |
| 10 | +### Using AWS CLI |
| 11 | + |
| 12 | +If you have [AWS CLI](https://docs.aws.amazon.com/cli/latest/userguide/install-cliv2.html), you can download SEVIR using the |
8 | 13 |
|
9 | | -To download, install AWS CLI, and download all of SEVIR (~1TB) to your current directory run |
10 | 14 | ``` |
11 | 15 | aws s3 sync --no-sign-request s3://sevir . |
12 | 16 | ``` |
13 | | -Each of the benchmarks in this repo use a subset of the full SEVIR dataset. |
14 | 17 |
|
| 18 | +To download only a specific modalitiy, e.g. `vil`, you can instead run |
| 19 | + |
| 20 | +``` |
| 21 | +aws s3 cp --no-sign-request s3://sevir/CATALOG.csv CATALOG.csv |
| 22 | +aws s3 sync --no-sign-request s3://sevir/data/vil . |
| 23 | +``` |
| 24 | + |
| 25 | +### Using `boto3` moduels |
| 26 | + |
| 27 | +Using the python `boto3` modules (`conda install boto3`) you can obtain SEVIR data by first connecting to the S3 bucket |
| 28 | + |
| 29 | +```python |
| 30 | +import boto3 |
| 31 | +from botocore.handlers import disable_signing |
| 32 | +resource = boto3.resource('s3') |
| 33 | +resource.meta.client.meta.events.register('choose-signer.s3.*', disable_signing) |
| 34 | +bucket=resource.Bucket('sevir') |
| 35 | +``` |
| 36 | + |
| 37 | +Then, get a list of files using |
| 38 | + |
| 39 | +``` |
| 40 | +objs=bucket.objects.filter(Prefix='') |
| 41 | +print([o.key for o in objs]) |
| 42 | +``` |
| 43 | + |
| 44 | +Finally, download files of interest from this list, e.g. |
| 45 | + |
| 46 | +```pthon |
| 47 | +bucket.download_file('CATALOG.csv','/home/data/SEVIR/CATALOG.csv') |
| 48 | +bucket.download_file('data/vil/2017/SEVIR_VIL_STORMEVENTS_2017_0701_1231.h5','/home/data/SEVIR/data/vil/2017/SEVIR_VIL_STORMEVENTS_2017_0701_1231.h5') |
| 49 | +#... etc |
| 50 | +``` |
15 | 51 |
|
16 | 52 |
|
17 | 53 |
|
|
0 commit comments