You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Complete list of packages can be installed with `pip install -r requirements.txt`.
25
25
26
+
For linting further install `pre-commit install --install-hooks`. Then you can run `pre-commit run --all-files`.
27
+
28
+
## Quickstart
29
+
30
+
A small subset of the data is available for easy experimentation. Download with:
31
+
```
32
+
from huggingface_hub import snapshot_download
33
+
34
+
snapshot_download(
35
+
repo_id="deinal/spacecast-data-small",
36
+
repo_type="dataset",
37
+
local_dir="data_small"
38
+
)
39
+
```
40
+
41
+
Training can then be run immediately on the preprocessed data with readily available graphs.
42
+
```
43
+
python -m neural_lam.train_model \
44
+
--config_path data_small/vlasiator_config.yaml \
45
+
--model graph_efm \
46
+
...
47
+
```
48
+
26
49
## Data
27
50
28
-
The data is stored in `Zarr` format on [Zenodo](https://zenodo.org/records/16930055). To create a training-ready dataset with [mllam-data-prep](https://github.com/mllam/mllam-data-prep), run:
51
+
The data is stored in [Zarr](https://zarr.dev) format on [Hugging Face](https://huggingface.co/datasets/deinal/spacecast-data).
52
+
53
+
It can be downloaded to a local `data` directory with:
29
54
```
30
-
mllam_data_prep data/vlasiator_mdp.yaml
55
+
from huggingface_hub import snapshot_download
56
+
57
+
snapshot_download(
58
+
repo_id="deinal/spacecast-data",
59
+
repo_type="dataset",
60
+
local_dir="data"
61
+
)
62
+
```
63
+
64
+
The folder will then follow the assumed structure of neural-lam:
65
+
```
66
+
data/
67
+
├── graph/ - Directory containing graphs for training
68
+
├── run_1.zarr/ - Vlasiator run 1 with ρ = 0.5 cm⁻³ solar wind
69
+
├── run_2.zarr/ - Vlasiator run 2 with ρ = 1.0 cm⁻³ solar wind
70
+
├── run_3.zarr/ - Vlasiator run 3 with ρ = 1.5 cm⁻³ solar wind
71
+
├── run_4.zarr/ - Vlasiator run 4 with ρ = 2.0 cm⁻³ solar wind
72
+
├── static.zarr/ - Static features x, z, r coordinates
73
+
├── vlasiator_config.yaml - Configuration file for neural-lam
74
+
├── vlasiator_run_1.yaml - Configuration file for datastore 1, referred to from vlasiator_config.yaml
75
+
├── vlasiator_run_2.yaml - Configuration file for datastore 2, referred to from vlasiator_config.yaml
76
+
├── vlasiator_run_3.yaml - Configuration file for datastore 3, referred to from vlasiator_config.yaml
77
+
└── vlasiator_run_4.yaml - Configuration file for datastore 4, referred to from vlasiator_config.yaml
31
78
```
32
79
33
-
Simple, multiscale, and hierarchical graphs are created and stored in `.pt` format using the following commands:
80
+
Preprocess the runs with [mllam-data-prep](https://github.com/mllam/mllam-data-prep), run:
with `--graph` as `simple`, `multiscale` or `hierarchcial` and `--save`is the name of the output file.
100
+
with `--graph` as `simple`, `multiscale` or `hierarchcial` and `--save`specifies the name of the output file.
45
101
46
102
## Logging
47
103
@@ -107,6 +163,19 @@ where a model checkpoint from a given path given to the `--load` in `.ckpt` form
107
163
108
164
## Cite
109
165
166
+
ML dataset
167
+
```
168
+
@misc{vlasiator2025mldata,
169
+
title={Vlasiator Dataset for Machine Learning Studies},
170
+
author={Zaitsev, Ivan and Holmberg, Daniel and Alho, Markku and Bouri, Ioanna and Franssila, Fanni and Jeong, Haewon and Palmroth, Minna and Roos, Teemu},
title={Graph-based Neural Space Weather Forecasting},
@@ -115,3 +184,4 @@ where a model checkpoint from a given path given to the `--load` in `.ckpt` form
115
184
year={2025}
116
185
}
117
186
```
187
+
This work is based on code using a single run dataloader at commit: https://github.com/fmihpc/spacecast/commit/937094079c1364ec484d3d1647e758f4a388ad97.
0 commit comments