Stacked Perceiver Variational Autoencoder
- Dataset class
- Patch embedding
- Encoder-Decoder
- Train
- Mixed precision computing
- Distributed computing
- Model checkpointing
- BUG: Loss values are all NaNs
- Weight loss by area of each of grid cell
- Per-variable weight for loss
- Refactor code; clean it up