You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/models/anysat.md
+9-5Lines changed: 9 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
# AnySat (`anysat`)
2
2
3
-
> Multi-frame Sentinel-2 time-series adapter that builds AnySat inputs (`s2` + `s2_dates`) from a temporal window and returns patch-grid features or pooled vectors.
3
+
> Multi-frame Sentinel-2 time-series adapter that builds AnySat inputs (`s2` + `s2_dates`) from a temporal window and returns dense sub-patch grids by default or pooled vectors.
4
4
5
5
## Quick Facts
6
6
@@ -53,11 +53,13 @@ For `input_chw`, the adapter accepts either `CHW` or `TCHW` with `C=10`. A `CHW`
53
53
<spanclass="pipeline-arrow">-></span> build AnySat side inputs
|`RS_EMBED_ANYSAT_CKPT`| unset | Local checkpoint override |
@@ -85,7 +89,7 @@ Important constraint:
85
89
86
90
## Output Semantics
87
91
88
-
AnySat follows the standard patch-grid pattern for multi-frame adapters. `pooled` applies spatial pooling over the patch grid, and `grid`returns `(D,H,W)`in model patch space rather than georeferenced raster pixels. The more distinctive AnySat details, such as frame packaging and `doy0_values`, are recorded in metadata rather than requiring a long per-page output section.
92
+
AnySat now uses two spatial output paths inside the adapter. `pooled`defaults to the historical rs-embed behavior and applies spatial pooling over the AnySat `patch` grid, which preserves the previous pooled vector dimensionality; pass `pooled_source="tile"` (or set `RS_EMBED_ANYSAT_POOLED_SOURCE=tile`) to use the native AnySat tile embedding instead. `grid`defaults to AnySat `dense`, so the returned `(D,H,W)`is a denser sub-patch feature map by default; pass `grid_feature_mode="patch"` to the public API (or set `RS_EMBED_ANYSAT_GRID_MODE=patch`) to recover the older patch-grid behavior. As with other on-the-fly models, this grid is model space rather than guaranteed georeferenced raster pixels.
Copy file name to clipboardExpand all lines: docs/models_reference.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -48,7 +48,7 @@ Use this table for a first-pass side-by-side comparison of input assumptions and
48
48
|`satmaepp`|`rshf.satmaepp.SatMAEPP`| 10m | S2 RGB (`B4,B3,B2`) | raw SR -> `/10000` -> RGB `uint8`; SatMAE++ fMoW eval preprocessing (`Normalize + Resize(short side) + CenterCrop`), default channel order `bgr`| default 224; source-aligned short-side resize + center crop; no pad | token sequence -> pooled or patch-token grid | High |
49
49
|`satmaepp_s2_10b`| SatMAE++ grouped-channel source branch (`models_mae_group_channels.py`, `base` / `large` runtime families) | 10m | S2 SR 10-band (`B2,B3,B4,B5,B6,B7,B8,B8A,B11,B12`) | clip `0..10000`; source Sentinel min/max mapping to `uint8`; `ToTensor + Resize(short side) + CenterCrop`| default 96 with patch size 8; source-style resize/crop; no pad | grouped token sequence -> pooled or group-reduced spatial token grid | High |
50
50
|`scalemae`|`rshf.scalemae.ScaleMAE` (ViT style) | 10m | S2 RGB (`B4,B3,B2`) + `input_res_m`| raw SR -> `/10000` -> RGB `uint8`; CLIP norm tensor; pass `input_res_m`| default 224; CLIP path has `Resize + CenterCrop`; no pad | token sequence or pooled vector depending on wrapper output | Medium |
51
-
|`anysat`| AnySat from upstream `hubconf.py` (`AnySat`, `tiny` / `small` / `base`) | 10m | S2 10-band TCHW (or CHW auto-expanded) | clip to `0..10000`; normalize mode default `per_tile_zscore`; builds per-frame `s2_dates`| resize TCHW to default 24; no crop, no pad | patch output `[D,H,W]`, pooled by spatial mean/max | Medium |
51
+
|`anysat`| AnySat from upstream `hubconf.py` (`AnySat`, `tiny` / `small` / `base`) | 10m | S2 10-band TCHW (or CHW auto-expanded) | clip to `0..10000`; normalize mode default `per_tile_zscore`; builds per-frame `s2_dates`| resize TCHW to default 24; no crop, no pad |grid defaults to dense sub-patch output `[D,H,W]`; pooled defaults to patch-grid mean/max, optional native tile vector| Medium |
52
52
|`galileo`|`Encoder` from official `single_file_galileo.py`| 10m | S2 10-band TCHW (or CHW auto-expanded) | clip to `0..10000`; normalize mode default `unit_scale`; constructs Galileo tensors with configurable `T` + per-frame `months`, optional NDVI channel | default 64 with patch 8; bilinear resize; no pad | pooled token vector and S2-group token grid | Medium |
53
53
|`wildsat`| WildSAT backbone + optional image head from checkpoint | 10m | S2 RGB CHW | clip to `0..10000` then `/10000`; default normalization `minmax`; convert to `uint8` then unit tensor | default 224; resize RGB; no pad | pooled branch output and optional grid (token or feature path) | Medium-Low |
54
54
|`prithvi`| Vendored `PrithviMAE` runtime with HF checkpoints | 30m | S2 6-band (`BLUE,GREEN,RED,NIR_NARROW,SWIR_1,SWIR_2`) | raw SR -> `/10000` -> clamp `[0,1]`; prep mode from env | default mode `resize` to 224; optional `pad` to patch multiple (legacy) | token sequence -> pooled or patch-token grid | Medium |
@@ -126,7 +126,7 @@ This table only lists env vars that materially change model input construction o
0 commit comments