You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+23-23Lines changed: 23 additions & 23 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -68,7 +68,6 @@ assume complete data availability across all modalities. We present **CrossOver*
68
68
69
69
70
70
# :newspaper: News
71
-
<!-- > 📡 Stay tuned for stronger checkpoint release trained on many more datasets! -->
72
71
-**Version 1.0** - **CrossOver is now stronger than ever**. We recommend updating to this version; changes include:
73
72
- More powerful pre-trained checkpoints; now available on Huggingface 👉 [here](https://huggingface.co/gradient-spaces/CrossOver/tree/main).
74
73
- Support for 2 additional datasets - ARKitScenes & MultiScan
@@ -132,26 +131,6 @@ See [DATA.MD](DATA.md) for detailed instructions on data download, preparation a
132
131
133
132
# :film_projector: Demo
134
133
135
-
## Scene Retrieval Demo
136
-
137
-
This demo script allows users to process a custom scene and retrieve the closest match from the supported datasets using different modalities. Detailed usage can be found inside the script. Example usage below:
138
-
139
-
```bash
140
-
$ python demo/demo_scene_retrieval.py
141
-
```
142
-
143
-
Various configurable parameters:
144
-
145
-
-`--query_path`: Path to the query scene file (eg: `./example_data/dining_room/scene_cropped.ply`).
146
-
-`--database_path`: Path to the precomputed embeddings of the database scenes downloaded before (eg: `./release_data/embed_scannet.pt`).
147
-
-`--query_modality`: Modality of the query scene, Options: `point`, `rgb`, `floorplan`, `referral`
148
-
-`--database_modality`: Modality used for retrieval. Same options as above.
149
-
-`--ckpt`: Path to the pre-trained scene crossover model checkpoint (details [here](#checkpoints)), example_path: `./checkpoints/scene_crossover_scannet+scan3r.pth/`).
150
-
151
-
For embedding and pre-trained model download, refer to [generated embedding data](DATA.md#generated-embedding-data) and [checkpoints](#checkpoints) sections.
152
-
153
-
> We also provide scripts for inference on a single scan of the supported datasets. Details in **Single Inference** section in [TRAIN.md](TRAIN.md).
154
-
155
134
## Instance Retrieval Demo
156
135
157
136
This demo script allows users to process a custom object and run cross-modal retrieval to find the closest matched object within a target scene . Detailed usage can be found inside the script. Example usage below:
@@ -173,6 +152,27 @@ Various configurable parameters:
173
152
-`--top_k`: Number of top results to return - default: `5`
174
153
175
154
155
+
## Scene Retrieval Demo
156
+
157
+
This demo script allows users to process a custom scene and retrieve the closest match from the supported datasets using different modalities. Detailed usage can be found inside the script. Example usage below:
158
+
159
+
```bash
160
+
$ python demo/demo_scene_retrieval.py
161
+
```
162
+
163
+
Various configurable parameters:
164
+
165
+
-`--query_path`: Path to the query scene file (eg: `./example_data/dining_room/scene_cropped.ply`).
166
+
-`--database_path`: Path to the precomputed embeddings of the database scenes downloaded before (eg: `./release_data/embed_scannet.pt`).
167
+
-`--query_modality`: Modality of the query scene, Options: `point`, `rgb`, `floorplan`, `referral`
168
+
-`--database_modality`: Modality used for retrieval. Same options as above.
169
+
-`--ckpt`: Path to the pre-trained scene crossover model checkpoint (details [here](#checkpoints)), example_path: `./checkpoints/scene_crossover_scannet+scan3r.pth/`).
170
+
171
+
For embedding and pre-trained model download, refer to [generated embedding data](DATA.md#generated-embedding-data) and [checkpoints](#checkpoints) sections.
172
+
173
+
> [!TIP]
174
+
> We also provide scripts for inference on a single scan of the supported datasets. Details in **Single Inference** section in [TRAIN.md](TRAIN.md).
175
+
176
176
177
177
# :weight_lifting: Training and Inference
178
178
@@ -194,13 +194,13 @@ We provide all available checkpoints on huggingface 👉 [here](https://huggingf
194
194
|Instance CrossOver trained on 3RScan |[3RScan](https://huggingface.co/gradient-spaces/CrossOver/tree/main/instance_crossover_scan3r.pth)|
195
195
|Instance CrossOver trained on ScanNet |[ScanNet](https://huggingface.co/gradient-spaces/CrossOver/tree/main/instance_crossover_scannet.pth)|
196
196
|Instance CrossOver trained on ScanNet + 3RScan |[ScanNet+3RScan](https://huggingface.co/gradient-spaces/CrossOver/tree/main/instance_crossover_scannet%2Bscan3r.pth)|
The files mentioned in the above command - ```download_data.py``` and ```3dod_train_val_splits.csv``` can be found in the official repository [here](https://github.com/apple/ARKitScenes), along with more detailed instructions and descriptions of the data.
125
125
@@ -157,14 +157,14 @@ ARKitScenes/
157
157
```
158
158
159
159
#### MultiScan
160
-
1. Download MultiScan data into MultiScan/scenes and run the following to extract MultiScan data
160
+
1. Download MultiScan data into MultiScan/scenes and run the following to extract MultiScan data.
161
161
162
162
```bash
163
163
cd MultiScan/scenes
164
164
unzip '*.zip'
165
165
rm -rf '*.zip'
166
166
```
167
-
3. To generate sequence of RGB images and corresponding camera poses from the ```.mp4``` file, run the follwing
167
+
3. To generate sequence of RGB images and corresponding camera poses from the ```.mp4``` file, run the following:
0 commit comments