Skip to content

Commit 1febd81

Browse files
committed
Fixes to the download doc
1 parent dbebdcf commit 1febd81

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

docs/4. repository/2. download-dataset.mdx

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ The export format has undergone a redesign to make it more efficient and easier
2121
- **File metadata** (checksums, sizes, timestamps) is provided directly by the Google Cloud Storage API.
2222
- Files use **zstd compression** built into the Parquet format.
2323

24-
The dataset is available at [export.sourcify.dev](https://export.sourcify.dev/). All files of the v2 format are stored under the `v2/` prefix.
24+
The dataset is available at [export.sourcify.dev/?prefix=v2/](https://export.sourcify.dev/?prefix=v2/). All files of the v2 format are stored under the `v2/` prefix.
2525

2626
### Downloading and Syncing the Dataset
2727

@@ -36,7 +36,7 @@ curl -s 'https://export.sourcify.dev/?prefix=v2/' | \
3636
Alternatively, the [AWS CLI](https://docs.aws.amazon.com/cli/latest/userguide/getting-started-install.html#getting-started-install-instructions) makes it easy to download and keep the dataset in sync. The following command downloads the entire dataset on the first run, and on subsequent runs only downloads new or modified files:
3737

3838
```bash
39-
aws s3 sync s3://sourcify-parquet-export/v2/ ./sourcify-dataset --endpoint-url https://storage.googleapis.com --no-sign-request
39+
aws s3 sync s3://sourcify-production-parquet-export/v2/ ./sourcify-dataset --endpoint-url https://storage.googleapis.com --no-sign-request
4040
```
4141

4242
### Note on `sourcify_matches`

0 commit comments

Comments
 (0)