Skip to content

Commit 42c5d6f

Browse files
rdhyeeclaude
andcommitted
Update Cesium demos to use Cloudflare R2 URLs for Zenodo parquet files
- narrow_vs_wide_performance.qmd: Update to use full iSamples dataset on R2 (844MB narrow, 282MB wide - all 4 sources) - parquet_cesium_isamples_wide.qmd: Update URL to January 2026 wide file which fixes issue #8 ([null] array bug) R2 URLs: - Narrow: https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202512_narrow.parquet - Wide: https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_wide.parquet Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
1 parent 4acceea commit 42c5d6f

File tree

2 files changed

+14
-11
lines changed

2 files changed

+14
-11
lines changed

tutorials/narrow_vs_wide_performance.qmd

Lines changed: 11 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -16,12 +16,14 @@ This page benchmarks the performance difference between **narrow** and **wide**
1616

1717
The iSamples property graph data can be serialized in two different parquet formats:
1818

19-
| Format | Description | File Size | Row Count |
20-
|--------|-------------|-----------|-----------|
21-
| **Narrow** | Stores relationships as separate edge rows (`otype='_edge_'`) | 691 MB | ~11.6M rows |
22-
| **Wide** | Stores relationships as `p__*` columns on entity rows | 275 MB | ~2.5M rows |
19+
| Format | Description | File Size | Row Count | Sources |
20+
|--------|-------------|-----------|-----------|---------|
21+
| **Narrow** | Stores relationships as separate edge rows (`otype='_edge_'`) | 844 MB | ~106M rows | All 4 sources |
22+
| **Wide** | Stores relationships as `p__*` columns on entity rows | 282 MB | ~20M rows | All 4 sources |
2323

24-
Both formats represent the **same underlying data** with identical semantics, but the wide format is optimized for analytical queries by eliminating edge rows.
24+
Both formats represent the **same underlying data** (SESAR, OpenContext, GEOME, Smithsonian) with identical semantics, but the wide format is optimized for analytical queries by eliminating edge rows.
25+
26+
**Data source**: Cloudflare R2 (updated January 2026)
2527

2628
### Why Performance Matters
2729

@@ -57,9 +59,10 @@ import { DuckDBClient } from "https://cdn.jsdelivr.net/npm/@observablehq/duckdb@
5759

5860
```{ojs}
5961
//| echo: false
60-
// Define parquet URLs
61-
narrowUrl = "https://storage.googleapis.com/opencontext-parquet/oc_isamples_pqg.parquet"
62-
wideUrl = "https://storage.googleapis.com/opencontext-parquet/oc_isamples_pqg_wide.parquet"
62+
// Define parquet URLs - iSamples full dataset on Cloudflare R2 (all sources)
63+
// Updated 2026-01-14: Using Zenodo narrow/wide files on Cloudflare R2
64+
narrowUrl = "https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202512_narrow.parquet"
65+
wideUrl = "https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_wide.parquet"
6366
```
6467

6568
### Environment Info

tutorials/parquet_cesium_isamples_wide.qmd

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -44,7 +44,7 @@ Cesium.Ion.defaultAccessToken = 'eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJqdGkiOi
4444
//| echo: false
4545
viewof parquet_path = Inputs.text({
4646
label:"Source (iSamples Wide Format)",
47-
value:"https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202512_wide.parquet",
47+
value:"https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_wide.parquet",
4848
placeholder: "URL or file:///path/to/file.parquet",
4949
width:"100%",
5050
submit:true
@@ -94,13 +94,13 @@ Download the file locally, then serve it:
9494

9595
```bash
9696
# Download the wide parquet file (~242MB)
97-
curl -O https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202512_wide.parquet
97+
curl -O https://pub-a18234d962364c22a50c787b7ca09fa5.r2.dev/isamples_202601_wide.parquet
9898

9999
# Serve it locally
100100
python3 -m http.server 8000
101101
```
102102

103-
Then use: `http://localhost:8000/isamples_202512_wide.parquet`
103+
Then use: `http://localhost:8000/isamples_202601_wide.parquet`
104104

105105
**Benefits of wide format file:**
106106
- 66% smaller than narrow format (242 MB vs 709 MB)

0 commit comments

Comments
 (0)