adbc-drivers · eric-wang-1990 · Mar 1, 2026 · Mar 2, 2026 · Mar 4, 2026 · Copilot
diff --git a/rust/spec/cloudfetch-pipeline-redesign.md b/rust/spec/cloudfetch-pipeline-redesign.md
diff --git a/rust/spec/sprint-plan-cloudfetch-redesign.md b/rust/spec/sprint-plan-cloudfetch-redesign.md
@@ -0,0 +1,283 @@
+# Sprint Plan: CloudFetch Pipeline Redesign (DashMap → Channel-Based)
+
+**Date:** 2026-02-28
+**Sprint Duration:** 2 weeks
+**JIRA Story:** [PECO-2926](https://databricks.atlassian.net/browse/PECO-2926)
+**Epic:** [PECO-2888](https://databricks.atlassian.net/browse/PECO-2888) — Databricks Driver Test Framework
+**Spec:** `rust/spec/cloudfetch-pipeline-redesign.md`
+
+---
+
+## Sprint Goal
+
+Replace the `DashMap<i64, ChunkEntry>` coordination structure in `StreamingCloudFetchProvider` with a two-channel pipeline (`download_channel` + `result_channel`) aligned with the C# reference implementation, eliminating lock discipline issues, large data in concurrent maps, and poll-based consumer synchronisation.
+
+---
+
+## Background
+
+The current implementation uses a single `DashMap` as the central coordination point for three unrelated concerns: presigned URL storage, download state machine, and result buffering (potentially 100 MB+). This causes:
+
+- Lock discipline burden (must not hold DashMap shard locks across `.await` points)
+- Unnatural indirection for URL refresh (two-`get_mut` dance)
+- Large Arrow data stored in a concurrent map with `Clone`-derivation risk
+- Poll-based consumer synchronisation via `Notify` loop
+
+The C# driver avoids all of these by flowing each chunk through two channels. This sprint implements the same pattern in Rust.
+
+---
+
+## Corrections from C# Reference Comparison
+
+Before implementation, the spec was audited against `csharp/src/Reader/CloudFetch/`. The following discrepancies were found and corrected in the spec:
+
+| Area | Original Spec | Corrected |
+|---|---|---|
+| `fetch_links()` return type | Implied single link per call | Returns `Vec<CloudFetchLink>` (batch) — scheduler must iterate |
+| Retry backoff | Said "exponential backoff is a follow-up" | **Linear**: `retry_delay * (attempt + 1)` matching C# `Task.Delay(_retryDelayMs * (retry + 1))` |
+| `url_expiration_buffer_secs` | Hardcoded constant `LINK_EXPIRY_BUFFER_SECS = 30` | Configurable field, default **60s** (matching C# `UrlExpirationBufferSeconds = 60`) |
+| `max_refresh_retries` | Referenced in spec but not defined | New `CloudFetchConfig` field, default **3** (matching C# `MaxUrlRefreshAttempts = 3`) |
+| `num_download_workers` | "N workers" with no source for N | New `CloudFetchConfig` field, default **3** (matching C# `ParallelDownloads = 3`) |
+| `max_retries` default | 5 | **3** (matching C# `MaxRetries = 3`) |
+| `retry_delay` default | 1500ms | **500ms** (matching C# `RetryDelayMs = 500`) |
+| `chunk_ready_timeout` | Not mentioned for removal | **Removed** — only served the `wait_for_chunk` Notify loop (deleted); C# has no equivalent wait-layer timeout |
+
+---
+
+## Architecture Overview
+
+```
+ChunkLinkFetcher → Scheduler → download_channel → Worker 1
+                            ↘                  → Worker 2
+                              result_channel      Worker N
+                                    ↓               ↓ (oneshot)
+                               Consumer       result_channel
+```
+
+**Two channels replace the DashMap:**
+- `download_channel` — unbounded; scheduler pushes `ChunkDownloadTask`, workers pull
+- `result_channel` — bounded to `max_chunks_in_memory`; scheduler pushes `ChunkHandle` in chunk-index order; consumer reads sequentially
+
+**Key new types:**
+
+```rust
+struct ChunkDownloadTask {
+    chunk_index: i64,
+    link: CloudFetchLink,
+    result_tx: oneshot::Sender<Result<Vec<RecordBatch>>>,
+}
+
+struct ChunkHandle {
+    chunk_index: i64,
+    result_rx: oneshot::Receiver<Result<Vec<RecordBatch>>>,
+}
+```
+
+---
+
+## Sub-Tasks
+
+### PECO-2927 — Update `CloudFetchConfig` and remove legacy types ✅ COMPLETED
+
+**Scope:** `rust/src/types/cloudfetch.rs`
+
+**Status:** Completed. All changes implemented and tests passing.
+
+**Changes:**
+- Add 3 new fields to `CloudFetchConfig`:
+  - `max_refresh_retries: u32` — default `3`
+  - `num_download_workers: usize` — default `3`
+  - `url_expiration_buffer_secs: u32` — default `60`
+- Remove `chunk_ready_timeout: Option<Duration>`
+- Correct defaults: `max_retries` 5→3, `retry_delay` 1500ms→500ms
+- Remove `LINK_EXPIRY_BUFFER_SECS` constant (replaced by `url_expiration_buffer_secs`)
+- Remove `ChunkEntry` and `ChunkState` types from public API (legacy copies kept as private types within `streaming_provider.rs` until it is rewritten)
+- Update `CloudFetchLink::is_expired()` to accept a `buffer_secs: u32` parameter instead of using the hardcoded constant
+- Update all tests in this file
+- Update `database.rs` option setters: removed `chunk_ready_timeout_ms`, added `max_refresh_retries`, `num_download_workers`, `url_expiration_buffer_secs`
+
+**Implementation decisions:**
+- `ChunkEntry`/`ChunkState` were not fully deleted from the codebase—they were moved to be private types within `streaming_provider.rs` since that module still uses the DashMap-based architecture. They will be fully removed when `streaming_provider.rs` is rewritten in a later task.
+- `DEFAULT_CHUNK_READY_TIMEOUT_SECS` was replaced by a module-local `DEFAULT_CHUNK_WAIT_TIMEOUT_SECS` constant in `streaming_provider.rs` (30s) to preserve the existing timeout behavior until the rewrite.
+
+---
+
+### PECO-2928 — Implement `ChunkDownloadTask`, `ChunkHandle`, and Scheduler
+
+**Scope:** New types + new scheduler task in `rust/src/reader/cloudfetch/`
+
+**Changes (Types):** ✅ COMPLETED (as part of Task 1)
+- Add `ChunkDownloadTask` struct with `chunk_index`, `link`, `result_tx` (oneshot sender)
+- Add `ChunkHandle` struct with `chunk_index`, `result_rx` (oneshot receiver)
+- Add `create_chunk_pair()` helper function for creating connected task/handle pairs
+- Types defined in new module: `rust/src/reader/cloudfetch/pipeline_types.rs`
+- Types exported from `mod.rs`
+
+**Changes (Scheduler):**
+- Implemented scheduler as a `tokio::spawn` task in `rust/src/reader/cloudfetch/scheduler.rs`:
+  - `spawn_scheduler()` function creates channels and spawns scheduler task
+  - `scheduler_task()` calls `fetch_links()` which returns `Vec<CloudFetchLink>` (batch)
+  - For each link in the batch: creates `oneshot::channel()` via `create_chunk_pair()`, sends `ChunkHandle` to `result_channel`, then sends `ChunkDownloadTask` to `download_channel`
+  - `download_channel`: unbounded mpsc for `ChunkDownloadTask`
+  - `result_channel`: bounded mpsc (capacity = `max_chunks_in_memory`) for `ChunkHandle`
+  - Bounded `result_channel` provides automatic backpressure (blocks scheduler when `max_chunks_in_memory` reached)
+  - Exits when `has_more = false`
+  - Supports cancellation via `CancellationToken`
+- Returns `SchedulerChannels` struct containing both channel receivers
+
+**Key invariant:** `ChunkHandle` is enqueued to `result_channel` _before_ the corresponding `ChunkDownloadTask` is dispatched to `download_channel`, preserving sequential ordering even when downloads complete out of order.
+
+**Tests implemented:**
+- `scheduler_sends_handles_in_chunk_index_order` — verify handles received in sequence
+- `scheduler_processes_batch_links` — verify all tasks/handles enqueued from batch
+- `backpressure_blocks_scheduler_at_capacity` — verify bounded channel blocks scheduler
+- `scheduler_exits_when_has_more_false` — verify clean exit
+- `scheduler_cancellation` — verify early termination on cancel
+- `scheduler_handle_before_task` — verify ordering invariant
+- `scheduler_multiple_batches` — verify multiple fetch_links calls
+- `scheduler_empty_batch` — verify empty batch handling
+
+---
+
+### PECO-2929 — Implement Download Workers
+
+**Scope:** `rust/src/reader/cloudfetch/` (replaces `download_chunk_with_retry`)
+
+**Changes:**
+- Spawn `config.num_download_workers` long-lived `tokio::spawn` tasks
+- Each worker loops over `download_channel`:
+  1. **Proactive expiry check** using `url_expiration_buffer_secs` buffer — if link is expired or expiring soon, call `refetch_link()` before first HTTP request
+  2. `GET presigned_url`
+  3. On success: parse Arrow IPC → `result_tx.send(Ok(batches))`
+  4. On 401/403/404: call `refetch_link()`, retry immediately (no sleep), count against `max_refresh_retries`
+  5. On other errors: `sleep(retry_delay * (attempt + 1))` (linear backoff), retry, count against `max_retries`
+  6. On `max_retries` exceeded: `result_tx.send(Err(...))`
+- All sleeps use `tokio::select!` on `cancel_token.cancelled()` for cancellation
+
+**Retry contract:**
+
+| Error type | Sleep before retry | Counts against `max_retries` | Counts against `max_refresh_retries` |
+|---|---|---|---|
+| Network / 5xx | Yes — `retry_delay * (attempt + 1)` | Yes | No |
+| 401 / 403 / 404 | No | Yes | Yes |
+| Link proactively expired | No | No | Yes |
+
+---
+
+### PECO-2930 — Implement Consumer (`next_batch`)
+
+**Scope:** `rust/src/reader/cloudfetch/streaming_provider.rs`
+
+**Changes:**
+- Replace `wait_for_chunk` + Notify poll loop with:
+  ```rust
+  let handle = result_rx.recv().await?;  // get next ChunkHandle in order
+  let batches = handle.result_rx.await?;  // await the oneshot directly
+  ```
+- `Ok(None)` when `result_rx` returns `None` (channel closed = end of stream)
+- Consumer selects on `cancel_token.cancelled()` when awaiting `result_rx`
+
+---
+
+### PECO-2931 — Refactor `StreamingCloudFetchProvider` struct
+
+**Scope:** `rust/src/reader/cloudfetch/streaming_provider.rs`
+
+**Remove these fields:**
+
+| Field | Replaced by |
+|---|---|
+| `chunks: Arc<DashMap<i64, ChunkEntry>>` | `download_channel` + `result_channel` |
+| `chunks_in_memory: AtomicUsize` | Bounded `result_channel` capacity |
+| `max_chunks_in_memory: usize` | `mpsc::channel(max_chunks_in_memory)` bound |
+| `chunk_state_changed: Arc<Notify>` | `oneshot` receivers per chunk |
+| `next_download_index: AtomicI64` | Sequential counter owned by scheduler task |
+| `current_chunk_index: AtomicI64` | Implicit in sequential `result_rx.recv()` calls |
+| `end_of_stream: AtomicBool` | Implicit when `result_rx` returns `None` |
+
+**New struct:**
+```rust
+pub struct StreamingCloudFetchProvider {
+    result_rx: Mutex<mpsc::Receiver<ChunkHandle>>,
+    schema: OnceLock<SchemaRef>,
+    batch_buffer: Mutex<VecDeque<RecordBatch>>,
+    cancel_token: CancellationToken,
+}
+```
+
+---
+
+### PECO-2932 — Unit Tests (9 tests)
+
+| Test | What it verifies |
+|---|---|
+| `scheduler_sends_handles_in_chunk_index_order` | `result_channel` receives handles in sequence |
+| `scheduler_processes_batch_links` | `fetch_links()` returns 3 links; all 3 tasks enqueued in order |
+| `worker_retries_on_transient_error` | Mock downloader fails N times then succeeds |
+| `worker_uses_linear_backoff` | Sleep duration is `retry_delay * (attempt + 1)`, not constant or exponential |
+| `worker_refetches_url_on_401_403_404` | Auth error triggers `refetch_link` inline |
+| `worker_no_sleep_on_auth_error` | Auth error does not sleep before refetch |
+| `worker_gives_up_after_max_refresh_retries` | Terminal error propagated via `result_tx` |
+| `worker_proactively_refreshes_expiring_url` | Link expiring within buffer triggers `refetch_link` before first HTTP request |
+| `backpressure_blocks_scheduler_at_capacity` | Full `result_channel` blocks scheduler, not consumer |
+
+---
+
+### PECO-2933 — Integration Tests (3 tests)
+
+| Test | What it verifies |
+|---|---|
+| `end_to_end_sequential_consumption` | All chunks downloaded and read in order |
+| `end_to_end_cancellation_mid_stream` | Cancel during active download — no deadlock or panic |
+| `end_to_end_401_recovery` | Presigned URL expires mid-stream; driver refetches and continues |
+
+---
+
+## Concurrency Model
+
+| Component | Primitive | Reason |
+|---|---|---|
+| Scheduler | Single `tokio::spawn` | Sequential chunk ordering required |
+| Download workers | N `tokio::spawn` sharing `download_channel` | Parallel downloads |
+| Consumer | Caller's task (no spawn) | Sequential result consumption |
+| `result_channel` | Bounded `mpsc` (capacity = `max_chunks_in_memory`) | Backpressure without manual counter |
+| URL refresh in worker | Local variable mutation | No shared state, no lock needed |
+
+---
+
+## Error Handling
+
+| Scenario | Behaviour |
+|---|---|
+| Download fails after `max_retries` | `result_tx.send(Err(...))` — consumer sees error on `result_rx.await` |
+| URL refresh fails after `max_refresh_retries` | Same as above |
+| `refetch_link()` itself returns error | Propagated via `result_tx.send(Err(...))` |
+| Cancellation during sleep | `tokio::select!` on `cancel_token.cancelled()` interrupts sleep |
+| Cancellation during `result_rx.await` | Consumer selects on `cancel_token.cancelled()` |
+| Scheduler exits (all chunks sent) | `download_channel` sender drops → workers drain then exit |
+| All workers exit | `oneshot` senders drop → `result_rx.await` returns `Err(RecvError)` |
+
+---
+
+## Files to Modify
+
+| File | Change |
+|---|---|
+| `rust/src/types/cloudfetch.rs` | Update `CloudFetchConfig`, remove `ChunkEntry`/`ChunkState` |
+| `rust/src/reader/cloudfetch/streaming_provider.rs` | Full rewrite of struct + pipeline |
+| `rust/src/reader/cloudfetch/chunk_downloader.rs` | Replace with worker loop |
+| `rust/src/reader/cloudfetch/link_fetcher.rs` | No changes (internal DashMap cache retained) |
+
+---
+
+## Definition of Done
+
+- [ ] `cargo build` passes with no warnings
+- [ ] `cargo clippy -- -D warnings` passes
+- [ ] All 9 unit tests pass
+- [ ] All 3 integration tests pass
+- [ ] `cargo fmt` applied
+- [ ] `DashMap` dependency removed from `streaming_provider.rs`
+- [ ] `ChunkEntry`, `ChunkState` types deleted
+- [ ] `chunk_ready_timeout` config field removed
+- [ ] `LINK_EXPIRY_BUFFER_SECS` constant removed
diff --git a/rust/src/database.rs b/rust/src/database.rs
@@ -222,10 +222,25 @@ impl Optionable for Database {
                         Err(DatabricksErrorHelper::set_invalid_option(&key, &value).to_adbc())
                     }
                 }
-                "databricks.cloudfetch.chunk_ready_timeout_ms" => {
+                "databricks.cloudfetch.max_refresh_retries" => {
                     if let Some(v) = Self::parse_int_option(&value) {
-                        self.cloudfetch_config.chunk_ready_timeout =
-                            Some(Duration::from_millis(v as u64));
+                        self.cloudfetch_config.max_refresh_retries = v as u32;
+                        Ok(())
-                        self.cloudfetch_config.max_refresh_retries = v as u32;
-                        Ok(())
+                        if v >= 0 && v <= u32::MAX as i64 {
+                            self.cloudfetch_config.max_refresh_retries = v as u32;
+                            Ok(())
+                        } else {
+                            Err(DatabricksErrorHelper::set_invalid_option(&key, &value).to_adbc())
+                        }
-                        self.cloudfetch_config.max_refresh_retries = v as u32;
-                        Ok(())
+                        if v >= 0 && v <= u32::MAX as i64 {
+                            self.cloudfetch_config.max_refresh_retries = v as u32;
+                            Ok(())
+                        } else {
+                            Err(DatabricksErrorHelper::set_invalid_option(&key, &value).to_adbc())
+                        }
+                    } else {
+                        Err(DatabricksErrorHelper::set_invalid_option(&key, &value).to_adbc())
+                    }
+                }
+                "databricks.cloudfetch.num_download_workers" => {
+                    if let Some(v) = Self::parse_int_option(&value) {
+                        self.cloudfetch_config.num_download_workers = v as usize;
+                        Ok(())
+                    } else {
+                        Err(DatabricksErrorHelper::set_invalid_option(&key, &value).to_adbc())
+                    }
+                }
+                "databricks.cloudfetch.url_expiration_buffer_secs" => {
+                    if let Some(v) = Self::parse_int_option(&value) {
+                        self.cloudfetch_config.url_expiration_buffer_secs = v as u32;
                         Ok(())
                     } else {
                         Err(DatabricksErrorHelper::set_invalid_option(&key, &value).to_adbc())

diff --git a/rust/src/reader/cloudfetch/mod.rs b/rust/src/reader/cloudfetch/mod.rs
@@ -23,9 +23,11 @@
 pub mod arrow_parser;
 pub mod chunk_downloader;
 pub mod link_fetcher;
+pub mod pipeline_types;
 pub mod streaming_provider;
 
 pub use arrow_parser::parse_arrow_ipc;
 pub use chunk_downloader::ChunkDownloader;
 pub use link_fetcher::{ChunkLinkFetcher, SeaChunkLinkFetcher};
+pub use pipeline_types::{create_chunk_pair, ChunkDownloadTask, ChunkHandle};
 pub use streaming_provider::StreamingCloudFetchProvider;