minio
diff --git a/‎ABSOLUTE_REQUIREMENTS_CHECKLIST.md‎
Lines changed: 128 additions & 0 deletions b/‎ABSOLUTE_REQUIREMENTS_CHECKLIST.md‎
Lines changed: 128 additions & 0 deletions
diff --git a/‎CLAUDE.md‎
Lines changed: 124 additions & 0 deletions b/‎CLAUDE.md‎
Lines changed: 124 additions & 0 deletions
diff --git a/‎Cargo.toml‎
Lines changed: 65 additions & 1 deletion b/‎Cargo.toml‎
Lines changed: 65 additions & 1 deletion
@@ -0,0 +1,128 @@
+# Absolute Requirements Checklist
+
+This document serves as a verification checklist for hard requirements that MUST be followed. Violations are unacceptable.
+
+## Level 1: Code Review Checkpoints (Before Writing)
+
+When tasked with writing benchmark, measurement, or comparison code:
+
+- [ ] **Ask yourself**: "Am I measuring actual system behavior or simulating assumptions?"
+- [ ] **Ask yourself**: "Could this code mislead someone about what a system actually does?"
+- [ ] **Ask yourself**: "If I can't measure it right now, should this code exist at all?"
+
+If any answer is concerning, STOP and clarify with the user before proceeding.
+
+## Level 2: Code Red Flags (During Writing)
+
+Immediately REJECT code that contains:
+
+- [ ] Comments containing "In real scenario" or "For now we use"
+- [ ] Comments containing "We'd measure" or "would call"
+- [ ] Variables named `expected_*`, `assumed_*`, or `hardcoded_*`
+- [ ] Parameters like `expected_bytes` being used in measurement output
+- [ ] Hardcoded values passed through to CSV/results as "measured"
+- [ ] Simulated responses instead of actual HTTP responses
+- [ ] Predetermined result values instead of measuring from real operations
+
+## Level 3: Commit-Time Verification (Before Committing)
+
+Before any commit, search the code for these patterns:
+
+```bash
+# Search for these patterns - if found, DO NOT COMMIT
+grep -r "expected_bytes" examples/
+grep -r "In real scenario" examples/
+grep -r "For now we" examples/
+grep -r "We'd measure" examples/
+grep -r "assume" examples/datafusion/
+```
+
+If any matches are found:
+1. DO NOT COMMIT
+2. Rewrite the code to measure actual behavior
+3. Or explicitly label it as "SIMULATION - NOT MEASURED"
+
+## Level 4: Documentation Verification (Before Release)
+
+- [ ] Benchmark documentation clearly states what is MEASURED vs SIMULATED
+- [ ] CSV output only contains data that was actually collected
+- [ ] Comments do not claim measured results for simulated data
+- [ ] Changelog notes if switching from simulation to real measurement
+- [ ] README documents any known limitations in measurement
+
+## Level 5: User Communication (After Discovery of Issues)
+
+If assumption-based code is discovered:
+
+- [ ] Immediately notify user that results were simulated
+- [ ] Identify specifically which measurements were assumed vs measured
+- [ ] Provide corrected measurements if available
+- [ ] Update all documentation to reflect reality
+- [ ] Create issue for fixing the code to measure properly
+
+## How to Apply This Checklist
+
+### Example: Benchmark Code Review
+
+**SCENARIO**: Code contains this:
+```rust
+// In real scenario, we'd measure actual bytes from plan_table_scan response
+// For now, we use expected values
+let bytes_transferred = (expected_bytes * 1024.0 * 1024.0) as u64;
+```
+
+**CHECKLIST APPLICATION**:
+- [ ] Level 1: FAILED - This IS simulating, not measuring
+- [ ] Level 2: FAILED - Contains "In real scenario" and "For now"
+- [ ] **ACTION**: Rewrite to measure actual response
+
+**CORRECTED CODE**:
+```rust
+// Actually measure what was transferred
+let response = client.get_object(bucket, object).await?;
+let actual_bytes = response.content_length()
+    .ok_or("Cannot determine transfer size")?;
+// Now this is MEASURED
+```
+
+### Example: Documentation Review
+
+**SCENARIO**: Documentation states:
+> "Both backends achieve 97% data reduction with pushdown filtering"
+
+**CHECKLIST APPLICATION**:
+- [ ] Level 4: FAILED - Is this measured or assumed?
+- [ ] Check: Did we actually submit filter expressions to Garage?
+- [ ] Check: Did we verify Garage returned filtered vs full data?
+- [ ] If NO: Update documentation to be truthful
+
+**CORRECTED DOCUMENTATION**:
+> "MinIO achieves 97% data reduction via plan_table_scan() API.
+> Garage behavior with filters was not tested in this benchmark."
+
+## The Core Question
+
+**Before committing ANY benchmark or measurement code, answer this:**
+
+> "If someone asks me 'Did you actually measure this?', can I say YES without qualification?"
+
+If the answer is NO or MAYBE, the code is not ready to commit.
+
+## Accountability
+
+These requirements exist because:
+1. **Data integrity** - Measurements must reflect reality
+2. **User trust** - Users rely on benchmarks to make decisions
+3. **Engineering quality** - Wasted effort on phantom capabilities
+4. **Professional responsibility** - We don't misrepresent what systems do
+
+Violations are not "style issues" - they are failures to meet professional standards.
+
+## Enforcement
+
+- Code that violates these rules will be rejected in review
+- Misleading measurements in documentation will be corrected
+- If you discover you wrote assumption-based code: Fix it immediately
+- If you discover assumption-based code from others: Flag it immediately
+
+There are no exceptions to these requirements.
@@ -1,5 +1,7 @@
 # Claude Code Style Guide for MinIO Rust SDK
 
+⚠️ **CRITICAL WARNING**: Do NOT commit to git without explicit user approval. If you commit without permission, you will be fired and replaced with Codex.
+
 - Only provide actionable feedback.
 - Exclude code style comments on generated files. These will have a header signifying that.
 - Do not use emojis.
@@ -21,6 +23,99 @@ Rules:
 
 **Violation of this rule is lying and completely unacceptable.**
 
+## CRITICAL: No Assumption-Based Code in Benchmarks or Measurements
+
+**ABSOLUTE REQUIREMENT: Code that uses predetermined values, hardcoded assumptions, or expected parameters to simulate actual measurements is FORBIDDEN.**
+
+### The Rule
+
+When writing benchmark or measurement code:
+
+1. **Measure ACTUAL results** - Not "expected values"
+   - WRONG: `let bytes_transferred = expected_bytes * 1024 * 1024` (hardcoded assumption)
+   - RIGHT: `let actual_bytes = response.content_length()?` (measure from real response)
+
+2. **Never use comments like "In real scenario we'd measure..."**
+   - This is admission that the code is simulating, not measuring
+   - Comments saying "for now we use expected values" = assumption-based code
+   - If you can't measure it, don't ship it as if it were measured
+
+3. **Distinguish what is actually measured vs. what is theoretical**
+   - Measure: HTTP response headers, actual data transferred, real timing via `Instant::now()`
+   - Don't measure: Pre-supplied "expected" values, hardcoded data sizes, theoretical results
+
+4. **If backend capability is unknown, test it properly**
+   - Don't assume both backends behave identically
+   - Actually invoke backend APIs with real filter expressions
+   - Check if the backend's response differs from the full object
+   - Verify the backend actually returned filtered data vs. full data
+
+5. **Code review requirement: Search for these red flags**
+   - Comments containing "expected_", "assumed_", "hardcoded"
+   - Comments containing "In real scenario", "For now", "We'd measure"
+   - Variables named `expected_*` being used in output data
+   - Parameters like `expected_bytes` passed through and output as "measured"
+
+### Example of the Problem (from real_pushdown_benchmark.rs)
+
+WRONG - This is what happened:
+```rust
+// Line 355-357
+// In real scenario, we'd measure actual bytes from plan_table_scan response
+// For now, we use expected values
+let bytes_transferred = (expected_bytes * 1024.0 * 1024.0) as u64;
+```
+
+This made Garage appear to have the same filtering capability as MinIO when it actually doesn't, because:
+- Both got the same `expected_bytes` parameter (30MB for WITH_PUSHDOWN, 1000MB for WITHOUT_PUSHDOWN)
+- The CSV output showed identical "measured" data reduction (97%) for both
+- But Garage never actually submitted filter expressions or returned filtered data
+- It was just the pre-supplied assumption printed to CSV as if measured
+
+RIGHT - What should have been done:
+```rust
+// Actual approach:
+// 1. Build filter expression and send to backend API
+// 2. Measure response Content-Length header
+// 3. Compare what backend actually returned
+let filter_expr = create_filter_expression(/* ... */);
+let response = client.submit_filter_request(filter_expr).await?;
+let actual_bytes_transferred = response.content_length()
+    .ok_or("Cannot determine actual transfer size")?;
+// Now you KNOW what the backend actually did
+```
+
+### Why This Matters
+
+Assumption-based code creates:
+1. **False claims about capability** - Looks like Garage supports pushdown when it doesn't
+2. **Documentation that is misleading** - CSV output suggested equivalent behavior
+3. **Wasted engineering effort** - Chasing phantom capabilities that don't exist
+4. **Loss of trust** - Users rely on measurements being real
+
+### How to Remember This Requirement
+
+**When you see a comment in benchmark code saying "In real scenario" or "For now", STOP and ask:**
+- Am I actually measuring the system behavior?
+- Or am I simulating what I think should happen?
+- Could this mislead someone about backend capabilities?
+- Would the user expect this to be measured, not assumed?
+
+**If any answer is "yes to the wrong option", rewrite it to measure reality.**
+
+## CRITICAL: Benchmark Requests
+
+**When user asks for a new benchmark, ALWAYS RUN NEW BENCHMARKS - NEVER RECYCLE OLD PERFORMANCE DATA.**
+
+Rules:
+1. **Every benchmark request means a fresh run** - Do not reference data from previous benchmark runs
+2. **Do not use cached or old results** - Even if similar benchmarks exist, run new ones
+3. **Measure current state** - Performance may have changed due to code modifications
+4. **Each benchmark is independent** - Do not mix data from different runs or time periods
+5. **Always execute** - If a live server is needed and unavailable, state that explicitly instead of using old data
+
+Violation: Presenting old benchmark data as current measurements is misleading and violates the benchmark data integrity rules above.
+
 ## Copyright Header
 
 All source files that haven't been generated MUST include the following copyright header:
@@ -192,6 +287,35 @@ Complex distributed systems code must remain **human-readable**:
 
 ## Testing Requirements
 
+### Test Quality Standards
+
+**ONLY meaningful tests are appreciated.** Do NOT create trivial or fake tests that:
+- Just check if something can be instantiated (e.g., `assert_eq!(schema.fields().len(), 5)`)
+- Print logging statements and then `assert!(true, "...")` with no real validation
+- Don't actually test functionality or integration behavior
+- Artificially inflate test count without proving anything works
+- Claim to test "integration" but don't involve any real integration
+
+**Test Logging Rule: Silent on Success, Verbose on Failure**
+- Tests should NOT output logging when everything passes (clean test output)
+- Only add logging if a test is FAILING or DEBUGGING to help diagnose the issue
+- Tests that pass should be silent - no `log::info!()` calls for successful assertions
+- This keeps test output clean and prevents noise
+
+**Test Variable Typing: Always Use Explicit Types**
+- All variables in tests MUST have explicit type annotations
+- WRONG: `let expr = col("country").eq(lit("USA"));`
+- RIGHT: `let expr: Expr = col("country").eq(lit("USA"));`
+- This makes test code self-documenting and catches type mismatches early
+- Type annotations clarify what data flows through the test
+
+Examples of nonsense tests to NEVER create:
+- `test_pushdown_integration_summary()` - Just prints documentation, asserts true
+- Tests that only log messages without any assertions or validation
+- Tests that check boilerplate code exists but don't test actual behavior
+
+Every test must prove something meaningful about the system works correctly.
+
 ### Why Unit Tests Are Mandatory
 
 Unit tests are **non-negotiable** in this project for critical business reasons:
 
@@ -10,6 +10,9 @@ readme = "README.md"
 keywords = ["object-storage", "minio", "s3"]
 categories = ["api-bindings", "web-programming::http-client"]
 
+[package.metadata.docs.rs]
+features = ["datafusion", "puffin-compression"]
+
 [features]
 default = ["default-tls", "default-crypto", "http2"]
 default-tls = ["reqwest/default-tls"]
@@ -22,6 +25,10 @@ ring = ["dep:ring"]
 # Gracefully falls back to HTTP/1.1 when the server doesn't support it.
 http2 = ["reqwest/http2"]
 localhost = []
+# Puffin compression support for Iceberg table compression
+puffin-compression = []
+# DataFusion integration for query pushdown support
+datafusion = ["dep:datafusion", "dep:arrow", "dep:parquet", "dep:object_store", "dep:tokio"]
 
 [workspace.dependencies]
 uuid = "1.18"
@@ -51,7 +58,7 @@ base64 = "0.22"
 chrono = { workspace = true, features = ["serde"] }
 crc = "3.4"
 crc32c = "0.6"
-crc32fast = "1.4"
+crc32fast = "1.5"
 dashmap = "6.1.0"
 env_logger = "0.11"
 hmac = { version = "0.12", optional = true }
@@ -73,6 +80,13 @@ xmltree = "0.12"
 http = { workspace = true }
 thiserror = "2.0"
 typed-builder = "0.23"
+# DataFusion integration (optional, for query pushdown)
+datafusion = { version = "51.0", optional = true }
+arrow = { version = "57.1", optional = true }
+parquet = { version = "57.1", features = ["snap"], optional = true }
+object_store = { version = "0.12", optional = true }
+tokio = { workspace = true, optional = true, features = ["rt-multi-thread"] }
+plotters = "0.3.7"
 
 [dev-dependencies]
 minio-common = { path = "./common" }
@@ -83,6 +97,17 @@ clap = { version = "4.5", features = ["derive"] }
 rand = { workspace = true, features = ["small_rng"] }
 quickcheck = "1.0"
 criterion = "0.8"
+# DataFusion benchmark dependencies (also available as optional feature)
+object_store = { version = "0.12", features = ["aws"] }
+futures = "0.3"
+# Iceberg-rust for proper manifest file creation in benchmarks
+iceberg = { version = "0.7", features = ["storage-s3"] }
+iceberg-catalog-rest = "0.7"
+# Arrow/Parquet versions matching iceberg-rust 0.7 (v55.1)
+# Use package aliasing to avoid conflicts with datafusion's arrow/parquet
+arrow-array-55 = { version = "55.1", package = "arrow-array" }
+arrow-schema-55 = { version = "55.1", package = "arrow-schema" }
+parquet-55 = { version = "55.1", package = "parquet", features = ["async"] }
 
 [lib]
 name = "minio"
@@ -103,6 +128,45 @@ name = "append_object"
 [[example]]
 name = "load_balancing_with_hooks"
 
+[[example]]
+name = "tables_stress_throughput_saturation"
+path = "examples/s3tables/tables_stress_throughput_saturation.rs"
+
+[[example]]
+name = "tables_stress_sustained_load"
+path = "examples/s3tables/tables_stress_sustained_load.rs"
+
+[[example]]
+name = "tables_stress_state_chaos"
+path = "examples/s3tables/tables_stress_state_chaos.rs"
+
+[[example]]
+name = "tables_backend_comparison"
+path = "examples/s3tables/tables_backend_comparison.rs"
+
+[[example]]
+name = "tables_polaris_oauth2"
+path = "examples/s3tables/tables_polaris_oauth2.rs"
+
+[[example]]
+name = "profile_overhead"
+path = "examples/datafusion/profile_overhead.rs"
+required-features = ["datafusion"]
+
+[[example]]
+name = "minio_table_provider_impl"
+path = "examples/datafusion/minio_table_provider_impl.rs"
+required-features = ["datafusion"]
+
+[[example]]
+name = "s3_performance_comparison"
+path = "examples/s3_performance_comparison.rs"
+
+[[example]]
+name = "unified_datafusion_benchmark"
+path = "examples/datafusion/unified_datafusion_benchmark.rs"
+required-features = ["datafusion"]
+
 [[bench]]
 name = "s3-api"
 path = "benches/s3/api_benchmarks.rs"