Skip to content

Conversation

zhyass
Copy link
Member

@zhyass zhyass commented May 8, 2025

I hereby agree to the terms of the CLA available at: https://docs.databend.com/dev/policies/cla/

Summary

This PR enhances Hilbert clustering in Databend by introducing block streaming write support and a new data exchange method: modulo-based Flight Scatter. Hilbert clustering now uses the expression

(range_id * node_num) / partition_num

to compute the scatter key, ensuring that adjacent data ranges are co-located on the same node and thus preserving their continuity.

  • Block Streaming Write Support: Enables writing data blocks in a streaming fashion during reclustering, improving efficiency and reducing memory usage.
  • Modulo-Based Data Exchange: Introduces a data exchange strategy that uses a modulo operation on a specified expression to distribute data across nodes.

Tests

  • Unit Test
  • Logic Test
  • Benchmark Test
  • No Test - Explain why

Type of change

  • Bug Fix (non-breaking change which fixes an issue)
  • New Feature (non-breaking change which adds functionality)
  • Breaking Change (fix or feature that could cause existing functionality not to work as expected)
  • Documentation Update
  • Refactoring
  • Performance Improvement
  • Other (please describe):

This change is Reviewable

@zhyass zhyass marked this pull request as draft May 8, 2025 18:53
@github-actions github-actions bot added the pr-feature this PR introduces a new feature to the codebase label May 8, 2025
@zhyass zhyass added ci-cloud Build docker image for cloud test and removed ci-cloud Build docker image for cloud test labels May 8, 2025
@zhyass zhyass added ci-cloud Build docker image for cloud test and removed ci-cloud Build docker image for cloud test labels May 13, 2025
@zhyass zhyass added ci-cloud Build docker image for cloud test and removed ci-cloud Build docker image for cloud test labels May 15, 2025
@zhyass zhyass added ci-cloud Build docker image for cloud test and removed ci-cloud Build docker image for cloud test labels May 19, 2025
@zhyass zhyass added ci-cloud Build docker image for cloud test and removed ci-cloud Build docker image for cloud test labels Jun 6, 2025
@databendlabs databendlabs deleted a comment from github-actions bot Jun 7, 2025
@zhyass zhyass added ci-cloud Build docker image for cloud test and removed ci-cloud Build docker image for cloud test labels Jun 7, 2025
@zhyass zhyass added ci-cloud Build docker image for cloud test and removed ci-cloud Build docker image for cloud test labels Jun 19, 2025
Copy link
Contributor

Docker Image for PR

  • tag: pr-17904-4528d21-1750331611

note: this image tag is only available for internal use.

Copy link
Contributor

github-actions bot commented Jun 24, 2025

🤖 Smart Auto-retry Analysis (Retry #1)

Workflow: 15840871716

📊 Summary

  • Failed Jobs: 1
  • Retryable: 0
  • Code Issues: 1

NO RETRY NEEDED

All failures appear to be code/test issues requiring manual fixes.

🔍 Job Details

  • linux / check: Not retryable (Code/Test)

🤖 About

Automated analysis using job annotations to distinguish infrastructure issues (auto-retried) from code/test issues (manual fixes needed).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci-cloud Build docker image for cloud test pr-feature this PR introduces a new feature to the codebase
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant