24.8.14 Backport of #87863: ArrowStream processing crash if non unique dictionary#1067
Merged
zvonand merged 2 commits intocustomizations/24.8.14from Oct 10, 2025
Merged
Conversation
ArrowStream processing crash if non unique dictionary
Collaborator
|
This is an automated comment for commit 0f43094 with description of existing statuses. It's updated for the latest CI running ❌ Click here to open a full report in a separate page
Successful checks
|
Collaborator
Author
|
Attn @zvonand |
Collaborator
|
@ilejn I would like to ask you to create PRs with proper names and descriptions (see other backports) Also, please, add necessary labels, so that the PR is easy to find in GitHub |
zvonand
approved these changes
Oct 10, 2025
Collaborator
Author
Collaborator
Author
|
Sorry for troubling anyway. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Changelog category (leave one):
Changelog entry (a [user-readable short description]
ClickHouse crashes if ArrowStream file has non-unique dictionary (ClickHouse#87863 by @ilejn)
Documentation entry for user-facing changes
The issue was observed in prod environment (no ideas how data was created).
I've created sample file by the script
The point is 'banana' is duplicated.
ClickHouse creates LowCardinality column for two meaningful elements ('apple' and 'banana') and index '2' is out of bound.
Backtrace