Performance investigations for slow cruise control rebalancing #7167
Unanswered
peterschmitzer
asked this question in
Q&A
Replies: 2 comments 5 replies
-
Maybe @kyguy or @ppatierno might have some ideas. But keep in mind that performance is heavily dependent on your infrastructure and might not be easily reproducible. |
Beta Was this translation helpful? Give feedback.
4 replies
-
It may be worth mentioning that the reassignment of partitions "manually" via the kafka-reassign-partitions.sh in kafka onboard tools worked flawlessly and was very fast. So we doubt it has something to do with our underlying infrastructure. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
We are currently practicing a scale out of our kafka cluster in our preprod and for unknown reason the rebalancing performs poorly and takes a lot of time. Unfortunately, we have not found a lot of documentation about performance considerations.
We are running strimzi/kafka:0.28.0-kafka-3.0.0
The cluster holds 4500 partitions with 30 replicas per topic and replication factor of 3 on 3 brokers when I scaled up to 4 brokers.
After applying the kafka rebalance with mode "add-brokers" it proposed to move 1000 replicas and 82MB. The whole process took more than one hour to complete. In our other cluster where we wanted to move only a little bit more data it seems to take forever (there we stoppe the rebalancing).
Does someone have experience with bad performance when applying a rebalance proposal? What may be possible root causes for the issues we see? In the past (with older strimzi and kafka versions) we have done a rebalance in a poc and moved 80GB of data which was completed in 15 minutes, so we are a little bit confused.
Beta Was this translation helpful? Give feedback.
All reactions