-
Notifications
You must be signed in to change notification settings - Fork 646
Description
Hi
Running cruise control in a kubernetes pod (built the image from 2.5.138 tag) in a kafka bare-metal cluster with +80 brokers passed as input parameter for bootstrap server, often fails and the pod doesn't start even after 30 minutes. Connection logs to Kafka were frequently being logged without any errors occurring. They sometimes started by repeatedly removing the pod or bootstrap metrics or recreating cc topics.
We found this number by dividing the brokers until it comes up. So we started with 400 and it didn't work. we went to 200, and so on, until we found that 80 works.
For short term, we passed only the first 80 brokers to cruise control and it starts fine. But this is not a clean solution. I wonder if anyone else encounter such problem.