Volume Resize Failure (Azure) #8728
Replies: 1 comment 4 replies
-
I don't think your log makes it really clear what happened in your case. Strimzi will restart the pods when they get into the Also, based on previous discussion, Azure has some special behaviors when it comes to resizing. In the past it was reported that it does not use the |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi there, I was required to increase the storage space of our three node kafka cluster and ran into an issue on one of the nodes. Two nodes completed the resize with no issue (no kafka restarts, no special events), the third one however ran into a
FileSystemResizePending
error.From my understanding, strimzi is supposed to roll the cluster in this case, to facilitate the required (re-)start of the pod. This did not happen and I started seeing errors in the strimzi logs. It seems like strimzi was just waiting for the pod to be ready and not realising that it needed to restart the pod to finalise the resize. In a different discussion it seems like strimzi did actually do the pod roll, so I think the issue may be related to the "race condition" that was encountered on the CSI resizer (see PVC Events).
Sidenote: After a good while (>30mins), I decided to delete the offending PVC and then strimzi successfully created a new one and kafka rebuilt itself. Had this not worked I would have restored the backup from before the resize.
This was with strimzi version 0.35.0 on Azure AKS.
My actual questions now as follows:
PVC Resize
Status Conditions:
Events:
Strimzi Logs
Note that the PVC resize only begins after the first exception (
2023-06-25 10:49:40 INFO PvcReconciler:132
).Beta Was this translation helpful? Give feedback.
All reactions