Multi-Cluster Ceph issue with CephFS #6083
-
|
Hi, rather than jump in and raise an Issue, I thought I'd start here. I've read all the documentation and read threads in here, however I cannot get multi-cluster to work. Firstly my approach is slightly different, in that I am not using CephCSI to provision volumes. These are created externally on the Ceph Cluster, so I'm using a Static volume approach. The error we're seeing from the You'd expect this to be a misconfigured Storage Class because it says it can't find the controller publish secret although its not, see below. I've checked the RBAC, RolesBindings, Roles etc against the examples in this repo, all good except for the nodeplugin role, which isn't created for some reason. Apart from that I can find no other issues, the error we see in the pod description is at the bottom of this post. Current deployment configs and info. Configmaps - note hostnames are slightly redacted and are in use on other single config deployments without issues, I have also created the storage classes & secrets in the csi-ceph-system namespace but get the same results. Storage Classes PV PVCs Secrets A test pod. Pod errors. The not authorised is probably the culprit due to the error we see in the provisioner. If you consider this to be a potential bug please let me know and I will raise it as such. TIA. |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments
-
|
@iPraveenParihar PTAL |
Beta Was this translation helpful? Give feedback.
-
|
Hey @seanmcconkey, We're seeing two separate issues: 1. Controller Publish failure on The CSI driver is failing to decode the 2. MountDevice failure on The mount is failing because no CephFS Metadata Server (MDS) is currently active or reachable. Could you run the following and verify if the MDS is active? |
Beta Was this translation helpful? Give feedback.
-
|
@iPraveenParihar thanks for the reply and apologies for the late response. I am declaring the I did figure the issue out and it was me forgetting that the The only thing I'd remark on this, is why we got that error message about the secret, I can understand why we got the Thanks once again. |
Beta Was this translation helpful? Give feedback.
@iPraveenParihar thanks for the reply and apologies for the late response.
I am declaring the
controller-publish-secretin the SC as shown above so its nothing to do with that, even though the error suggest it is.I did figure the issue out and it was me forgetting that the
fsnameis notcephfson this cluster, we overlay the name change and I was doing my tests with the un-overlayed PV template.The only thing I'd remark on this, is why we got that error message about the secret, I can understand why we got the
mdserror messages though.Thanks once again.