Rook-Ceph Failure

Problem

Sometimes when you are shutting down a node or a node fails, the pod status is stuck in initializing, Ceph status might hang, or the Rook module fails with the following error:

Cause

The Ceph monitor isn't fully initialized or ready to serve requests and the metadata server cannot complete its initialization because it's dependent on the Ceph monitor.

Solution

Restart rook-ceph-operator and the metadata server using the kubectl rollout restart deployment -n rook-ceph rook-ceph-operator rook-ceph-mds-cephfs-a rook-ceph-mds-cephfs-b command.

ON THIS PAGE

Rook-Ceph Failure

Problem

Cause

Solution