Skip to content

Failure to set a real time scheduler in CTDB when CephFS backend is used #18

@xhernandez

Description

@xhernandez

When a SIT Environment is created using the CephFS backend, sometimes the CTDB service fails to start, logging this error:

Unable to set scheduler to SCHED_FIFO (Operation not permitted)

This seems related to BZ 1201952. The two workarounds proposed there work:

  1. Increase the value of /sys/fs/cgroup/cpu/system.slice/cpu.rt_runtime_us.
  2. Add the option realtime scheduling = false into the legacy section of ctdb.conf.

For now SIT Environment uses the second approach.

Upon further investigation, it seems that the system.slice subdirectory doesn't exist for other backends, but on CephFS, when cephadm is boostrapping the cluster, it appears (among others). Most probably this is related to container and/or systemd activity. This particular subdirectory seems to disappear automatically after some time, but sometimes it's already too late and CTDB has already attempted to start and failed.

Some additional related information provided by @phlogistonjohn (Thanks !!!):

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions