ceph - cephadm - 2024-09-25

Timestamp (UTC)Message
2024-09-25T16:20:00.268Z
<Yonatan Zaken> Hi All,

I am experiencing failures when starting Ceph services via the unit.run files, e.g:
/var/lib/ceph/f14c2a2d-97ed-435c-bb1d-4a30b617a31d/osd.0/unit.run

Failing on the following:
```Error: could not find cgroup mount in "/proc/self/cgroup"
systemd[1]: ceph-f14c2a2d-97ed-435c-bb1d-4a30b617a31d@osd.0.service: Control process exited, code=exited status=126```
Here is the part in the `podmun run` that actually seems to fail. Note I use: `--cgroups=enabled, --memory-limit=32g, --memory-request=16g, --cpuset-cpus=0,32,1,33,16,48,17,49`

```/bin/podman run --rm \
  --ipc=host \
  --stop-signal=SIGTERM \
  --net=host \
  --entrypoint /usr/bin/ceph-osd \
  --privileged \
  --group-add=disk \
  --init \
  --name ceph-f14c2a2d-97ed-435c-bb1d-4a30b617a31d-osd-0 \
  --pids-limit=-1 \
  -d \
  --log-driver journald \
  --conmon-pidfile /run/ceph-f14c2a2d-97ed-435c-bb1d-4a30b617a31d@osd.0.service-pid \
  --cidfile /run/ceph-f14c2a2d-97ed-435c-bb1d-4a30b617a31d@osd.0.service-cid \
  --cgroups=enabled \
  --memory-limit=32g \
  --memory-request=16g \
  --cpuset-cpus=0,32,1,33,16,48,17,49 \
  -e CONTAINER_IMAGE=172.91.0.1:8787/ceph/daemon:reef-rockylinux-8-x86_64 \
  -e NODE_NAME=gilad-dev-allinone-0 \
  -e CEPH_USE_RANDOM_NONCE=1 \
  -e TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES=134217728 \
  -v /var/run/ceph/f14c2a2d-97ed-435c-bb1d-4a30b617a31d:/var/run/ceph:z \
  -v /var/log/ceph/f14c2a2d-97ed-435c-bb1d-4a30b617a31d:/var/log/ceph:z \
  -v /var/lib/ceph/f14c2a2d-97ed-435c-bb1d-4a30b617a31d/crash:/var/lib/ceph/crash:z \
  -v /run/systemd/journal:/run/systemd/journal \
  -v /var/lib/ceph/f14c2a2d-97ed-435c-bb1d-4a30b617a31d/osd.0:/var/lib/ceph/osd/ceph-0:z \
  -v /var/lib/ceph/f14c2a2d-97ed-435c-bb1d-4a30b617a31d/osd.0/config:/etc/ceph/ceph.conf:z \
  -v /dev:/dev \
  -v /run/udev:/run/udev \
  -v /sys:/sys \
  -v /run/lvm:/run/lvm \
  -v /run/lock/lvm:/run/lock/lvm \
  -v /var/lib/ceph/f14c2a2d-97ed-435c-bb1d-4a30b617a31d/selinux:/sys/fs/selinux:ro \
  -v /:/rootfs \
  -v /etc/hosts:/etc/hosts:ro \
  172.91.0.1:8787/ceph/daemon:reef-rockylinux-8-x86_64 \
  -n osd.0 \
  -f \
  --setuser ceph \
  --setgroup ceph \
  --default-log-to-file=false \
  --default-log-to-journald=true \
  --default-log-to-stderr=false```
`podman info`  shows:
  cgroupManager: systemd
  cgroupVersion: v1

And `mount | grep cgroup`
```tmpfs on /sys/fs/cgroup type tmpfs (ro,nosuid,nodev,noexec,seclabel,mode=755)
cgroup on /sys/fs/cgroup/systemd type cgroup (rw,nosuid,nodev,noexec,relatime,seclabel,xattr,release_agent=/usr/lib/systemd/systemd-cgroups-agent,name=systemd)
cgroup on /sys/fs/cgroup/pids type cgroup (rw,nosuid,nodev,noexec,relatime,seclabel,pids)
cgroup on /sys/fs/cgroup/cpuset type cgroup (rw,nosuid,nodev,noexec,relatime,seclabel,cpuset)
cgroup on /sys/fs/cgroup/net_cls,net_prio type cgroup (rw,nosuid,nodev,noexec,relatime,seclabel,net_cls,net_prio)
cgroup on /sys/fs/cgroup/freezer type cgroup (rw,nosuid,nodev,noexec,relatime,seclabel,freezer)
cgroup on /sys/fs/cgroup/blkio type cgroup (rw,nosuid,nodev,noexec,relatime,seclabel,blkio)
cgroup on /sys/fs/cgroup/devices type cgroup (rw,nosuid,nodev,noexec,relatime,seclabel,devices)
cgroup on /sys/fs/cgroup/rdma type cgroup (rw,nosuid,nodev,noexec,relatime,seclabel,rdma)
cgroup on /sys/fs/cgroup/perf_event type cgroup (rw,nosuid,nodev,noexec,relatime,seclabel,perf_event)
cgroup on /sys/fs/cgroup/memory type cgroup (rw,nosuid,nodev,noexec,relatime,seclabel,memory)
cgroup on /sys/fs/cgroup/cpu,cpuacct type cgroup (rw,nosuid,nodev,noexec,relatime,seclabel,cpu,cpuacct)
cgroup on /sys/fs/cgroup/hugetlb type cgroup (rw,nosuid,nodev,noexec,relatime,seclabel,hugetlb) ```

Any issue? please create an issue here and use the infra label.