2024-12-20T03:13:11.162Z | <Dan Mick> omani001: idle (buster,jammy,arm64,huge,installed-os-jammy,xenial)
omani002: idle (sepia,arm64,gigantic,huge,centos9,installed-os-centos9)
omani003: idle (sepia,arm64,gigantic,huge,centos9,installed-os-centos9)
omani004: offline (jammy,focal,arm64,bionic,huge,installed-os-jammy,xenial)
omani005: idle (jammy,focal,arm64,bionic,huge,installed-os-jammy,xenial)
omani006: temporarily offline: Oct 27 Disconnected by akraitman: Hardware issue needs further investigation (jammy,focal,arm64,bionic,huge,installed-os-jammy,xenial)
omani007: offline (arm64,centos8,huge,installed-os-centos8)
omani008: idle (sepia,arm64,gigantic,huge,centos9,installed-os-centos9)
omani009: 1 active builds (jammy,arm64,bionic,huge,installed-os-jammy,xenial)
<https://jenkins.ceph.com/job/ceph-pull-requests-arm64/66152/>
omani010: idle (sepia,arm64,gigantic,huge,centos9,installed-os-centos9)
sulcata02: temporarily offline: Jul 10 Disconnected by dmick: Kernel crash dumps were filling up /var/crash. Suspect hardware issue. (sepia,arm64,gigantic,huge,centos9,installed-os-centos9)
confusa01: offline (sepia,arm64,gigantic,centos8,huge,installed-os-centos8)
confusa02: offline (sepia,arm64,gigantic,centos8,huge,installed-os-centos8)
confusa03: offline (sepia,arm64,gigantic,huge,centos9,installed-os-centos9)
confusa04: idle (sepia,arm64,gigantic,huge,centos9,installed-os-centos9)
confusa05: offline (sepia,arm64,gigantic,huge,centos7)
confusa06: offline (sepia,arm64,gigantic,huge,centos7,installed-os-centos7)
confusa07: idle (sepia,arm64,gigantic,huge,centos9,installed-os-centos9)
confusa08: temporarily offline: Aug 25 Disconnected by akraitman: Seeing this in the ipmi console
{94016}[Hardware Error]: section_type: memory error
[554312.388350] {94016}[Hardware Error]: error_status: Storage error in DRAM memory (0x0000000000000400)
[554312.397881] {94016}[Hardware Error]: node:4 card:1032 rank:10 bank:65535 row:0 column:0
[554312.406234] {94016}[Hardware Error]: error_type: 2, single-bit ECC
[554312.412661] {94016}[Hardware Error]: Error 1, type: info
[554312.418150] {94016}[Hardware Error]: fru_text: ecc_errc_count_l
[554312.424245] {94016}[Hardware Error]: section_type: general processor error
[554312.431367] {94016}[Hardware Error]: requestor_id: 0x000000000000ffff
[554312.438055] {94016}[Hardware Error]: Error 2, type: info
[554312.443528] {94016}[Hardware Error]: fru_text: ecc_errc_count_h (sepia,arm64,gigantic,huge,centos9,installed-os-centos9)
confusa10: idle (sepia,arm64,gigantic,huge,centos9,installed-os-centos9)
confusa11: 1 active builds (jammy,sepia,focal,arm64,bionic,gigantic,huge,installed-os-jammy,xenial)
<https://jenkins.ceph.com/job/ceph-pull-requests-arm64/66151/>
confusa12: temporarily offline: Nov 21 ?? (jammy,sepia,focal,arm64,bionic,gigantic,huge,installed-os-jammy,xenial)
confusa13: offline (sepia,arm64,gigantic,huge,centos9,installed-os-centos9)
confusa14: offline (jammy,sepia,focal,arm64,bionic,gigantic,huge,installed-os-jammy,xenial)
Idle: 9 Busy: 2 Offline: 13 |