ceph - sepia - 2024-11-22

Timestamp (UTC)Message
2024-11-22T02:36:54.576Z
<Rongqi Sun> Because confusa12 and confusa14 are offline, I can't access them. How can I bring them back to online then find out the root cause?
2024-11-22T02:36:59.708Z
<Rongqi Sun> @Dan Mick @David Galloway
2024-11-22T02:41:04.345Z
<David Galloway> I already found the root cause. It was fixed on 14. 12 never came back after a reboot and needs further investigation.
2024-11-22T02:55:08.447Z
<Rongqi Sun> OK, thanks for the help
2024-11-22T10:14:49.212Z
<Kyrylo Shatskyy> Does it mean we stop doing -Werror,-Wall? Because I am still trying to resolve those errors on similar modern systems.
2024-11-22T11:50:58.202Z
<Zac Dover> Error cloning the repo:

```Caused by: java.io.IOException: Cannot run program "git" (in directory "/home/jenkins-build/build/workspace/ceph-pull-requests"): error=0, Failed to exec spawn helper: pid: 2743079, exit value: 1
	at java.base/java.lang.ProcessBuilder.start(ProcessBuilder.java:1143)
	at java.base/java.lang.ProcessBuilder.start(ProcessBuilder.java:1073)
	at hudson.Proc$LocalProc.<init>(Proc.java:252)
	at hudson.Proc$LocalProc.<init>(Proc.java:221)
	at hudson.Launcher$LocalLauncher.launch(Launcher.java:994)
	at hudson.Launcher$ProcStarter.start(Launcher.java:506)
	at org.jenkinsci.plugins.gitclient.CliGitAPIImpl.launchCommandIn(CliGitAPIImpl.java:2841)
	... 16 more
Caused by: java.io.IOException: error=0, Failed to exec spawn helper: pid: 2743079, exit value: 1
	at java.base/java.lang.ProcessImpl.forkAndExec(Native Method)
	at java.base/java.lang.ProcessImpl.<init>(ProcessImpl.java:314)
	at java.base/java.lang.ProcessImpl.start(ProcessImpl.java:244)
	at java.base/java.lang.ProcessBuilder.start(ProcessBuilder.java:1110)
	... 22 more
ERROR: Error cloning remote repo 'origin'
[Cobertura] Skipping Cobertura coverage report as build was not SUCCESS or better ...```
<https://jenkins.ceph.com/job/ceph-pull-requests/147142/console>

Re: <https://github.com/ceph/ceph/pull/60754>
2024-11-22T15:21:26.019Z
<John Mulligan> As a group we don't need to do anything. I am not suggesting to change any of the current defaults. I was just trying to get past those compiler errors to see if there were any other build blockers.
2024-11-22T15:22:46.350Z
<John Mulligan> (there were not)
2024-11-22T16:30:44.485Z
<Casey Bodley> shall we target Alma when adding el10 support? once that's working, we could go back and convert el9 from centos to also
2024-11-22T16:31:07.364Z
<Casey Bodley> shall we target Alma when adding el10 support? once that's working, we could go back and convert el9 from centos to alma also
2024-11-22T21:24:41.813Z
<David Galloway> @Dan Mick You wanna look together?
2024-11-22T21:26:59.965Z
<David Galloway> @Adam Kraitman Does Grafana monitor the Jenkins Builders for constantly failing jobs?
2024-11-22T21:31:56.021Z
<David Galloway> Seems maybe just restarting the jenkins service (`systemctl restart jenkins`) on the Jenkins Builder fixes it.  That would make sense if Java was updated but the jenkins service wasn't restarted.
2024-11-22T22:08:52.074Z
<Dan Mick> so this was the magic thing that I was talking about
2024-11-22T22:09:33.419Z
<Dan Mick> the thing is: isn't the clone that failed running on the builder?
2024-11-22T22:11:59.023Z
<David Galloway> Yes.  The service on the Jenkins Builder is called "jenkins"
2024-11-22T22:12:02.632Z
<David Galloway> I blame Alfredo
2024-11-22T22:13:20.922Z
<Dan Mick> oh godddamnit you said Jenkins Builder.  I'm sorry.
2024-11-22T22:14:00.744Z
<Dan Mick> I was searching for a five-paragraph message I sent on Wednesday explaining nfs-ganesha to send to Yuri, who just said "I don't understand why the amd64 prerelease container built"
2024-11-22T22:14:32.171Z
<Dan Mick> do you have any idea if restarting the builder is safe if a job is running?
2024-11-22T22:14:45.616Z
<David Galloway> It is not
2024-11-22T22:15:18.731Z
<David Galloway> So
Mark offline
Wait for job to finish (if applicable)
`systemctl restart jenkins`
Mark builder back online
2024-11-22T22:15:43.130Z
<Dan Mick> <nod>  /me wonders if the builder.yml role tries to wait
2024-11-22T22:16:32.301Z
<Dan Mick> answer: no
2024-11-22T22:17:01.275Z
<Zac Dover> Error cloning the repo:

```Caused by: java.io.IOException: Cannot run program "git" (in directory "/home/jenkins-build/build/workspace/ceph-pull-requests"): error=0, Failed to exec spawn helper: pid: 2743079, exit value: 1
	at java.base/java.lang.ProcessBuilder.start(ProcessBuilder.java:1143)
	at java.base/java.lang.ProcessBuilder.start(ProcessBuilder.java:1073)
	at hudson.Proc$LocalProc.<init>(Proc.java:252)
	at hudson.Proc$LocalProc.<init>(Proc.java:221)
	at hudson.Launcher$LocalLauncher.launch(Launcher.java:994)
	at hudson.Launcher$ProcStarter.start(Launcher.java:506)
	at org.jenkinsci.plugins.gitclient.CliGitAPIImpl.launchCommandIn(CliGitAPIImpl.java:2841)
	... 16 more
Caused by: java.io.IOException: error=0, Failed to exec spawn helper: pid: 2743079, exit value: 1
	at java.base/java.lang.ProcessImpl.forkAndExec(Native Method)
	at java.base/java.lang.ProcessImpl.<init>(ProcessImpl.java:314)
	at java.base/java.lang.ProcessImpl.start(ProcessImpl.java:244)
	at java.base/java.lang.ProcessBuilder.start(ProcessBuilder.java:1110)
	... 22 more
ERROR: Error cloning remote repo 'origin'
[Cobertura] Skipping Cobertura coverage report as build was not SUCCESS or better ...```
<https://jenkins.ceph.com/job/ceph-pull-requests/147142/console>

Re: <https://github.com/ceph/ceph/pull/60754>
2024-11-22T22:20:00.414Z
<Dan Mick> oh you said Jenkins Builder.  I'm sorry.

Any issue? please create an issue here and use the infra label.