ceph - sepia - 2024-11-15

Timestamp (UTC)Message
2024-11-15T01:59:35.818Z
<Rongqi Sun> Could you help figure it out? Because I do not have enough access
2024-11-15T05:19:30.017Z
<nizamial09> Started seeing the below in the jenkins job: <https://jenkins.ceph.com/job/ceph-api/85038/console>
```Cloning repository <https://github.com/ceph/ceph.git>
 > git init /home/jenkins-build/build/workspace/ceph-api # timeout=10
ERROR: Error cloning remote repo 'origin'
hudson.plugins.git.GitException: Could not init /home/jenkins-build/build/workspace/ceph-api
	at org.jenkinsci.plugins.gitclient.CliGitAPIImpl$5.execute(CliGitAPIImpl.java:1079)
	at org.jenkinsci.plugins.gitclient.CliGitAPIImpl$2.execute(CliGitAPIImpl.java:825)
	at org.jenkinsci.plugins.gitclient.RemoteGitImpl$CommandInvocationHandler$GitCommandMasterToSlaveCallable.call(RemoteGitImpl.java:170)
	at org.jenkinsci.plugins.gitclient.RemoteGitImpl$CommandInvocationHandler$GitCommandMasterToSlaveCallable.call(RemoteGitImpl.java:161)
	at hudson.remoting.UserRequest.perform(UserRequest.java:211)
	at hudson.remoting.UserRequest.perform(UserRequest.java:54)
	at hudson.remoting.Request$2.run(Request.java:377)
	at hudson.remoting.InterceptingExecutorService.lambda$wrap$0(InterceptingExecutorService.java:78)
	at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
	at hudson.remoting.Engine$1.lambda$newThread$0(Engine.java:137)
	at java.base/java.lang.Thread.run(Thread.java:840)
	Suppressed: hudson.remoting.Channel$CallSiteStackTrace: Remote call to JNLP4-connect connection from 8.43.84.3/8.43.84.3:58755
		at hudson.remoting.Channel.attachCallSiteStackTrace(Channel.java:1923)
		at hudson.remoting.UserRequest$ExceptionResponse.retrieve(UserRequest.java:384)
		at hudson.remoting.Channel.call(Channel.java:1112)
		at PluginClassLoader for git-client//org.jenkinsci.plugins.gitclient.RemoteGitImpl$CommandInvocationHandler.execute(RemoteGitImpl.java:153)
		at jdk.internal.reflect.GeneratedMethodAccessor558.invoke(Unknown Source)
		at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
		at java.base/java.lang.reflect.Method.invoke(Method.java:569)
		at PluginClassLoader for git-client//org.jenkinsci.plugins.gitclient.RemoteGitImpl$CommandInvocationHandler.invoke(RemoteGitImpl.java:138)
		at PluginClassLoader for git-client/jdk.proxy79/jdk.proxy79.$Proxy165.execute(Unknown Source)
		at PluginClassLoader for git//hudson.plugins.git.GitSCM.retrieveChanges(GitSCM.java:1220)
		at PluginClassLoader for git//hudson.plugins.git.GitSCM._checkout(GitSCM.java:1310)
		at PluginClassLoader for git//hudson.plugins.git.GitSCM.checkout(GitSCM.java:1277)
		at hudson.scm.SCM.checkout(SCM.java:540)
		at hudson.model.AbstractProject.checkout(AbstractProject.java:1250)
		at hudson.model.AbstractBuild$AbstractBuildExecution.defaultCheckout(AbstractBuild.java:649)
		at jenkins.scm.SCMCheckoutStrategy.checkout(SCMCheckoutStrategy.java:85)
		at hudson.model.AbstractBuild$AbstractBuildExecution.run(AbstractBuild.java:522)
		at hudson.model.Run.execute(Run.java:1831)
		at hudson.model.FreeStyleBuild.run(FreeStyleBuild.java:44)
		at hudson.model.ResourceController.execute(ResourceController.java:101)
		at hudson.model.Executor.run(Executor.java:445)
Caused by: hudson.plugins.git.GitException: Error performing git command: git init /home/jenkins-build/build/workspace/ceph-api
	at org.jenkinsci.plugins.gitclient.CliGitAPIImpl.launchCommandIn(CliGitAPIImpl.java:2864)
	at org.jenkinsci.plugins.gitclient.CliGitAPIImpl.launchCommandIn(CliGitAPIImpl.java:2768)
	at org.jenkinsci.plugins.gitclient.CliGitAPIImpl.launchCommandIn(CliGitAPIImpl.java:2763)
	at org.jenkinsci.plugins.gitclient.CliGitAPIImpl.launchCommand(CliGitAPIImpl.java:2052)
	at org.jenkinsci.plugins.gitclient.CliGitAPIImpl$5.execute(CliGitAPIImpl.java:1077)
	... 12 more
Caused by: java.io.IOException: Cannot run program "git" (in directory "/home/jenkins-build/build/workspace/ceph-api"): error=0, Failed to exec spawn helper: pid: 298658, exit value: 1
	at java.base/java.lang.ProcessBuilder.start(ProcessBuilder.java:1143)
	at java.base/java.lang.ProcessBuilder.start(ProcessBuilder.java:1073)
	at hudson.Proc$LocalProc.<init>(Proc.java:252)
	at hudson.Proc$LocalProc.<init>(Proc.java:221)
	at hudson.Launcher$LocalLauncher.launch(Launcher.java:994)
	at hudson.Launcher$ProcStarter.start(Launcher.java:506)
	at org.jenkinsci.plugins.gitclient.CliGitAPIImpl.launchCommandIn(CliGitAPIImpl.java:2841)
	... 16 more
Caused by: java.io.IOException: error=0, Failed to exec spawn helper: pid: 298658, exit value: 1
	at java.base/java.lang.ProcessImpl.forkAndExec(Native Method)
	at java.base/java.lang.ProcessImpl.<init>(ProcessImpl.java:314)
	at java.base/java.lang.ProcessImpl.start(ProcessImpl.java:244)
	at java.base/java.lang.ProcessBuilder.start(ProcessBuilder.java:1110)
	... 22 more
ERROR: Error cloning remote repo 'origin'```
2024-11-15T13:22:10.603Z
<Casey Bodley> i changed the branch protection rules for main to require the arm64 check
2024-11-15T16:34:38.288Z
<Laura Flores> Great!
2024-11-15T18:45:21.179Z
<Dan Mick> I mean, ultimately, yes, it's on HDD, but it's across an iscsi link to the LRC like all the other VMs on RHEV at the moment.
2024-11-15T19:30:54.897Z
<Anthony D'Atri> ugh, that explains why it's so bad.  When there are like 640 pending messages for ceph-users the web UI kept timing out as I tried to debulk them.  Then something happened that broke the web UI more thoroughly, no idea what.  So I patched together a Python shell strategy for the meantime, which is not at all sustainable.
2024-11-15T19:31:41.707Z
<Dan Mick> something is wrong.  it's not at all clear it's the storage.
2024-11-15T19:36:56.346Z
<Anthony D'Atri> Agreed, though the storage I think does contribute to perf / timeouts.
2024-11-15T19:37:04.776Z
<Anthony D'Atri> The OS and packages are ancient
2024-11-15T19:44:59.273Z
<Dan Mick> it hasn't seen love in a while, not the least of which being there appears to be no way to simply update the mailman software; it seems to require a complete backup/reinstall/restore, which is quirky because 1) containers and 2) mailman being largely undocumented
2024-11-15T19:45:40.587Z
<Dan Mick> I know it's way less than optimal but it's just been not first on the list for a while.  @Adam Kraitman was looking at it at one point but I don't remember his status/conclusion
2024-11-15T19:47:52.196Z
<Josh Durgin> I've got some spare time today, can I get access to the vm to investigate? Haven't been able to load the pending subscribers in the web ui for ceph-users either
2024-11-15T20:21:14.342Z
<Anthony D'Atri> Adam got me my current access.  I'll set you up, Josh, shoot me your SSH key?
2024-11-15T20:21:31.151Z
<Anthony D'Atri> Subscription via the CLI is the easy part
2024-11-15T20:24:11.976Z
<Josh Durgin> thanks, dm'd
2024-11-15T20:50:10.508Z
<Dan Mick> I'm working on it
2024-11-15T21:02:49.033Z
<Anthony D'Atri> I set him up
2024-11-15T21:03:08.618Z
<Anthony D'Atri> I set him up already as `jdurgin`
2024-11-15T21:03:24.275Z
<Dan Mick> and I just did it again, with ansible, so all the t's were crossed
2024-11-15T21:04:37.212Z
<Josh Durgin> thanks!
2024-11-15T21:04:40.371Z
<Anthony D'Atri> I set him up already as `jdurgin`, the `sudo` group etc.
2024-11-15T21:23:27.539Z
<yuriw> Can't schedule suites
2024-11-15T21:23:42.497Z
<yuriw> https://files.slack.com/files-pri/T1HG3J90S-F081T7EGSAU/download/untitled
2024-11-15T21:24:01.516Z
<yuriw> Can't schedule suites `"git clone failed!"`
2024-11-15T21:24:09.107Z
<yuriw> @Zack Cerza FYI

Any issue? please create an issue here and use the infra label.