ceph - cephfs - 2024-09-12

Timestamp (UTC)Message
2024-09-12T07:26:53.833Z
<Venky Shankar> oh, this is a different crash
2024-09-12T07:26:58.160Z
<Venky Shankar> where is this? smithi017?
2024-09-12T07:29:03.847Z
<Xiubo Li> console_logs/smithi067.log
2024-09-12T07:29:34.486Z
<Xiubo Li> I believe just this node crashed and then caused the slow perf issue
2024-09-12T07:30:22.737Z
<Venky Shankar> kk
2024-09-12T07:30:24.016Z
<Venky Shankar> `RIP: 0010:usercopy_abort+0x74/0x76`
2024-09-12T07:30:36.334Z
<Xiubo Li> a known issue
2024-09-12T07:30:41.955Z
<Venky Shankar> did the kernel oops when copying data to/from user space?
2024-09-12T07:30:48.400Z
<Venky Shankar> that would explain the bus error.
2024-09-12T07:31:25.221Z
<Xiubo Li> Yeah, between user space and kernel space data exchange
2024-09-12T07:32:46.646Z
<Venky Shankar> kk, and when you say its a known issue, is there a tracker that I can have a look?
2024-09-12T07:33:03.361Z
<Xiubo Li> let me chack, wait for a moment
2024-09-12T07:34:08.709Z
<Xiubo Li> <https://tracker.ceph.com/issues/64471>
2024-09-12T07:37:01.096Z
<Venky Shankar> ah, this one.
2024-09-12T07:37:19.805Z
<Igor Golikov> Hi, can you point me to some meaningful metrics in the MDS? There are tons of them there, and I am sure from your experience there is a subset that is more meaningful
2024-09-12T07:37:30.379Z
<Venky Shankar> I see - kdb was still enabled for this run.
2024-09-12T07:37:51.881Z
<Venky Shankar> You can start looking at the ones marked USEFUL
2024-09-12T07:38:10.635Z
<Igor Golikov> perfect. thanks!
2024-09-12T07:38:17.701Z
<Xiubo Li> I just added one option in teutulogy it seems haven't do any change in qa suite yet.
2024-09-12T07:38:29.656Z
<Venky Shankar> i.e., PRIO_USEFUL and high priority. those are the ones which are basically important to be able to make sense of where the MDS is spending time on.
2024-09-12T07:38:37.477Z
<Xiubo Li> maybe we can just add the option to disable it when running the jobs
2024-09-12T07:38:44.681Z
<Venky Shankar> yeh
2024-09-12T07:42:38.331Z
<Milind Changire> @Rishabh Dave fyi - this can be merged: <https://github.com/ceph/ceph/pull/57459>
2024-09-12T08:43:56.617Z
<Milind Changire> does [BZ 2218759](https://bugzilla.redhat.com/show_bug.cgi?id=2218759) seem like an aggressive MDLog trimming issue that was also seen elsewhere ?
2024-09-12T14:45:58.587Z
<Dhairya Parmar> When I tried to start a fuse client on a sub-path to the already existing mount, it created a sub-sub-path with the same name as sub-path:

```$ mkdir mnt
$ ./bin/ceph-fuse mnt
ceph-fuse[1062104]: starting ceph client
ceph-fuse[1062104]: starting fuse
$ tree mnt
mnt```
``````
2024-09-12T14:56:57.919Z
<Dhairya Parmar> When I tried to start a fuse client on a sub-path to the already existing mount, it created a sub-sub-path with the same name as sub-path.

Created mount dir and start fuse client on it:
```$ mkdir mnt
$ ./bin/ceph-fuse mnt
ceph-fuse[1062104]: starting ceph client
ceph-fuse[1062104]: starting fuse
$ tree mnt
mnt

0 directories, 0 files```
Created foo inside `mnt` :
```$ mkdir mnt/foo
$ tree mnt
mnt
└── foo

2 directories, 0 files```
Started fuse-cliet on `mnt/foo` :
```$ ./bin/ceph-fuse mnt/foo
ceph-fuse[1062221]: starting ceph client
ceph-fuse[1062221]: starting fuse
$ df -hT
Filesystem     Type            Size  Used Avail Use% Mounted on
...
ceph-fuse      fuse.ceph-fuse   99G     0   99G   0% /home/dparmar/ceph/build/mnt
ceph-fuse      fuse.ceph-fuse   99G     0   99G   0% /home/dparmar/ceph/build/mnt/foo```
Now there is `mnt/foo/foo` :
```$ tree mnt
mnt
└── foo
    └── foo

3 directories, 0 files```
Does this mean we're creating a ceph-fuse inside an already running ceph-fuse (`mnt`) and so the mount call to `mnt/foo` is treated as `mnt` which leads to creation of `mnt/foo/foo` ? Or am I interpreting this incorrectly?
2024-09-12T16:31:02.777Z
<Patrick Donnelly> You're layering mounts in the linux vfs
2024-09-12T16:31:07.376Z
<Patrick Donnelly> has nothing to do with cephfs
2024-09-12T16:35:56.936Z
<Igor Golikov> Hi have anyone tried to use Tunnelblick VPN client on Mac? I am following their instructions (for Sepia access) and it doesnt work 😞
2024-09-12T16:49:04.776Z
<Patrick Donnelly> not sure who else uses mac around here but isn't there an openvpn client for mac?
2024-09-12T17:08:28.881Z
<Dhairya Parmar> yea, was a bit curious how this works/happens.
2024-09-12T17:08:34.998Z
<Dhairya Parmar> anyway, thanks !
2024-09-12T17:57:33.396Z
<gregsfortytwo> I don’t use tunnelblick on my work laptop but I used to, and it’s still on my personal one. Which bit is broken, and have you gone through the tunnelblick docs? I remember it being fairly straightforward :/
2024-09-12T17:58:06.948Z
<gregsfortytwo> Main issue is stacking VPNs is a pain
2024-09-12T17:58:27.407Z
<gregsfortytwo> So if you’re running the IBM vpn as well, it’s definitely making your life harder :(

Any issue? please create an issue here and use the infra label.