Hi,

nobody reach with this ganesha failure yet ?

I've ran some additional configuration tests and discovered that the crashes of ganesha.nfsd only occurs when setting "--ingress-mode haproxy-protocol". Ganesha.nfsd is stable with "--ingress-mode keepalive-only" and "--ingress-mode haproxy-standard".

The crashes with "--ingress-mode haproxy-protocol" occurs randomly on the servers, even if no client mount the NFS partition.

Patrick

Le 21/09/2025 à 17:27, Patrick Bégou a écrit :
Hi,

just to provide some additional information, it is the ganesha.nfsd process which core-dumps :

sept. 19 17:06:12 whitaker02-ceph ceph-aa64f278-3ba8-11f0-b327-303ea701bc10-nfs-whitaker-nfs-4-0-whitaker02-ceph-wjynvn[85697]: 19/09/2025 15:06:12 : epoch 68cd5fb6 : whitaker02-ceph : ganesha.nfsd-2[svc_8] rpc :TIRPC :EVENT :svc_vc_recv: 0x7f99c400a6b0 fd 50 proxy header rest len failed header rlen = % (will set dead) sept. 19 17:06:12 whitaker02-ceph ceph-aa64f278-3ba8-11f0-b327-303ea701bc10-nfs-whitaker-nfs-4-0-whitaker02-ceph-wjynvn[85697]: 19/09/2025 15:06:12 : epoch 68cd5fb6 : whitaker02-ceph : ganesha.nfsd-2[reaper] nfs_try_lift_grace :STATE :EVENT :check grace:reclaim complete(0) clid count(0) sept. 19 17:06:12 whitaker02-ceph ceph-aa64f278-3ba8-11f0-b327-303ea701bc10-nfs-whitaker-nfs-4-0-whitaker02-ceph-wjynvn[85697]: 19/09/2025 15:06:12 : epoch 68cd5fb6 : whitaker02-ceph : ganesha.nfsd-2[svc_10] rpc :TIRPC :EVENT :svc_vc_recv: 0x7f99b0004060 fd 50 proxy ignored for local sept. 19 17:06:13 whitaker02-ceph systemd-coredump[120951]: Process 85701 (ganesha.nfsd) of user 0 dumped core.

                                                            Stack trace of thread 77:                                                             #0 0x00007f9a939ad32e n/a (/usr/lib64/libntirpc.so.5.8 + 0x2232e)                                                             ELF object binary architecture: AMD x86-64 sept. 19 17:06:14 whitaker02-ceph podman[120958]: 2025-09-19 17:06:14.123651716 +0200 CEST m=+0.029310490 container died 5cb71f9d8b8986292dd8916ad0f610b9c12f5c08a1de505cd6f7a11137446263 (image=quay.io/ceph/ceph@sha256:8214ebff6133ac27d20659038df6962dbf9d77da21c9438a296b2e2059a56af6, name=ceph-aa64f278-3ba8-11f0-b327-303ea701bc10-nfs-whitaker-nfs-4-0-whitaker02-ceph-wjynvn, org.opencontainers.image.authors=Ceph Release Team <[email protected]>, CEPH_SHA1=0eceb0defba60152a8182f7bd87d164b639885b8, org.label-schema.name=CentOS Stream 9 Base Image, OSD_FLAVOR=default, org.label-schema.build-date=20250303, io.buildah.version=1.39.3, CEPH_GIT_REPO=https://github.com/ceph/ceph.git, CEPH_REF=squid, FROM_IMAGE=quay.io/centos/centos:stream9, org.opencontainers.image.documentation=https://docs.ceph.com/, org.label-schema.license=GPLv2, GANESHA_REPO_BASEURL=https://buildlogs.centos.org/centos/$releasever-stream/storage/$basearch/nfsganesha-5/, org.label-schema.vendor=CentOS, org.label-schema.schema-version=1.0) sept. 19 17:06:14 whitaker02-ceph podman[120958]: 2025-09-19 17:06:14.139752858 +0200 CEST m=+0.045411632 container remove 5cb71f9d8b8986292dd8916ad0f610b9c12f5c08a1de505cd6f7a11137446263 (image=quay.io/ceph/ceph@sha256:8214ebff6133ac27d20659038df6962dbf9d77da21c9438a296b2e2059a56af6, name=ceph-aa64f278-3ba8-11f0-b327-303ea701bc10-nfs-whitaker-nfs-4-0-whitaker02-ceph-wjynvn, FROM_IMAGE=quay.io/centos/centos:stream9, org.label-schema.vendor=CentOS, CEPH_SHA1=0eceb0defba60152a8182f7bd87d164b639885b8, CEPH_REF=squid, org.opencontainers.image.documentation=https://docs.ceph.com/, org.label-schema.name=CentOS Stream 9 Base Image, org.label-schema.license=GPLv2, CEPH_GIT_REPO=https://github.com/ceph/ceph.git, GANESHA_REPO_BASEURL=https://buildlogs.centos.org/centos/$releasever-stream/storage/$basearch/nfsganesha-5/, OSD_FLAVOR=default, org.label-schema.build-date=20250303, org.opencontainers.image.authors=Ceph Release Team <[email protected]>, io.buildah.version=1.39.3, org.label-schema.schema-version=1.0) sept. 19 17:06:14 whitaker02-ceph systemd[1]: ceph-aa64f278-3ba8-11f0-b327-303ea701bc10@nfs.whitaker-nfs.4.0.whitaker02-ceph.wjynvn.service: Main process exited, code=exited, status=139/n/a sept. 19 17:06:14 whitaker02-ceph systemd[1]: ceph-aa64f278-3ba8-11f0-b327-303ea701bc10@nfs.whitaker-nfs.4.0.whitaker02-ceph.wjynvn.service: Failed with result 'exit-code'. sept. 19 17:06:14 whitaker02-ceph systemd[1]: ceph-aa64f278-3ba8-11f0-b327-303ea701bc10@nfs.whitaker-nfs.4.0.whitaker02-ceph.wjynvn.service: Consumed 3.485s CPU time. sept. 19 17:06:24 whitaker02-ceph systemd[1]: ceph-aa64f278-3ba8-11f0-b327-303ea701bc10@nfs.whitaker-nfs.4.0.whitaker02-ceph.wjynvn.service: Scheduled restart job, restart counter is at 8.

Patrick

_______________________________________________
ceph-users mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to