intermittent memberOf performance issues

Paul B. Henson Fri, 04 Feb 2022 13:51:33 -0800

I've run into another problem with the memberOf implementation on my 2.5servers. After I sorted out the proper configuration, queries requestingmemberOf were very performant:

Feb 4 13:26:44 ldap-01 slapd[1207]: conn=23393 op=1 SRCHbase="ou=user,dc=cpp,dc=edu" scope=2 deref=3filter="(&(objectClass=person)(calstateEduPersonEmplID=013522522))"

Feb  4 13:26:44 ldap-01 slapd[1207]: conn=23393 op=1 SRCH attr=memberOf

Feb 4 13:26:44 ldap-01 slapd[1207]: conn=23393 op=1 SEARCH RESULTtag=101 err=0 qtime=0.000015 etime=0.191860 nentries=1 text=

However, intermittently the server gets into a state where the exactsame query takes over 30 seconds:

Feb 4 08:05:11 ldap-01 slapd[1425456]: conn=40797 op=1 SRCHbase="ou=user,dc=cpp,dc=edu" scope=2 deref=3filter="(&(objectClass=person)(calstateEduPersonEmplID=015559557))"

Feb  4 08:05:11 ldap-01 slapd[1425456]: conn=40797 op=1 SRCH attr=memberOf

Feb 4 08:05:50 ldap-01 slapd[1425456]: conn=40797 op=1 SEARCH RESULTtag=101 err=0 qtime=0.000019 etime=39.435523 nentries=1 text=

When this occurs, the only way to resolve the issue that I have found isto reboot the server. Simply restarting slapd results in the samedegraded performance on these queries.

Normally there is very low read I/O load on the servers duringoperation, probably averaging less than 1M/s, peaking up to maybe20-30M/s for just an instant occasionally. When the memberOf queryperformance is degraded, there is a very high read I/O load on theserver, continuously about 200-300M/s.

Any thoughts on this? It seems like for some reason the server gets intoa state where it is not using the cache or memory map for doing thesearch required to construct the memberOf results? But instead is doinga full disk read of the entire database?

It's also weird that restarting the service is not resolve this, butrebooting the server does. I'm not intimately familiar with theinternals of lmdb, is there some state that persists with theenvironment or memory map in between service runs that is only clearedby a reboot?

I initially thought I might have had a theory on it, relating to anunrelated bug in RHEL 8.5 that broke the "needs-rebooting" commandresulting in servers not properly rebooting after kernel/libraryupdates. The most recent occurrence of this issue started up after suchan update without the required reboot, but upon reviewing historicoccurrences it has occurred at times that don't meet that criteria, so Ifind myself clueless again as to what's going on.

Any advice on how to fix or do further debugging on this issue muchappreciated, thanks…

intermittent memberOf performance issues

Reply via email to