Hi,

Am 30.11.23 um 09:05 schrieb Peter Krefting:

we are experiencing that the Apache httpd locks up, filling the error.log with errors after the nightly maintenance (not every night, though):

  [Wed Nov 29 00:00:01.922731 2023] [mpm_event:notice] [pid 62346:tid 139841215223680] AH00489: Apache/2.4.57 (Debian) mod_wsgi/4.9.4 Python/3.11 configured -- resuming normal operations   [Wed Nov 29 00:00:01.922790 2023] [core:notice] [pid 62346:tid 139841215223680] AH00094: Command line: '/usr/sbin/apache2'   [Wed Nov 29 00:00:03.924683 2023] [mpm_event:error] [pid 62346:tid 139841215223680] AH03490: scoreboard is full, not at MaxRequestWorkers.Increase ServerLimit.   [Wed Nov 29 00:00:04.925780 2023] [mpm_event:error] [pid 62346:tid 139841215223680] AH03490: scoreboard is full, not at MaxRequestWorkers.Increase ServerLimit.

(etc)

I set up a nightly job to mail me the output of "apachectl status", and this is the contents reported at Nov 29 00:00:00, i.e. just before the nightly maintenance:

Some processes are in "stopping" state but are not dying. They accumulate until the scoreboard is full.

First you should verify with ps if the processes with the PIDs in the "(old gen)" lines still exist. If not, it is a bug in apache itself (probably in mpm_event).

If they do, one needs to find out why they are not dying. This is likely the fault of some module. Third-party modules that do not come with apache itself are more likely to be the culprit. Since you are using mod_wsgi, that is a likely candidate. Look in the error log for the last log lines of the processes with those PIDs. Maybe that gives some hint. Or maybe mod_wsgi writes some logs itself, I am not familiar with that one.





                   Apache Server Status for localhost (via ::1)

    Server Version: Apache/2.4.57 (Debian) mod_wsgi/4.9.4 Python/3.11
    Server MPM: event
    Server Built: 2023-04-13T03:26:51
      __________________________________________________________________

    Current Time: Wednesday, 29-Nov-2023 00:00:01 CET
    Restart Time: Tuesday, 21-Nov-2023 08:51:55 CET
    Parent Server Config. Generation: 8
    Parent Server MPM Generation: 7
    Server uptime: 7 days 15 hours 8 minutes 6 seconds
    Server load: 0.01 0.02 0.00
    Total accesses: 34860 - Total Traffic: 717.8 MB - Total Duration:
           2289464

    CPU Usage: u297.08 s30.61 cu.01 cs.07 - .0497% CPU load
    .0529 requests/sec - 1141 B/second - 21.1 kB/request - 65.676
           ms/request

    2 requests currently being processed, 48 idle workers

    Slot PID Stopping Connections Threads Async connections
    total accepting busy idle writing keep-alive closing
    0 62348 yes (old gen) 0 no 0 0 0 0 0
    1 62350 yes (old gen) 0 no 0 0 0 0 0
    2 66497 yes (old gen) 0 no 0 0 0 0 0
    3 66498 yes (old gen) 0 no 0 0 0 0 0
    4 73089 yes (old gen) 0 no 0 0 0 0 0
    5 73090 yes (old gen) 0 no 0 0 0 0 0
    6 79644 yes (old gen) 0 no 0 0 0 0 0
    7 79645 yes (old gen) 0 no 0 0 0 0 0
    8 86126 yes (old gen) 0 no 0 0 0 0 0
    9 86127 yes (old gen) 0 no 0 0 0 0 0
    10 92669 yes (old gen) 0 no 0 0 0 0 0
    11 92670 yes (old gen) 0 no 0 0 0 0 0
    12 99203 yes (old gen) 0 no 0 0 0 0 0
    13 99204 yes (old gen) 0 no 0 0 0 0 0
    14 105761 no 0 yes 0 25 0 0 0
    15 105762 no 0 yes 2 23 0 0 0
    Sum 16 14 0   2 48 0 0 0

................................................................
................................................................
......................_____________________________________W____
____W___

    Scoreboard Key:
    "_" Waiting for Connection, "S" Starting up, "R" Reading Request,
    "W" Sending Reply, "K" Keepalive (read), "D" DNS Lookup,
    "C" Closing connection, "L" Logging, "G" Gracefully finishing,
    "I" Idle cleanup of worker, "." Open slot with no current process
'/usr/bin/lynx -dump http://localhost:80/server-status' failed.
Maybe you need to install a package providing www-browser or you
need to adjust the APACHE_LYNX variable in /etc/apache2/envvars



Cheers,
Stefan

Reply via email to