Our modperl servers with apache+modperl >= versions 1.3.28 and modperl
1.28, crash randomly with

[Tue Mar  9 18:02:17 2004] [notice] child pid 10940 exit signal Segmentation fault (11)

I'd say on the order of 1 in 100 requests or less. But after an hour,
the above lines collect in the error log, its the only exciting thing
happening to the servers.

The crash seems to happen somewhere after serving the page because the
access.log for the crashing process shows a last good request (no 500
error or anything). The last request processed before death is random,
there is no URL that triggers this segv. The age of the child before it
segvs also seems to be random.

I've tried rebuilding/reinstalling everything on both releases, and the
error persists. Everything else works ok otherwise. Earlier versions of
apache+modperl (running on earlier redhats) do not crash like this.

The box(es) are recent hardware, redhat9, otherwise well behaved.

For some reason I can no longer strace -p httpd children (get
attach/detach and the child dies), although I can strace -p other
demons. So I can't even follow a child in case it dies :( -- any reason
for this, at least?

How can I best debug this? perhaps capture the segv and do a stack trace
or write a core file? it only happens in production, or if it happens in
dev, then I need a production-like request rate anyway.

thanks

-justin


-- 
Report problems: http://perl.apache.org/bugs/
Mail list info: http://perl.apache.org/maillist/modperl.html
List etiquette: http://perl.apache.org/maillist/email-etiquette.html

Reply via email to