Our modperl servers with apache+modperl >= versions 1.3.28 and modperl 1.28, crash randomly with
[Tue Mar 9 18:02:17 2004] [notice] child pid 10940 exit signal Segmentation fault (11) I'd say on the order of 1 in 100 requests or less. But after an hour, the above lines collect in the error log, its the only exciting thing happening to the servers. The crash seems to happen somewhere after serving the page because the access.log for the crashing process shows a last good request (no 500 error or anything). The last request processed before death is random, there is no URL that triggers this segv. The age of the child before it segvs also seems to be random. I've tried rebuilding/reinstalling everything on both releases, and the error persists. Everything else works ok otherwise. Earlier versions of apache+modperl (running on earlier redhats) do not crash like this. The box(es) are recent hardware, redhat9, otherwise well behaved. For some reason I can no longer strace -p httpd children (get attach/detach and the child dies), although I can strace -p other demons. So I can't even follow a child in case it dies :( -- any reason for this, at least? How can I best debug this? perhaps capture the segv and do a stack trace or write a core file? it only happens in production, or if it happens in dev, then I need a production-like request rate anyway. thanks -justin -- Report problems: http://perl.apache.org/bugs/ Mail list info: http://perl.apache.org/maillist/modperl.html List etiquette: http://perl.apache.org/maillist/email-etiquette.html