>-----Original Message----- >From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] [SNIP]
>cgid should _never_ exit without something in the error log. >That makes it >sound like a core problem, i.e. ap_process_child_status() or a >signal handler is >fubar, in addition to whatever made the cgi daemon die. > >But if that is in fact happening, I would trace syscalls & >signals for the cgid >process. (can't remember what the HPUX trace program is >called, but you want >something similar to truss/strace) To give a background, here's what I did : 1. Use a large timeout and keepalive timeout, and 100 threads / process. 2. Use HTTP/1.0 as the SPECweb99 client seems to have some problem with HTTP/1.1 (not much work done there) 3. Start SPECweb99 run nabled all the different dynamic tests - DYNAMIC_CONTENT, DYNAMIC_POST, DYNAMIC_CAD_GET, DYNAMIC_CGI_GET. - The run came back with a lot of "Can't connect" errors It's probably okay because some config was probably screwed up 4. The stipulated 20 min. warmup and the 20 min run happens The results are NOT posted even after 30 minutes 5. I get suspicious, and I try to do a simple GET to Apache - realized that Apache was hung. (telnet localhost 80... GET /foo etc stuff) 6. Attached gdb to each of the process - and found that a couple of processes were processing do_post (in mod_specweb99) and NO cgid process (YES - I backported Jeff's patch to restart cgid) !! I tried attaching tusc to the cgi daemon - but since the daemon dies at a random time, my log file was getting too full, and I had to just stop it. I tried resetting the log a couple of times - but then I got diverted and started thinking in a diffent angle : were the timeouts were too long, is the system was running out of sockets etc. I can reproduce the problem every single run of SPECweb99 (history: 2.0.43 ran just fine). I'll try to get the tusc for cgid when it dies - to see if it helps. BTW, one more thing I noticed : there's some problem with keeping the sockets alive for a long time. The SPECweb99 client logs a error on close socket (EBADF) when the keepalive times out. -Madhu