Hi Eric,
maybe your observation is in fact related to Henri's, since you also get
core dumps (segmentation faults), which is quite unusual. See further
comments inline.
Eric Wertman wrote:
Sorry to jump in, I'm new here. I started watching this list because of
a problem I'm having with the mod_jk 1.2.20 as well. I'm not getting
core files, but I do have problems that I can't reproduce at log level
trace or debug.
Apache logs segmentation faults. You might be able to produce a core
(which would be very helpful) by setting the coredumpdirectory in apache
and maybe tweaking your os config. One point that might help to produce
cores, would be to not run apache as root.
I'm running it on AIX 5.3 (ml05). I compiled the apache 2.2.4 and
apr-1.2.8 along with the mod_jk, and using tomcat 5.5.20 with an IBM 1.5
JRE.
I compiled them all using the IBM cc_r compiler. I've actually tried a
number of different flags trying to resolve my problem, assuming
initially that I was doing something incorrectly. The behavior
persists, though, and my compilations have all been clean. Apache
without mod_jk doesn't give me any problems.
Superficially it seems to work fine. Once I run tests with ab, it
starts to get a little ugly. I get a number of failed requests, and
these types of errors:
[Tue Jan 30 03:03:53 2007] [2203720:0000] [error] jk_ajp_common.c
(1504): Unknown AJP protocol code: 41
[Tue Jan 30 03:03:53 2007] [2203720:0000] [error] jk_ajp_common.c (970):
wrong message format 0x3837 from 127.0.0.1:8010
hose mean, taht something is fundamentally wrong in the answer mopd_jk
receives from tomcat. It doesn't really look like AJP/1.3. Is your MPM
worker or prefork? Could you try again with prefork, in case it is worker?
Do you get error message on the tomcat side?
[Tue Jan 30 03:03:53 2007] [2203720:0000] [error] jk_ajp_common.c
(1566): (PlatformServer) Tomcat is down or refused connection. No
response has been sent to the client (yet)
[Tue Jan 30 03:03:53 2007] [2203720:0000] [info] jk_ajp_common.c
(1877): (PlatformServer) receiving from tomcat failed, recoverable
operation attempt=0
[Tue Jan 30 03:03:53 2007] [2203720:0000] [info] jk_ajp_common.c
(1916): (PlatformServer) sending request to tomcat failed, recoverable
operation attempt=1
[Tue Jan 30 03:03:53 2007] [2203720:0000] [info] jk_ajp_common.c
(1842): (PlatformServer) request failed, because of server error without
recovery in send loop attempt=0
[Tue Jan 30 03:03:53 2007] [2203720:0000] [info] mod_jk.c (2142):
Service error=-5 for worker=PlatformServer
[Tue Jan 30 03:03:53 2007] [2203720:0000] [info] mod_jk.c (401): Write
without start, starting with defaults
Ths one is very unusual, and it is the log statement, that is the only
one, that could have produced Henri's core dump, although we still do
not know why. It correlates to a protocol error.
[Tue Jan 30 03:03:53 2007] [2203720:0000] [error] jk_ajp_common.c (970):
wrong message format 0x031a from 127.0.0.1:8010
[Tue Jan 30 03:03:53 2007] [2203720:0000] [error] jk_ajp_common.c
(1592): (PlatformServer) Tomcat is down or network problems. Part of the
response has already been sent to the cli
ent
[Tue Jan 30 03:03:53 2007] [2203720:0000] [info] jk_ajp_common.c
(1877): (PlatformServer) receiving from tomcat failed, recoverable
operation attempt=1
[Tue Jan 30 03:03:53 2007] [2203720:0000] [error] jk_ajp_common.c
(1504): Unknown AJP protocol code: D8
[Tue Jan 30 03:03:53 2007] [2203720:0000] [info] jk_ajp_common.c
(1842): (PlatformServer) request failed, because of server error without
recovery in send loop attempt=0
[Tue Jan 30 03:03:53 2007] [2203720:0000] [info] mod_jk.c (2142):
Service error=-5 for worker=PlatformServer
[Tue Jan 30 03:03:53 2007] [2203720:0000] [info] jk_ajp_common.c
(1916): (PlatformServer) sending request to tomcat failed, recoverable
operation attempt=2
[Tue Jan 30 03:03:53 2007] [2203720:0000] [error] jk_ajp_common.c (915):
sendfull returned -14 with errno=14
[Tue Jan 30 03:03:53 2007] [2203720:0000] [error] jk_ajp_common.c (948):
(PlatformServer) can't receive the response message from tomcat, network
problems or tomcat is down (127.0.
0.1:8010), err=-9
[Tue Jan 30 03:03:53 2007] [2203720:0000] [error] jk_ajp_common.c
(1566): (PlatformServer) Tomcat is down or refused connection. No
response has been sent to the client (yet)
[Tue Jan 30 03:03:53 2007] [2203720:0000] [info] jk_ajp_common.c
(1877): (PlatformServer) receiving from tomcat failed, recoverable
operation attempt=0
[Tue Jan 30 03:03:53 2007] [2203720:0000] [error] jk_ajp_common.c
(1292): (PlatformServer) failed resending request body (812760852)
[Tue Jan 30 03:03:53 2007] [2203720:0000] [info] jk_ajp_common.c
(1916): (PlatformServer) sending request to tomcat failed, recoverable
operation attempt=3
[Tue Jan 30 03:03:53 2007] [2203720:0000] [info] jk_ajp_common.c
(1916): (PlatformServer) sending request to tomcat failed, recoverable
operation attempt=1
[Tue Jan 30 03:03:53 2007] [2203720:0000] [error] jk_ajp_common.c (915):
sendfull returned -9 with errno=9
[Tue Jan 30 03:03:53 2007] [2203720:0000] [info] jk_ajp_common.c
(1201): (PlatformServer) error sending request. Will try another pooled
connection
[Tue Jan 30 03:03:53 2007] [2203720:0000] [error] jk_ajp_common.c (915):
sendfull returned -14 with errno=14
[Tue Jan 30 03:03:53 2007] [2203720:0000] [error] jk_ajp_common.c (948):
(PlatformServer) can't receive the response message from tomcat, network
problems or tomcat is down (127.0.
And these in the apache error_log:
[Tue Jan 30 03:02:28 2007] [info] Shared memory session cache initialised
[Tue Jan 30 03:02:28 2007] [info] Init: Initializing (virtual) servers
for SSL
[Tue Jan 30 03:02:28 2007] [info] Server: Apache/2.2.4, Interface:
mod_ssl/2.2.4, Library: OpenSSL/0.9.8d
[Tue Jan 30 03:02:28 2007] [notice] Apache/2.2.4 (Unix) mod_jk/1.2.20
mod_ssl/2.2.4 OpenSSL/0.9.8d configured -- resuming normal operations
[Tue Jan 30 03:02:28 2007] [info] Server built: Jan 29 2007 23:44:30
[Tue Jan 30 03:02:28 2007] [debug] worker.c(1740): AcceptMutex: sysvsem
(default: sysvsem)
[Tue Jan 30 03:03:54 2007] [notice] child pid 2203720 exit signal
Segmentation fault (11)
[Tue Jan 30 03:04:00 2007] [debug] worker.c(1083): the listener thread
didn't exit
[Tue Jan 30 03:04:01 2007] [debug] worker.c(1083): the listener thread
didn't exit
[Tue Jan 30 03:04:13 2007] [debug] worker.c(1354): taking over
scoreboard slot from 1552520 (quiescing)
[Tue Jan 30 03:04:19 2007] [notice] child pid 2576392 exit signal
Segmentation fault (11)
[Tue Jan 30 03:04:23 2007] [notice] child pid 2310294 exit signal
Segmentation fault (11)
[Tue Jan 30 03:04:26 2007] [notice] child pid 2576394 exit signal
Segmentation fault (11)
[Tue Jan 30 03:29:30 2007] [notice] child pid 1552524 exit signal
Segmentation fault (11)
[Tue Jan 30 04:09:22 2007] [notice] child pid 2584796 exit signal
Segmentation fault (11)
Again, I thought I would be able to track them down by changing the log
level to "trace", but the problem disappears at that log level. I had
assumed it was slowing the connection rate down enough to mitigate the
problem. Not sure if any of that helps.. if you can help me or want
more info let me know.
With JkLogLevel debug (which logs more than info but less than trace) we
would see, which packets you receive from tomcat. Maybe debug still
shows the problem?
Could you please post your config (Jk* from apache, workers.roperties
and Connector from tomcat).
Thanks!
Eric
Regards,
Rainer
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]