Issues:
We've been experiencing some issues with our IIS --> Connector --> Tomcat
setups,
primarily intermittent 'white pages' where the HTTP response is blank, and
also
this morning an instance where I was forced to restart IIS / IIS admin
service
in order to get the connection to Tomcat working again, that instance showed
connector log messages of: factory for lb failed for wlb.
Additionally I'm seeing both connector and Tomcat log messages and stack
traces
that don't seem to manifest a visible problem but are concerning to me.
This email is structured with a layout of our system setup, followed by
examples
of the log messages of concern, followed by our configuration files. Any
assistance
in understanding what these log messages mean or any pointers on how to fix
them
would be most appreciated.
Thanks for your time,
Sean Overby
Solvepoint Corporation
System Setup:
- Front End:
Two IIS
- Web Server 1, IIS 6.0 on Windows 2003, Connector version 1.2.19
- Web Server 2, IIS 5.x on Windows 2000, Connector version 1.2.19
Web Server 1 and 2 sit inside a DMZ, firewall between both IIS and all
Tomcat Servers.
- Back End:
Two Tomcat Servers (in different network segments)
- Tomcat 1, Tomcat 5.5 on Windows 2003.
- Tomcat 2, Tomcat 5.5 on Windows 2003.
Connector Log Messages
----------------------
-- These message corresponded with a complete failure of all servlet
requests with a
-- 'Page Not Found' error, restarted IIS admin service / IIS website to fix.
[Fri Oct 13 08:04:41 2006] [3620:2044] [error] jk_worker.c (146): factory
for lb failed for wlb
[Fri Oct 13 08:04:41 2006] [3620:2044] [error] jk_worker.c (256): failed to
create worker wlb
[Fri Oct 13 08:22:51 2006] [2812:2300] [error] jk_worker.c (146): factory
for lb failed for wlb
[Fri Oct 13 08:22:51 2006] [2812:2300] [error] jk_worker.c (256): failed to
create worker wlb
[Fri Oct 13 08:28:50 2006] [2348:2164] [error] jk_worker.c (146): factory
for lb failed for wlb
[Fri Oct 13 08:28:50 2006] [2348:2164] [error] jk_worker.c (256): failed to
create worker wlb
-- These messages happen intermittently, and do not *seem* to cause any
visible issues
[Fri Oct 13 07:40:44 2006] [0332:2032] [error] jk_ajp_common.c (947):
(wlweb1) can't receive the response message from tomcat, network problems or
tomcat is down (192.168.1.215:8011), err=-54
[Fri Oct 13 07:40:44 2006] [0332:2032] [error] jk_ajp_common.c (1536):
(wlweb1) Tomcat is down or refused connection. No response has been sent to
the client (yet)
[Fri Oct 13 07:40:44 2006] [0332:1796] [error] jk_ajp_common.c (947):
(wlweb1) can't receive the response message from tomcat, network problems or
tomcat is down (192.168.1.215:8011), err=-54
[Fri Oct 13 07:40:44 2006] [0332:1796] [error] jk_ajp_common.c (1536):
(wlweb1) Tomcat is down or refused connection. No response has been sent to
the client (yet)
[Fri Oct 13 07:40:45 2006] [0332:2032] [error] jk_ajp_common.c (1879):
(wlweb1) Connecting to tomcat failed. Tomcat is probably not started or is
listening on the wrong port
[Fri Oct 13 07:40:45 2006] [0332:1796] [error] jk_ajp_common.c (1879):
(wlweb1) Connecting to tomcat failed. Tomcat is probably not started or is
listening on the wrong port
[Fri Oct 13 07:41:44 2006] [0332:2220] [error] jk_ajp_common.c (947):
(wlweb2) can't receive the response message from tomcat, network problems or
tomcat is down (192.168.1.195:8012), err=-54
[Fri Oct 13 07:41:44 2006] [0332:2220] [error] jk_ajp_common.c (1536):
(wlweb2) Tomcat is down or refused connection. No response has been sent to
the client (yet)
[Fri Oct 13 07:41:45 2006] [0332:2220] [error] jk_ajp_common.c (1879):
(wlweb2) Connecting to tomcat failed. Tomcat is probably not started or is
listening on the wrong port
Tomcat stdout.log messages:
---------------------------
- Interesting thing about this message is that the Tomcat 2 server gets
these
- messages at a rate 50 times higher then on Tomcat 1 server, Tomcat 2 is
also
- on a different LAN segment then Tomcat 1.
java.lang.ArrayIndexOutOfBoundsException: 8192
at org.apache.jk.common.MsgAjp.appendByte(MsgAjp.java:105)
at org.apache.jk.common.MsgAjp.appendByteChunk(MsgAjp.java:147)
at org.apache.jk.common.MsgAjp.appendBytes(MsgAjp.java:132)
at org.apache.jk.common.JkInputStream.appendHead(JkInputStream.java:302)
at org.apache.jk.core.MsgContext.action(MsgContext.java:258)
at org.apache.coyote.Response.action(Response.java:182)
at org.apache.coyote.Response.sendHeaders(Response.java:374)
at
org.apache.catalina.connector.OutputBuffer.doFlush(OutputBuffer.java:317)
at org.apache.catalina.connector.OutputBuffer.close(OutputBuffer.java:278)
at org.apache.catalina.connector.Response.finishResponse(Response.java:483)
at
org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:151)
at org.apache.jk.server.JkCoyoteHandler.invoke(JkCoyoteHandler.java:199)
at org.apache.jk.common.HandlerRequest.invoke(HandlerRequest.java:282)
at org.apache.jk.common.ChannelSocket.invoke(ChannelSocket.java:754)
at
org.apache.jk.common.ChannelSocket.processConnection(ChannelSocket.java:684)
at
org.apache.jk.common.ChannelSocket$SocketConnection.runIt(ChannelSocket.java
:876)
at
org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.jav
a:684)
at java.lang.Thread.run(Unknown Source)
Web Server 1, worker.properties:
--------------------------------
worker.list=wlb,wlx,jkstatus
worker.wlweb1.type=ajp13
worker.wlweb1.host=192.168.1.215
worker.wlweb1.port=8011
worker.wlweb1.socket_keepalive=true
worker.wlweb1.connection_pool_size=600
worker.wlweb1.lbfactor=1
worker.wlweb2.type=ajp13
worker.wlweb2.host=192.168.1.195
worker.wlweb2.port=8012
worker.wlweb2.socket_keepalive=true
worker.wlweb2.connection_pool_size=600
worker.wlweb2.connection_pool_timeout=20
worker.wlweb2.lbfactor=1
worker.wlxml.type=ajp13
worker.wlxml.host=192.168.1.213
worker.wlxml.port=8009
worker.wlxml.socket_keepalive=true
worker.wlxml.connection_pool_size=600
worker.wlxml.connection_pool_timeout=20
worker.wlxml.lbfactor=1
# Defining a load balancer
worker.wlb.type=lb
worker.wlb.balance_workers=wlweb1,wlweb2
worker.wlx.type=lb
worker.wlx.balance_workers=wlxml
worker.jkstatus.type=status
Web Server 1, uriworkermap.properties:
--------------------------------------
# uriworkermap.properties - IIS
/admin/*=wlb
/manager/*=wlb
/jsp-examples/*=wlb
/servlets-examples/*=wlb
/v4/*=wlb
#/r2u/*=wlb
/dhweb/*=wlx
!/servlets-examples/*.jpeg=wlb
Web Server 2, worker.properties:
--------------------------------
worker.list=wlb,jkstatus
worker.wlweb1.type=ajp13
worker.wlweb1.host=192.168.1.215
worker.wlweb1.port=8011
worker.wlweb1.socket_keepalive=true
worker.wlweb1.connection_pool_size=600
worker.wlweb1.lbfactor=1
worker.wlweb2.type=ajp13
worker.wlweb2.host=192.168.1.195
worker.wlweb2.port=8009
worker.wlweb2.socket_keepalive=true
worker.wlweb2.connection_pool_size=600
worker.wlweb2.lbfactor=1
worker.wlb.type=lb
worker.wlb.balance_workers=wlweb1,wlweb2
worker.jkstatus.type=status
Web Server 2, uriworkermap.properties:
--------------------------------------
/admin/*=wlb
/manager/*=wlb
/jsp-examples/*=wlb
/servlets-examples/*=wlb
/r2u/*=wlb
!/servlets-examples/*.jpeg=wlb
Tomcat Server 1 & 2, server.xml:
-------------------------------
- Note with the exception of the line directly below the server.xml is a
- 'vanilla' copy of the install server.xml, the entire server.xml file
- is included below however.
Server 1:
---------
<Connector port="8011" minProcessors="600" maxProcessors="600"
backlog="300" acceptCount="300"
enableLookups="false" redirectPort="8443" protocol="AJP/1.3"
/>
Server 2:
---------
<Connector port="8012" minProcessors="600" maxProcessors="600"
backlog="300" acceptCount="300"
enableLookups="false" redirectPort="8443" protocol="AJP/1.3"
/>
- Complete server.xml:
======================
<!-- Example Server Configuration File -->
<!-- Note that component elements are nested corresponding to their
parent-child relationships with each other -->
<!-- A "Server" is a singleton element that represents the entire JVM,
which may contain one or more "Service" instances. The Server
listens for a shutdown command on the indicated port.
Note: A "Server" is not itself a "Container", so you may not
define subcomponents such as "Valves" or "Loggers" at this level.
-->
<Server port="8005" shutdown="SHUTDOWN">
<!-- Comment these entries out to disable JMX MBeans support used for the
administration web application -->
<Listener className="org.apache.catalina.core.AprLifecycleListener" />
<Listener className="org.apache.catalina.mbeans.ServerLifecycleListener"
/>
<Listener
className="org.apache.catalina.mbeans.GlobalResourcesLifecycleListener" />
<Listener
className="org.apache.catalina.storeconfig.StoreConfigLifecycleListener"/>
<!-- Global JNDI resources -->
<GlobalNamingResources>
<!-- Test entry for demonstration purposes -->
<Environment name="simpleValue" type="java.lang.Integer" value="30"/>
<!-- Editable user database that can also be used by
UserDatabaseRealm to authenticate users -->
<Resource name="UserDatabase" auth="Container"
type="org.apache.catalina.UserDatabase"
description="User database that can be updated and saved"
factory="org.apache.catalina.users.MemoryUserDatabaseFactory"
pathname="conf/tomcat-users.xml" />
</GlobalNamingResources>
<!-- A "Service" is a collection of one or more "Connectors" that share
a single "Container" (and therefore the web applications visible
within that Container). Normally, that Container is an "Engine",
but this is not required.
Note: A "Service" is not itself a "Container", so you may not
define subcomponents such as "Valves" or "Loggers" at this level.
-->
<!-- Define the Tomcat Stand-Alone Service -->
<Service name="Catalina">
<!-- A "Connector" represents an endpoint by which requests are received
and responses are returned. Each Connector passes requests on to
the
associated "Container" (normally an Engine) for processing.
By default, a non-SSL HTTP/1.1 Connector is established on port
8080.
You can also enable an SSL HTTP/1.1 Connector on port 8443 by
following the instructions below and uncommenting the second
Connector
entry. SSL support requires the following steps (see the SSL
Config
HOWTO in the Tomcat 5 documentation bundle for more detailed
instructions):
* If your JDK version 1.3 or prior, download and install JSSE 1.0.2
or
later, and put the JAR files into "$JAVA_HOME/jre/lib/ext".
* Execute:
%JAVA_HOME%\bin\keytool -genkey -alias tomcat -keyalg RSA
(Windows)
$JAVA_HOME/bin/keytool -genkey -alias tomcat -keyalg RSA
(Unix)
with a password value of "changeit" for both the certificate and
the keystore itself.
By default, DNS lookups are enabled when a web application calls
request.getRemoteHost(). This can have an adverse impact on
performance, so you can disable it by setting the
"enableLookups" attribute to "false". When DNS lookups are
disabled,
request.getRemoteHost() will return the String version of the
IP address of the remote client.
-->
<!-- Define a non-SSL HTTP/1.1 Connector on port 8080 -->
<Connector
port="8080" maxHttpHeaderSize="8192"
maxThreads="150" minSpareThreads="25" maxSpareThreads="75"
enableLookups="false" redirectPort="8443" acceptCount="100"
connectionTimeout="20000" disableUploadTimeout="true" />
<!-- Note : To disable connection timeouts, set connectionTimeout value
to 0 -->
<!-- Note : To use gzip compression you could set the following properties
:
compression="on"
compressionMinSize="2048"
noCompressionUserAgents="gozilla, traviata"
compressableMimeType="text/html,text/xml"
-->
<!-- Define a SSL HTTP/1.1 Connector on port 8443 -->
<!--
<Connector port="8443" maxHttpHeaderSize="8192"
maxThreads="150" minSpareThreads="25" maxSpareThreads="75"
enableLookups="false" disableUploadTimeout="true"
acceptCount="100" scheme="https" secure="true"
clientAuth="false" sslProtocol="TLS" />
-->
<!-- Define an AJP 1.3 Connector on port 8009 is default -->
<!-- ADDED the following parms to this minProcessors="25"
maxProcessors="225" acceptCount="150" - sao -->
<Connector port="8011" minProcessors="600" maxProcessors="600"
backlog="300" acceptCount="300"
enableLookups="false" redirectPort="8443" protocol="AJP/1.3"
/>
<!-- Define a Proxied HTTP/1.1 Connector on port 8082 -->
<!-- See proxy documentation for more information about using this. -->
<!--
<Connector port="8082"
maxThreads="150" minSpareThreads="25" maxSpareThreads="75"
enableLookups="false" acceptCount="100"
connectionTimeout="20000"
proxyPort="80" disableUploadTimeout="true" />
-->
<!-- An Engine represents the entry point (within Catalina) that
processes
every request. The Engine implementation for Tomcat stand alone
analyzes the HTTP headers included with the request, and passes
them
on to the appropriate Host (virtual host). -->
<!-- You should set jvmRoute to support load-balancing via AJP ie :
<Engine name="Standalone" defaultHost="localhost" jvmRoute="jvm1">
-->
<!-- Define the top level container in our container hierarchy -->
<Engine name="Catalina" defaultHost="localhost">
<!-- The request dumper valve dumps useful debugging information about
the request headers and cookies that were received, and the
response
headers and cookies that were sent, for all requests received by
this instance of Tomcat. If you care only about requests to a
particular virtual host, or a particular application, nest this
element inside the corresponding <Host> or <Context> entry
instead.
For a similar mechanism that is portable to all Servlet 2.4
containers, check out the "RequestDumperFilter" Filter in the
example application (the source for this filter may be found in
"$CATALINA_HOME/webapps/examples/WEB-INF/classes/filters").
Request dumping is disabled by default. Uncomment the following
element to enable it. -->
<!--
<Valve className="org.apache.catalina.valves.RequestDumperValve"/>
-->
<!-- Because this Realm is here, an instance will be shared globally
-->
<!-- This Realm uses the UserDatabase configured in the global JNDI
resources under the key "UserDatabase". Any edits
that are performed against this UserDatabase are immediately
available for use by the Realm. -->
<Realm className="org.apache.catalina.realm.UserDatabaseRealm"
resourceName="UserDatabase"/>
<!-- Comment out the old realm but leave here for now in case we
need to go back quickly -->
<!--
<Realm className="org.apache.catalina.realm.MemoryRealm" />
-->
<!-- Replace the above Realm with one of the following to get a Realm
stored in a database and accessed via JDBC -->
<!--
<Realm className="org.apache.catalina.realm.JDBCRealm"
driverName="org.gjt.mm.mysql.Driver"
connectionURL="jdbc:mysql://localhost/authority"
connectionName="test" connectionPassword="test"
userTable="users" userNameCol="user_name"
userCredCol="user_pass"
userRoleTable="user_roles" roleNameCol="role_name" />
-->
<!--
<Realm className="org.apache.catalina.realm.JDBCRealm"
driverName="oracle.jdbc.driver.OracleDriver"
connectionURL="jdbc:oracle:thin:@ntserver:1521:ORCL"
connectionName="scott" connectionPassword="tiger"
userTable="users" userNameCol="user_name"
userCredCol="user_pass"
userRoleTable="user_roles" roleNameCol="role_name" />
-->
<!--
<Realm className="org.apache.catalina.realm.JDBCRealm"
driverName="sun.jdbc.odbc.JdbcOdbcDriver"
connectionURL="jdbc:odbc:CATALINA"
userTable="users" userNameCol="user_name"
userCredCol="user_pass"
userRoleTable="user_roles" roleNameCol="role_name" />
-->
<!-- Define the default virtual host
Note: XML Schema validation will not work with Xerces 2.2.
-->
<Host name="localhost" appBase="webapps"
unpackWARs="true" autoDeploy="true"
xmlValidation="false" xmlNamespaceAware="false">
<!-- Defines a cluster for this node,
By defining this element, means that every manager will be
changed.
So when running a cluster, only make sure that you have webapps
in there
that need to be clustered and remove the other ones.
A cluster has the following parameters:
className = the fully qualified name of the cluster class
name = a descriptive name for your cluster, can be anything
mcastAddr = the multicast address, has to be the same for all
the nodes
mcastPort = the multicast port, has to be the same for all the
nodes
mcastBindAddr = bind the multicast socket to a specific address
mcastTTL = the multicast TTL if you want to limit your
broadcast
mcastSoTimeout = the multicast readtimeout
mcastFrequency = the number of milliseconds in between sending
a "I'm alive" heartbeat
mcastDropTime = the number a milliseconds before a node is
considered "dead" if no heartbeat is received
tcpThreadCount = the number of threads to handle incoming
replication requests, optimal would be the same amount of threads as nodes
tcpListenAddress = the listen address (bind address) for TCP
cluster request on this host,
in case of multiple ethernet cards.
auto means that address becomes
InetAddress.getLocalHost().getHostAddress()
tcpListenPort = the tcp listen port
tcpSelectorTimeout = the timeout (ms) for the Selector.select()
method in case the OS
has a wakup bug in java.nio. Set to 0 for
no timeout
printToScreen = true means that managers will also print to
std.out
expireSessionsOnShutdown = true means that
useDirtyFlag = true means that we only replicate a session
after setAttribute,removeAttribute has been called.
false means to replicate the session after each
request.
false means that replication would work for the
following piece of code: (only for SimpleTcpReplicationManager)
<%
HashMap map =
(HashMap)session.getAttribute("map");
map.put("key","value");
%>
replicationMode = can be either 'pooled', 'synchronous' or
'asynchronous'.
* Pooled means that the replication happens
using several sockets in a synchronous way. Ie, the data gets replicated,
then the request return. This is the same as the 'synchronous' setting
except it uses a pool of sockets, hence it is multithreaded. This is the
fastest and safest configuration. To use this, also increase the nr of tcp
threads that you have dealing with replication.
* Synchronous means that the thread that
executes the request, is also the
thread the replicates the data to the other
nodes, and will not return until all
nodes have received the information.
* Asynchronous means that there is a specific
'sender' thread for each cluster node,
so the request thread will queue the
replication request into a "smart" queue,
and then return to the client.
The "smart" queue is a queue where when a
session is added to the queue, and the same session
already exists in the queue from a previous
request, that session will be replaced
in the queue instead of replicating two
requests. This almost never happens, unless there is a
large network delay.
-->
<!--
When configuring for clustering, you also add in a valve to
catch all the requests
coming in, at the end of the request, the session may or may not
be replicated.
A session is replicated if and only if all the conditions are
met:
1. useDirtyFlag is true or setAttribute or removeAttribute has
been called AND
2. a session exists (has been created)
3. the request is not trapped by the "filter" attribute
The filter attribute is to filter out requests that could not
modify the session,
hence we don't replicate the session after the end of this
request.
The filter is negative, ie, anything you put in the filter, you
mean to filter out,
ie, no replication will be done on requests that match one of
the filters.
The filter attribute is delimited by ;, so you can't escape out
; even if you wanted to.
filter=".*\.gif;.*\.js;" means that we will not replicate the
session after requests with the URI
ending with .gif and .js are intercepted.
The deployer element can be used to deploy apps cluster wide.
Currently the deployment only deploys/undeploys to working
members in the cluster
so no WARs are copied upons startup of a broken node.
The deployer watches a directory (watchDir) for WAR files when
watchEnabled="true"
When a new war file is added the war gets deployed to the local
instance,
and then deployed to the other instances in the cluster.
When a war file is deleted from the watchDir the war is
undeployed locally
and cluster wide
-->
<!--
<Cluster
className="org.apache.catalina.cluster.tcp.SimpleTcpCluster"
managerClassName="org.apache.catalina.cluster.session.DeltaManager"
expireSessionsOnShutdown="false"
useDirtyFlag="true"
notifyListenersOnReplication="true">
<Membership
className="org.apache.catalina.cluster.mcast.McastService"
mcastAddr="228.0.0.4"
mcastPort="45564"
mcastFrequency="500"
mcastDropTime="3000"/>
<Receiver
className="org.apache.catalina.cluster.tcp.ReplicationListener"
tcpListenAddress="auto"
tcpListenPort="4001"
tcpSelectorTimeout="100"
tcpThreadCount="6"/>
<Sender
className="org.apache.catalina.cluster.tcp.ReplicationTransmitter"
replicationMode="pooled"
ackTimeout="15000"/>
<Valve
className="org.apache.catalina.cluster.tcp.ReplicationValve"
filter=".*\.gif;.*\.js;.*\.jpg;.*\.htm;.*\.html;.*\.txt;"/>
<Deployer
className="org.apache.catalina.cluster.deploy.FarmWarDeployer"
tempDir="/tmp/war-temp/"
deployDir="/tmp/war-deploy/"
watchDir="/tmp/war-listen/"
watchEnabled="false"/>
</Cluster>
-->
<!-- Normally, users must authenticate themselves to each web app
individually. Uncomment the following entry if you would like
a user to be authenticated the first time they encounter a
resource protected by a security constraint, and then have that
user identity maintained across *all* web applications
contained
in this virtual host. -->
<!--
<Valve className="org.apache.catalina.authenticator.SingleSignOn" />
-->
<!-- Access log processes all requests for this virtual host. By
default, log files are created in the "logs" directory relative
to
$CATALINA_HOME. If you wish, you can specify a different
directory with the "directory" attribute. Specify either a
relative
(to $CATALINA_HOME) or absolute path to the desired directory.
-->
<!--
<Valve className="org.apache.catalina.valves.AccessLogValve"
directory="logs" prefix="localhost_access_log."
suffix=".txt"
pattern="common" resolveHosts="false"/>
-->
<!-- Access log processes all requests for this virtual host. By
default, log files are created in the "logs" directory relative
to
$CATALINA_HOME. If you wish, you can specify a different
directory with the "directory" attribute. Specify either a
relative
(to $CATALINA_HOME) or absolute path to the desired directory.
This access log implementation is optimized for maximum
performance,
but is hardcoded to support only the "common" and "combined"
patterns.
-->
<!--
<Valve
className="org.apache.catalina.valves.FastCommonAccessLogValve"
directory="logs" prefix="localhost_access_log."
suffix=".txt"
pattern="common" resolveHosts="false"/>
-->
</Host>
</Engine>
</Service>
</Server>