Cluster Configuration Questions

Mark Walker Sun, 17 Jun 2007 17:59:12 -0700

We run a cluster of Tomcat servers with Apache as a front end load balancerusing mod_jk configured for sticky sessions. Our primary applicationprovides users with access to their financial accounts. Fast response timesare as important as session replication.

We are starting to have problems with response times. The applicationbecomes virtually unresponsive. Based on research into how our clustering iscurrently set up, I believe the problem is that the servers are tied upreplicating session data. We have twelve instances of Tomcat spread acrossthree servers (9 in the production cluster, three in the test cluster). Hereis our current cluster definition (the only values that vary are thetcpListenAddress and tcpListenPort):


      <Cluster className="org.apache.catalina.cluster.tcp.SimpleTcpCluster"

managerClassName="org.apache.catalina.cluster.session.DeltaManager"

               expireSessionsOnShutdown="false"
               useDirtyFlag="true"
               notifyListenersOnReplication="true">

          <Membership
              className="org.apache.catalina.cluster.mcast.McastService"
              mcastAddr="228.0.0.4"
              mcastPort="45564"
              mcastFrequency="500"
              mcastDropTime="3000"/>

          <Receiver

className="org.apache.catalina.cluster.tcp.ReplicationListener"

              tcpListenAddress="10.9.100.2"
              tcpListenPort="4021"
              tcpSelectorTimeout="100"
              tcpThreadCount="6"/>

          <Sender

className="org.apache.catalina.cluster.tcp.ReplicationTransmitter"

              replicationMode="pooled"
              ackTimeout="15000"
              waitForAck="true"/>

<ValveclassName="org.apache.catalina.cluster.tcp.ReplicationValve"filter=".*\.gif;.*\.js;.*\.jpg;.*\.png;.*\.htm;.*\.html;.*\.css;.*\.txt;"/>

<ClusterListenerclassName="org.apache.catalina.cluster.session.ClusterSessionListener"/>

      </Cluster>

Based on what I have found in my research, It seems we need to either A)continue with replicationMode="pooled" and increase the tcpThreadCountsubstantially or B) switch to replicationMode="fastasyncqueue" with atcpTheadCount of "8". I would prefer to continue to use "pooled" to providethe best failover if we should need to stop or restart a cluster instance.However, I cannot afford to have the application "disappear" from the enduser's perspective, due to session replication demands.



Questions for a pooled cluster:

How high should the tcpThreadCount be set? Should the value be related tothe average number of sessions?Should the ackTimeout be altered to help prevent the application fromgetting stuck doing replication?



Questions for a fastasyncqueue cluster:

The javadocs for FastAsyncSocketSender say to "Limit the queue lockcontention under high load!" How?They also say "after one minute idle time, or number of request (100) theconnection is reconnected with next request. Change this for productionuse!" Change it higher or lower? Should the value be related to the averagenumber of sessions?Another concern is the comment about the ackTimeout default of 15 secondsis "very low for big all session replication messages after restart a node".That description seems to accurately describe our servers. I was concernedthat this value was might be too high for our servers in a "pooled"


Any other helpful suggestions will be greatly appreciated!


Thanks!


Mark

_________________________________________________________________

Need a break? Find your escape route with Live Search Maps.http://maps.live.com/default.aspx?ss=Restaurants~Hotels~Amusement%20Park&cp=33.832922~-117.915659&style=r&lvl=13&tilt=-90&dir=0&alt=-1000&scene=1118863&encType=1&FORM=MGAC01



---------------------------------------------------------------------
To start a new topic, e-mail: users@tomcat.apache.org
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Cluster Configuration Questions

Reply via email to