Re: [EXTERNAL] RE: Re: General Architecture Question for multiple websites on a single RedHat server

2012-07-10 Thread Simon, Leonard
We hit the leap second when it happened and rebooted the servers which
solved that problem.

Thanks anyway

On Mon, Jul 9, 2012 at 5:24 PM, Martin Gainty mgai...@hotmail.com wrote:


 apparently for RH 6.1 stopping and starting the ntp daemon works
 /etc/init.d/ntpd stop; date; date `date +%m%d%H%M%C%y.%S`; date ;
 /etc/init.d/ntpd starthttps://bugzilla.redhat.com/show_bug.cgi?id=836748

 Martin Gainty
 __
 Verzicht und Vertraulichkeitanmerkung/Note de déni et de confidentialité

 Diese Nachricht ist vertraulich. Sollten Sie nicht der vorgesehene
 Empfaenger sein, so bitten wir hoeflich um eine Mitteilung. Jede unbefugte
 Weiterleitung oder Fertigung einer Kopie ist unzulaessig. Diese Nachricht
 dient lediglich dem Austausch von Informationen und entfaltet keine
 rechtliche Bindungswirkung. Aufgrund der leichten Manipulierbarkeit von
 E-Mails koennen wir keine Haftung fuer den Inhalt uebernehmen.
 Ce message est confidentiel et peut être privilégié. Si vous n'êtes pas le
 destinataire prévu, nous te demandons avec bonté que pour satisfaire
 informez l'expéditeur. N'importe quelle diffusion non autorisée ou la copie
 de ceci est interdite. Ce message sert à l'information seulement et n'aura
 pas n'importe quel effet légalement obligatoire. Étant donné que les email
 peuvent facilement être sujets à la manipulation, nous ne pouvons accepter
 aucune responsabilité pour le contenu fourni.


  Date: Mon, 9 Jul 2012 22:42:21 +0200
  From: a...@ice-sa.com
  To: users@tomcat.apache.org
  Subject: Re: [EXTERNAL] Re: General Architecture Question for multiple
 websites on a single RedHat server
 
  Simon, Leonard wrote:
   Well our Tomcat went out to lunch again and we had to recycle the
 webserver
   to get things stablized. By this I mean we get reports from the users
 that
   screens become unresponsive and looking at a top we see tomcat process
   taking 100% CPU.
  
   Was able to do a thread dump captured with a kill -3 PID and here it
 is if
   anyone is so inclined to comment on it.
  
 
  Have not followed this thread and not looked at the thread dump (which
 would not tell me
  anything anyway), but by any chance have you googled for leap second
 bug ?
  Mentioning that because of your 100% CPU above.
  See e.g. https://bugzilla.redhat.com/show_bug.cgi?id=479765
 
 
  -
  To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
  For additional commands, e-mail: users-h...@tomcat.apache.org
 




Re: [EXTERNAL] Re: Re: General Architecture Question for multiple websites on a single RedHat server

2012-07-10 Thread Simon, Leonard
Chris,

Thanks for looking at this.

Tomcat version is 6.0.32.
mod_jk is at 1.2.31


Someone else did the thread dump so I'm assuming they did it on the right
process.

On Tue, Jul 10, 2012 at 12:19 PM, Christopher Schultz 
ch...@christopherschultz.net wrote:

 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1

 Simon,

 On 7/9/12 4:24 PM, Simon, Leonard wrote:
  Well our Tomcat went out to lunch again and we had to recycle the
  webserver to get things stablized. By this I mean we get reports
  from the users that screens become unresponsive and looking at a
  top we see tomcat process taking 100% CPU.

 Are you sure this is the right process?

  Was able to do a thread dump captured with a kill -3 PID and here
  it is if anyone is so inclined to comment on it.

 This thread dump shows a mostly-idle server with the exception of
 those threads in socketAccept() (not sure why these count as RUNNABLE
 when they are really blocking) and those executing reads from the
 client connection(s).

 What exact version of Tomcat are you using, and what version of mod_jk
 (or, if you are using mox_proxy_ajp, what httpd version)? IIRC, there
 have been some stability improvements in recent Tomcat versions around
 the worker threads being returned to their associated connectors.

 - -chris
 -BEGIN PGP SIGNATURE-
 Version: GnuPG/MacGPG2 v2.0.17 (Darwin)
 Comment: GPGTools - http://gpgtools.org
 Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

 iEYEARECAAYFAk/8VfwACgkQ9CaO5/Lv0PCgWwCfbN/E29q4DKee4q1A+IEMmED6
 8+0AnivucFDMS/7lhPiOb+0tv5I/6vim
 =ABXf
 -END PGP SIGNATURE-

 -
 To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
 For additional commands, e-mail: users-h...@tomcat.apache.org




Re: [EXTERNAL] Re: Re: General Architecture Question for multiple websites on a single RedHat server

2012-07-10 Thread Mark Eggers
- Original Message -

 From: Simon, Leonard leonard.si...@hsn.net
 To: Tomcat Users List users@tomcat.apache.org
 Cc: 
 Sent: Tuesday, July 10, 2012 9:54 AM
 Subject: Re: [EXTERNAL] Re: Re: General Architecture Question for multiple 
 websites on a single RedHat server
 
 Chris,
 
 Thanks for looking at this.
 
 Tomcat version is 6.0.32.
 mod_jk is at 1.2.31
 
 
 Someone else did the thread dump so I'm assuming they did it on the right
 process.
 
 On Tue, Jul 10, 2012 at 12:19 PM, Christopher Schultz 
 ch...@christopherschultz.net wrote:
 
  -BEGIN PGP SIGNED MESSAGE-
  Hash: SHA1
 
  Simon,
 
  On 7/9/12 4:24 PM, Simon, Leonard wrote:
   Well our Tomcat went out to lunch again and we had to recycle the
   webserver to get things stablized. By this I mean we get reports
   from the users that screens become unresponsive and looking at a
   top we see tomcat process taking 100% CPU.
 
  Are you sure this is the right process?
 
   Was able to do a thread dump captured with a kill -3 PID and here
   it is if anyone is so inclined to comment on it.
 
  This thread dump shows a mostly-idle server with the exception of
  those threads in socketAccept() (not sure why these count as RUNNABLE
  when they are really blocking) and those executing reads from the
  client connection(s).
 
  What exact version of Tomcat are you using, and what version of mod_jk
  (or, if you are using mox_proxy_ajp, what httpd version)? IIRC, there
  have been some stability improvements in recent Tomcat versions around
  the worker threads being returned to their associated connectors.
 
  - -chris


I didn't see much in the way that rang immediate alarm bells. It looks like 
you're processing about 18 client connections, and everything else is pretty 
quiet. These client connections are going through the AJP connector (as you've 
noted in your reply above).

A few things though:

As someone in this thread has already mentioned, permgen is pretty full. You 
might try increasing that with -XX:MaxPermSize=128m.

There are a lot of garbage collection threads. You can see this on a multi-core 
system. From digging around, it appears that the number of parallel garbage 
collection threads follows this formula:

8 + (5/8)X = GCT

You get one GCT (garbage collection thread) per core for the first 8 cores, and 
then 5/8 of a thread for every core after that. So in your case:

8 + (5/8)X = 18
X = 16

This means that your system has 24 cores. Are you running on a 24 core system, 
or have you tuned garbage collection with JVM arguments.

In general, if you're not running into GC issues, tuning GC parameters is 
counter-productive. If you do have to tune GC parameters, lots of testing is in 
order.

I noticed that you also have an MQ Trace monitor running. Are you using MQ? 
Directly accessing an MQ service without going through a pool configured for 
graceful restarts / retries can cause a system to become unresponsive. However, 
I don't see any evidence of that in this thread dump.

As I've said off line, it's really difficult to see what's consuming CPU from 
one thread dump. Here's how to start figuring out what is going on with your 
system.

1. Keep access logs

If you don't, then start. You'll want the access logs to replay on a test 
environment to see if you can recreate the problem. JMeter is a good tool for 
replaying information from access logs.

2. When the problem occurs

a. Multiple thread dumps, about 5 seconds apart. Use a tool like jstack so it's 
scriptable

   jstack -l [process-id]
   where [process-id] is the process id of the distressed Tomcat

The -l generates a long listing, and may not be necessary. You'll need to have 
the right permissions (either root or the user running the JVM being targeted 
with the process id).

b. At the same time use something like the following to see which thread is 
consuming CPU:

   ps -L -o pcpu,lwp -p [process-id]
   where [process-id] is the process id of the distressed Tomcat

This will show all the threads of the process, the percentage of CPU used for 
each thread, and the thread process ID. You can then correlate the thread 
process ID with the thread dump to see exactly what is consuming the CPU.

This will generate tons of output, so it's best to put both in a script and 
direct the output to files.

Now you'll end up with the following:

1. What requests were being made of your server when the problem occurred
2. Multiple thread dumps while the problem is occurring
3. The identity of the thread (or threads) that is consuming the CPU

Once you get this information, you'll be in a much better position to determine 
what is causing your problems.

. . . . just my two cents.
/mde/

-
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org



Re: [EXTERNAL] Re: Re: General Architecture Question for multiple websites on a single RedHat server

2012-07-10 Thread André Warnier

Mark Eggers wrote:
...



. . . . just my two cents.


As a consultant in my professional capacity, I'd charge just about 100,000 times more for 
that.



-
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org



Re: [EXTERNAL] Re: Re: General Architecture Question for multiple websites on a single RedHat server

2012-07-10 Thread Mark Eggers
- Original Message -

 From: André Warnier a...@ice-sa.com
 To: Tomcat Users List users@tomcat.apache.org
 Cc: 
 Sent: Tuesday, July 10, 2012 12:12 PM
 Subject: Re: [EXTERNAL] Re: Re: General Architecture Question for multiple 
 websites on a single RedHat server
 
 Mark Eggers wrote:
 ...
 
 
  . . . . just my two cents.
 
 As a consultant in my professional capacity, I'd charge just about 100,000 
 times more for that.


Writing quick how-to mail messages on a mailing list - cheap.

As a consultant in my professional capacity, writing the scripts, running the 
analysis, replaying the access log on a test system, helping / dealing with 
developers, system admins, system architects, management . . . agreed.

Oh, and document, document, document.
(but please to always call it research - apologies to Tom Lehrer)

/mde/

-
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org