RE: 100% CPU

2004-09-01 Thread Bill Grover
Thanks for the info.I agree 100% not to make the system do all that work.I'm going to change the datasource definitions on my development server today.Since they will continue to work I'll change my live server tomorrow.
__ 
file:///E:/EUColor.gif 	
Bill Grover 	
Manager, Information Systems 	Phone:	 301.424.3300 x3324 	
EU Services, Inc. 	FAX:	 301.424.3696	
649 North Horners Lane 	E-Mail:	mailto:[EMAIL PROTECTED] [EMAIL PROTECTED]	
Rockville, MD 20850-1299 	WWW:	http://www.euservices.com/ http://www.euservices.com 	
__ 

-Original Message-
From: Mark A Kruger [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 31, 2004 11:02 AM
To: CF-Talk
Subject: RE: 100% CPU

Bill,

FYI In regard to point 9 below. When you use a regular server name instead
of a FLQN name you cause the JDBC server system to do the following:

First it uses the unc service...

check the host file for that string to resolve (no dice)
check dns (also no dice)

.move to rnc service (windows locator - basically a handoff to
windows)

check lmhosts file (if enabled and configured to do so)...
check active directory created flqn on the nearest dc if you are on a domain
([EMAIL PROTECTED])
check Active dir on the master domain controller if nearest dc fails..
try to resolve the name using windows locator broadcasting

It's a lot of hoops just for a friendlier naming scheme.Mostly, you just
don't want windows to have to do this work for you. Windows networking is
more dns based, but it still has a locator service with a fairly complicated
heirarchy of resolution attempts. Instead, you want the JDBC service to
bypass the application layer and get that name resolved prior to it arriving
at the windows locator service. Simply ask yourself - how is the server
going to resolve this name.

Mark Minasi's mastering windows server 200x books have some great
information on how windows goes about this process.

Mark A. Kruger, CFG, MSCE
www.cfwebtools.com
www.necfug.com
http://blog.mxconsulting.com
-Original Message-
From: Bill Grover [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 31, 2004 9:34 AM
To: CF-Talk
Subject: RE: 100% CPU

Thanks for the response.I'll work on this list and see if it doesn't
help.Here is some responses to a few of your points.

1. On the server in question we don't have any access.Just SQL Server
2. I'll try the updated drivers.
8. SQL is not on the web server.Both boxes are separate servers.
9. Our names are not FQDN but are not IP addresses either.We just use
the server name (sql-server).I'll play with the host file. 
_
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




Re: 100% CPU

2004-09-01 Thread Ian Buzer
Hi all,

An interesting update to this thread.

I posted a message to the forums on the Macromedia site, and it got picked up by someone at Macromedia. Anyway, after sending him the contents of a few log files, it looks like he's traced it to a bug in the ColdFusion client variable purging system. Apparently there are bugs in this that cause connection leaks and it's slow which can cause a backup of requests.

There is a hotfix for this at:
http://www.macromedia.com/support/coldfusion/ts/documents/client_purge_hotfix.htm

Apparently this wasn't included in the Updater?

Also, there is a bug in the hotfix (!) which appears to mean that client variables are not deleted at all. This is to be fixed by another hotfix in the next few days but I am told it is better to apply the originalhotfix in the mean time.

So, if you are getting unexplained surges in CPU on an hourly basis (hourly from when the service is started) then this could be your problem.

Ian
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




Re: 100% CPU

2004-09-01 Thread jonese
got a link to the thread on MM forums so we can monitor that as well?

- Original Message -
From: Ian Buzer [EMAIL PROTECTED]
Date: Wed, 01 Sep 2004 12:36:15 -0400
Subject: Re: 100% CPU
To: CF-Talk [EMAIL PROTECTED]

Hi all,

An interesting update to this thread.

I posted a message to the forums on the Macromedia site, and it got
picked up by someone at Macromedia. Anyway, after sending him the
contents of a few log files, it looks like he's traced it to a bug in
the ColdFusion client variable purging system. Apparently there are
bugs in this that cause connection leaks and it's slow which can cause
a backup of requests.

There is a hotfix for this at:
http://www.macromedia.com/support/coldfusion/ts/documents/client_purge_hotfix.htm

Apparently this wasn't included in the Updater?

Also, there is a bug in the hotfix (!) which appears to mean that
client variables are not deleted at all. This is to be fixed by
another hotfix in the next few days but I am told it is better to
apply the originalhotfix in the mean time.

So, if you are getting unexplained surges in CPU on an hourly basis
(hourly from when the service is started) then this could be your
problem.

Ian
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




Re: 100% CPU

2004-09-01 Thread Ian Buzer
got a link to the thread on MM forums so we can monitor that as well?

Sorry - all the diagnosis happened via email.

Ian
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




Re: 100% CPU

2004-08-31 Thread vishnu prasad
hi 
jrun.xml
Jrun.xml file can be found in Jrun\server\server_instance_name\server-inf\jrun.xml

in the SchedulerService section overwrite this code

service class=jrunx.scheduler.SchedulerService name=SchedulerService
attribute name=bindToJNDItrue/attribute
attribute name=activeHandlerThreads25/attribute
attribute name=maxHandlerThreads1000/attribute
attribute name=minHandlerThreads20/attribute
attribute name=threadWaitTimeout180/attribute
attribute name=timeout600/attribute
/service
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




Re: 100% CPU

2004-08-31 Thread Ian Buzer
Hi vishnu,

Thanks for the reply. Just to understand what this does

Am I right in thinking this controls the cf scheduler? I'm fairly confident that our load problems are not related to scheduled requests as these occur overnight.

Or does the scheduler have other implications?

Thanks
Ian
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




Re: 100% CPU

2004-08-31 Thread Neculai Macarie
 I wonder if anyone can give me any clues on what else to look for. Surely we
 should be able to get more traffic out of this spec machine?

You can try and get a stack trace to see what's happenning with CF when the problem
occurs.
http://www.macromedia.com/support/coldfusion/ts/documents/tn18339.htm

-- 
mack /
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




RE: 100% CPU

2004-08-31 Thread Bill Grover
Ian,

 
We have seen this behavior as well.When we start experiencing the page slowdowns and notice the jrun CPU very high we will stop and restart the CF Application Server, this fixes the problem.Yes the pages are down for a few seconds but in our environment this is not as big a problem as having the slow page response, or the page never coming up at all.
__ 
file:///E:/EUColor.gif 	
Bill Grover 	
Manager, Information Systems 	Phone:	 301.424.3300 x3324 	
EU Services, Inc. 	FAX:	 301.424.3696	
649 North Horners Lane 	E-Mail:	mailto:[EMAIL PROTECTED] [EMAIL PROTECTED]	
Rockville, MD 20850-1299 	WWW:	http://www.euservices.com/ http://www.euservices.com 	
__ 

-Original Message-
From: Ian Buzer [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 31, 2004 5:00 AM
To: CF-Talk
Subject: 100% CPU

Hi all,

We're running a CFMX 6.1 / SQL server 2000 app on a 1.6GHz AMD server with
1.5Gb of RAM.The app gets about 2.5 million page requests a month, peaking 
at about 7000 page requests per hour.

Since yesterday morning it's suddenly started to misbehave, with the
processor regularly maxing out at 100%. It will run clean an fast for 20 
minutes or so, then suddenly requests will start to queue up and the 
whole thing grinds to a halt. Usually it recovers, but sometimes the
service needs restarting to get it back.

The app is well behaved, with most requests taking about half a second. 
There seem to be no particular long running requests that trigger it.

This happened a couple of months ago and I restricted the amount of memory
SQL server used and increased the RAM on the box to 1.5Gb. It currently 
appears to have about 3-400Mb free.

I've installed the CFMX updater (released a few days ago), reinstalled
ColdFusion, scandisked the drive and defragged.

I wonder if anyone can give me any clues on what else to look for. Surely we
should be able to get more traffic out of this spec machine?

Many thanks
Ian Buzer 
_
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




RE: 100% CPU

2004-08-31 Thread Andrew Dixon
Hi Ian.

 
We have had this problem as well, on both Windows 2003 and Redhat Linux
servers. There doesn't seems to be any logic cause of the problem and
therefore no fix. The only way to work around the problem is to restart the
server regularly. We have a scheduled task on both Windows and Linux to
restart the service once a day, which seems in the most part to have fixed
the problem, but it still happens from time to time.

 
Best Regards

 
Andrew.

_

From: Ian Buzer [mailto:[EMAIL PROTECTED] 
Sent: 31 August 2004 10:00
To: CF-Talk
Subject: 100% CPU

Hi all,

We're running a CFMX 6.1 / SQL server 2000 app on a 1.6GHz AMD server with
1.5Gb of RAM.The app gets about 2.5 million page requests a month, peaking

at about 7000 page requests per hour.

Since yesterday morning it's suddenly started to misbehave, with the
processor regularly maxing out at 100%. It will run clean an fast for 20 
minutes or so, then suddenly requests will start to queue up and the 
whole thing grinds to a halt. Usually it recovers, but sometimes the
service needs restarting to get it back.

The app is well behaved, with most requests taking about half a second. 
There seem to be no particular long running requests that trigger it.

This happened a couple of months ago and I restricted the amount of memory
SQL server used and increased the RAM on the box to 1.5Gb. It currently 
appears to have about 3-400Mb free.

I've installed the CFMX updater (released a few days ago), reinstalled
ColdFusion, scandisked the drive and defragged.

I wonder if anyone can give me any clues on what else to look for. Surely we
should be able to get more traffic out of this spec machine?

Many thanks
Ian Buzer 
_
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




Re: 100% CPU

2004-08-31 Thread Ian Buzer
Hi Mack,

You can try and get a stack trace to see what's happenning with CF when the problem
occurs.
http://www.macromedia.com/support/coldfusion/ts/documents/tn18339.htm

Good info - I'll check this out.

Thanks
Ian
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




Re: 100% CPU

2004-08-31 Thread Ian Buzer
Hi Andrew, Bill,

We've been running ServersAlive for the last couple of years, which has been cycling the service automatically everytime the system slows and like you say, this has worked well as a backup. 

Trouble is, it was doing this about every 10 minutes when we had our recent problems - not really acceptable!

I've been doing my sums - 7000 requests per hour is on average 2 requests per second. I guess in real life this could increase to say 4 requests per second for a few minutes at a time. Maybe this is as much as a single processor can handle?

What other loads are people getting out of a single processor machine?

Thanks
Ian
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




RE: 100% CPU

2004-08-31 Thread Mark A Kruger
Bill,

I went through a similar process with a client of ours. We did several
things to mitigate this issue.

1. Eliminate Access dbs.If you have anything running on Access get rid of
it.
2. Make sure and use the JDBC Merant 3.3 drivers for SQL - this is an update
from 6.1.
3. Check out long running db requests - maintenance tasks, utility tasks,
data imports, exports and dumps ect.Create a separate datasource for
these, restrict the number of connections and deselect the maintain db
connections for that db resource in cf admin.
4.Fiddle with the JVM maxperm memory, and heap sizes. Try to get them
large enough so that the heap is not being constantly resized and garbage
collection is efficient. There are a ton more JVM settings depending on the
JVM you are using.
5. Use Perfmon to figure out which resource is hanging.
6. Consider increasing the process isolation in IIS for the application - it
can tell you if IIS is having conflicts with other processes outside the ap.
7. Check the web logs for a burst of http requests at the point of failure.
One customer I know created a dynamic publishing system with constantly
updated content - then he used google adsense to serve ads and make himself
money. But every time a new page was published (new to google) the
googlebots attack to figure out which ads to serve.Since his server was
near capacity this sometimes surfaced as a denial of service type attack
(much like you are seeing) - and since many of his pages were driven by
database calls it looked like a possible db problem - or connection issue
etc.
8. Consider moving the SQL server Off of the web server.
9. Make sure and use a FQDN for the server name in the database
administrator. In other words, don't use 127.0.0.1 or any other IP
address.use sql.mydomain.com type syntax. In fact, put the domain in the
HOST file if you have a default win server configuration. Windows will check
the host file first to resolve the domain - which saves a dns request.JDBC
will attempt to resolve an ip into a domain name using reverse lookup before
making a connection. In many cases this results in a lengthy initial
connection request.Having a guaranteed resolution to a flqn (a la the host
file) eliminates this problem.

That's my short list - anyone else?

Mark A. Kruger, CFG, MSCE
www.cfwebtools.com
www.necfug.com
http://blog.mxconsulting.com
-Original Message-
From: Bill Grover [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 31, 2004 8:11 AM
To: CF-Talk
Subject: RE: 100% CPU

Ian,

We have seen this behavior as well.When we start experiencing the page
slowdowns and notice the jrun CPU very high we will stop and restart the CF
Application Server, this fixes the problem.Yes the pages are down for a
few seconds but in our environment this is not as big a problem as having
the slow page response, or the page never coming up at all.
__
 file:///E:/EUColor.gif
Bill Grover
Manager, Information Systems Phone: 301.424.3300 x3324
EU Services, Inc. FAX: 301.424.3696
649 North Horners Lane E-Mail:mailto:[EMAIL PROTECTED]
[EMAIL PROTECTED]
Rockville, MD 20850-1299 WWW:http://www.euservices.com/
http://www.euservices.com
__

-Original Message-
From: Ian Buzer [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 31, 2004 5:00 AM
To: CF-Talk
Subject: 100% CPU

Hi all,

We're running a CFMX 6.1 / SQL server 2000 app on a 1.6GHz AMD server with
1.5Gb of RAM.The app gets about 2.5 million page requests a month,
peaking
at about 7000 page requests per hour.

Since yesterday morning it's suddenly started to misbehave, with the
processor regularly maxing out at 100%. It will run clean an fast for 20
minutes or so, then suddenly requests will start to queue up and the
whole thing grinds to a halt. Usually it recovers, but sometimes the
service needs restarting to get it back.

The app is well behaved, with most requests taking about half a second.
There seem to be no particular long running requests that trigger it.

This happened a couple of months ago and I restricted the amount of memory
SQL server used and increased the RAM on the box to 1.5Gb. It currently
appears to have about 3-400Mb free.

I've installed the CFMX updater (released a few days ago), reinstalled
ColdFusion, scandisked the drive and defragged.

I wonder if anyone can give me any clues on what else to look for. Surely
we
should be able to get more traffic out of this spec machine?

Many thanks
Ian Buzer
 _
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




RE: 100% CPU

2004-08-31 Thread Mark A Kruger
One more I forgot

We have noted that the docs say the the simultaneous requests number in
CF admin should be roughly 2x the number of procs. While this number works
well on a busy cf 5 machine. It seems to be quite low for a cfmx platform.
We've increased this number to between 8 and 13 on a dual proc machine and
seen significant results. On a single proc machine I would try 6 and see
what you get.

-Mark

-Original Message-
From: Andrew Dixon [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 31, 2004 8:21 AM
To: CF-Talk
Subject: RE: 100% CPU

Hi Ian.

We have had this problem as well, on both Windows 2003 and Redhat Linux
servers. There doesn't seems to be any logic cause of the problem and
therefore no fix. The only way to work around the problem is to restart
the
server regularly. We have a scheduled task on both Windows and Linux to
restart the service once a day, which seems in the most part to have fixed
the problem, but it still happens from time to time.

Best Regards

Andrew.

 _

From: Ian Buzer [mailto:[EMAIL PROTECTED]
Sent: 31 August 2004 10:00
To: CF-Talk
Subject: 100% CPU

Hi all,

We're running a CFMX 6.1 / SQL server 2000 app on a 1.6GHz AMD server with
1.5Gb of RAM.The app gets about 2.5 million page requests a month,
peaking

at about 7000 page requests per hour.

Since yesterday morning it's suddenly started to misbehave, with the
processor regularly maxing out at 100%. It will run clean an fast for 20
minutes or so, then suddenly requests will start to queue up and the
whole thing grinds to a halt. Usually it recovers, but sometimes the
service needs restarting to get it back.

The app is well behaved, with most requests taking about half a second.
There seem to be no particular long running requests that trigger it.

This happened a couple of months ago and I restricted the amount of memory
SQL server used and increased the RAM on the box to 1.5Gb. It currently
appears to have about 3-400Mb free.

I've installed the CFMX updater (released a few days ago), reinstalled
ColdFusion, scandisked the drive and defragged.

I wonder if anyone can give me any clues on what else to look for. Surely
we
should be able to get more traffic out of this spec machine?

Many thanks
Ian Buzer
 _
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




Re: 100% CPU

2004-08-31 Thread Adam Haskell
Great info Mark, Didn;t knwo the ahile don't use IP for database bit.
I want to stress on he one Mark almost forgot. At the machine in
question's current proc speed I'd say 6 MAX and since you figure 4 max
request ever I would lower it to 5. Even if you get 10 requests if the
average load time is 1/2 second the longest wait would be about maybe
1 second

Adam H 

-Original Message-
From: Andrew Dixon [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 31, 2004 8:21 AM
To: CF-Talk
Subject: RE: 100% CPU
 
Hi Ian.
 
We have had this problem as well, on both Windows 2003 and Redhat Linux
servers. There doesn't seems to be any logic cause of the problem and
therefore no fix. The only way to work around the problem is to restart
 the
server regularly. We have a scheduled task on both Windows and Linux to
restart the service once a day, which seems in the most part to have fixed
the problem, but it still happens from time to time.
 
Best Regards
 
Andrew.
 
 _
 
From: Ian Buzer [mailto:[EMAIL PROTECTED]
Sent: 31 August 2004 10:00
To: CF-Talk
Subject: 100% CPU
 
Hi all,
 
We're running a CFMX 6.1 / SQL server 2000 app on a 1.6GHz AMD server with
1.5Gb of RAM.The app gets about 2.5 million page requests a month,
 peaking
 
at about 7000 page requests per hour.
 
Since yesterday morning it's suddenly started to misbehave, with the
processor regularly maxing out at 100%. It will run clean an fast for 20
minutes or so, then suddenly requests will start to queue up and the
whole thing grinds to a halt. Usually it recovers, but sometimes the
service needs restarting to get it back.
 
The app is well behaved, with most requests taking about half a second.
There seem to be no particular long running requests that trigger it.
 
This happened a couple of months ago and I restricted the amount of memory
SQL server used and increased the RAM on the box to 1.5Gb. It currently
appears to have about 3-400Mb free.
 
I've installed the CFMX updater (released a few days ago), reinstalled
ColdFusion, scandisked the drive and defragged.
 
I wonder if anyone can give me any clues on what else to look for. Surely
 we
should be able to get more traffic out of this spec machine?
 
Many thanks
Ian Buzer
 _
 

 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




RE: 100% CPU

2004-08-31 Thread Bill Grover
Thanks for the response.I'll work on this list and see if it doesn't help.Here is some responses to a few of your points.

 
1. On the server in question we don't have any access.Just SQL Server
2. I'll try the updated drivers.
8. SQL is not on the web server.Both boxes are separate servers.
9. Our names are not FQDN but are not IP addresses either.We just use the server name (sql-server).I'll play with the host file.

 
__ 
file:///E:/EUColor.gif 	
Bill Grover 	
Manager, Information Systems 	Phone:	 301.424.3300 x3324 	
EU Services, Inc. 	FAX:	 301.424.3696	
649 North Horners Lane 	E-Mail:	mailto:[EMAIL PROTECTED] [EMAIL PROTECTED]	
Rockville, MD 20850-1299 	WWW:	http://www.euservices.com/ http://www.euservices.com 	
__ 

-Original Message-
From: Mark A Kruger [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 31, 2004 9:41 AM
To: CF-Talk
Subject: RE: 100% CPU

Bill,

I went through a similar process with a client of ours. We did several
things to mitigate this issue.

1. Eliminate Access dbs.If you have anything running on Access get rid of
it.
2. Make sure and use the JDBC Merant 3.3 drivers for SQL - this is an update
from 6.1.
3. Check out long running db requests - maintenance tasks, utility tasks,
data imports, exports and dumps ect.Create a separate datasource for
these, restrict the number of connections and deselect the maintain db
connections for that db resource in cf admin.
4.Fiddle with the JVM maxperm memory, and heap sizes. Try to get them
large enough so that the heap is not being constantly resized and garbage
collection is efficient. There are a ton more JVM settings depending on the
JVM you are using.
5. Use Perfmon to figure out which resource is hanging.
6. Consider increasing the process isolation in IIS for the application - it
can tell you if IIS is having conflicts with other processes outside the ap.
7. Check the web logs for a burst of http requests at the point of failure.
One customer I know created a dynamic publishing system with constantly
updated content - then he used google adsense to serve ads and make himself
money. But every time a new page was published (new to google) the
googlebots attack to figure out which ads to serve.Since his server was
near capacity this sometimes surfaced as a denial of service type attack
(much like you are seeing) - and since many of his pages were driven by
database calls it looked like a possible db problem - or connection issue
etc.
8. Consider moving the SQL server Off of the web server.
9. Make sure and use a FQDN for the server name in the database
administrator. In other words, don't use 127.0.0.1 or any other IP
address.use sql.mydomain.com type syntax. In fact, put the domain in the
HOST file if you have a default win server configuration. Windows will check
the host file first to resolve the domain - which saves a dns request.JDBC
will attempt to resolve an ip into a domain name using reverse lookup before
making a connection. In many cases this results in a lengthy initial
connection request.Having a guaranteed resolution to a flqn (a la the host
file) eliminates this problem.

That's my short list - anyone else?

Mark A. Kruger, CFG, MSCE
www.cfwebtools.com
www.necfug.com
http://blog.mxconsulting.com
-Original Message-
From: Bill Grover [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 31, 2004 8:11 AM
To: CF-Talk
Subject: RE: 100% CPU

Ian,

We have seen this behavior as well.When we start experiencing the page
slowdowns and notice the jrun CPU very high we will stop and restart the CF
Application Server, this fixes the problem.Yes the pages are down for a
few seconds but in our environment this is not as big a problem as having
the slow page response, or the page never coming up at all.
__
 file:///E:/EUColor.gif
Bill Grover
Manager, Information Systems Phone: 301.424.3300 x3324
EU Services, Inc. FAX: 301.424.3696
649 North Horners Lane E-Mail:mailto:[EMAIL PROTECTED]
[EMAIL PROTECTED]
Rockville, MD 20850-1299 WWW:http://www.euservices.com/
http://www.euservices.com
__

-Original Message-
From: Ian Buzer [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 31, 2004 5:00 AM
To: CF-Talk
Subject: 100% CPU

Hi all,

We're running a CFMX 6.1 / SQL server 2000 app on a 1.6GHz AMD server with
1.5Gb of RAM.The app gets about 2.5 million page requests a month,
peaking
at about 7000 page requests per hour.

Since yesterday morning it's suddenly started to misbehave, with the
processor regularly maxing out at 100%. It will run clean an fast for 20
minutes or so, then suddenly requests will start to queue up and the
whole thing grinds to a halt. Usually it recovers, but sometimes the
service needs restarting to get it back.

The app is well behaved, with most requests taking about half a second.
There seem

RE: 100% CPU

2004-08-31 Thread Dave Watts
 We have noted that the docs say the the simultaneous 
 requests number in CF admin should be roughly 2x the number 
 of procs. While this number works well on a busy cf 5 
 machine. It seems to be quite low for a cfmx platform.
 We've increased this number to between 8 and 13 on a dual 
 proc machine and seen significant results. On a single proc 
 machine I would try 6 and see what you get.

Unfortunately, finding the optimal value for this parameter really requires
load testing. I do agree that the optimal value for CFMX is usually higher
than it was for CF 5, based on the load tests I've seen.

Dave Watts, CTO, Fig Leaf Software
http://www.figleaf.com/
phone: 202-797-5496
fax: 202-797-5444
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




RE: 100% CPU

2004-08-31 Thread Bill Grover
Thanks for the response.I'll work on this list and see if it doesn't help.Here is some responses to a few of your points.

 
1. On the server in question we don't have any access.Just SQL Server
2. I'll try the updated drivers.
8. SQL is not on the web server.Both boxes are separate servers.
9. Our names are not FQDN but are not IP addresses either.We just use the server name (sql-server).I'll play with the host file.

 
__ 
file:///E:/EUColor.gif 	
Bill Grover 	
Manager, Information Systems 	Phone:	 301.424.3300 x3324 	
EU Services, Inc. 	FAX:	 301.424.3696	
649 North Horners Lane 	E-Mail:	mailto:[EMAIL PROTECTED] [EMAIL PROTECTED]	
Rockville, MD 20850-1299 	WWW:	http://www.euservices.com/ http://www.euservices.com 	
__ 

-Original Message-
From: Mark A Kruger [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 31, 2004 9:41 AM
To: CF-Talk
Subject: RE: 100% CPU

Bill,

I went through a similar process with a client of ours. We did several
things to mitigate this issue.

1. Eliminate Access dbs.If you have anything running on Access get rid of
it.
2. Make sure and use the JDBC Merant 3.3 drivers for SQL - this is an update
from 6.1.
3. Check out long running db requests - maintenance tasks, utility tasks,
data imports, exports and dumps ect.Create a separate datasource for
these, restrict the number of connections and deselect the maintain db
connections for that db resource in cf admin.
4.Fiddle with the JVM maxperm memory, and heap sizes. Try to get them
large enough so that the heap is not being constantly resized and garbage
collection is efficient. There are a ton more JVM settings depending on the
JVM you are using.
5. Use Perfmon to figure out which resource is hanging.
6. Consider increasing the process isolation in IIS for the application - it
can tell you if IIS is having conflicts with other processes outside the ap.
7. Check the web logs for a burst of http requests at the point of failure.
One customer I know created a dynamic publishing system with constantly
updated content - then he used google adsense to serve ads and make himself
money. But every time a new page was published (new to google) the
googlebots attack to figure out which ads to serve.Since his server was
near capacity this sometimes surfaced as a denial of service type attack
(much like you are seeing) - and since many of his pages were driven by
database calls it looked like a possible db problem - or connection issue
etc.
8. Consider moving the SQL server Off of the web server.
9. Make sure and use a FQDN for the server name in the database
administrator. In other words, don't use 127.0.0.1 or any other IP
address.use sql.mydomain.com type syntax. In fact, put the domain in the
HOST file if you have a default win server configuration. Windows will check
the host file first to resolve the domain - which saves a dns request.JDBC
will attempt to resolve an ip into a domain name using reverse lookup before
making a connection. In many cases this results in a lengthy initial
connection request.Having a guaranteed resolution to a flqn (a la the host
file) eliminates this problem.

That's my short list - anyone else?

Mark A. Kruger, CFG, MSCE
www.cfwebtools.com
www.necfug.com
http://blog.mxconsulting.com
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




RE: 100% CPU

2004-08-31 Thread Mark A Kruger
Bill,

FYI In regard to point 9 below. When you use a regular server name instead
of a FLQN name you cause the JDBC server system to do the following:

First it uses the unc service...

check the host file for that string to resolve (no dice)
check dns (also no dice)

.move to rnc service (windows locator - basically a handoff to
windows)

check lmhosts file (if enabled and configured to do so)...
check active directory created flqn on the nearest dc if you are on a domain
([EMAIL PROTECTED])
check Active dir on the master domain controller if nearest dc fails..
try to resolve the name using windows locator broadcasting

It's a lot of hoops just for a friendlier naming scheme.Mostly, you just
don't want windows to have to do this work for you. Windows networking is
more dns based, but it still has a locator service with a fairly complicated
heirarchy of resolution attempts. Instead, you want the JDBC service to
bypass the application layer and get that name resolved prior to it arriving
at the windows locator service. Simply ask yourself - how is the server
going to resolve this name.

Mark Minasi's mastering windows server 200x books have some great
information on how windows goes about this process.

Mark A. Kruger, CFG, MSCE
www.cfwebtools.com
www.necfug.com
http://blog.mxconsulting.com
-Original Message-
From: Bill Grover [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 31, 2004 9:34 AM
To: CF-Talk
Subject: RE: 100% CPU

Thanks for the response.I'll work on this list and see if it doesn't
help.Here is some responses to a few of your points.

1. On the server in question we don't have any access.Just SQL Server
2. I'll try the updated drivers.
8. SQL is not on the web server.Both boxes are separate servers.
9. Our names are not FQDN but are not IP addresses either.We just use
the server name (sql-server).I'll play with the host file.
 [Todays Threads] 
 [This Message] 
 [Subscription] 
 [Fast Unsubscribe] 
 [User Settings]
 [Donations and Support]




RE: 100% CPU on CF5 - but patch has already applied!

2003-07-18 Thread DURETTE, STEVEN J (AIT)
Brian,

Don't take this the wrong way.  

Have you checked your cf code and any queries that you are running? I have
had problems of 100% usage and generally it was due to a coding error or
very poorly written query.  Since it is developer edition, you should know
which templates are running when it occurs, so I would start there.

Steve


-Original Message-
From: BRIAN MELOCHE [mailto:[EMAIL PROTECTED]
Sent: Friday, July 18, 2003 11:35 AM
To: CF-Talk
Subject: 100% CPU on CF5 - but patch has already applied!


Hello all...

I thought I would post this to the list, since my attempts to solve the
problem have failed.

I have run into an old problem.  Well, sort of.  I have the developer
version of CF5 installed on my desktop at work, which is running XP Pro on a
2.8 GHz P4...  and I am running into the 100% CPU usage caused by
NTconsoleJava.exe - the ColdFusion Management Repository Server.

But... here's the problem:  I have applied the patches from Macromedia and
deleted the logs, as according to:

http://www.macromedia.com/support/coldfusion/ts/documents/tn18000.htm

BUT I STILL HAVE THE PROBLEM!!!  It's still doing it!

Other than disabling the service, is there anything I can do to fix the
problem?  Has anyone run into this problem AFTER applying the patch?  Is
there another patch that I can't find?

Sincerely,

Brian Meloche,
4th Floor, ITO - 414
CSS
(304) 759-0585 x448
Macromedia Certified Advanced ColdFusion 5.0 Developer
130 pounds lost and counting!


~|
Archives: http://www.houseoffusion.com/cf_lists/index.cfm?forumid=4
Subscription: 
http://www.houseoffusion.com/cf_lists/index.cfm?method=subscribeforumid=4
FAQ: http://www.thenetprofits.co.uk/coldfusion/faq

Get the mailserver that powers this list at 
http://www.coolfusion.com

Unsubscribe: 
http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4



RE: 100% CPU on CF5 - but patch has already applied!

2003-07-18 Thread Dave Watts
 It's not a coding issue. There wasn't any code running.  
 This happens on startup.

No, the management service doesn't have anything to do with code. I know you
asked for ways to fix the problem, but unless you're actually using the
service, I'd just recommend that you turn it off. Perhaps you've enabled a
server probe? I think that's handled by the management service, but don't
recall for sure.

Dave Watts, CTO, Fig Leaf Software
http://www.figleaf.com/
voice: (202) 797-5496
fax: (202) 797-5444

~|
Archives: http://www.houseoffusion.com/cf_lists/index.cfm?forumid=4
Subscription: 
http://www.houseoffusion.com/cf_lists/index.cfm?method=subscribeforumid=4
FAQ: http://www.thenetprofits.co.uk/coldfusion/faq

This list and all House of Fusion resources hosted by CFHosting.com. The place for 
dependable ColdFusion Hosting.
http://www.cfhosting.com

Unsubscribe: 
http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4