Done all of that. But with the latest outage we have it narrowed down to the
restarting of the mid-tiers that is causing the issues. Sent to BMC on one
of the restarts a multi gig log file.

The strange item is that we refreshed our QA and staging system,  with
production data and the issue does not show up there.

Howard 

-----Original Message-----
From: Action Request System discussion list(ARSList)
[mailto:arslist@ARSLIST.ORG] On Behalf Of Longwing, LJ CTR MDA/IC
Sent: Friday, February 01, 2013 12:49 PM
To: arslist@ARSLIST.ORG
Subject: Re: Slow system restart/arsystem errors after restarting the
mid-tiers on 7.6.4

Howard,
I highly recommend turning on API/SQL logging to the same file, and then
startup the Mid-Tier.  At the end of the 'unavailable' period you should
have a VERY good log file that will be able to be analyzed by arwklga to
give you the reasons for your errors.  It will tell you what API calls were
taking the longest as well as the SQL in question, from that you should be
able to analyze what in your system is slow and in turn tune it.

-----Original Message-----
From: Action Request System discussion list(ARSList)
[mailto:arslist@ARSLIST.ORG] On Behalf Of Howard Richter
Sent: Friday, February 01, 2013 10:40 AM
To: arslist@ARSLIST.ORG
Subject: Slow system restart/arsystem errors after restarting the mid-tiers
on 7.6.4

** 

Happy Friday all,

 

We have been seeing a very strange error since our go-live with 7.6.4 (and
working with no luck with BMC support), when we restart our mid-tiers. We
are on windows server 2008 and using MSSQL 2008.

 

What we have seen is we will shut down our two customer facing arservers (as
well as our integration arserver), as well as our three mid-tiers. Then we
bring up our three arservers (integration first, then the two customer
facing ones), with no issues. After testing them with auto creation of an
incident via email (an integration we created) and some poking around using
the old user tool, we bring up one of our mid-tiers (i.e. start up the
tomcat engine). At that point, the arserver that the mid-tier starts to talk
to, starts to show database time out errors and you can no longer log in via
the user tool.

 

It takes about 30 minutes for the system (the arserver/mid-tier) to
stabilize and then we are able to bring up the other two mid-tiers.

 

We have not seen this issue in our QA system (which is all on VM) and our
staging system. They both have somewhat the same architecture and we have
verified the mid-tier configuration/patch releases. 

 

Since we are still working with BMC support (for over 3 months on this), I
just wanted to see if anyone had seen this type of issue on other 7.6.4
systems.

 

Take care,

 

Howard

_ARSlist: "Where the Answers Are" and have been for 20 years_

____________________________________________________________________________
___
UNSUBSCRIBE or access ARSlist Archives at www.arslist.org "Where the Answers
Are, and have been for 20 years"

_______________________________________________________________________________
UNSUBSCRIBE or access ARSlist Archives at www.arslist.org
"Where the Answers Are, and have been for 20 years"

Reply via email to