Ok....I went over this countless times over the weekend and can't figure out what's wrong....here is my ar.cfg file for each server App1: Server-Name: myserver-app Server-Connect-Name: myserver-app1 IP-Name: myserver-app IP-Name: myserver-app1
App2: Server-Name: myserver-app Server-Connect-Name: myserver-app2 IP-Name: myserver-app IP-Name: myserver-app2 According to the Server Group doc, all servers in the group must have the same 'Server-Name'...so they do. The Server-Connect-Name is the value used by the operation ranking form, and must be dns resolvable...which they are. Here is the server group log <SGRP> /* Wed Jan 02 2008 08:33:25.2960 */ Server Group Trace Log -- ON (AR Server 7.1.00 Patch 001 200711161033) <SGRP> /* Wed Jan 02 2008 08:33:37.7810 */ The Administration operation is currently owned by server myserver-app1 <SGRP> /* Wed Jan 02 2008 08:33:37.7810 */ Starting Administration operation <SGRP> /* Wed Jan 02 2008 08:33:37.7810 */ The Administration operation is currently owned by server myserver-app1 <SGRP> /* Wed Jan 02 2008 08:33:37.7810 */ Starting Administration operation <SGRP> /* Wed Jan 02 2008 08:33:37.7810 */ The Escalation operation is currently owned by server myserver-app1 <SGRP> /* Wed Jan 02 2008 08:33:37.7810 */ Starting Escalation operation <SGRP> /* Wed Jan 02 2008 08:33:37.7810 */ The Escalation operation is currently owned by server myserver-app1 <SGRP> /* Wed Jan 02 2008 08:33:37.7810 */ Starting Escalation operation <SGRP> /* Wed Jan 02 2008 08:33:37.7810 */ The Archive operation is currently owned by server myserver-app1 <SGRP> /* Wed Jan 02 2008 08:33:37.7810 */ Starting Archive operation <SGRP> /* Wed Jan 02 2008 08:33:37.7810 */ The Archive operation is currently owned by server myserver-app1 <SGRP> /* Wed Jan 02 2008 08:33:37.7810 */ Starting Archive operation <SGRP> /* Wed Jan 02 2008 08:33:37.7810 */ The E-Mail Engine operation is currently owned by server myserver-app1 <SGRP> /* Wed Jan 02 2008 08:33:37.7810 */ Resuming E-Mail Engine operation <SGRP> /* Wed Jan 02 2008 08:33:37.8120 */ The E-Mail Engine operation is currently owned by server myserver-app1 <SGRP> /* Wed Jan 02 2008 08:33:37.8120 */ Resuming E-Mail Engine operation <SGRP> /* Wed Jan 02 2008 08:33:37.8120 */ The Flashboards operation is currently owned by server myserver-app1 <SGRP> /* Wed Jan 02 2008 08:33:37.8120 */ Resuming Flashboards operation <SGRP> /* Wed Jan 02 2008 08:33:37.8120 */ The Flashboards operation is currently owned by server myserver-app1 <SGRP> /* Wed Jan 02 2008 08:33:37.8120 */ Resuming Flashboards operation <SGRP> /* Wed Jan 02 2008 08:34:36.8290 */ Server myserver-app1 assuming responsibility for unowned Administration operation <SGRP> /* Wed Jan 02 2008 08:34:36.8290 */ Starting Administration operation <SGRP> /* Wed Jan 02 2008 08:34:36.8290 */ Server myserver-app1 assuming responsibility for unowned Escalation operation <SGRP> /* Wed Jan 02 2008 08:34:36.8290 */ Starting Escalation operation <SGRP> /* Wed Jan 02 2008 08:34:36.8290 */ Server myserver-app1 assuming responsibility for unowned Archive operation <SGRP> /* Wed Jan 02 2008 08:34:36.8290 */ Starting Archive operation <SGRP> /* Wed Jan 02 2008 08:34:36.8290 */ Server myserver-app1 assuming responsibility for unowned E-Mail Engine operation <SGRP> /* Wed Jan 02 2008 08:34:36.8290 */ Resuming E-Mail Engine operation <SGRP> /* Wed Jan 02 2008 08:34:36.8290 */ Server myserver-app1 assuming responsibility for unowned Flashboards operation <SGRP> /* Wed Jan 02 2008 08:34:36.8290 */ Resuming Flashboards operation <SGRP> /* Wed Jan 02 2008 08:35:36.8610 */ Server myserver-app1 assuming responsibility for unowned Administration operation <SGRP> /* Wed Jan 02 2008 08:35:36.8610 */ Starting Administration operation <SGRP> /* Wed Jan 02 2008 08:35:36.8610 */ Server myserver-app1 assuming responsibility for unowned Escalation operation <SGRP> /* Wed Jan 02 2008 08:35:36.8610 */ Starting Escalation operation <SGRP> /* Wed Jan 02 2008 08:35:36.8610 */ Server myserver-app1 assuming responsibility for unowned Archive operation <SGRP> /* Wed Jan 02 2008 08:35:36.8610 */ Starting Archive operation <SGRP> /* Wed Jan 02 2008 08:35:36.8610 */ Server myserver-app1 assuming responsibility for unowned E-Mail Engine operation <SGRP> /* Wed Jan 02 2008 08:35:36.8610 */ Resuming E-Mail Engine operation <SGRP> /* Wed Jan 02 2008 08:35:36.8610 */ Server myserver-app1 assuming responsibility for unowned Flashboards operation <SGRP> /* Wed Jan 02 2008 08:35:36.8610 */ Resuming Flashboards operation as you see there are two weird things going on...it seems to be doing each operation twice....and for that matter...each time I add the server back into the group it creates two records in the ranking form for each server....any ideas???? _____ From: Action Request System discussion list(ARSList) [mailto:[EMAIL PROTECTED] On Behalf Of Shyman, Jonathan Sent: Friday, December 28, 2007 12:54 PM To: arslist@ARSLIST.ORG Subject: Re: Server Group Error 8745 ** Something may not have been set right when this server was originally added to the server group. It may be possible to track down what if you can compare the settings in ar.conf and in Remedy Administrator on both servers. If that fails to turn up anything a next step might be to remove the primary server from the server group and then add it back. I'm not sure what impact that may have on operations when you do it or if you'd have to stop/restart all the server group servers to make the change. J.T. Shyman Column Technologies [EMAIL PROTECTED] _____ From: LJ LongWing (Head) Sent: Fri 12/28/2007 2:48 PM To: arslist@ARSLIST.ORG Subject: Re: Server Group Error 8745 ** Ok...here is something Odd...the Primary server (the one having the issues with the escalation Queue says this on startup) <SGRP> /* Fri Dec 28 2007 12:02:49.7240 */ The Administration operation is currently owned by server <ThisServer> <SGRP> /* Fri Dec 28 2007 12:02:49.7240 */ Starting Administration operation then 30 seconds later, and every thirty seconds it gives this message...for each operation that it is an owner for <SGRP> /* Fri Dec 28 2007 12:03:18.6640 */ Server <ThisServer> assuming responsibility for unowned Administration operation <SGRP> /* Fri Dec 28 2007 12:03:18.6640 */ Starting Administration operation which tells me that server1 in this group isn't recognizing itself? _____ From: Action Request System discussion list(ARSList) [mailto:[EMAIL PROTECTED] On Behalf Of Shyman, Jonathan Sent: Friday, December 28, 2007 11:38 AM To: arslist@ARSLIST.ORG Subject: Re: Server Group Error 8745 ** Yes, what he said. Thanks, Joe. Good point. J.T. Shyman Column Technologies [EMAIL PROTECTED] _____ From: Joe D'Souza Sent: Fri 12/28/2007 1:34 PM To: arslist@ARSLIST.ORG Subject: Re: Server Group Error 8745 ** Also turn on the server group logs on all the servers belonging to the server group.. If there is a communication problem between the servers in the server group, it will show on these logs... Joe -----Original Message----- From: Action Request System discussion list(ARSList) [mailto:[EMAIL PROTECTED] Behalf Of Shyman, Jonathan Sent: Friday, December 28, 2007 1:29 PM To: arslist@ARSLIST.ORG Subject: Re: Server Group Error 8745 ** Since this seems to be occurring during server group communication I would wonder if the two servers can talk to each other. If these servers are in fact in a servergroup, here's what I'd do to start troubleshooting: 1. Make sure each server in the group can ping each other server both by short name and my fully-qualified domain name (FQDN) 2. Make sure each server in the group can ping the server group host name. I'm assuming you are using a hardware load-balancer of some kind to do this and that it has its own name and IP address 3. Make sure the hosts file on each server points to the server group host name for its own IP. J.T. Shyman Column Technologies [EMAIL PROTECTED] _____ From: LJ LongWing (Head) Sent: Fri 12/28/2007 1:24 PM To: arslist@ARSLIST.ORG Subject: Re: Server Group Error 8745 ** The first restart today was by complete reboot of the server...so no possibility of a thread not shutting down...and it started happening as you can see almost exactly an hour afterwards. Then the second you see is just a restart of the service, and again, an hour before throwing the error again...so I don't think it's a thread issue. _____ From: Action Request System discussion list(ARSList) [mailto:[EMAIL PROTECTED] On Behalf Of Joe D'Souza Sent: Friday, December 28, 2007 11:12 AM To: arslist@ARSLIST.ORG Subject: Re: Server Group Error 8745 ** This is probably because the escalation thread that was created previously has not been killed gracefully after the server was brought down. After stopping the services could you check to see if the Escalation RPC is still in use by an rpcinfo -all? Joe -----Original Message----- From: Action Request System discussion list(ARSList) [mailto:[EMAIL PROTECTED] Behalf Of LJ LongWing (Head) Sent: Friday, December 28, 2007 12:58 PM To: arslist@ARSLIST.ORG Subject: Re: Server Group Error 8745 ** Found some more information about this....this appears to begin almost exactly 1 hour after I restart the server Fri Dec 28 07:06:46 2007 : Action Request System(R) Server Version 7.1.00 Patch 001 200711161033 (c) Copyright 1991-2007 BMC Software, Inc. Fri Dec 28 08:06:58 2007 ServerGroup : Unable to create a queue to process RPC requests. (ARERR 8745) Fri Dec 28 08:06:58 2007 CreateEscalationQueue Fri Dec 28 09:51:10 2007 : Action Request System(R) Server Version 7.1.00 Patch 001 200711161033 (c) Copyright 1991-2007 BMC Software, Inc. Fri Dec 28 10:51:00 2007 ServerGroup : Unable to create a queue to process RPC requests. (ARERR 8745) Fri Dec 28 10:51:00 2007 CreateEscalationQueue Any suggestions yet? _____ From: LJ LongWing (Head) [mailto:[EMAIL PROTECTED] Sent: Friday, December 28, 2007 7:04 AM To: arslist@ARSLIST.ORG Subject: Server Group Error 8745 Recently stood up my first server group and one of them is giving the following error Fri Dec 28 06:54:10 2007 ServerGroup : Unable to create a queue to process RPC requests. (ARERR 8745) Fri Dec 28 06:54:10 2007 CreateEscalationQueue Strange part is I can't find this error in the documentation, not in the Error guide, or any of the other docs...can't find any reference to it anywhere....not on Google, nor list archives...any suggestions? ARS 7.1 Win2k3 __20060125_______________________This posting was submitted with HTML in it___ __20060125_______________________This posting was submitted with HTML in it___ __20060125_______________________This posting was submitted with HTML in it___ __20060125_______________________This posting was submitted with HTML in it___ _______________________________________________________________________________ UNSUBSCRIBE or access ARSlist Archives at www.arslist.org Platinum Sponsor: www.rmsportal.com ARSlist: "Where the Answers Are"