---

** [tickets:#1081] Node went for reboot in the middle of switchover**

**Status:** unassigned
**Milestone:** 4.3.3
**Created:** Mon Sep 15, 2014 11:45 AM UTC by Sirisha Alla
**Last Updated:** Mon Sep 15, 2014 11:45 AM UTC
**Owner:** nobody

The issue is seen on SLES X86 VMs running with opensaf changeset 5697 +#946 
patch. IMM Db is loaded with 50k objects.

IMM Application along with switchovers is in progress.

Syslog on SC-2:

Sep 15 15:16:03 SLES-64BIT-SLOT2 osafamfnd[2452]: NO Assigned 
'safSi=SC-2N,safApp=OpenSAF' ACTIVE to 'safSu=SC-2,safSg=2N,safApp=OpenSAF'
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer disconnected 2 
<0, 2010f> (@OpenSafImmReplicatorA)
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer (applier) 
connected: 32 (@OpenSafImmReplicatorA) <0, 2010f>
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafimmpbed: IN PBE slave waiting for prepare 
from primary on PRTA update ccb:100000036
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafimmpbed: IN ccb-prepare received at PBE 
slave ccbId:100000036/4294967350 numOps:1
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafimmpbed: NO Prepare 
ccb:100000036/4294967350 received at Pbe slave when Prior Ccb 22 still 
processing
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer (applier) 
connected: 33 (@OpenSafImmReplicatorB) <348, 2020f>
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafntfimcnd[4058]: NO Started
Sep 15 15:16:04 SLES-64BIT-SLOT2 osafimmpbed: IN PBE slave waiting for prepare 
from primary on PRTA update ccb:100000036
Sep 15 15:16:04 SLES-64BIT-SLOT2 osafimmpbed: IN ccb-prepare received at PBE 
slave ccbId:100000036/4294967350 numOps:1
Sep 15 15:16:04 SLES-64BIT-SLOT2 osafimmpbed: NO Prepare 
ccb:100000036/4294967350 received at Pbe slave when Prior Ccb 22 still 
processing
Sep 15 15:16:04 SLES-64BIT-SLOT2 osafimmpbed: IN PBE slave waiting for prepare 
from primary on PRTA update ccb:100000036
Sep 15 15:16:04 SLES-64BIT-SLOT2 osafimmpbed: IN ccb-prepare received at PBE 
slave ccbId:100000036/4294967350 numOps:1
Sep 15 15:16:04 SLES-64BIT-SLOT2 osafimmpbed: NO Prepare 
ccb:100000036/4294967350 received at Pbe slave when Prior Ccb 22 still 
processing
Sep 15 15:16:05 SLES-64BIT-SLOT2 osafimmpbed: IN PBE slave waiting for prepare 
from primary on PRTA update ccb:100000036
Sep 15 15:16:05 SLES-64BIT-SLOT2 osafimmpbed: IN ccb-prepare received at PBE 
slave ccbId:100000036/4294967350 numOps:1
Sep 15 15:16:05 SLES-64BIT-SLOT2 osafimmpbed: NO Prepare 
ccb:100000036/4294967350 received at Pbe slave when Prior Ccb 22 still 
processing
Sep 15 15:16:05 SLES-64BIT-SLOT2 osafimmpbed: IN PBE slave waiting for prepare 
from primary on PRTA update ccb:100000036
Sep 15 15:16:05 SLES-64BIT-SLOT2 osafimmpbed: IN ccb-prepare received at PBE 
slave ccbId:100000036/4294967350 numOps:1
Sep 15 15:16:05 SLES-64BIT-SLOT2 osafimmpbed: NO Prepare 
ccb:100000036/4294967350 received at Pbe slave when Prior Ccb 22 still 
processing
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmpbed: IN PBE slave waiting for prepare 
from primary on PRTA update ccb:100000036
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmnd[2366]: WA update of PERSISTENT 
runtime attributes in object 'safNode=PL-3,safCluster=myClmCluster' REVERTED. 
PBE rc:20
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer locally 
disconnected. Marking it as doomed 28 <5, 2020f> (safClmService)
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer disconnected 
28 <5, 2020f> (safClmService)
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer connected: 34 
(safClmService) <353, 2020f>
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmpbed: IN ccb-prepare received at PBE 
slave ccbId:100000037/4294967351 numOps:1
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmpbed: NO Prepare 
ccb:100000037/4294967351 received at Pbe slave when Prior Ccb 22 still 
processing
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmpbed: IN PBE slave waiting for prepare 
from primary on PRTA update ccb:100000036
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafclmd[2417]: ER saImmOiImplementerSet 
failed rc:14, exiting
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafclmd[2417]: ER saImmOiImplementerSet 
failed rc:14, exiting
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafclmd[2417]: ER saImmOiImplementerSet 
failed rc:14, exiting
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafamfnd[2452]: NO 
'safComp=CLM,safSu=SC-2,safSg=2N,safApp=OpenSAF' faulted due to 'avaDown' : 
Recovery is 'nodeFailfast'
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafamfnd[2452]: ER 
safComp=CLM,safSu=SC-2,safSg=2N,safApp=OpenSAF Faulted due to:avaDown Recovery 
is:nodeFailfast
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafamfnd[2452]: Rebooting OpenSAF NodeId = 
131599 EE Name = , Reason: Component faulted: recovery is node failfast, 
OwnNodeId = 131599, SupervisionTime = 60
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer locally 
disconnected. Marking it as doomed 34 <353, 2020f> (safClmService)
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer disconnected 
34 <353, 2020f> (safClmService)
Sep 15 15:16:06 SLES-64BIT-SLOT2 opensaf_reboot: Rebooting local node; 
timeout=60

Not sure whether this is reoccurance of behavior mentioned in #946 (note that 
#946 patch is incorporated) or that of clm needs to handle ERR_EXIST as err try 
again. Syslog and traces attached.


---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Want excitement?
Manually upgrade your production database.
When you want reliability, choose Perforce
Perforce version control. Predictably reliable.
http://pubads.g.doubleclick.net/gampad/clk?id=157508191&iu=/4140/ostg.clktrk
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to