---
** [tickets:#1081] Node went for reboot in the middle of switchover**
**Status:** unassigned
**Milestone:** 4.3.3
**Created:** Mon Sep 15, 2014 11:45 AM UTC by Sirisha Alla
**Last Updated:** Mon Sep 15, 2014 11:45 AM UTC
**Owner:** nobody
The issue is seen on SLES X86 VMs running with opensaf changeset 5697 +#946
patch. IMM Db is loaded with 50k objects.
IMM Application along with switchovers is in progress.
Syslog on SC-2:
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafamfnd[2452]: NO Assigned
'safSi=SC-2N,safApp=OpenSAF' ACTIVE to 'safSu=SC-2,safSg=2N,safApp=OpenSAF'
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer disconnected 2
<0, 2010f> (@OpenSafImmReplicatorA)
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer (applier)
connected: 32 (@OpenSafImmReplicatorA) <0, 2010f>
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafimmpbed: IN PBE slave waiting for prepare
from primary on PRTA update ccb:100000036
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafimmpbed: IN ccb-prepare received at PBE
slave ccbId:100000036/4294967350 numOps:1
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafimmpbed: NO Prepare
ccb:100000036/4294967350 received at Pbe slave when Prior Ccb 22 still
processing
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer (applier)
connected: 33 (@OpenSafImmReplicatorB) <348, 2020f>
Sep 15 15:16:03 SLES-64BIT-SLOT2 osafntfimcnd[4058]: NO Started
Sep 15 15:16:04 SLES-64BIT-SLOT2 osafimmpbed: IN PBE slave waiting for prepare
from primary on PRTA update ccb:100000036
Sep 15 15:16:04 SLES-64BIT-SLOT2 osafimmpbed: IN ccb-prepare received at PBE
slave ccbId:100000036/4294967350 numOps:1
Sep 15 15:16:04 SLES-64BIT-SLOT2 osafimmpbed: NO Prepare
ccb:100000036/4294967350 received at Pbe slave when Prior Ccb 22 still
processing
Sep 15 15:16:04 SLES-64BIT-SLOT2 osafimmpbed: IN PBE slave waiting for prepare
from primary on PRTA update ccb:100000036
Sep 15 15:16:04 SLES-64BIT-SLOT2 osafimmpbed: IN ccb-prepare received at PBE
slave ccbId:100000036/4294967350 numOps:1
Sep 15 15:16:04 SLES-64BIT-SLOT2 osafimmpbed: NO Prepare
ccb:100000036/4294967350 received at Pbe slave when Prior Ccb 22 still
processing
Sep 15 15:16:05 SLES-64BIT-SLOT2 osafimmpbed: IN PBE slave waiting for prepare
from primary on PRTA update ccb:100000036
Sep 15 15:16:05 SLES-64BIT-SLOT2 osafimmpbed: IN ccb-prepare received at PBE
slave ccbId:100000036/4294967350 numOps:1
Sep 15 15:16:05 SLES-64BIT-SLOT2 osafimmpbed: NO Prepare
ccb:100000036/4294967350 received at Pbe slave when Prior Ccb 22 still
processing
Sep 15 15:16:05 SLES-64BIT-SLOT2 osafimmpbed: IN PBE slave waiting for prepare
from primary on PRTA update ccb:100000036
Sep 15 15:16:05 SLES-64BIT-SLOT2 osafimmpbed: IN ccb-prepare received at PBE
slave ccbId:100000036/4294967350 numOps:1
Sep 15 15:16:05 SLES-64BIT-SLOT2 osafimmpbed: NO Prepare
ccb:100000036/4294967350 received at Pbe slave when Prior Ccb 22 still
processing
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmpbed: IN PBE slave waiting for prepare
from primary on PRTA update ccb:100000036
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmnd[2366]: WA update of PERSISTENT
runtime attributes in object 'safNode=PL-3,safCluster=myClmCluster' REVERTED.
PBE rc:20
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer locally
disconnected. Marking it as doomed 28 <5, 2020f> (safClmService)
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer disconnected
28 <5, 2020f> (safClmService)
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer connected: 34
(safClmService) <353, 2020f>
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmpbed: IN ccb-prepare received at PBE
slave ccbId:100000037/4294967351 numOps:1
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmpbed: NO Prepare
ccb:100000037/4294967351 received at Pbe slave when Prior Ccb 22 still
processing
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmpbed: IN PBE slave waiting for prepare
from primary on PRTA update ccb:100000036
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafclmd[2417]: ER saImmOiImplementerSet
failed rc:14, exiting
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafclmd[2417]: ER saImmOiImplementerSet
failed rc:14, exiting
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafclmd[2417]: ER saImmOiImplementerSet
failed rc:14, exiting
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafamfnd[2452]: NO
'safComp=CLM,safSu=SC-2,safSg=2N,safApp=OpenSAF' faulted due to 'avaDown' :
Recovery is 'nodeFailfast'
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafamfnd[2452]: ER
safComp=CLM,safSu=SC-2,safSg=2N,safApp=OpenSAF Faulted due to:avaDown Recovery
is:nodeFailfast
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafamfnd[2452]: Rebooting OpenSAF NodeId =
131599 EE Name = , Reason: Component faulted: recovery is node failfast,
OwnNodeId = 131599, SupervisionTime = 60
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer locally
disconnected. Marking it as doomed 34 <353, 2020f> (safClmService)
Sep 15 15:16:06 SLES-64BIT-SLOT2 osafimmnd[2366]: NO Implementer disconnected
34 <353, 2020f> (safClmService)
Sep 15 15:16:06 SLES-64BIT-SLOT2 opensaf_reboot: Rebooting local node;
timeout=60
Not sure whether this is reoccurance of behavior mentioned in #946 (note that
#946 patch is incorporated) or that of clm needs to handle ERR_EXIST as err try
again. Syslog and traces attached.
---
Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Want excitement?
Manually upgrade your production database.
When you want reliability, choose Perforce
Perforce version control. Predictably reliable.
http://pubads.g.doubleclick.net/gampad/clk?id=157508191&iu=/4140/ostg.clktrk
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets