[tickets] [opensaf:tickets] #719 AMFD: Invalid read when deleting a node from a node group

2014-01-22 Thread Hans Feldt
- **status**: review --> assigned --- ** [tickets:#719] AMFD: Invalid read when deleting a node from a node group** **Status:** assigned **Created:** Wed Jan 15, 2014 05:35 AM UTC by Gary Lee **Last Updated:** Thu Jan 16, 2014 05:35 AM UTC **Owner:** Gary Lee When deleting a node from a node

[tickets] [opensaf:tickets] #601 AMF: restarted component is assigned a CSI with HA state QUIESCED

2014-01-22 Thread Hans Feldt
>From Praveen: "While testing #601 getting crash. case: Node shutdown and FAILED_OPERATION in quiescing callback. Crash does not come without patch. (gdb) bt #0 0x0039ef679b60 in strlen () from /lib64/libc.so.6 #1 0x0039ef646cb9 in vfprintf () from /lib64/libc.so.6 #2 0x0039ef6

[tickets] [opensaf:tickets] #690 Opensaf start failed when RDE could not RESPAWN.

2014-01-22 Thread Anders Widell
changeset: 4823:987352974f01 branch: opensaf-4.2.x parent: 4819:c6106939fc4c user:Anders Widell date:Wed Jan 22 09:07:33 2014 +0100 summary: base: Use default scheduling policy when configured policy is invalid [#690] changeset: 4824:561f35399a85 branch: op

[tickets] [opensaf:tickets] #690 Opensaf start failed when RDE could not RESPAWN.

2014-01-22 Thread Anders Widell
- **status**: review --> fixed --- ** [tickets:#690] Opensaf start failed when RDE could not RESPAWN.** **Status:** fixed **Created:** Tue Dec 24, 2013 09:21 AM UTC by manu **Last Updated:** Fri Jan 17, 2014 02:50 PM UTC **Owner:** Anders Widell Changeset:- 4733 Opensaf is up and running. 1)

[tickets] [opensaf:tickets] #737 AdminOp sync request queued up for more than 25mins

2014-01-22 Thread Anders Bjornerstedt
- **status**: unassigned --> accepted - **assigned_to**: Anders Bjornerstedt - **Version**: 4.4.M0 --> Unofficial 4.4 version - **Milestone**: future --> 4.4.RC1 --- ** [tickets:#737] AdminOp sync request queued up for more than 25mins** **Status:** accepted **Created:** Wed Jan 22, 2014 07:02

[tickets] [opensaf:tickets] #722 payloads did not go for reboot when both the controllers rebooted

2014-01-22 Thread Mathi Naickan
Both 721 and 722 have not been reproduced. But yes, 722 happened immediately after 721 occurred in the same setup. However, the problems are different in 721 and 722. i.e. In 722, even though the controllers went for a reboot, the OpenSAF services are still alive on the payload and they should ha

[tickets] [opensaf:tickets] Re: #711 log record write FAILED with SA_AIS_ERR_TRY_AGAIN after failover

2014-01-22 Thread elunlen
The problem here is that the new root path is not saved correctly on standby when changed using an imm command. In the “old” logservice this was done in the log service imm OI both on active and standby (applier). This is changed in the “new” 4.4 variant so that the new root path is check-pointe

[tickets] [opensaf:tickets] Re: #722 payloads did not go for reboot when both the controllers rebooted

2014-01-22 Thread Anders Bjornerstedt
Well #721 is not just a case of a "faster" failover, it is a case of an *incorrect* failover, or "unclean" failover. This because the failover is done before MDS down has been received. We should revert the FM behavior back to how it was before unless someone can come up with some other way to g

[tickets] [opensaf:tickets] #738 INVALID_PARAM nees to be returned when a timeout in negative is passed for ClusterNodeGet_4

2014-01-22 Thread Sirisha Alla
--- ** [tickets:#738] INVALID_PARAM nees to be returned when a timeout in negative is passed for ClusterNodeGet_4** **Status:** unassigned **Created:** Wed Jan 22, 2014 08:35 AM UTC by Sirisha Alla **Last Updated:** Wed Jan 22, 2014 08:35 AM UTC **Owner:** nobody The issue is seen on changes

[tickets] [opensaf:tickets] #738 INVALID_PARAM needs to be returned when a timeout in negative is passed for ClusterNodeGet_4

2014-01-22 Thread Sirisha Alla
- **summary**: INVALID_PARAM nees to be returned when a timeout in negative is passed for ClusterNodeGet_4 --> INVALID_PARAM needs to be returned when a timeout in negative is passed for ClusterNodeGet_4 --- ** [tickets:#738] INVALID_PARAM needs to be returned when a timeout in negative is p

[tickets] [opensaf:tickets] #721 IMMD asserted when trying to become active during failover

2014-01-22 Thread Mathi Naickan
- **status**: unassigned --> assigned - **assigned_to**: Mathi Naickan --- ** [tickets:#721] IMMD asserted when trying to become active during failover** **Status:** assigned **Created:** Thu Jan 16, 2014 07:32 AM UTC by Sirisha Alla **Last Updated:** Fri Jan 17, 2014 11:03 AM UTC **Owner:** M

[tickets] [opensaf:tickets] #735 SMF: saAmfResponse send to early when quiesced

2014-01-22 Thread Ingvar Bergström
changeset: 4827:0ea0adfe07ff branch: opensaf-4.4.x parent: 4825:5c555032178c user:Ingvar Bergstrom date:Mon Jan 20 14:54:07 2014 +0100 summary: smfd: saAmfResponse invoked after quiesced_ack is received [#735] changeset: 4828:7957351d2292 tag: tip parent:

[tickets] [opensaf:tickets] #735 SMF: saAmfResponse send to early when quiesced

2014-01-22 Thread Ingvar Bergström
- **status**: review --> fixed --- ** [tickets:#735] SMF: saAmfResponse send to early when quiesced** **Status:** fixed **Created:** Mon Jan 20, 2014 07:45 AM UTC by Ingvar Bergström **Last Updated:** Tue Jan 21, 2014 12:23 PM UTC **Owner:** Ingvar Bergström SMFD send response (saAmfResponse)

[tickets] [opensaf:tickets] #663 unlock of payload node fails with time out

2014-01-22 Thread Praveen
changeset: 4829:0d30e0e32380 user:praveen.malv...@oracle.com date:Wed Jan 22 17:42:51 2014 +0530 summary: amfd: respond node level admin op at sufailover [#663] changeset: 4830:9d2955c86a96 branch: opensaf-4.4.x tag: tip parent: 4827:0ea0adfe07ff user:

[tickets] [opensaf:tickets] #663 unlock of payload node fails with time out

2014-01-22 Thread Praveen
- **status**: review --> fixed --- ** [tickets:#663] unlock of payload node fails with time out** **Status:** fixed **Created:** Tue Dec 17, 2013 11:28 AM UTC by surender khetavath **Last Updated:** Thu Jan 09, 2014 06:42 AM UTC **Owner:** Praveen changeset : 4733 model : 2n configuration : 1

[tickets] [opensaf:tickets] #601 AMF: restarted component is assigned a CSI with HA state QUIESCED

2014-01-22 Thread Hans Feldt
It is clear from the trace that AVND_COMP_CSI_REC instances are deleted in the chain: avnd_su_si_oper_done avnd_su_si_del avnd_su_si_csi_del and then the call chain unwinds back to avnd_comp_csi_remove where this free memory is dereferenced because that is what is being iterated over

[tickets] [opensaf:tickets] #601 AMF: restarted component is assigned a CSI with HA state QUIESCED

2014-01-22 Thread Hans Feldt
How about that the break in line 1763 one breaks the inner loop. I think it is supposed to break both loops. Now it will do a FIND_NEXT with curr_csi and start over again. Praveen please try to change this and verify. --- ** [tickets:#601] AMF: restarted component is assigned a CSI with HA sta

Re: [tickets] [opensaf:tickets] #601 AMF: restarted component is assigned a CSI with HA state QUIESCED

2014-01-22 Thread praveen malviya
I will verify it. Thanks, Praveen On 22-Jan-14 10:14 PM, Hans Feldt wrote: How about that the break in line 1763 one breaks the inner loop. I think it is supposed to break both loops. Now it will do a FIND_NEXT with curr_csi and start over again. Praveen please try to change this and verify.

[tickets] [opensaf:tickets] #732 Amf crashed while performing LOCK-IN operation on the SU.

2014-01-22 Thread Nagendra Kumar
- **status**: review --> fixed --- ** [tickets:#732] Amf crashed while performing LOCK-IN operation on the SU.** **Status:** fixed **Created:** Mon Jan 20, 2014 06:51 AM UTC by manu **Last Updated:** Thu Jan 23, 2014 04:48 AM UTC **Owner:** Nagendra Kumar Changeset - 4733 SU is on the SC-1 n

[tickets] [opensaf:tickets] #732 Amf crashed while performing LOCK-IN operation on the SU.

2014-01-22 Thread Nagendra Kumar
changeset: 4831:4fb798a8a8b5 branch: opensaf-4.3.x parent: 4824:561f35399a85 user:Nagendra Kumar date:Thu Jan 23 10:13:26 2014 +0530 summary: amfnd: ignore healthcheck command response for terminated component [#732] changeset: 4832:6f9d8d676411 branch: open

[tickets] [opensaf:tickets] #362 AMF saflogs UNKNOWN...

2014-01-22 Thread Nagendra Kumar
changeset: 4834:426b6b6e71e5 branch: opensaf-4.2.x parent: 4823:987352974f01 user:Nagendra Kumar date:Thu Jan 23 10:21:24 2014 +0530 summary: amfd: remove UNKNOWN word from saflog [#362] changeset: 4835:0c89cdcbacfd branch: opensaf-4.3.x parent: 4831:4fb

[tickets] [opensaf:tickets] #362 AMF saflogs UNKNOWN...

2014-01-22 Thread Nagendra Kumar
- **status**: review --> fixed --- ** [tickets:#362] AMF saflogs UNKNOWN...** **Status:** fixed **Created:** Fri May 31, 2013 03:28 AM UTC by Nagendra Kumar **Last Updated:** Thu Jan 23, 2014 04:55 AM UTC **Owner:** Nagendra Kumar Migrated from http://devel.opensaf.org/ticket/1794 AMF logs s

[tickets] [opensaf:tickets] #657 SMF component got faulted during middleware rollback from CS-4667 to CS-3796

2014-01-22 Thread Ingvar Bergström
- **status**: unassigned --> accepted - **assigned_to**: Ingvar Bergström - **Version**: --> 4.4 --- ** [tickets:#657] SMF component got faulted during middleware rollback from CS-4667 to CS-3796** **Status:** accepted **Created:** Fri Dec 13, 2013 11:59 AM UTC by Hrishikesh **Last Updated:*

[tickets] [opensaf:tickets] Re: #601 AMF: restarted component is assigned a CSI with HA state QUIESCED

2014-01-22 Thread Praveen
I will verify it. Thanks, Praveen On 22-Jan-14 10:14 PM, Hans Feldt wrote: > > How about that the break in line 1763 one breaks the inner loop. I > think it is supposed to break both loops. Now it will do a FIND_NEXT > with curr_csi and start over again. Praveen please try to change this > and

[tickets] [opensaf:tickets] #324 amf: The operational state of a component stays DISABLED while that of SU gets ENABLED when the respective node gets added into the cluster.

2014-01-22 Thread Nagendra Kumar
- **status**: review --> assigned - **assigned_to**: Nagendra Kumar --> nobody --- ** [tickets:#324] amf: The operational state of a component stays DISABLED while that of SU gets ENABLED when the respective node gets added into the cluster.** **Status:** assigned **Created:** Fri May 24,

[tickets] [opensaf:tickets] #324 amf: The operational state of a component stays DISABLED while that of SU gets ENABLED when the respective node gets added into the cluster.

2014-01-22 Thread Nagendra Kumar
- **status**: assigned --> review - **assigned_to**: Nagendra Kumar --- ** [tickets:#324] amf: The operational state of a component stays DISABLED while that of SU gets ENABLED when the respective node gets added into the cluster.** **Status:** review **Created:** Fri May 24, 2013 09:19 AM U

[tickets] [opensaf:tickets] #739 amf: amfnd crashes during removal of assignments

2014-01-22 Thread Praveen
--- ** [tickets:#739] amf: amfnd crashes during removal of assignments ** **Status:** unassigned **Created:** Thu Jan 23, 2014 06:30 AM UTC by Praveen **Last Updated:** Thu Jan 23, 2014 06:30 AM UTC **Owner:** nobody The issue is observed in 2N model and in those configurations in which SUs h

[tickets] [opensaf:tickets] #740 MsgQueueOpen returns ERR_NO_RESOURCES randomly

2014-01-22 Thread Sirisha Alla
--- ** [tickets:#740] MsgQueueOpen returns ERR_NO_RESOURCES randomly** **Status:** unassigned **Created:** Thu Jan 23, 2014 07:45 AM UTC by Sirisha Alla **Last Updated:** Thu Jan 23, 2014 07:45 AM UTC **Owner:** nobody MsgQueueOpen fails randomly with ERR_NO_RESOURCES. This issue is seen on