[devel] Update on my Active Participation in OpenSAF project

2017-08-31 Thread A V Mahesh
Dear All, As you all know, yesterday was my last day of Active Participation in OpenSAF project. The day, I joined `The OpenSAF Community; an open source project was new to me, but the OpenSAF Community Openness allowed me to discuss freely and participate in technical discussions and the ex

Re: [devel] [PATCH 1/1] smf: fix to avoid core dump during smf upgrade while imm sync [#2550]

2017-08-28 Thread A V Mahesh
Hi Vijay, ACK. This is better than previous patch by doing recycle of time wait , but this is not final solution, so you write an enhancement ticket to fix the issue with a solution like , smf (imm application) may get callback or notified by IMM/some-other-way  after completing  sync process

Re: [devel] [PATCH 1/1] dtm: Re-factor the inter-node receive logic [#2554]

2017-08-22 Thread A V Mahesh
Hi Anders, ACK. not tested. -AVM On 8/21/2017 2:25 PM, Anders Widell wrote: The DTM receive logic was unnecessarily complex: it called recv() in many different places and had conditional statements that executed different pieces of code depending on how many bytes that had been received on th

Re: [devel] [PATCH 1/1] smf: fix to avoid core dump during smf upgrade while imm sync [#2550]

2017-08-17 Thread A V Mahesh
On 8/18/2017 8:43 AM, A V Mahesh wrote: Hi Vijay Roy , If i have larger imm db to sync , you will end up with same problem again. One more point , if I have multiple node in queue waiting for sync , again you will end up with same problem . -AVM On 8/17/2017 5:34 PM, Vijay Roy wrote

Re: [devel] [PATCH 1/1] imm: change log level from error to warning in PBE [#2559]

2017-08-17 Thread A V Mahesh
ACK, Not tested. -AVM On 8/17/2017 6:33 PM, Zoran Milinkovic wrote: In #2491, the log message if logged always with error log level. If a cluster goes headless, this case is very likely to be happen with ERR_TRY_AGAIN, but that's expected behavior when the cluster goes headless. --- src/imm/c

Re: [devel] [PATCH 1/1] smf: fix to avoid core dump during smf upgrade while imm sync [#2550]

2017-08-17 Thread A V Mahesh
Hi Vijay Roy , If i have larger imm db to sync , you will end up with same problem again. -AVM On 8/17/2017 5:34 PM, Vijay Roy wrote: --- src/smf/smfd/SmfUpgradeStep.cc | 31 +++ 1 file changed, 27 insertions(+), 4 deletions(-) diff --git a/src/smf/smfd/SmfUpgr

Re: [devel] [PATCH 1/1] dtm: Support clusters with more than one hundred nodes [#2240]

2017-08-16 Thread A V Mahesh
Hi Anders Widell, ACK, over all changes looks good . Patch can be pushed now , if we found any issues on top of this patch can be addressable later . -AVM On 8/16/2017 3:49 PM, Anders Widell wrote: Re-factor the inter-node poll loop by replacing the 100 entry fixed size poll vector with

Re: [devel] [PATCH 1/1] dtmd: send broadcast messages after initial discovery [#2522]

2017-08-16 Thread A V Mahesh
ftransportd[177]: Rebooting OpenSAF NodeId = 0 EE Name = No EE Mapped, Reason: osafdtmd Process down, Rebooting the node, OwnNodeId = 0, SupervisionTime = 60 2017-08-15 10:41:20.570 PL-4 opensaf_reboot: Rebooting local node; timeout=60 On 08/16/2017 10:11 AM, A V Mahesh wrote: Hi Anders

Re: [devel] [PATCH 1/1] imm: regenerate PBE in verifyClassPBE only if database is corrupted [#2491]

2017-08-16 Thread A V Mahesh
Hi Zoran , ACK , not tested. -AVM On 8/14/2017 6:08 PM, Zoran Milinkovic wrote: In verifyClassPBE(), the patch makes distinguish between IMM issue and database corruption. For IMM issue, PBE will not be regenerated, while for database corruption, PBE will be regenerated. --- src/imm/commo

Re: [devel] [PATCH 1/1] dtmd: send broadcast messages after initial discovery [#2522]

2017-08-16 Thread A V Mahesh
, Anders Widell On 08/16/2017 06:21 AM, A V Mahesh wrote: Hi Anders , On 8/15/2017 6:13 PM, Anders Widell wrote: Is it a pure performance optimization, or is there some potential problem that can happen if two nodes discover each other at the same time and estabilsh TCP connections in both

Re: [devel] [PATCH 1/1] dtmd: send broadcast messages after initial discovery [#2522]

2017-08-15 Thread A V Mahesh
casting at a lower rate after the initial discovery phase. If we have only one configuration option and enable this new continuous broadcast mode by setting that option to zero as you suggest, we would need to hard-code the length of the initial phase (to five seconds?) as well as the interval between broadcast

Re: [devel] [PATCH 1/1] imm: include CLM in poll before CLM handle is initialized [#2544]

2017-08-15 Thread A V Mahesh
Hi Zoran, ACK, not tested. -AVM On 8/7/2017 8:33 PM, Zoran Milinkovic wrote: CLM selection object is initially set to -1. Included CLM selection object in poll will be ignored until CLM selection object is created and set to fds[FD_CLM]. --- src/imm/immnd/immnd_main.c | 4 +--- 1 file chan

Re: [devel] [PATCH 1/1] imm: fix immnd coredump due to initialized CLM handle [#2549]

2017-08-15 Thread A V Mahesh
Hi Zoran , ACK , with following , not tested. Please check that the `immnd_cb->clm_hdl` cleared on Headless case. -AVM On 8/9/2017 5:14 PM, Zoran Milinkovic wrote: Initially CLM handle is set to 0. When CLM handle is initialized, it can be initialized again only when saClmDispatch returns

Re: [devel] [PATCH 1/1] dtmd: send broadcast messages after initial discovery [#2522]

2017-08-13 Thread A V Mahesh
Hi Alex Jones, I don't think you required an additional configuration (CONT_BCAST_INT) for continuing dtm discovery, you can utilize the DTM_INI_DIS_TIMEOUT_SECS=5 ( dtm_discovery_timeout_secs ), it self like : Just by adding additional comment saying : "# Comment the next line (DTM_INI_DIS

Re: [devel] [PATCH 2/2] dtmd: change log wording [#2030]

2017-08-09 Thread A V Mahesh
Hi Ale, ACK, with following ( not tested). Please take care of alignment. -AVM On 8/9/2017 8:14 PM, Alex Jones wrote: return failure when unable to add new node --- src/dtm/dtmnd/dtm_node.c | 7 +-- 1 file changed, 5 insertions(+), 2 deletions(-) diff --git a/src/dtm/dtmnd/dtm_node.c

Re: [devel] [PATCH 1/1] log: fix cppcheck, cpplint and reorganize headers - part 1 [#2445]

2017-08-08 Thread A V Mahesh
Hi Canh Van, ACK, with Same comment as Lennart: Change `SaAisErrorT om_rc` similar in `static void read_logsv_config_obj_2()` , as you already did in `static SaAisErrorT amf_quiescing_state_handler()` in this patch it self. ===

Re: [devel] [PATCH 1/1] dtmd: fix uninitialized memory errors and crashes when deleting nodes [#2545]

2017-08-08 Thread A V Mahesh
Hi Alex , ACK, Not tested. -AVM On 8/9/2017 12:02 AM, Alex Jones wrote: ==27393== Thread 4: ==27393== Conditional jump or move depends on uninitialised value(s) ==27393==at 0x4C2F336: __memcmp_sse4_1 (in /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) ==27393==by 0x4E64ADF: nc

Re: [devel] [PATCH 1/1] dtmd: change log wording [#2030]

2017-08-08 Thread A V Mahesh
Hi Alex, You removed osafassert(0); and event `rc` is not updated to NCSCC_RC_FAILURE; why you what proceed for when node_id=0. node->node_id can be zero because of other factors, please check. -AVM On 8/8/2017 11:42 PM, Alex Jones wrote: Sep 13 14:00:52 SC-2 local0.err osafdtmd[378]: ER

Re: [devel] [PATCH 1/1] log: fix referring to uninitialized variable caused assertion failed [#2546]

2017-08-08 Thread A V Mahesh
Hi Vu, ACK. -AVM On 8/8/2017 3:01 PM, Vu Minh Nguyen wrote: Log application got coredump due to failed assertion in Log agent. The problem was caused by referring to uninitialized variable `bool updated`. Previously, `updated` variable was initialized when the method `FetchAndIncreaseRefCoun

Re: [devel] [PATCH 1/1] clm: Include boot time and node address in join request message [#2489]

2017-08-07 Thread A V Mahesh
Hi Anders & praveen, On 8/7/2017 1:35 PM, Anders Widell wrote: A node can have more than one single network address. If you run the ifconfig command, you get a list of network interfaces. Each one of these interfaces can have several address assigned to it: IPv4 addresses, IPv6 addresses, and

Re: [devel] [PATCH 1/1] base: Add osaf_get_boot_time and osaf_timespec_average functions [#2535]

2017-08-04 Thread A V Mahesh
Hi Anders, ACK not tested. -AVM On 7/28/2017 1:45 PM, Anders Widell wrote: The osaf_get_boot_time function returns the time stamp when the node was booted. The osaf_timespec_average function returns the average of two time stamps. --- src/base/Makefile.am | 12 ++-

Re: [devel] [PATCH 0/1] Review Request for base: Add osaf_get_boot_time and osaf_timespec_average functions [#2535]

2017-08-03 Thread A V Mahesh
Hi Anders Widell , I will review and ACK by EOD. -AVM On 8/3/2017 2:08 PM, Anders Widell wrote: Hi Mahesh! I intend to push this tomorrow unless there are any comments. thanks, Anders Widell On 07/28/2017 10:15 AM, Anders Widell wrote: Summary: base: Add osaf_get_boot_time and osaf_timesp

Re: [devel] [PATCH 1/1] log: fix coredump generated by referencing to un-initialized pointer [#2534]

2017-07-25 Thread A V Mahesh
Hi Vu, ACK , Not tested. -AVM On 7/25/2017 1:39 PM, Vu Minh Nguyen wrote: Fix coredump generated by referencing to un-initialized pointer `o_msg` in LogAgent::saLogInitialize(). --- src/log/agent/lga_agent.cc | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/src/log/agent

Re: [devel] [PATCH 1/1] log: fix log supported maximum 2047 characters for long DN [#2525]

2017-07-20 Thread A V Mahesh
Hi Canh , ACK , Not tested. -AVM On 7/12/2017 1:40 PM, Canh Van Truong wrote: Currently, log support maximum 2047 characters for long DN. it should support maximum 2048 characters. The patch also fixes to check SA_AMF_COMPONENT_NAME maximum 2048 characters and refactor encode and decode for

Re: [devel] [PATCH 1/1] imm: regenerate PBE on SQL constraint error [#2527]

2017-07-18 Thread A V Mahesh
Hi Zoran, ACK, Not tested. Sorry for the delay I was out of office. -AVM On 7/12/2017 5:37 PM, Zoran Milinkovic wrote: PBE will be regenerated on SQL constraint error code. --- src/imm/common/immpbe_dump.cc | 17 + 1 file changed, 17 insertions(+) diff --git a/src/imm/comm

Re: [devel] [PATCH 1/1] nid: correct typo in svc_map [#2519]

2017-07-04 Thread A V Mahesh
Hi Hans N, ACK, Not tested. -AVM On 7/4/2017 7:53 PM, Hans Nordeback wrote: --- src/nid/nodeinit.cc | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/src/nid/nodeinit.cc b/src/nid/nodeinit.cc index 2f9c785cc..9eddd743d 100644 --- a/src/nid/nodeinit.cc +++ b/src/nid/nodeinit

Re: [devel] [PATCH 1/1] imm: send imm finalize message to immnd when dispatch returns ERR_BAD_HANDLE [#2504]

2017-07-03 Thread A V Mahesh
Hi Zoran, ACK, Tested below case. -AVM On 6/30/2017 8:39 AM, A V Mahesh wrote: Hi Zoran, Thanks for the test case, I will test like below. -AVM On 6/29/2017 3:53 PM, Zoran Milinkovic wrote: Hi Mahesh, Try to set an implementer on a class with disabled long DN support. When you create an

Re: [devel] [PATCH 1/1] log: update test case logtest 2 51 [#2515]

2017-07-03 Thread A V Mahesh
Hi Canh Van, ACK with following , code review only. On 6/30/2017 12:35 PM, Canh Van Truong wrote: +if ((ret = clock_gettime(CLOCK_REALTIME, &timeout_t)) == -1) { I suggest to use `osaf_clock_gettime(CLOCK_REALTIME, &timeout_t);` -AVM On 6/30/2017 12:35 PM, Canh Van Truong wrote: T

Re: [devel] [PATCH 1/1] imm: send imm finalize message to immnd when dispatch returns ERR_BAD_HANDLE [#2504]

2017-06-29 Thread A V Mahesh
that, you will not be able to set a new implementer with the same name. BR, Zoran -Original Message- From: A V Mahesh [mailto:mahesh.va...@oracle.com] Sent: den 29 juni 2017 05:39 To: Zoran Milinkovic Cc: opensaf-devel@lists.sourceforge.net Subject: Re: [PATCH 1/1] imm: send imm finalize

Re: [devel] [PATCH 1/1] imm: send imm finalize message to immnd when dispatch returns ERR_BAD_HANDLE [#2504]

2017-06-28 Thread A V Mahesh
Hi Zoran, How you are able to catch this bug ? Jut by manually seeing IMMND run-time size growing , while running threaded application ! -AVM On 6/28/2017 7:32 PM, Zoran Milinkovic wrote: Send IMM_FINALIZE message to immnd when dispatch functions returns ERR_BAD_HANDLE. IMM_FINALIZE will

Re: [devel] [PATCH 1/1] osaf: Fix build problems when using GCC 7.1.1 [#2502]

2017-06-28 Thread A V Mahesh
Hi Anders, ACK, not tested ( need to setup GCC 7.1.1). -AVM On 6/20/2017 2:27 PM, Anders Widell wrote: Fix build problems caused by the following warnings: * -Werror=format-overflow= * -Werror=parentheses * -Werror=unused-variable --- src/amf/amfd/imm.cc| 2

Re: [devel] [PATCH 1/1] amfnd: retry on ERR_NOT_EXIST [#2490]

2017-06-22 Thread A V Mahesh
not yet exist. Thanks Gary -Original Message- From: A V Mahesh Organization: Oracle Corporation Date: Thursday, 22 June 2017 at 12:32 pm To: gary Cc: praveen malviya , , Nagendra Kumar , minh chau , Subject: Re: [devel] [PATCH 1/1] amfnd: retry on ERR_NOT_EXIST [#2490] Hi Gary,

Re: [devel] [PATCH 1/1] amfnd: retry on ERR_NOT_EXIST [#2490]

2017-06-21 Thread A V Mahesh
can you please elaborate. -AVM Gary On 21 Jun 2017, at 6:55 pm, A V Mahesh wrote: Hi Gary, On 6/21/2017 12:47 PM, Gary Lee wrote: Hi Mahesh The IMM objects and admin op are issued on the same node. AMFD will ‘forward’ the admin op to AMFND on *another* node, where sometimes the object i

Re: [devel] [PATCH 1/1] amfnd: retry on ERR_NOT_EXIST [#2490]

2017-06-21 Thread A V Mahesh
can initiated admin op earlier than the IMM object sync why because AMF messages are having higher priority (MDS). Can you please explore option of both object create & admin operation can occur on same AMFND node ( not ‘forwarding’ ) -AVM Thanks Gary On 21/6/17, 5:14 pm, "A V Mahes

Re: [devel] [PATCH 1/1] amfnd: retry on ERR_NOT_EXIST [#2490]

2017-06-21 Thread A V Mahesh
Hi Pravee/Gary, On 6/21/2017 12:13 PM, praveen malviya wrote: Hi Gary, Is there any ticket in IMM for any related issue? Both creation of SU and admin operation on it will go through IMM only. I do not know how IMM handles it and how IMM spec talks about it. [AVM] Even we have problem of new

Re: [devel] [PATCH 1/1] imm: Allow stale clients to be discarded when going to headless [#2487]

2017-06-15 Thread A V Mahesh
Hi Hung Nguyen, Ok, ACK from me. -AVM On 6/15/2017 12:03 PM, Hung Nguyen wrote: Hi Mahesh, Stale clients are clients that are not discarded (yet) due to problem with sending messages to IMMD. In case of headless, we don't care about IMMD, we just want to get grid of all clients locally, so

Re: [devel] [PATCH 1/1] imm: Allow stale clients to be discarded when going to headless [#2487]

2017-06-14 Thread A V Mahesh
Hi Hung, Thanks for the explaining , still didn't get the advantage of unset the mIsStale flag for headless case, can you please clarify below taged with [AVM] On 6/15/2017 10:03 AM, Hung Nguyen wrote: Hi Mahesh, In imma_discard_connection(), we mark the client as stale when having problem

Re: [devel] [PATCH 1/1] imm: Allow stale clients to be discarded when going to headless [#2487]

2017-06-14 Thread A V Mahesh
Hi Hung, Jut trying to syncup the ligic of unsets the mIsStale flag of the clients before immnd do invoking immnd_proc_imma_discard_connection() , immnd in multiple other cases also invoking immnd_proc_imma_discard_connection() for exaple immnd_proc_imma_down() as well, in addition to immnd_

Re: [devel] [PATCH 1/1] imm: fix counting timeouts for old critical CCBs [#2495]

2017-06-14 Thread A V Mahesh
or not. -1 is not 0 (FALSE), and IF block will be executed. I don't see problem here. Thanks, Zoran -Original Message- From: A V Mahesh [mailto:mahesh.va...@oracle.com] Sent: den 14 juni 2017 12:13 To: Zoran Milinkovic Cc: opensaf-devel@lists.sourceforge.net Subject: Re: [PATCH

Re: [devel] [PATCH 1/1] imm: fix counting timeouts for old critical CCBs [#2495]

2017-06-14 Thread A V Mahesh
Hi Zora, ACK form me. Note : osaf_timespec_compare() will also returns -1 , please cross check once the logic at very fist if() condition of void ImmModel::getOldCriticalCcbs() looks OK to me . -AVM On 6/14/2017 3:03 PM, Zoran Milinkovic wrote: The patch fix counting timeouts for old cr

Re: [devel] [PATCH 1/1] log: fix log agent does not use client version when recovery [#2457]

2017-06-11 Thread A V Mahesh
Hi Canh Van, ACK , form me. Sorry for the delay, I was out of office last week -AVM On 5/24/2017 5:29 PM, Canh Van Truong wrote: When log agent recover the client, it uses the highest version that log agent supported to re-intialize client. It should use the client version to re-initialize

Re: [devel] [PATCH 1/1] immtest: Explicitly clear implementer name in OI long DN testcases [#2484]

2017-06-06 Thread A V Mahesh
Hi Hung Nguyen, Ok , ACK from me. -AVM On 6/7/2017 10:19 AM, Hung Nguyen wrote: Hi Mahesh, Because the longDn testsuite use the same implementer for the testcases and the other testsuites don't. So we don't have this problem in other testsuites. BR, Hung Nguyen - DEK Technologies -

Re: [devel] [PATCH 1/1] immtest: Explicitly clear implementer name in OI long DN testcases [#2484]

2017-06-06 Thread A V Mahesh
Hi Hung Nguyen, ACK with following : The saImmOiFinalize() function does not release the objects and the implementer name that will still be associated with the immOiHandle, that is why we now Explicitly called saImmOiImplementerClear(), if that is the case why only long DN testcases ? -AV

Re: [devel] [PATCH 1/1] imm: remove vector clearing for dead implementers, nodes and admin owners in objectSync [#2481]

2017-06-05 Thread A V Mahesh
;. During the sync, IMM DB is locked. Veteran nodes are read-only (R_AVAILABLE state), and syncing nodes are write-only (W_AVAILABLE state). While veteran nodes are in R_AVAILABLE state, they only check their data with syncing data, but do not update any data. BR, Zoran -----Original

Re: [devel] [PATCH 1/1] imm: remove vector clearing for dead implementers, nodes and admin owners in objectSync [#2481]

2017-06-05 Thread A V Mahesh
lears that vector on each sync object, and reexecuting discard node is not possible when sync is done. The same is for dead implementers and admin owners. Thanks, Zoran -Original Message- From: A V Mahesh [mailto:mahesh.va...@oracle.com] Sent: den 5 juni 2017 07:13 To: Zoran Milinkovi

Re: [devel] [PATCH 1/1] imm: remove vector clearing for dead implementers, nodes and admin owners in objectSync [#2481]

2017-06-04 Thread A V Mahesh
Hi Zoran, On 6/2/2017 8:45 PM, Zoran Milinkovic wrote: After removing the clearing of dead implementer, node and admin owner vectors, re-executing on vectors will be done after the node is fully synced. Just Trying to understand the problem more , I understand problem as follows please confi

[devel] [PATCH 1/1] mds: clear mds lib valgrind warning [#2474]

2017-06-01 Thread A V Mahesh
--- src/mds/mds_c_api.c | 4 +++- src/mds/mds_c_db.c| 12 ++-- src/mds/mds_dt_tcp.c | 1 + src/mds/mds_dt_tipc.c | 1 + src/mds/mds_log.cc| 3 +++ 5 files changed, 18 insertions(+), 3 deletions(-) diff --git a/src/mds/mds_c_api.c b/src/mds/mds_c_api.c index a5986c2..f5ba318

[devel] [PATCH 0/1] Review Request for mds: clear mds lib valgrind warning [#2474]

2017-06-01 Thread A V Mahesh
uot; above): - *** EXPLAIN/COMMENT THE PATCH SERIES HERE *** revision 9fc4e0c96a82b5670408747b06d38a22c0f967fd Author: A V Mahesh Date: Thu, 1 Jun 2017 14:22:30 +0530 mds: clear mds lib valgrind warning [#2474] Fixed MDS valgrind-3.12.0 report Complete diffstat: -- src/mds/m

Re: [devel] [PATCH 1/1] imm: Discard Adm Impl continuation when peer IMMND is down [#2461]

2017-05-17 Thread A V Mahesh
Hi Hung Nguyen, On 5/17/2017 1:10 PM, Hung Nguyen wrote: > + /* Discard Adm Impl continuation */ > + for (ci3 = sAdmImplContinuationMap.begin(); > + ci3 != sAdmImplContinuationMap.end();) { If possible convert iterator ci3 to auto. -AVM On 5/17/2017 1:10 PM, Hung Nguyen wrote: > Discard

Re: [devel] [PATCH 1/1] log: process TRY AGAIN in logtest 2 40 and logtest 2 41 [#2453]

2017-05-17 Thread A V Mahesh
Hi Canh Van, ACK, not tested. -AVM On 5/8/2017 1:08 PM, Canh Van Truong wrote: > Re-write again in case test cases receive write ack with TRY AGAIN > error > --- > src/log/apitest/tet_saLogStreamOpen_2.c | 1 - > src/log/apitest/tet_saLogWriteLogCallbackT.c | 49 >

[devel] [PATCH 1/1] log: correct mutex lock and unlock sequence in logd [#2452]

2017-05-10 Thread A V Mahesh
issue : logd is crashing becuse unlocking a mutex twice with out lock fix :corrected lgs_ftcom_mutexmutex lock and unlock sequence --- src/log/logd/lgs_filehdl.cc | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/src/log/logd/lgs_filehdl.cc b/src/log/logd/lgs_filehdl.cc in

[devel] [PATCH 0/1] Review Request for log: correct mutex lock and unlock sequence in logd [#2452]

2017-05-10 Thread A V Mahesh
THE PATCH SERIES HERE *** revision ea3ba1360df02fe44e3b472daed5beebfcbb8d65 Author: A V Mahesh Date: Thu, 11 May 2017 10:28:36 +0530 log: correct mutex lock and unlock sequence in logd [#2452] issue : logd is crashing becuse unlocking a mutex twice with out lock fix :corrected lgs_ftcom_mute

Re: [devel] [PATCH 1/1] log: fix agent gets TRY_AGAIN instead TIMEOUT during failover [#2411]

2017-05-10 Thread A V Mahesh
"no active" check in > the agent. > Note: This will only work if there is a "graceful" role change meaning that > the server leaving its active role is still working. [AVM] why we want give different reply to Agent graceful (TRY AGAIN) & no-graceful (TIMEO

Re: [devel] [PATCH 1/1] log: fix agent gets TRY_AGAIN instead TIMEOUT during failover [#2411]

2017-05-10 Thread A V Mahesh
Hi Canh Van, Please check my comments as [AVM]. -AVM On 5/10/2017 12:06 PM, Canh Van Truong wrote: > > Hi Mahesh, > > Thanks for your update. > > Please check my comments. > > Regards > > Canh > > -----Original Message- > From: A V Mahesh [mailto:mah

Re: [devel] [PATCH 1/1] log: fix agent gets TRY_AGAIN instead TIMEOUT during failover [#2411]

2017-05-09 Thread A V Mahesh
Hi Canh Van, The update one: On 5/5/2017 6:37 PM, Canh Van Truong wrote: > During testing, e.g: use saflogger in loop to send a log record to log service > during failover, we frequently encounter saLogStreamClose() or saLogFinalize() > get SA_AIS_ERR_TIMEOUT as active LOG service is shutdown whi

Re: [devel] [PATCH 1/1] log: fix agent gets TRY_AGAIN instead TIMEOUT during failover [#2411]

2017-05-09 Thread A V Mahesh
Hi Canh Van, On 5/5/2017 6:37 PM, Canh Van Truong wrote: > During testing, e.g: use saflogger in loop to send a log record to log service > during failover, we frequently encounter saLogStreamClose() or saLogFinalize() > get SA_AIS_ERR_TIMEOUT as active LOG service is shutdown while agent request

Re: [devel] [PATCH 1/1] amfnd: Ignore second NCSMDS_DOWN [#2436]

2017-05-03 Thread A V Mahesh
; The patch looks ok to me. >> >> Thanks >> -Nagu >> >>> -Original Message- >>> From: minh chau [mailto:minh.c...@dektech.com.au] >>> Sent: 28 April 2017 15:24 >>> To: A V Mahesh; Suryanarayana.Garlapati; hans.nordeb...@ericsson.c

Re: [devel] [PATCH 1/1] log: generate hash only if having destination name set [#2438]

2017-05-02 Thread A V Mahesh
Hi Vu, I did ACK today early hrs , you haven't received ? On 5/2/2017 12:32 PM, A V Mahesh wrote: > Hi Vu, > > Ack. > > -AVM > > > On 4/26/2017 12:25 PM, Vu Minh Nguyen wrote: >> rfc5424_msgid is only referred when streaming and >> having destination

Re: [devel] [PATCH 1/1] log: generate hash only if having destination name set [#2438]

2017-05-02 Thread A V Mahesh
Hi Vu, Ack. -AVM On 4/26/2017 12:25 PM, Vu Minh Nguyen wrote: > rfc5424_msgid is only referred when streaming and > having destination name set on that log stream. > > It means it does meaningless job sometimes, > that is do hash calculation even there is no destination name set. > > This ticke

Re: [devel] [PATCH 1/1] log: fix checkpoint dest_names in open stream request [#2434]

2017-04-28 Thread A V Mahesh
----Original Message- >> From: A V Mahesh [mailto:mahesh.va...@oracle.com] >> Sent: Friday, April 28, 2017 4:56 AM >> To: Canh Van Truong ; >> lennart.l...@ericsson.com; vu.m.ngu...@dektech.com.au >> Cc: opensaf-devel@lists.sourceforge.net >> Subject: Re: [PATCH

Re: [devel] [PATCH 1/1] log: fix checkpoint dest_names in open stream request [#2434]

2017-04-27 Thread A V Mahesh
laces. Maybe we could > refactor to handle in one place to make clean code. > But it is not related to this ticket. We could refactor it in later ? > > Regards > Canh > > -Original Message- > From: Canh Van Truong [mailto:canh.v.tru...@dektech.com.au] > Sent: Th

Re: [devel] [PATCH 1/1] base: Blocking send causes AMF health check time-out [#2278]

2017-04-27 Thread A V Mahesh
Hi Hans N, ACK . This patch will also resolve My another problem , which ia have been debugging related to clock_gettime(CLOCK_MONOTONIC) -AVM On 4/27/2017 7:25 PM, Hans Nordeback wrote: > --- > src/base/sysf_tmr.c | 20 +--- > 1 file changed, 17 insertions(+), 3 deletio

Re: [devel] [PATCH 1/1] log: fix checkpoint dest_names in open stream request [#2434]

2017-04-27 Thread A V Mahesh
ient is associated with the stream. The > cold sync is the same reason. > > I am not sure if I understand your question? Could you clear it to me? > > Thanks > > Canh > > -Original Message- > From: A V Mahesh [mailto:mahesh.va...@oracle.com] > Sent: Thurs

Re: [devel] [PATCH 1/1] log: fix checkpoint dest_names in open stream request [#2434]

2017-04-26 Thread A V Mahesh
Hi Canh Van, As if you are updating client_id while lgs_ckpt_stream_open() checkpoint is client_id not required as reference to while ckpt_stream_close() checkpoint ? currently this assigned to -1. -AVM On 4/25/2017 6:58 PM, Canh Van Truong wrote: > Handling of checkpoint for stream open is

Re: [devel] [PATCH 1/1] amfnd: Ignore second NCSMDS_DOWN [#2436]

2017-04-26 Thread A V Mahesh
gt; <143>1 2017-04-26T21:16:52.955775+10:00 PL-4 osafimmnd 395 mds.log > [meta sequenceId="30060"] << mds_mcm_await_active_tmr_expiry > > I guess the other node-director services also receive the 2nd > NCSMDS_DOWN(Vdest), but those services have no problem b

Re: [devel] [PATCH 1/1] amfnd: Ignore second NCSMDS_DOWN [#2436]

2017-04-26 Thread A V Mahesh
Hi Minh Chau, On 4/26/2017 12:05 PM, minh chau wrote: > amfnd will receive another NCSMDS_DOWN you mean amfnd is receiving NCSMDS_DOWN for same amfd twice ? or amfnd is receiving NCSMDS_DOWN for both active amfd & standby amfd ? -AVM On 4/26/2017 12:05 PM, minh chau wrote: > > @Suryanara

Re: [devel] [PATCH 1 of 1] cpd: to correct failover behavior of cpsv [#1765] V5

2017-04-14 Thread A V Mahesh
gs. > > Dear Zoran, > > Do you have any extra comment about this patch? > If not, I will request pushing it at start of next week. > > Sincerely, > Hoang > > -Original Message- > From: A V Mahesh [mailto:mahesh.va...@oracle.com] > Sent: Thursday, Apri

Re: [devel] [PATCH 1 of 1] cpd: to correct failover behavior of cpsv [#1765] V5

2017-04-13 Thread A V Mahesh
check that it may be redundant call . -AVM On 4/12/2017 2:19 PM, A V Mahesh wrote: > Hi Hoang, > > On 2/10/2017 3:09 PM, Vo Minh Hoang wrote: >> If cpnd is temporary down only, we don't need clean up anything. >> If cpnd is permanently down, the bad effect of this proposal

Re: [devel] [PATCH 1 of 1] cpd: to correct failover behavior of cpsv [#1765] V5

2017-04-12 Thread A V Mahesh
6f0500cf0, arg=0x7f86f0501ef0 === -AVM On 4/12/2017 11:08 AM, A V Mahesh wrote: > Hi Hoang, > > On 2/10/2017 3:09 PM, Vo Minh Hoang wrote: >> Dear Mahesh, >> >> Based on what I saw, in this case, retention time cannot detect CPND >>

Re: [devel] [PATCH 1 of 1] cpd: to correct failover behavior of cpsv [#1765] V5

2017-04-11 Thread A V Mahesh
consider it. > > Thank you and best regards, > Hoang > > -Original Message- > From: A V Mahesh [mailto:mahesh.va...@oracle.com] > Sent: Friday, February 10, 2017 10:40 AM > To: Hoang Vo ; zoran.milinko...@ericsson.com > Cc: opensaf-devel@lists.sourceforge.net > Subj

Re: [devel] [PATCH 1 of 1] log: fix log file size is not reset to zero (0) when log file is created [#1439]

2017-04-11 Thread A V Mahesh
Hi Canh Van, ACK not tested. -AVM On 3/22/2017 8:26 PM, Canh Van Truong wrote: > src/log/apitest/tet_LogOiOps.c | 92 > ++ > src/log/logd/lgs_stream.cc | 3 + > 2 files changed, 95 insertions(+), 0 deletions(-) > > > When updating attributes o

Re: [devel] [PATCH 1 of 1] ckpt: Increase limit for number of file desciptors in CKPTND [#2415]

2017-04-09 Thread A V Mahesh
Hi Anders Widell, ACK not tested . -AVM On 4/8/2017 2:39 PM, Anders Widell wrote: > src/ckpt/ckptnd/cpnd_main.c | 18 ++ > 1 files changed, 18 insertions(+), 0 deletions(-) > > > The CKPT node director needs one file descriptor per open replica. Since the > maximum number of

Re: [devel] [PATCH 1 of 1] ckpt: Add option OSAF_CKPT_SHM_ALLOC_GUARANTEE=2 for backwards compatibility [#2395]

2017-04-07 Thread A V Mahesh
Ok, ACK. -AVM On 4/7/2017 1:05 PM, Anders Widell wrote: > See reply below. > > thanks, > > Anders Widell > > > On 04/07/2017 05:17 AM, A V Mahesh wrote: >> Hi Anders Widell, >> >> Ack with following comments : >> >> On

Re: [devel] [PATCH 1 of 1] ckpt: Add option OSAF_CKPT_SHM_ALLOC_GUARANTEE=2 for backwards compatibility [#2395]

2017-04-06 Thread A V Mahesh
One more comment added to previous mail. On 4/7/2017 8:47 AM, A V Mahesh wrote: > Hi Anders Widell, > > Ack with following comments : > > On 4/6/2017 4:55 PM, Anders Widell wrote: >> The setting OSAF_CKPT_SHM_ALLOC_GUARANTEE=0 results in increased >> memory us

Re: [devel] [PATCH 1 of 1] ckpt: Add option OSAF_CKPT_SHM_ALLOC_GUARANTEE=2 for backwards compatibility [#2395]

2017-04-06 Thread A V Mahesh
Hi Anders Widell, Ack with following comments : On 4/6/2017 4:55 PM, Anders Widell wrote: > The setting OSAF_CKPT_SHM_ALLOC_GUARANTEE=0 results in increased memory usage, > whereas the setting OSAF_CKPT_SHM_ALLOC_GUARANTEE=1 results in lower > performance. Thus, there was no way to configure CKPT

Re: [devel] [PATCH 1 of 1] log: finalize accessor handle when getting scAbsenceAllowed [#2406]

2017-04-02 Thread A V Mahesh
Hi Vu, I don't think fix is required in LOG , unless we have BUG in IMM saImmOmInitialize() According to IMM spec the saImmOmFinalize() function closes the association represented by the immHandle parameter between the invoking process and the IMM Service. If the saImmOmFinalize() function co

Re: [devel] [PATCH 1 of 1] log: finalize accessor handle when getting scAbsenceAllowed [#2404]

2017-04-02 Thread A V Mahesh
Hi Vu, I don't think fix is required in LOG , unless we have BUG in IMM saImmOmInitialize() According to IMM spec the saImmOmFinalize() function closes the association represented by the immHandle parameter between the invoking process and the IMM Service. If the saImmOmFinalize() function co

[devel] [PATCH 0 of 1] Review Request for smf: Fix all Cppcheck 1.77 issues [#2342] V1

2017-03-30 Thread A V Mahesh
n Comments (indicate scope for each "y" above): - changeset 6155b9372e7d740761052e7d26722bc7ff40b3b5 Author: A V Mahesh Date: Fri, 31 Mar 2017 09:59:32 +0530 smf: Fix all Cppcheck 1.77 issues [#2342] V1 V1 update :

Re: [devel] [PATCH 1 of 1] log: unit test failed due to not updating new rfc5424 msgid handling [#2399]

2017-03-27 Thread A V Mahesh
Hi Vu, ACK , Not tested. -AVM On 3/27/2017 7:57 PM, Vu Minh Nguyen wrote: > src/log/tests/lgs_dest_test.cc | 22 -- > 1 files changed, 12 insertions(+), 10 deletions(-) > > > Ticket #2397 has updated new handling on rf5424 msgid, > but forgot reflecting it to unit test.

Re: [devel] [PATCH 1 of 1] log: cache RFC5424 msgid when log stream is created [#2397]

2017-03-26 Thread A V Mahesh
Hi Vu, ACK not tested . -AVM On 3/24/2017 2:05 PM, Vu Minh Nguyen wrote: > src/log/logd/lgs_dest.cc | 4 +--- > src/log/logd/lgs_dest.h | 1 + > src/log/logd/lgs_evt.cc | 1 + > src/log/logd/lgs_imm.cc | 10 ++ > src/log/logd/lgs_mbcsv.cc | 5 + > src/log/logd

Re: [devel] [PATCH 1 of 1] log: fix both active and standby sites own the same log file name [#2383]

2017-03-26 Thread A V Mahesh
s ensured that just the stream in one node open the log file. > > Thanks > Canh > > -Original Message- > From: A V Mahesh [mailto:mahesh.va...@oracle.com] > Sent: Thursday, March 23, 2017 4:40 AM > To: Canh Van Truong ; > lennart.l...@ericsson.com; vu.m.ngu...

Re: [devel] [PATCH 1 of 1] log: fix both active and standby sites own the same log file name [#2383]

2017-03-22 Thread A V Mahesh
Hi Canh, On 3/17/2017 6:44 PM, Canh Van Truong wrote: > Fix: > When the lgs has HA state up to quiesced and receive mds callback with > MDS_CALLBACK_QUIESCED_ACK, lgs check all stream, if any stream are > opening log file, closing all log file. Let us consider a case where : HA state quiesce

Re: [devel] [PATCH 0 of 3] Review Request for amf: Fix all Cppcheck 1.77 issues [#2341] V3

2017-03-22 Thread A V Mahesh
Hi AMF dev, A gentle reminder for review. So far I have received comments for Gary , If you guys doesn't have any other comments , I will push by tomorrow EOD. -AVM On 3/20/2017 10:49 AM, A V Mahesh wrote: > Summary:amf: Fix all Cppcheck 1.77 issues [#2341] V3 > Review reque

[devel] [PATCH 1 of 3] amfd: Fix all Cppcheck 1.77 issues [#2341] V3

2017-03-19 Thread A V Mahesh
src/amf/amfd/app.cc|4 +- src/amf/amfd/apptype.cc| 21 ++ src/amf/amfd/chkop.cc |3 +- src/amf/amfd/ckpt_dec.cc | 15 ++--- src/amf/amfd/ckpt_edu.cc |4 +- src/amf/amfd/ckpt_enc.cc | 60 src/amf/amfd/clm.cc

[devel] [PATCH 0 of 3] Review Request for amf: Fix all Cppcheck 1.77 issues [#2341] V3

2017-03-19 Thread A V Mahesh
n Comments (indicate scope for each "y" above): - changeset efb3364a0779447b858cbd0cdae1b92f0a2d2716 Author: A V Mahesh Date: Mon, 20 Mar 2017 10:33:35 +0530 amfd: Fix all Cppcheck 1.77 issues [#2341] V3 V3 fixe

[devel] [PATCH 2 of 3] amfnd: Fix all Cppcheck 1.77 issues [#2341] V3

2017-03-19 Thread A V Mahesh
src/amf/amfnd/amfnd.cc | 16 ++-- src/amf/amfnd/cbq.cc | 10 -- src/amf/amfnd/chc.cc | 30 -- src/amf/amfnd/clc.cc | 38 +++--- src/amf/amfnd/comp.cc| 9 - src/amf/amfnd/compdb.cc | 18

[devel] [PATCH 3 of 3] amfa: Fix all Cppcheck 1.77 issues [#2341] V3

2017-03-19 Thread A V Mahesh
src/amf/agent/ava_hdl.cc | 14 ++ src/amf/agent/ava_mds.cc | 13 + src/amf/agent/ava_op.cc | 12 +--- src/amf/amfwd/amf_wdog.c | 3 ++- src/amf/common/d2nmsg.c | 8 +++- src/amf/common/n2avamsg.c | 2 +- src/amf/tools/amf_pm.c| 5 ++--- 7 f

[devel] [PATCH 0 of 1] Review Request for smf: Fix all Cppcheck 1.77 issues [#2342]

2017-03-16 Thread A V Mahesh
n Comments (indicate scope for each "y" above): - changeset 13f8239a95e09caaf2a92cdf716fc08c4511ab8c Author: A V Mahesh Date: Thu, 16 Mar 2017 14:35:53 +0530 smf: Fix all Cppcheck 1.77 issues [#2342] [src/smf/agen

Re: [devel] [PATCH 0 of 3] Review Request for amf: Fix all Cppcheck 1.77 issues [#2341] V2

2017-03-14 Thread A V Mahesh
* remove the corresponding element from mon_req list */ rc = avnd_mon_req_del(cb, pid); ===== -AVM On 3/15/2017 10:00 AM, A V Mahesh wrote: > Summary:amf: Fix all Cppcheck 1.77 issues [#2341] V2 > Review

Re: [devel] [PATCH 2 of 3] amfnd: Fix all Cppcheck 1.77 issues [#2341] V2

2017-03-14 Thread A V Mahesh
nt8_t > *)&rec->pid); > if (NCSCC_RC_SUCCESS != rc) { > LOG_NO("PM Rec doesn't exist in Comp '%s' of pid %llu", > comp->name.c_str(), pid); > } > -rec = nullptr; /* rec is

Re: [devel] [PATCH 0 of 3] Review Request for amf: Fix all Cppcheck 1.77 issues [#2341] V1

2017-03-14 Thread A V Mahesh
haps it's easier if you pushed V1 first. Otherwise the patches get even > bigger and harder to review. I was referring to regression tests failing > without the changes I proposed, when I said legacy tests failed. > > thanks > >> On 14 Mar 2017, at 6:01 pm, A V Mahesh wro

[devel] [PATCH 0 of 3] Review Request for amf: Fix all Cppcheck 1.77 issues [#2341] V2

2017-03-14 Thread A V Mahesh
n Comments (indicate scope for each "y" above): - changeset 9401812b47ad5a079e6e82daad0ff83c1cf635f2 Author: A V Mahesh Date: Wed, 15 Mar 2017 09:51:41 +0530 amfd: Fix all Cppcheck 1.77 issues [#2341] V2 V2 Re-based changes on OpenSAF

[devel] [PATCH 2 of 3] amfnd: Fix all Cppcheck 1.77 issues [#2341] V2

2017-03-14 Thread A V Mahesh
src/amf/amfnd/amfnd.cc | 15 +-- src/amf/amfnd/cbq.cc | 9 +++-- src/amf/amfnd/chc.cc | 29 +++-- src/amf/amfnd/clc.cc | 37 ++--- src/amf/amfnd/comp.cc| 8 +++- src/amf/amfnd/compdb.cc | 17

[devel] [PATCH 3 of 3] amfa: Fix all Cppcheck 1.77 issues [#2341] V2

2017-03-14 Thread A V Mahesh
src/amf/agent/ava_hdl.cc | 13 + src/amf/agent/ava_mds.cc | 12 src/amf/agent/ava_op.cc | 11 --- src/amf/amfwd/amf_wdog.c | 2 +- src/amf/common/d2nmsg.c | 7 ++- src/amf/common/n2avamsg.c | 1 - src/amf/tools/amf_pm.c| 4 +--- 7 files ch

[devel] [PATCH 1 of 3] amfd: Fix all Cppcheck 1.77 issues [#2341] V2

2017-03-14 Thread A V Mahesh
src/amf/amfd/app.cc|3 +- src/amf/amfd/apptype.cc| 20 ++ src/amf/amfd/chkop.cc |3 +- src/amf/amfd/ckpt_dec.cc | 15 ++--- src/amf/amfd/ckpt_edu.cc |3 +- src/amf/amfd/ckpt_enc.cc | 60 src/amf/amfd/clm.cc

Re: [devel] [PATCH 0 of 3] Review Request for amf: Fix all Cppcheck 1.77 issues [#2341] V1

2017-03-14 Thread A V Mahesh
Hi Gary, Previously you found some old application issue and you resolved it is that related to this path or different issue ? -AVM On 3/14/2017 12:20 PM, A V Mahesh wrote: > Hi Gar, > > Thanks for the review. > > On 3/14/2017 11:47 AM, Gary Lee wrote: >> By the way,

Re: [devel] [PATCH 0 of 3] Review Request for amf: Fix all Cppcheck 1.77 issues [#2341] V1

2017-03-13 Thread A V Mahesh
id %llu", > comp->name.c_str(), pid); > } > > - delete rec; /* rec is no more, dont use it */ > + rec = nullptr; /* rec is no more, dont use it */ > > /* remove the corresponding element from mon_req list */ > rc = avnd_m

Re: [devel] [PATCH 1 of 1] base: Improve trace by using tid instead of pid [#2370]

2017-03-13 Thread A V Mahesh
Hi Hans N, It seems in some flavors of linux `gettid` is available, so dose it be under #ifndef ? -AVM On 3/13/2017 5:46 PM, Hans Nordeback wrote: > src/base/logtrace.c | 7 ++- > 1 files changed, 6 insertions(+), 1 deletions(-) > > > Even though not being LSB it would be good to use t

Re: [devel] [PATCH 1 of 2] log: fix to remove a redundant double quotation [#1463]

2017-03-13 Thread A V Mahesh
Hi Can, ACK for series, not tested -AVM On 3/8/2017 6:43 PM, Canh Van Truong wrote: > src/log/logd/lgs_fmt.cc | 3 --- > 1 files changed, 0 insertions(+), 3 deletions(-) > > > When sending a log record which is longer than limited quota, > there will be a double quotation mark although ther

Re: [devel] [PATCH 1 of 1] log: fix logd crash on Active side [#2362]

2017-03-12 Thread A V Mahesh
Hi Can, ACK not tested. -AVM On 3/10/2017 9:04 PM, Canh Van Truong wrote: > src/log/logd/lgs_filehdl.cc | 12 ++-- > 1 files changed, 6 insertions(+), 6 deletions(-) > > > The cause of issue is free cfg_namelist while struct dirent **cfg_namelist > unallocated > > diff --git a/src/

  1   2   3   4   5   6   7   >