Hi Reka,

I am not clear on the 2 properties you mention below, are they supposed to be 
set in the stratos.sh ? I just picked up the latest code and from the apache 
stratos repo and don’t see them ?

Btw,  read.write.lock.monitor.enabled=false  is disabled in our production code 
(I assume it is set to false by default if not specified) , I only enable it to 
provide additional information

Thanks

Martin

From: Reka Thirunavukkarasu [mailto:r...@wso2.com]
Sent: Monday, June 22, 2015 7:30 AM
To: Martin Eppel (meppel)
Cc: dev@stratos.apache.org; Lasindu Charith (lasi...@wso2.com); Ryan Du Plessis 
(rdupless)
Subject: Re: Testing Stratos 4.1: Application undeployment: application fails 
to undeploy (nested grouping, group scaling)

Hi Martin,
I have verified the fix by enabling read.write.lock.monitor.enabled=true. The 
fix worked fine with it. Since we are using concurrency and delegated some flow 
to Threads, i had to provide the thread values to below values in the 
stratos.sh.

    -Dapplication.monitor.thread.pool.size=50 \
    -Dgroup.monitor.thread.pool.size=50 \
Please note that it is recommended to have 
read.write.lock.monitor.enabled=false as it will consume more footprint in the 
production. This property introduce only for the testing purpose.

We are in the process of analyzing the thread size and will come up with a 
recommended values for it.
Also, i have fixed a small issue in the REST endpoint as it returns some 
default value whenever application run time is not found. Now that if runtime 
is not found, the below message will get populated.

{"status":"error","message":"Application runtime not found"}
I have also verified the undeployment with group scaling. Didn't find any 
issues with the above fixes.
Please find the latest commit as below:

0a969200d11228158606f011ca7e5e795f336d92.
Please note that below error was only observed which is harmless for now. I 
have verified it with a workaround and working fine. But will check on the 
severity and decide on a proper fix or will go with the workaround.

[1]. TID: [0] [STRATOS] [2015-06-22 14:22:01,872] ERROR 
{org.apache.stratos.common.concurrent.locks.ReadWriteLockMonitor} -  System 
error, lock has not released for 30 seconds: [lock-name] topology [lock-type] 
Write [thread-id] 117 [thread-name] pool-24-thread-2 [stack-trace]
java.lang.Thread.getStackTrace(Thread.java:1589)
org.apache.stratos.common.concurrent.locks.ReadWriteLock.acquireWriteLock(ReadWriteLock.java:123)
org.apache.stratos.messaging.message.processor.topology.updater.TopologyUpdater.acquireWriteLockForService(TopologyUpdater.java:123)
org.apache.stratos.messaging.message.processor.topology.ApplicationClustersCreatedMessageProcessor.doProcess(ApplicationClustersCreatedMessageProcessor.java:78)
org.apache.stratos.messaging.message.processor.topology.ApplicationClustersCreatedMessageProcessor.process(ApplicationClustersCreatedMessageProcessor.java:59)
org.apache.stratos.messaging.message.processor.topology.ServiceRemovedMessageProcessor.process(ServiceRemovedMessageProcessor.java:64)
org.apache.stratos.messaging.message.processor.topology.ServiceCreatedMessageProcessor.process(ServiceCreatedMessageProcessor.java:65)
org.apache.stratos.messaging.message.processor.topology.CompleteTopologyMessageProcessor.process(CompleteTopologyMessageProcessor.java:74)
org.apache.stratos.messaging.message.processor.MessageProcessorChain.process(MessageProcessorChain.java:61)
org.apache.stratos.messaging.message.receiver.topology.TopologyEventMessageDelegator.run(TopologyEventMessageDelegator.java:73)
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
java.lang.Thread.run(Thread.java:745)
Thanks,
Reka



On Mon, Jun 22, 2015 at 12:24 PM, Reka Thirunavukkarasu 
<r...@wso2.com<mailto:r...@wso2.com>> wrote:
Hi Martin,
Found the reason why we didn't encounter these locking issue as we were testing 
with default stratos pack which has read.write.lock.monitor.enabled=false. The 
locking warning or issue is raised only when you use 
read.write.lock.monitor.enabled=true. That's why you were only facing these 
locking issue as you use this configuration in your setup.
Since I'm able to reproduce the issue, i will test with the fix that i already 
pushed and update the thread.
We will discuss and try to make this read.write.lock.monitor.enabled=true by 
default with stratos. So that we can find issues as early and fix them.

Thanks,
Reka

On Mon, Jun 22, 2015 at 12:16 AM, Reka Thirunavukkarasu 
<r...@wso2.com<mailto:r...@wso2.com>> wrote:
Sorry Martin..I have only locally fixed the issue. I have pushed it now only. 
Can you test with 1c21daaeea7b27ad0a0141a70b32e9443e78e309 when you get chance? 
I will also continue testing with this fix.
Thanks,
Reka

On Mon, Jun 22, 2015 at 12:07 AM, Martin Eppel (meppel) 
<mep...@cisco.com<mailto:mep...@cisco.com>> wrote:
Btw,

This is my last commit I picked up from the stratos master:

commit 58bea52be814269416f70391fef50859aa5ae0a1
Author: lasinducharith 
<lasinduchar...@gmail.com<mailto:lasinduchar...@gmail.com>>
Date:   Fri Jun 19 19:40:27 2015 +0530

From: Martin Eppel (meppel)
Sent: Sunday, June 21, 2015 10:28 AM
To: dev@stratos.apache.org<mailto:dev@stratos.apache.org>; Reka Thirunavukkarasu
Cc: Lasindu Charith (lasi...@wso2.com<mailto:lasi...@wso2.com>); Ryan Du 
Plessis (rdupless)
Subject: RE: Testing Stratos 4.1: Application undeployment: application fails 
to undeploy (nested grouping, group scaling)

Hi Reka,

Here is another example which fails, see application at [1.], attached log 
files and jsons.  I run a few scenarios, the one which is failing is 
application with the name “s-g-c1-c2-c3” (last scenario). All members get 
removed but application remains deployed,

“s-g-c1-c2-c3: applicationInstances 0, groupInstances 0, clusterInstances 0, 
members 0 ()”


Thanks


Martin




[cid:image001.png@01D0ACDD.5F28BE80]




From: Imesh Gunaratne [mailto:im...@apache.org]
Sent: Sunday, June 21, 2015 1:32 AM
To: Reka Thirunavukkarasu
Cc: dev; Lasindu Charith (lasi...@wso2.com<mailto:lasi...@wso2.com>); Ryan Du 
Plessis (rdupless)
Subject: Re: Testing Stratos 4.1: Application undeployment: application fails 
to undeploy (nested grouping, group scaling)

Great! Thanks Reka!

On Sun, Jun 21, 2015 at 8:34 AM, Reka Thirunavukkarasu 
<r...@wso2.com<mailto:r...@wso2.com>> wrote:
Hi Martin/Imesh,
Sure..I will have a look on the logs. I will also go through the recent commits 
and try to revert the fix which added for nested group scaling as it is not 
needed for this release.  As Martin mentioned that after the fixes, there are 
more issues. Otherwise, we will have to go through another major effort in 
testing it.
I will update the progress of it...

Thanks,
Reka

On Sun, Jun 21, 2015 at 8:14 AM, Imesh Gunaratne 
<im...@apache.org<mailto:im...@apache.org>> wrote:
Hi Martin,

Thanks for the quick response. Yes we will definitely go through the logs and 
investigate this.

Thanks

On Sun, Jun 21, 2015 at 8:09 AM, Martin Eppel (meppel) 
<mep...@cisco.com<mailto:mep...@cisco.com>> wrote:
Hi Isuru,

No, the issue does not seem to be resolved. With the latest code I see issues 
in test cases which used to work before  (beyond the latest example I posted 
the log files for - see below), not sure yet what is going on.  I will be 
investigating further (making sure I am not mistaken) and following up with 
some examples after the weekend but if you guys can take a look at the log 
files on Monday I provided with the previous email that would be great,

Thanks

Martin

From: Imesh Gunaratne [mailto:im...@apache.org<mailto:im...@apache.org>]
Sent: Saturday, June 20, 2015 7:29 PM
To: dev
Cc: Lasindu Charith (lasi...@wso2.com<mailto:lasi...@wso2.com>); Reka 
Thirunavukkarasu (r...@wso2.com<mailto:r...@wso2.com>); Ryan Du Plessis 
(rdupless)
Subject: Re: Testing Stratos 4.1: Application undeployment: application fails 
to undeploy (nested grouping, group scaling)

Hi All,

I'm sorry I could not follow the entire discussion.
Can someone explain the latest status please? Have we resolved the initial 
group scaling issue and now seeing an application removal problem?

Thanks

On Sat, Jun 20, 2015 at 2:06 AM, Martin Eppel (meppel) 
<mep...@cisco.com<mailto:mep...@cisco.com>> wrote:
Hi Lasindu, Reka,


Just run into the issue with removing the application again: (with the fix for 
the issue included)

Please see [1a., 1b.] for the application structure (group scaling defined at 
only one group level). See also the respective artifacts and log file attached.

Please advise if we should reopen the JIRA

Thanks

Martin


Application [1a.]

[cid:image002.png@01D0ACDD.5F28BE80]

[1b.] application after “starting application remove”

[cid:image003.png@01D0ACDD.5F28BE80]









--
Imesh Gunaratne

Senior Technical Lead, WSO2
Committer & PMC Member, Apache Stratos



--
Imesh Gunaratne

Senior Technical Lead, WSO2
Committer & PMC Member, Apache Stratos


--
Reka Thirunavukkarasu
Senior Software Engineer,
WSO2, Inc.:http://wso2.com,
Mobile: +94776442007<tel:%2B94776442007>




--
Imesh Gunaratne

Senior Technical Lead, WSO2
Committer & PMC Member, Apache Stratos



--
Reka Thirunavukkarasu
Senior Software Engineer,
WSO2, Inc.:http://wso2.com,
Mobile: +94776442007<tel:%2B94776442007>




--
Reka Thirunavukkarasu
Senior Software Engineer,
WSO2, Inc.:http://wso2.com,
Mobile: +94776442007<tel:%2B94776442007>




--
Reka Thirunavukkarasu
Senior Software Engineer,
WSO2, Inc.:http://wso2.com,
Mobile: +94776442007

Reply via email to