[jira] [Updated] (CURATOR-355) Curator client fails when connecting to read-only ensemble

Benjamin Jaton (JIRA) Tue, 01 Nov 2016 14:58:24 -0700

     [ 
https://issues.apache.org/jira/browse/CURATOR-355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Benjamin Jaton updated CURATOR-355:
-----------------------------------
    Description: 
          ZK is 3.5.1-alpha

I have a 3 nodes ZK cluster , readonly mode is enabled.
2 nodes are down, so one of them (QA-E8WIN11) is in read-only (verified by 
using the ZK API manually). All the machines of the ensemble can be pinged from 
the client.

I'm using this piece of code:

{code}
                Builder curatorClientBuilder = CuratorFrameworkFactory.builder()
                                
.connectString("QA-E8WIN11:2181,QA-E8WIN12:2181")
                                
.sessionTimeoutMs(45000).connectionTimeoutMs(15000)
                                .retryPolicy(new RetryNTimes(3, 
5000)).canBeReadOnly(true);

                CuratorFramework client = curatorClientBuilder.build();
                client.start();
                client.getZookeeperClient().blockUntilConnectedOrTimedOut();
                System.out.println("Successfully established the connection 
with ZooKeeper");
                
                client.getData().forPath("/");
                System.out.println("Done.");{code}

When curator pick the host that is UP first, it goes through very quickly. When 
it picks the host that is down first (QA-E8WIN12), it seems to be stuck at the 
getData() call for a very long time, and then eventually fail with a 
ConnectionLossException. (see attached log)

  was:
ZK is 3.5.1-alpha

I have a 3 nodes ZK cluster , readonly mode is enabled.
2 nodes are down, so one of them (QA-E8WIN11) is in read-only (verified by 
using the ZK API manually). All the machines of the ensemble can be pinged from 
the client.

I'm using this piece of code:

{code}
                Builder curatorClientBuilder = CuratorFrameworkFactory.builder()
                                
.connectString("QA-E8WIN11:2181,QA-E8WIN12:2181")
                                
.sessionTimeoutMs(45000).connectionTimeoutMs(15000)
                                .retryPolicy(new RetryNTimes(3, 
5000)).canBeReadOnly(true);

                CuratorFramework client = curatorClientBuilder.build();
                client.start();
                client.getZookeeperClient().blockUntilConnectedOrTimedOut();
                System.out.println("Successfully established the connection 
with ZooKeeper");
                
                client.getData().forPath("/");
                System.out.println("Done.");{code}

When curator pick the host that is UP first, it goes through very quickly. When 
it picks the host that is down first (QA-E8WIN12), it seems to be stuck at the 
getData() call for a very long time, and then eventually fail with a 
ConnectionLossException. (see attached log)


> Curator client fails when connecting to read-only ensemble
> ----------------------------------------------------------
>
>                 Key: CURATOR-355
>                 URL: https://issues.apache.org/jira/browse/CURATOR-355
>             Project: Apache Curator
>          Issue Type: Bug
>          Components: Client
>    Affects Versions: 2.11.0
>            Reporter: Benjamin Jaton
>            Priority: Critical
>         Attachments: test2.log
>
>
>           ZK is 3.5.1-alpha
> I have a 3 nodes ZK cluster , readonly mode is enabled.
> 2 nodes are down, so one of them (QA-E8WIN11) is in read-only (verified by 
> using the ZK API manually). All the machines of the ensemble can be pinged 
> from the client.
> I'm using this piece of code:
> {code}
>               Builder curatorClientBuilder = CuratorFrameworkFactory.builder()
>                               
> .connectString("QA-E8WIN11:2181,QA-E8WIN12:2181")
>                               
> .sessionTimeoutMs(45000).connectionTimeoutMs(15000)
>                               .retryPolicy(new RetryNTimes(3, 
> 5000)).canBeReadOnly(true);
>               CuratorFramework client = curatorClientBuilder.build();
>               client.start();
>               client.getZookeeperClient().blockUntilConnectedOrTimedOut();
>               System.out.println("Successfully established the connection 
> with ZooKeeper");
>               
>               client.getData().forPath("/");
>               System.out.println("Done.");{code}
> When curator pick the host that is UP first, it goes through very quickly. 
> When it picks the host that is down first (QA-E8WIN12), it seems to be stuck 
> at the getData() call for a very long time, and then eventually fail with a 
> ConnectionLossException. (see attached log)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (CURATOR-355) Curator client fails when connecting to read-only ensemble

Reply via email to