Sure.  I will add my notes to this WIKI page and we can fix them later as
necessary.

On Thu, Dec 10, 2009 at 11:16 AM, Patrick Hunt <ph...@apache.org> wrote:

> Would you be willing to create a "zk on ec2" page on the wiki?
> http://wiki.apache.org/hadoop/ZooKeeper/ZooKeeperOnEC2
>
> Not sure on the exactly content -- basically outline what you did to get ZK
> running. Something you think would be helpful to other users trying to
> deploy ZK on ec2 for the first time. :-)
>
> Patrick
>
>
>
> Something Something wrote:
>
>> Sorry, ignore this.  My bad.
>>
>> These messages disappeared when I started ZooKeeper on other two
>> instances.
>> I tested this by killing 1 or 2 processes at a time.  These messages start
>> popping up when a process gets killed - implying that these instances are
>> indeed talking to each other.
>>
>> Thanks.
>>
>>
>> On Wed, Dec 9, 2009 at 10:22 PM, Something Something <
>> mailinglist...@gmail.com> wrote:
>>
>>  I keep getting "Connection refused" on EC2.  Seems like it's a security
>>> issue on Amazon's EC2 environment.  I have tried specifying machine names
>>> in
>>> following 4 ways, but nothing seems to help:
>>>
>>> server.1=domU-12-31-38-01-B8-47.compute-1.internal:2888:3888
>>> server.1=domU-12-31-38-01-B8-47:2888:3888
>>> server.1=ec2-72-44-42-160.compute-1.amazonaws.com:2888:3888
>>> server.1=10.253.191.181:2888:3888
>>>
>>> Please help.  Thanks.
>>>
>>> Here's the message:
>>>
>>>
>>> ---------------------------------------------------------------------------------------------------------------------
>>> r...@domu-12-31-38-01-b8-47:/ebs1/zookeeper-3.2.1/bin# 2009-12-10
>>> 06:03:42,577 - WARN
>>>  [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorumcnxmana...@336]
>>> - Cannot open channel to 2 at election address
>>> domU-12-31-38-01-B8-A3.compute-1.internal/10.253.191.81:3888
>>> java.net.ConnectException: Connection refused
>>>    at sun.nio.ch.Net.connect(Native Method)
>>>    at sun.nio.ch.SocketChannelImpl.connect(SocketChannelImpl.java:507)
>>>    at java.nio.channels.SocketChannel.open(SocketChannel.java:146)
>>>    at
>>>
>>> org.apache.zookeeper.server.quorum.QuorumCnxManager.connectOne(QuorumCnxManager.java:323)
>>>    at
>>>
>>> org.apache.zookeeper.server.quorum.QuorumCnxManager.connectAll(QuorumCnxManager.java:356)
>>>    at
>>>
>>> org.apache.zookeeper.server.quorum.FastLeaderElection.lookForLeader(FastLeaderElection.java:603)
>>>    at
>>> org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:488)
>>> 2009-12-10 06:03:42,578 - WARN
>>> [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorumcnxmana...@336] - Cannot open
>>> channel to 3 at election address
>>> domU-12-31-38-00-94-A6.compute-1.internal/
>>> 10.252.155.84:3888
>>> java.net.ConnectException: Connection refused
>>>    at sun.nio.ch.Net.connect(Native Method)
>>>    at sun.nio.ch.SocketChannelImpl.connect(SocketChannelImpl.java:507)
>>>    at java.nio.channels.SocketChannel.open(SocketChannel.java:146)
>>>    at
>>>
>>> org.apache.zookeeper.server.quorum.QuorumCnxManager.connectOne(QuorumCnxManager.java:323)
>>>    at
>>>
>>> org.apache.zookeeper.server.quorum.QuorumCnxManager.connectAll(QuorumCnxManager.java:356)
>>>    at
>>>
>>> org.apache.zookeeper.server.quorum.FastLeaderElection.lookForLeader(FastLeaderElection.java:603)
>>>    at
>>> org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:488)
>>> 2009-12-10 06:03:42,578 - INFO
>>> [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@612] - Notification
>>> time out: 60000
>>>
>>>
>>>
>>> On Wed, Dec 9, 2009 at 11:08 AM, Henry Robinson <he...@cloudera.com
>>> >wrote:
>>>
>>>  Nearly! 1+2 are correct, but you also need to start ZooKeeper on all
>>>> three
>>>> instances with bin/zkServer.sh start.
>>>>
>>>> Henry
>>>>
>>>> On Wed, Dec 9, 2009 at 11:00 AM, Something Something <
>>>> mailinglist...@gmail.com> wrote:
>>>>
>>>>  Now that I have your attention..next question... :)
>>>>>
>>>>> Now I would like to start a Zookeeper Quorum on 3 EC Instances.  Read
>>>>>
>>>> the
>>>>
>>>>> doc regarding... "Running Replicated ZooKeeper".  It says "all servers
>>>>>
>>>> in
>>>>
>>>>> the quorum should have the same configuration file"..  Does this
>>>>> mean...
>>>>>
>>>> I
>>>>
>>>>> should..
>>>>>
>>>>> 1)  Download & Install ZooKeeper on all 3 instances (at the same
>>>>>
>>>> location.)
>>>>
>>>>> 2)  Save the same zoo.cfg in /conf for all 3 instances.
>>>>> 3)  On one instance (Master?), run...
>>>>>
>>>>> bin/zkServer.sh start
>>>>>
>>>>> Would that start ZooKeeper on all 3 instances?  Thanks for the help.
>>>>>
>>>>>
>>>>> On Wed, Dec 9, 2009 at 10:24 AM, Something Something <
>>>>> mailinglist...@gmail.com> wrote:
>>>>>
>>>>>  Switched to 3.2.1.  Much better.  Got a command prompt.  Thank you
>>>>>>
>>>>> both.
>>>>
>>>>>
>>>>>> On Wed, Dec 9, 2009 at 10:09 AM, Henry Robinson <he...@cloudera.com
>>>>>> wrote:
>>>>>>
>>>>>>  The 3.2.1 command line is a lot nicer (has an actual prompt, tab
>>>>>>> auto-completion, shows your connection status etc) - if you can
>>>>>>>
>>>>>> upgrade
>>>>
>>>>> to
>>>>>
>>>>>> 3.2.1 which is a good deal more modern, I would recommend it. If I
>>>>>>>
>>>>>> recall
>>>>>
>>>>>> correctly, there was no prompt in 3.1.1...
>>>>>>>
>>>>>>> Henry
>>>>>>>
>>>>>>> On Wed, Dec 9, 2009 at 9:36 AM, Something Something <
>>>>>>> mailinglist...@gmail.com> wrote:
>>>>>>>
>>>>>>>  Without -server made some progress, but don't see a command prompt.
>>>>>>>> Shouldn't I see one?
>>>>>>>>
>>>>>>>> This is what I see:
>>>>>>>> 2009-12-09 17:27:56,709 - INFO  [main:zookee...@341] - Initiating
>>>>>>>>
>>>>>>> client
>>>>>>>
>>>>>>>> connection, host=127.0.0.1:2181 sessionTimeout=5000
>>>>>>>> watcher=org.apache.zookeeper.zookeepermain$mywatc...@32fb4f
>>>>>>>> 2009-12-09 17:27:56,710 - INFO  [main:clientc...@91] -
>>>>>>>> zookeeper.disableAutoWatchReset is false
>>>>>>>> 2009-12-09 17:27:56,792 - INFO
>>>>>>>>
>>>>>>>  [main-SendThread:clientcnxn$sendthr...@800
>>>>>>>
>>>>>>>> ]
>>>>>>>> - Attempting connection to server /127.0.0.1:2181
>>>>>>>> 2009-12-09 17:27:56,802 - INFO
>>>>>>>>
>>>>>>>  [main-SendThread:clientcnxn$sendthr...@716
>>>>>>>
>>>>>>>> ]
>>>>>>>> - Priming connection to java.nio.channels.SocketChannel[connected
>>>>>>>>
>>>>>>> local=/
>>>>>>>
>>>>>>>> 127.0.0.1:49619 remote=/127.0.0.1:2181]
>>>>>>>> 2009-12-09 17:27:56,806 - INFO
>>>>>>>>
>>>>>>>  [main-SendThread:clientcnxn$sendthr...@868
>>>>>>>
>>>>>>>> ]
>>>>>>>> - Server connection successful
>>>>>>>> WatchedEvent: Server state change. New state: SyncConnected
>>>>>>>>
>>>>>>>>
>>>>>>>> Should I just use 3.2.1 version?
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> On Wed, Dec 9, 2009 at 9:20 AM, Mahadev Konar <
>>>>>>>>
>>>>>>> maha...@yahoo-inc.com>
>>>>
>>>>> wrote:
>>>>>>>>
>>>>>>>>  Hi,
>>>>>>>>>  Can you try this?
>>>>>>>>>
>>>>>>>>> bin/zkCli.sh 127.0.0.1:2181
>>>>>>>>>
>>>>>>>>> The -server command was added later as far as I remember.
>>>>>>>>>
>>>>>>>>> Thanks
>>>>>>>>> mahadev
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On 12/9/09 9:05 AM, "Something Something" <
>>>>>>>>>
>>>>>>>> mailinglist...@gmail.com
>>>>
>>>>> wrote:
>>>>>>>>>
>>>>>>>>>  I am trying to start ZooKeeper on an EC2 instance.  Here's what
>>>>>>>>>>
>>>>>>>>> I
>>>>
>>>>> did:
>>>>>>>
>>>>>>>> 1)  Downloaded & Unpacked ZooKeeper 3.1.1 on EC2 instance.
>>>>>>>>>> 2)  cp /conf/zoo_sample.cfg /conf/zoo.cfg
>>>>>>>>>> 3)  Changed the dataDir path to point to my EBS volume.
>>>>>>>>>> 4)  In one command window, ran /bin/zkServer.sh start
>>>>>>>>>> (The last message I see is... "Snapshotting: 0)
>>>>>>>>>>
>>>>>>>>>> 5)  Opened another command window, and ran jps
>>>>>>>>>> (This shows a new process called, QuorumPeerMain.  That's the
>>>>>>>>>>
>>>>>>>>> only
>>>>
>>>>> one
>>>>>>>
>>>>>>>> I
>>>>>>>>
>>>>>>>>> see.)
>>>>>>>>>>
>>>>>>>>>> 6)  As per documentation, tried
>>>>>>>>>>
>>>>>>>>>> bin/zkCli.sh -server 127.0.0.1:2181
>>>>>>>>>>
>>>>>>>>>> (This gives me IOException: USAGE)
>>>>>>>>>>
>>>>>>>>>> 7) So I ran:
>>>>>>>>>>
>>>>>>>>>> bin/zkCli.sh -server 127.0.0.1:2181 ls
>>>>>>>>>>
>>>>>>>>>> Got UnknownHostException: -server
>>>>>>>>>>
>>>>>>>>>> 8)  So I tried various ways of specifying IP address in EC2,
>>>>>>>>>>
>>>>>>>>> such
>>>>
>>>>> as:
>>>>>>>
>>>>>>>> 10.xx.xx.xx
>>>>>>>>>> ec2-xx-xx-xx-xxx.compute-1.amazonaws.com
>>>>>>>>>> domU-12-31-xx-xx-xx-xx.compute-1.internal
>>>>>>>>>> domU-12-31-xx-xx-xx-xx
>>>>>>>>>>
>>>>>>>>>> None of them worked.  Keep getting UnknownHostException.
>>>>>>>>>>
>>>>>>>>>> What am I doing wrong.  Please help.  Thanks.
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>
>>>
>>

Reply via email to