Re: zookeeper on ec2

2009-09-01 Thread Patrick Hunt
What is your client timeout? It may be too low. also see this section on handling recoverable errors: http://wiki.apache.org/hadoop/ZooKeeper/ErrorHandling connection loss in particular needs special care since: When a ZooKeeper client loses a connection to the ZooKeeper server there may be

Re: zookeeper on ec2

2009-09-01 Thread Mahadev Konar
Hi Satish, Connectionloss is a little trickier than just retrying blindly. Please read the following sections on this - http://wiki.apache.org/hadoop/ZooKeeper/ErrorHandling And the programmers guide: http://hadoop.apache.org/zookeeper/docs/r3.1.1/zookeeperProgrammers.html To learn more

Re: zookeeper on ec2

2009-09-01 Thread Patrick Hunt
I'm not very familiar with ec2 environment, are you doing any monitoring? In particular network connectivity btw nodes? Sounds like networking issues btw nodes (I'm assuming you've also looked at stuff like this http://wiki.apache.org/hadoop/ZooKeeper/Troubleshooting and verified that you are

Re: zookeeper on ec2

2009-09-01 Thread Satish Bhatti
For my initial testing I am running with a single ZooKeeper server, i.e. the ensemble only has one server. Not sure if this is exacerbating the problem? I will check out the trouble shooting link you sent me. On Tue, Sep 1, 2009 at 5:01 PM, Patrick Hunt ph...@apache.org wrote: I'm not very

Re: zookeeper on ec2

2009-09-01 Thread Patrick Hunt
Depends on what your tests are. Are they pretty simple/light? then probably network issue. Heavy load testing? then might be the server/client, might be the network. easiest thing is to run a ping test while running your zk test and see if pings are getting through (and latency). You should

Open for registration now. Re: CFP of 3rd Hadoop in China event (Hadoop World:Beijing 2009)

2009-09-01 Thread 何 永强
Hi all, The 3rd Hadoop in China event (Hadoop World:Beijing 2009) is open for registration now. http://hadoop-world-beijing.eventbrite.com/ Please register as early as possible. Thanks, Yongqiang On 09-8-22 上午12:21, He Yongqiang heyongqi...@software.ict.ac.cn wrote:

Re: zookeeper on ec2

2009-09-01 Thread Ted Dunning
Can you enable verboseGC and look at the tenuring distribution and times for GC? On Tue, Sep 1, 2009 at 5:54 PM, Satish Bhatti cthd2...@gmail.com wrote: Parallel/Serial. inf...@domu-12-31-39-06-3d-d1:/opt/ir/agent/infact-installs/aaa/infact$ iostat Linux 2.6.18-xenU-ec2-v1.0