Why does suddenly ha switching?

2014-08-11 Thread juil
hadoop version 2.4.1 Network is in good condition. What causes the session timeout? ** DFSZKFailoverController logs2014-08-11 17:20:13,226 DEBUG org.apache.zookeeper.ClientCnxn: Got ping response for sessionid: 0x147a0162d86000c after 0ms2014-08-11 17:20:27,012 INFO org.apache.zookeeper

Re: Tez part of YARN?

2014-08-11 Thread Nicolas Maillard
Hello Your are correct Tez is a yarn app or client to be more precise. It is not bundled by default in hadoop 2, same as pig for example it is up to you to add it. Tez and pig are actually close in that sense that they are not services to be installed exactly but rather clients to be added and use

Re: Yarn, MRv1, MRv2 lots of newbie doubts and questions

2014-08-11 Thread Nicolas Maillard
Hello As the hadoop ecosystem moves fast and the yarn part was a mini revolution I understand your confusion. To make it simple in hadoop 1 there were two main things Hadoop MapReduce and Hadoop HDFS. Hadoop MR was actually two things: A compute paradigme, map-reduce and a distribution process of

How to get specific rowkey from hbase

2014-08-11 Thread leiwang...@gmail.com
Hi, I have an input which has about 10M records,each recored is a rowkey in hbase. How can i get these data from HBase with MapReduce job? Thanks, Lei leiwang...@gmail.com

Hadoop 2.4.1 Verifying Automatic Failover Failed: ResourceManager

2014-08-11 Thread arthur.hk.c...@gmail.com
Hi I am running Hadoop 2.4.1 with YARN HA enabled (two name nodes, NM1 and NM2). When verifying ResourceManager failover, I use “kill -9” to terminate the ResourceManager in name node 1 (NM1), if I run the the test job, it seems that the failover of ResourceManager keeps trying NM1 and NM2 non

Examining effect of changing block placement policy.

2014-08-11 Thread Arjun Bakshi
Hi, I've made some changed to the default block placement policy and want to see how if affects a cluster. Any suggestions on how I can test the before and after of a cluster after making these changes? I read up a bit on Rumen and GridMix in my search for tools that would help me benchmark

Re: Hadoop 2.4.1 Verifying Automatic Failover Failed: ResourceManager

2014-08-11 Thread arthur.hk.c...@gmail.com
Hi, If I have TWO nodes for ResourceManager HA, what should be the correct steps and commands to start and stop ResourceManager in a ResourceManager HA cluster ? Unlike ./sbin/start-dfs.sh (which can start all NNs from a NN), it seems that ./sbin/start-yarn.sh can only start YARN in a node at a

Re: Hadoop 2.4.1 Verifying Automatic Failover Failed: ResourceManager

2014-08-11 Thread Xuan Gong
Hey, Arthur: Did you use single node cluster or multiple nodes cluster? Could you share your configuration file (yarn-site.xml) ? This looks like a configuration issue. Thanks Xuan Gong On Mon, Aug 11, 2014 at 9:45 AM, arthur.hk.c...@gmail.com < arthur.hk.c...@gmail.com> wrote: > Hi, > >

ulimit for Hive

2014-08-11 Thread Ana Gillan
Hi, I¹ve been reading a lot of posts about needing to set a high ulimit for file descriptors in Hadoop and I think it¹s probably the cause of a lot of the errors I¹ve been having when trying to run queries on larger data sets in Hive. However, I¹m really confused about how and where to set the lim

Re: Hadoop 2.4.1 Verifying Automatic Failover Failed: ResourceManager

2014-08-11 Thread arthur.hk.c...@gmail.com
Hi, it is a multiple-node cluster, two master nodes (rm1 and rm2), below is my yarn-site.xml. At the moment, the ResourceManager HA works if: 1) at rm1, run ./sbin/start-yarn.sh yarn rmadmin -getServiceState rm1 active yarn rmadmin -getServiceState rm2 14/08/12 07:47:59 INFO ipc.Client: Retry

Synchronization among Mappers in map-reduce task

2014-08-11 Thread saurabh jain
Hi Folks , I have been writing a map-reduce application where I am having an input file containing records and every field in the record is separated by some delimiter. In addition to this user will also provide a list of columns that he wants to lookup in a master properties file (stored in HDFS

Re: Hadoop 2.4.1 Verifying Automatic Failover Failed: ResourceManager

2014-08-11 Thread Xuan Gong
Some questions: Q1) I need start yarn in EACH master separately, is this normal? Is there a way that I just run ./sbin/start-yarn.sh in rm1 and get the STANDBY ResourceManager in rm2 started as well? No, need to start multiple RMs separately. Q2) How to get alerts (e.g. by email) if the ACTIVE R

Re: Hadoop 2.4.1 Verifying Automatic Failover Failed: ResourceManager

2014-08-11 Thread arthur.hk.c...@gmail.com
Hi, Thank y very much! At the moment if I run ./sbin/start-yarn.sh in rm1, the standby STANDBY ResourceManager in rm2 is not started accordingly. Please advise what would be wrong? Thanks Regards Arthur On 12 Aug, 2014, at 1:13 pm, Xuan Gong wrote: > Some questions: > Q1) I need start