Re: issue about run MR job use system user in CDH5

2014-07-21 Thread Alexander Alten-Lorenz
Please post vendor specific questions to the mailinglists of the vendor: https://groups.google.com/a/cloudera.org/forum/#!forum/cdh-user Look closer at: security.UserGroupInformation: PriviledgedActionException as:hbase (auth:SIMPLE) cause:org.apache.hadoop.security.AccessControlException: Perm

issue about run MR job use system user in CDH5

2014-07-21 Thread ch huang
hi,maillist: i set up CDH5 yarn cluster ,and set the following option in my mapred-site.xml file yarn.app.mapreduce.am.staging-dir /data mapreduce history server will set history dir in the directory /data ,but if i submit MR job use other user ,i get

Subscribe user hadoop user

2014-07-21 Thread Liu, Yi A
Regards, Yi Liu

Re: why i can not use '*' in remove hadoop directory?

2014-07-21 Thread Mapred Learn
Can you paste fs -ls output on these ? > On Jul 21, 2014, at 5:51 PM, ch huang wrote: > > hi,maillist: > i use "sudo -u hdfs hadoop fs -rm -r -skipTrash > /user/hive/warehouse/adx.db/dsp_request/2014-03*/*" in CDH4.4,but i find it > can not work in CDH5,why? > > # sudo -u hdfs hadoop

Re: Replace a block with a new one

2014-07-21 Thread Zesheng Wu
Mmm, it seems that the facebook branch https://github.com/facebook/hadoop-20/ has implemented reed-solomon codes, what I was checking earlier were the following two is

Re: Replace a block with a new one

2014-07-21 Thread Zesheng Wu
Thank Bertrand, I've checked these information earlier. There's only XOR implementation, and missed blocks are reconstructed by creating new files. 2014-07-22 3:47 GMT+08:00 Bertrand Dechoux : > And there is actually quite a lot of information about it. > > > https://github.com/facebook/hadoop-

why i can not use '*' in remove hadoop directory?

2014-07-21 Thread ch huang
hi,maillist: i use "sudo -u hdfs hadoop fs -rm -r -skipTrash /user/hive/warehouse/adx.db/dsp_request/2014-03*/*" in CDH4.4,but i find it can not work in CDH5,why? # sudo -u hdfs hadoop fs -rm -r -skipTrash /user/hive/warehouse/dsp.db/dsp_request/2014-01*/* rm: `/user/hive/warehouse/dsp.db

how to reduce delay in HDFS restart

2014-07-21 Thread Anfernee Xu
Hi, For some reason, all PIDs file are missing in my cluster, I have to manually kill all java processes on all machines, then I restarted the HDFS, but it took so long time in applying changes in edit log file, so my question is how can I reduce the delay? My understanding is as follows, could so

yarn container memory setting

2014-07-21 Thread Chen Song
I read a bit on documentation on yarn memory tuning and found that It is suggested to set mapreduce.map.java.opts = 0.8 * mapreduce.map.memory.mb. I am wondering why is 0.8, but not 0.9 or higher? -- Chen Song

Re: Replace a block with a new one

2014-07-21 Thread Bertrand Dechoux
And there is actually quite a lot of information about it. https://github.com/facebook/hadoop-20/blob/master/src/contrib/raid/src/java/org/apache/hadoop/hdfs/DistributedRaidFileSystem.java http://wiki.apache.org/hadoop/HDFS-RAID https://issues.apache.org/jira/browse/MAPREDUCE/component/12313416/

Re: Is it a good idea to delete / move the default configuration xml file ?

2014-07-21 Thread Chris Mawata
I suspect someone wanted to read through them to know tge defaults. Chris On Jul 21, 2014 2:16 PM, "Chris Nauroth" wrote: > That's a good point. I'm not sure how bare *-default.xml files would be > showing up on a deployment outside the jars. > > Chris Nauroth > Hortonworks > http://hortonworks.

planning a cluster

2014-07-21 Thread Adaryl "Bob" Wakefield, MBA
What is the rule for determining how many nodes should be in your initial cluster? B.

Re: Is it a good idea to delete / move the default configuration xml file ?

2014-07-21 Thread Chris Nauroth
That's a good point. I'm not sure how bare *-default.xml files would be showing up on a deployment outside the jars. Chris Nauroth Hortonworks http://hortonworks.com/ On Mon, Jul 21, 2014 at 11:12 AM, Chris Mawata wrote: > Aren't the *-default.xml files supposed to be inside the jars rather

Re: Is it a good idea to delete / move the default configuration xml file ?

2014-07-21 Thread Chris Mawata
Aren't the *-default.xml files supposed to be inside the jars rather than loose files? Cheers Chris Mawata On Jul 21, 2014 12:59 PM, "Chris Nauroth" wrote: > I recommend against deleting or moving *-default.xml, because these files > may be supplying reasonable default values for configuration pr

Re: Is it a good idea to delete / move the default configuration xml file ?

2014-07-21 Thread Chris Nauroth
I recommend against deleting or moving *-default.xml, because these files may be supplying reasonable default values for configuration properties that you haven't set in *-site.xml. We also put defaults into the code itself in case a configuration property is found to be completely missing, but I'

Re: Replace a block with a new one

2014-07-21 Thread Bertrand Dechoux
I wrote my answer thinking about the XOR implementation. With reed-solomon and single replication, the cases that need to be considered are indeed smaller, simpler. It seems I was wrong about my last statement though. If the machine hosting a single-replicated block is lost, it isn't likely that t

Re: Difference between different tar

2014-07-21 Thread Shahab Yunus
The '-bin' file does not have the source code (bin for binaries) while the other does. You can check and see the major difference in the 'src' folders under the top-level directory after unzipping/untarring. Regards, Shahab On Mon, Jul 21, 2014 at 3:54 AM, Vimal Jain wrote: > Hi, > On download

Re: Replace a block with a new one

2014-07-21 Thread Zesheng Wu
If a block is corrupted but hasn't been detected by HDFS, you could delete the block from the local filesystem (it's only a file) then HDFS will replicate the good remaining replica of this block. We only have one replica for each block, if a block is corrupted, HDFS cannot replicate it. 2014-0

Re: Replace a block with a new one

2014-07-21 Thread Zesheng Wu
Thanks Bertrand, my reply comments inline following. So you know that a block is corrupted thanks to an external process which in this case is checking the parity blocks. If a block is corrupted but hasn't been detected by HDFS, you could delete the block from the local filesystem (it's only a fil

Re: Replace a block with a new one

2014-07-21 Thread Bertrand Dechoux
So you know that a block is corrupted thanks to an external process which in this case is checking the parity blocks. If a block is corrupted but hasn't been detected by HDFS, you could delete the block from the local filesystem (it's only a file) then HDFS will replicate the good remaining replica

Re: Replace a block with a new one

2014-07-21 Thread Zesheng Wu
We want to implement a RAID on top of HDFS, something like facebook implemented as described in: https://code.facebook.com/posts/536638663113101/saving-capacity-with-hdfs-raid/ 2014-07-21 17:19 GMT+08:00 Bertrand Dechoux : > You want to implement a RAID on top of HDFS or use HDFS on top of RAID?

Re: Progress indicator should not be negative.

2014-07-21 Thread Nitin Pawar
Smita, Any chance you can print the value of containers waiting and total before the exception. From http://svn.apache.org/viewvc/hadoop/common/branches/branch-2.4.1/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-client/src/main/java/org/apache/hadoop/yarn/client/api/impl/AMRMClientImpl.java?view=ma

Re: Replace a block with a new one

2014-07-21 Thread Bertrand Dechoux
You want to implement a RAID on top of HDFS or use HDFS on top of RAID? I am not sure I understand any of these use cases. HDFS handles for you replication and error detection. Fine tuning the cluster wouldn't be the easier solution? Bertrand Dechoux On Mon, Jul 21, 2014 at 7:25 AM, Zesheng Wu

RE: Progress indicator should not be negative.

2014-07-21 Thread Smita Deshpande
Hi Guys, Can anyone please provide some suggestions/solutions on it? Thanks, Smita From: Smita Deshpande Sent: Thursday, July 17, 2014 11:41 AM To: 'user@hadoop.apache.org' Subject: Progress indicator should not be negative. Hi, I am running the distributed shell example of YARN

Is it a good idea to delete / move the default configuration xml file ?

2014-07-21 Thread Chris MacKenzie
Hi All, I have just realised that my implementation of hadoop-2.4.1 is pulling in all the default.xml files. I have three copies of each in different directories, obviously at least one of those is on the class path. Anyway with all the effort to set up a site, it seems strange to me that I woul

Re: Configuration set up questions - Container killed on request. Exit code is 143

2014-07-21 Thread Chris MacKenzie
Thanks Ozawa Regards, Chris MacKenzie Expert in all aspects of photography telephone: 0131 332 6967 email: stu...@chrismackenziephotography.co.uk corporate: www.chrismackenziephotography.co.uk weddings:

Difference between different tar

2014-07-21 Thread Vimal Jain
Hi, On download page of hadoop ( e.g. http://apache.arvixe.com/hadoop/common/stable1/ ) , I see lots of tars. Whats the difference between hadoop-1.2.1-bin.tar.gz and hadoop-1.2.1.tar.gz