Re: Building Hadoop on Windows, SDK 7.1, Error An Ant BuildException has occured: input file hadoop-2.6.0-src\hadoop-common-project\hadoop-common\target\findbugsXml.xml does not exist

2015-04-05 Thread Umesh Kant
Forgot to mention, am using following Maven command mvn package -X -Pdist -Pdocs -Psrc -Dtar -DskipTests -Pnative-win findbugs:findbugs Thanks,Umesh   From: Umesh Kant kantum...@yahoo.com To: user@hadoop.apache.org user@hadoop.apache.org Sent: Friday, April 3, 2015 11:35 PM Subject:

How run gridmix2 example in Yarn?

2015-04-05 Thread xeonmailinglist-gmail
Hi, I would like to run the |Webdatascan| example from Gridmix2, but I can’t find the classes in Mapreduce 2.6. How I run the gridmix example in YARN? [1] http://www.programdevelop.com/2458744/ ​ -- -- Thanks

Re: How will Hadoop handle it when a datanode server with total hardware failure?

2015-04-05 Thread Josh Baer
Three spindles failing on three different machines could potentially cause data loss issues if they fail simultaneously-- which would be pretty uncommon and affect only a small percentage of data. Otherwise if one machine or individual spindle fails, the blocks that are stored on those machines

Re: compress data in hadoop

2015-04-05 Thread Shahab Yunus
Your package seems different. Have you tried the following package and class? org.apache.hadoop.io.compress.BZip2Codec Regards, Shahab On Sun, Apr 5, 2015 at 9:45 AM, xeonmailinglist-gmail xeonmailingl...@gmail.com wrote: Hi, I have run the command [1] to create compressed data from my

compress data in hadoop

2015-04-05 Thread xeonmailinglist-gmail
Hi, I have run the command [1] to create compressed data from my Sequence files that are in the |/user/root/out1| dir, but I got the error [2]. How I compress data in hadoop? [1] |hadoop jar ./share/hadoop/tools/lib/hadoop-streaming-2.6.0.jar -D mapreduce.job.reduces=30 -D

CPU utilization in map function

2015-04-05 Thread Ravikant Dindokar
I am newbie learning hadoop . I am running Apache Giraph on hadoop 2.2.0. I want to find out how much CPU utilization as well as time spent in sending messages in each superstep for a Giraph application. I am not familiar with hadoop code. Can you suggest the functions I should look into to get