Re: rack awareness and safemode

2012-03-22 Thread John Meagher
03 >>>> As far as I play with test cluster. The balancer would take care of >>>> replica placement. >>>> I just don't want to fall into the situation that the hdfs sit in the >>>> safemode >>>> for hours and users can't use hadoop

Re: rack awareness and safemode

2012-03-20 Thread John Meagher
Unless something has changed recently it won't automatically relocate the blocks. When I did something similar I had a script that walked through the whole set of files that were misreplicated and increased the replication factor then dropped it back down. This triggered relocation of blocks to m

Re: How to find out whether a node is Overloaded from Cpu utilization ?

2012-01-18 Thread John Meagher
The problem I've run into more than memory is having the system CPU time get out of control. My guess is that the threshold for what is considered "overloaded" is going to be dependent on your system setup, what you're running on it, and what bounds your jobs. On Tue, Jan 17, 2012 at 22:06, Arun

Re: Fixing Mis-replicated blocks

2011-10-21 Thread John Meagher
t; Of course, the little script only works if the replication factor is 3 on > all the files. If it's a variable amount you should use the java API to get > the existing factor and then increase by one and then decrease. > > Jeff > > On Thu, Oct 20, 2011 at 8:44 AM, John Meagher w

Fixing Mis-replicated blocks

2011-10-20 Thread John Meagher
After a hardware move with an unfortunate mis-setup rack awareness script our hadoop cluster has a large number of mis-replicated blocks. After about a week things haven't gotten better on their own. Is there a good way to trigger the name node to fix the mis-replicated blocks? Here's what I'm u

Re: How do I diagnose IO bounded errors using the framework counters?

2011-10-05 Thread John Meagher
The counter names are created dynamically in mapred.Task /** * Counters to measure the usage of the different file systems. * Always return the String array with two elements. First one is the name of * BYTES_READ counter and second one is of the BYTES_WRITTEN counter. */ protected

Re: Applications creates bigger output than input?

2011-04-29 Thread John Meagher
Another case is augmenting data. This is sometimes done outside of MR in an ETL flow, but can be done as an MR job. Doing something like this is using Hadoop to handle the scaling issues, but really isn't what MR is intended for. A real example of this is: * Input: standard apache weblog * Data