once per day, starting at 1:00AM EDT. I have changed it to use
> a fewer number of reducers just to see how that effects the distribution.
>
> Dave Shine
> Sr. Software Engineer
> 321.939.5093 direct | 407.314.0122 mobile CI Boost(tm) Clients Outperform
> Online(tm) www.
Just a thought, but can you deal with the problem with increased granularity by
simply making the jobs smaller?
If you have enough jobs, when one takes twice as long there will be plenty of
other small jobs to employ the other nodes, right?
- Tim.
F
Have you considered deflate or bzip?
- Tim.
From: Marek Miglinski [mmiglin...@seven.com]
Sent: Thursday, June 14, 2012 1:39 AM
To: mapreduce-user@hadoop.apache.org
Subject: codec compression ratio
When procession 65billion records and using LZO or Sna