Erwan,

Faced with a similar situation last week I found that decreasing

mapred.max.split.size

Increased my parallelism by 6x. Yes mapred even though it was a Tez job. I
reduced it to 10mb from 256mb which I believe is the default.

The other variables to try are:
tez.grouping.min-size (make it smaller)
tez.grouping.max-size (smaller as well)


Good luck.


On 4/6/15, 2:57 PM, "Erwan MAS" <[email protected]> wrote:

>On Mon, Apr 06, 2015 at 12:15:05PM -0500, max scalf wrote:
>> Try setting the below in Hive and see what happens..btw what are you
>> configs in hive if any?
>> 
>> set mapred.map.tasks = 20;
>> 
>
>Does not change the behavior :(
>
>--
>     ____________________________________________________________
>    / Erwan MAS                                                 /\
>   | mailto:[email protected]                                   |_/
>___|________________________________________________________   |
>\___________________________________________________________\__/

Reply via email to