Re: hive.tez.auto.reducer.parallelism can slove skew join problem?

Rajesh Balamohan Mon, 25 May 2015 02:48:05 -0700

As of today, Tez autoparallelism can only decrease the number of reducers
allocated. It can not increase the number of tasks at runtime (could be
there in future releases).


- If the ratio of REDUCE_INPUT_GROUPS / REDUCE_INPUT_RECORDS is
approximately 1.0, you can possibly increase the number of reducers for the
vertex.
- If the ratio of REDUCE_INPUT_GROUPS / REDUCE_INPUT_RECORDS is lot less
than 0.2 (~20%), this could potentially mean single reducer taking up most
of the records.  In this case, you might want to consider increasing the
amount of memory allocated (try increasing the container size to check if
it is helping the situation)

~Rajesh.B

On Mon, May 25, 2015 at 2:41 PM, David Ginzburg <davidginzb...@gmail.com>
wrote:

> Thank you,
> Already tried this with no effect on number of reducers
>
> On Mon, May 25, 2015 at 3:51 AM, r7raul1...@163.com <r7raul1...@163.com>
> wrote:
>
>>
>> when one reduce process too many data(skew join)  set 
>> hive.tez.auto.reducer.parallelism
>> =true can slove this problem?
>>
>> ------------------------------
>> r7raul1...@163.com
>>
>
>


-- 
~Rajesh.B

Re: hive.tez.auto.reducer.parallelism can slove skew join problem?

Reply via email to