Re: Redistribute intermediate table default not by rand()

2018-11-22 Thread ShaoFeng Shi
che.org/jira/browse/KYLIN-3388 > >> > >> > >> > >> > >> ------ 原始邮件 ------ > >> 发件人: "liuzhixin"; > >> 发送时间: 2018年11月2日(星期五) 中午12:53 > >> 收件人: "dev"; > >> 抄送: "ShaoFeng Shi

Re: Redistribute intermediate table default not by rand()

2018-11-02 Thread liuzhixin
; -- 原始邮件 -- >> 发件人: "liuzhixin"; >> 发送时间: 2018年11月2日(星期五) 中午12:53 >> 收件人: "dev"; >> 抄送: "ShaoFeng Shi"; >> 主题: Re: Redistribute intermediate table default not by rand() >> >> >> >>

Re: Redistribute intermediate table default not by rand()

2018-11-02 Thread liuzhixin
始邮件 -- > 发件人: "liuzhixin"; > 发送时间: 2018年11月2日(星期五) 下午3:11 > 收件人: "dev"; > 抄送: "Chao Long"; > 主题: Re: Redistribute intermediate table default not by rand() > > > > Hi Chao Long, > > Thank you for the answer. > # &

?????? Redistribute intermediate table default not by rand()

2018-11-02 Thread Chao Long
n"; : 2018??11??2??(??) 3:11 ??: "dev"; ????: "Chao Long"; ????: Re: Redistribute intermediate table default not by rand() Hi Chao Long?? Thank you for the answer. # Step1: Create Intermediate Flat Hive Table Step2: Redistribute intermediate table # Perhaps, Ky

Re: Redistribute intermediate table default not by rand()

2018-11-02 Thread liuzhixin
> 发件人: "liuzhixin"; > 发送时间: 2018年11月2日(星期五) 中午12:53 > 收件人: "dev"; > 抄送: "ShaoFeng Shi"; > 主题: Re: Redistribute intermediate table default not by rand() > > > > Hi kylin team: > > Step: Redistribute intermediate table > # > 默认

Re: Redistribute intermediate table default not by rand()

2018-11-02 Thread liuzhixin
> 发件人: "liuzhixin"; > 发送时间: 2018年11月2日(星期五) 中午12:53 > 收件人: "dev"; > 抄送: "ShaoFeng Shi"; > 主题: Re: Redistribute intermediate table default not by rand() > > > > Hi kylin team: > > Step: Redistribute intermediate table > # > 默认

Re: Redistribute intermediate table default not by rand()

2018-11-01 Thread liuzhixin
se/KYLIN-3388 >>> >>> >>> >>> >>> ------ 原始邮件 -- >>> 发件人: "liuzhixin"; >>> 发送时间: 2018年11月2日(星期五) 中午12:53 >>> 收件人: "dev"; >>> 抄送: "ShaoFeng Shi"; >>>

Re: Redistribute intermediate table default not by rand()

2018-11-01 Thread liuzhixin
se/KYLIN-3388 >>> >>> >>> >>> >>> ------ 原始邮件 -- >>> 发件人: "liuzhixin"; >>> 发送时间: 2018年11月2日(星期五) 中午12:53 >>> 收件人: "dev"; >>> 抄送: "ShaoFeng Shi"; >>>

Re: Redistribute intermediate table default not by rand()

2018-11-01 Thread ShaoFeng Shi
人: "liuzhixin"; > > 发送时间: 2018年11月2日(星期五) 中午12:53 > > 收件人: "dev"; > > 抄送: "ShaoFeng Shi"; > > 主题: Re: Redistribute intermediate table default not by rand() > > > > > > > > Hi kylin team: > > > > Step: Redistribute

Re: Redistribute intermediate table default not by rand()

2018-11-01 Thread liuzhixin
rg/jira/browse/KYLIN-3388 > > > > > -- 原始邮件 -- > 发件人: "liuzhixin"; > 发送时间: 2018年11月2日(星期五) 中午12:53 > 收件人: "dev"; > 抄送: "ShaoFeng Shi"; > 主题: Re: Redistribute intermediate table default not by rand() > > > > Hi

Re: Redistribute intermediate table default not by rand()

2018-11-01 Thread liuzhixin
- 原始邮件 -- >> 发件人: "liuzhixin"; >> 发送时间: 2018年11月2日(星期五) 中午12:53 >> 收件人: "dev"; >> 抄送: "ShaoFeng Shi"; >> 主题: Re: Redistribute intermediate table default not by rand() >> >> >> >> Hi kylin team: >

Re: Redistribute intermediate table default not by rand()

2018-11-01 Thread ShaoFeng Shi
a/browse/KYLIN-3388 > > > > > -- 原始邮件 -- > 发件人: "liuzhixin"; > 发送时间: 2018年11月2日(星期五) 中午12:53 > 收件人: "dev"; > 抄送: "ShaoFeng Shi"; > 主题: Re: Redistribute intermediate table default not by rand() > > > > Hi

?????? Redistribute intermediate table default not by rand()

2018-11-01 Thread Chao Long
oFeng Shi"; ????: Re: Redistribute intermediate table default not by rand() Hi kylin team: Step: Redistribute intermediate table # ??DISTRIBUTE BYDISTRIBUTE BY RAND() Best

Re: Redistribute intermediate table default not by rand()

2018-11-01 Thread liuzhixin
Hi kylin team: Step: Redistribute intermediate table # 默认选择了维度的前三个字段作为DISTRIBUTE BY的依据,没有采用DISTRIBUTE BY RAND() 如果没有合适的维度字段,这样的默认策略将会导致数据更加的数据不均衡。 Best Regards! > 在 2018年11月2日,下午12:03,liuzhixin 写道: > > Hi kylin team: > > Version: Kylin2.5-hadoop3.1 for hdp3.0 > # > Step: Redistribute intermed

Redistribute intermediate table default not by rand()

2018-11-01 Thread liuzhixin
Hi kylin team: Version: Kylin2.5-hadoop3.1 for hdp3.0 # Step: Redistribute intermediate table # DISTRIBUTE BY is that: INSERT OVERWRITE TABLE table_intermediate SELECT * FROM table_intermediate DISTRIBUTE BY Field1, Field2, Field3; # Not DISTRIBUTE BY RAND() # Is this default DISTRIBUTE BY Field1