Hi zhixin, Data may become not correct if use "distribute by rand()". https://issues.apache.org/jira/browse/KYLIN-3388
------------------ ???????? ------------------ ??????: "liuzhixin"<[email protected]>; ????????: 2018??11??2??(??????) ????12:53 ??????: "dev"<[email protected]>; ????: "ShaoFeng Shi"<[email protected]>; ????: Re: Redistribute intermediate table default not by rand() Hi kylin team: Step: Redistribute intermediate table # ??????????????????????????????DISTRIBUTE BY????????????????DISTRIBUTE BY RAND() ???????????????????????????????????????????????????????????????????? Best Regards?? > ?? 2018??11??2????????12:03??liuzhixin <[email protected]> ?????? > > Hi kylin team: > > Version: Kylin2.5-hadoop3.1 for hdp3.0 > # > Step: Redistribute intermediate table > # > DISTRIBUTE BY is that: > INSERT OVERWRITE TABLE table_intermediate SELECT * FROM table_intermediate > DISTRIBUTE BY Field1, Field2, Field3; > # > Not DISTRIBUTE BY RAND() > # > Is this default DISTRIBUTE BY Field1, Field2, Field3? how to DISTRIBUTE BY > RAND()? > > Best wishes. >
