Re: Redistribute intermediate table default not by rand()

2018-11-02 Thread liuzhixin
HI ShaoFeng Shi: 数据表中高基数维度(例如request_id或者timestamp)会带来维度膨胀,引起了OOM; 而其他的一些偏低的高基数维度本身数据分布就不均衡,导致数据也分布不均衡; # 数据本身有很多分布就不均衡,没有了rand(),Kylin该如何处理? Best Wishes > 在 2018年11月2日,下午1:42,ShaoFeng Shi 写道: > > Please move the high cardinality dimensions to the leading position of > rowkey, that will make

[jira] [Created] (KYLIN-3665) Partition time column may never be added

2018-11-02 Thread Chao Long (JIRA)
Chao Long created KYLIN-3665: Summary: Partition time column may never be added Key: KYLIN-3665 URL: https://issues.apache.org/jira/browse/KYLIN-3665 Project: Kylin Issue Type: Bug

Re: Re: Re: [DISCUSS] New Kylin Streaming Solution From eBay

2018-11-02 Thread ShaoFeng Shi
Hi Gang, I appreciate your hard work! Ma Gang 于2018年11月1日周四 下午3:29写道: > Hi ShaoFeng, > For streaming ingest/query performance, there is a doc: > https://drive.google.com/file/d/1GSBMpRuVQRmr8Ev2BWvssfMd-Rck9vsH/view?ths=true > , it is also in the design doc's 'performance' section attached in

[jira] [Created] (KYLIN-3664) Hive metrics reporter HiveProducer doesn't support multiple instances on one host

2018-11-02 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3664: --- Summary: Hive metrics reporter HiveProducer doesn't support multiple instances on one host Key: KYLIN-3664 URL: https://issues.apache.org/jira/browse/KYLIN-3664

Re: Redistribute intermediate table default not by rand()

2018-11-02 Thread liuzhixin
Hi Chao Long, Yes! # So I said “has provided”, below, > At the same time, Kylin should support the custom column for shard. (has > provided) # Bug, Kylin can insert one rand column in the intermediate hive table for the next shard, (as default). Best Wishes! > 在 2018年11月2日,下午4:03,Chao Long

?????? Redistribute intermediate table default not by rand()

2018-11-02 Thread Chao Long
Hi zhixin, As I remember If you set "shard by" column in cube design page, Kylin will use this column as the condition of "distribute by", rather than the first three field of rowkey. -- -- ??: "liuzhixin"; : 2018??11??2??(??)

答复: [VOTE] Release apache-kylin-2.5.1 (RC1)

2018-11-02 Thread Na Zhai
+1 mvn test passed 发送自 Windows 10 版邮件应用 发件人: Cheng Wang 发送时间: Friday, November 2, 2018 3:29:12 PM 收件人: dev@kylin.apache.org 主题: Re: [VOTE] Release apache-kylin-2.5.1 (RC1) +1 (binding) Best Regards,

Re: [VOTE] Release apache-kylin-2.5.1 (RC1)

2018-11-02 Thread Cheng Wang
+1 (binding) Best Regards, Cheng On 11/2/18, 2:09 PM, "ShaoFeng Shi" wrote: >Hi all, > >I have created a build for Apache Kylin 2.5.1, release candidate 1. > >Changes highlights: > >[KYLIN-3531] - Login failed with case-insensitive username >[KYLIN-3604] - Can't build cube with spark in

Re: Redistribute intermediate table default not by rand()

2018-11-02 Thread liuzhixin
Hi Chao Long, Thank you for the answer. # Step1: Create Intermediate Flat Hive Table Step2: Redistribute intermediate table # Perhaps, Kylin can insert one rand column in the intermediate hive table for the next shard, (as default). At the same time, Kylin should support the custom column for

Re: Redistribute intermediate table default not by rand()

2018-11-02 Thread liuzhixin
Hi Chao Long, Thank you for the answer. # Step1: Create Intermediate Flat Hive Table Step2: Redistribute intermediate table # Perhaps, Kylin can insert one rand column in the intermediate hive table for the next shard, (as default). At the same time, Kylin should support the custom column for

Re: [VOTE] Release apache-kylin-2.5.1 (RC1)

2018-11-02 Thread JiaTao Tao
 Here is my vote: +1 (binding) ShaoFeng Shi 于2018年11月2日周五 下午2:10写道: > Hi all, > > I have created a build for Apache Kylin 2.5.1, release candidate 1. > > Changes highlights: > > [KYLIN-3531] - Login failed with case-insensitive username > [KYLIN-3604] - Can't build cube with spark in HBase

[jira] [Created] (KYLIN-3663) Failed to delete project when project has more than one table

2018-11-02 Thread rongchuan.jin (JIRA)
rongchuan.jin created KYLIN-3663: Summary: Failed to delete project when project has more than one table Key: KYLIN-3663 URL: https://issues.apache.org/jira/browse/KYLIN-3663 Project: Kylin

Re: Redistribute intermediate table default not by rand()

2018-11-02 Thread liuzhixin
Hi ShaoFeng Shi, Thank you for the answer. # Step1: Create Intermediate Flat Hive Table Step2: Redistribute intermediate table # Perhaps, Kylin can insert one rand column for the next shard, (as default). At the same time, Kylin should support the custom column for shard. Best Wishes. > 在

Re: Redistribute intermediate table default not by rand()

2018-11-02 Thread liuzhixin
Hi ShaoFeng Shi, Thank you for the answer. # Step1: Create Intermediate Flat Hive Table Step2: Redistribute intermediate table # Perhaps, Kylin can insert one rand column for the next shard, (as default). At the same time, Kylin should support the custom column for shard. Best Wishes. > 在

Re: [VOTE] Release apache-kylin-2.5.1 (RC1)

2018-11-02 Thread zhan shaoxiong
+1 On [DATE], "[NAME]" <[ADDRESS]> wrote: Hi all, I have created a build for Apache Kylin 2.5.1, release candidate 1. Changes highlights: [KYLIN-3531] - Login failed with case-insensitive username [KYLIN-3604] - Can't build cube with spark in HBase standalone

Re: 回复:[VOTE] Release apache-kylin-2.5.1 (RC1)

2018-11-02 Thread zhan shaoxiong
+1 On [DATE], "[NAME]" <[ADDRESS]> wrote: +1 -- 原始邮件 -- 发件人: "ShaoFeng Shi"; 发送时间: 2018年11月2日(星期五) 下午2:09 收件人: "dev"; 主题: [VOTE] Release apache-kylin-2.5.1 (RC1) Hi all, I have created a

??????[VOTE] Release apache-kylin-2.5.1 (RC1)

2018-11-02 Thread Chao Long
+1 -- -- ??: "ShaoFeng Shi"; : 2018??11??2??(??) 2:09 ??: "dev"; : [VOTE] Release apache-kylin-2.5.1 (RC1) Hi all, I have created a build for Apache Kylin 2.5.1, release candidate 1. Changes highlights: [KYLIN-3531] -

[VOTE] Release apache-kylin-2.5.1 (RC1)

2018-11-02 Thread ShaoFeng Shi
Hi all, I have created a build for Apache Kylin 2.5.1, release candidate 1. Changes highlights: [KYLIN-3531] - Login failed with case-insensitive username [KYLIN-3604] - Can't build cube with spark in HBase standalone mode [KYLIN-3613] - Kylin with Standalone HBase Cluster could not find the

Re: Redistribute intermediate table default not by rand()

2018-11-02 Thread ShaoFeng Shi
Hi Zhixin, Kylin 2.5.1 will add some tips in the advanced step, hope that can help. liuzhixin 于2018年11月2日周五 下午2:05写道: > Hi Chao Long: > > Thank you for the answer. > # > Maybe kylin should provide config for every build step > > Best wishes. > > > 在 2018年11月2日,下午1:38,Chao Long 写道: > > > > Hi

Re: Redistribute intermediate table default not by rand()

2018-11-02 Thread liuzhixin
Hi Chao Long: Thank you for the answer. # Maybe kylin should provide config for every build step Best wishes. > 在 2018年11月2日,下午1:38,Chao Long 写道: > > Hi zhixin, > Data may become not correct if use "distribute by rand()". > https://issues.apache.org/jira/browse/KYLIN-3388 > > > > >

Re: Redistribute intermediate table default not by rand()

2018-11-02 Thread liuzhixin
Hi ShaoFeng Shi OK, thank you for the answer. # Perhaps Kylin should provide the tips or notes for the default shard. Best Wishes. > 在 2018年11月2日,下午1:42,ShaoFeng Shi 写道: > > Please move the high cardinality dimensions to the leading position of > rowkey, that will make the data distribution