退订
退订,谢谢了
Re:Re: Re: Re: use timestamp for cube partition
退订 At 2017-10-10 16:04:18, "ShaoFeng Shi"wrote: >The "shard-by" column will be used to distribute the cube data to different >shards (each shard is a HBase region). Usually the "shard-by" column need >be a high cardinality column, using which can ensure the shards are in >similar size, like user_id, order_id etc. Usually partition column's >cardinality is not enough high, so not suggest to use it for this purpose. > >The default streaming parser in Kylin accepts JSON format. If you have >another format, need implement that by extending the StreamingParser class. > >2017-10-10 16:30 GMT+08:00 崔苗 : > >> Thanks for your your suggestion,finally we changed the timestamp into date >> format by sql and it worked.some other questions:1、what's the >> meaning of 'shard by' column,is it proper to set partition column as the >> 'shard by' column?2、Is there limitations on data format in kafka when >> building streaming cubes?we succeed to build streaming a cube on the sample >> data supplied by kylin,but failed on our own data,it's avro format,not json >> format. >> 在 2017-10-10 14:47:30,ShaoFeng Shi 写道: >> >Hi Miao, >> > >> >It doesn't understand your time format. You need to use the standard Date >> >format in Hive. Or you can implement your own logic, with the interface " >> >IPartitionConditionBuilder" >> > >> >2017-10-10 11:33 GMT+08:00 崔苗 : >> > >> >> well,the timestamp column was bigint such as 1507547479434 in hive >> table, >> >> when I define the endtime to build the cube ,I found the timestamp >> >> 1507547479434 was converted to '20171009' and the log show that kylin >> >> loaded data from hive with condition "WHERE (USER_REG.REG_TIME >> >> 20171009)",so the Intermediate Flat Hive Table was null. I want to know >> >> could kylin derive other time values like “year_start”, “day_start” from >> >> the bigint timestamp in hive as it does in kafka table? or we must >> change >> >> the bigint timestamp into data format such as "2017-10-09" in hive? >> >> At 2017-10-09 22:04:56, ShaoFeng Shi wrote: >> >> >Hi Miao, >> >> > >> >> >What's the error as you said: "kylin failed to load data from hive >> >> tables"? >> >> > >> >> >In my opinion, it is not recommended to use timestamp as the partition >> >> >column, since it is too fine granularity. Usually, the cube is >> partitioned >> >> >by day/week/month; in some cases, it is by the hour; In streaming >> case, it >> >> >might partition by the minute; But no case by timestamp. I put some >> >> >comments about this in this document: >> >> >https://kylin.apache.org/docs21/tutorial/cube_streaming.html >> >> > >> >> >2017-10-09 14:27 GMT+08:00 崔苗 : >> >> > >> >> >> Hi, >> >> >> we want to use tables in kafka as fact tables and tables in MySql as >> >> >> lookup tables,so we put all the tables into hive and want to join >> them >> >> as >> >> >> cubes. >> >> >> >> >> >> the time column in fact table was timestamp, so does kylin2.1 support >> >> >> timestamp for cube partition? >> >> >> I find this :https://issues.apache.org/jira/browse/KYLIN-633 , >> >> >> >> >> >> it seems kylin already supprt Timestamp for cube partition,but when >> we >> >> >> define timestamp as partition , kylin failed to load data from hive >> >> tables. >> >> >> >> >> >> >> >> >> thanks in advanced for your reply. >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> > >> >> > >> >> >-- >> >> >Best regards, >> >> > >> >> >Shaofeng Shi 史少锋 >> >> >> >> >> >> >> > >> > >> >-- >> >Best regards, >> > >> >Shaofeng Shi 史少锋 >> >> >> > > >-- >Best regards, > >Shaofeng Shi 史少锋
[jira] [Created] (KYLIN-2958) SAML logout issue
Pan, Julian created KYLIN-2958: -- Summary: SAML logout issue Key: KYLIN-2958 URL: https://issues.apache.org/jira/browse/KYLIN-2958 Project: Kylin Issue Type: Bug Reporter: Pan, Julian Assignee: Pan, Julian Priority: Minor -- This message was sent by Atlassian JIRA (v6.4.14#64029)
Review Request 63234: KYLIN-2934 Write user guide for KYLIN-2656
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/63234/ --- Review request for kylin, Dayue Gao, Dong Li, hongbin ma, and Wang Xiaoyu. Bugs: KYLIN-2934 https://issues.apache.org/jira/browse/KYLIN-2934 Repository: kylin Description --- Write user guide for KYLIN-2656 Diffs - website/_data/docs21.yml 90acd705c website/_docs21/howto/howto_enable_zookeeper_acl.md PRE-CREATION Diff: https://reviews.apache.org/r/63234/diff/1/ Testing --- Thanks, pengjianhua
Re: data format when building increment cube
Oh, I'm using 2.2 (will release soon). Then you can run "bin/mestore.sh backup", and then find the model json file in "model_desc" folder, in it you will see. After modify the partition class, you can restore from that folder back to HBase, and then restart Kylin. 2017-10-23 18:00 GMT+08:00 op <520075...@qq.com>: > on kylin web,there is no "Edit Json" . my version 2.0 > > > -- 原始邮件 -- > *发件人:* "ShaoFeng Shi";; > *发送时间:* 2017年10月23日(星期一) 下午5:53 > *收件人:* "dev" ; > *主题:* Re: data format when building increment cube > > You can backup the metastore to local, then you will see the model json > file. > > On Kylin web, move mouse to the model, click "Action" -> "Edit JSON", will > see the JSON content. > > 2017-10-23 17:23 GMT+08:00 op <520075...@qq.com>: > > > thank you shaofeng,but i cannot find the json file > > or you mean change it on kylin web? > > > > > > > > > > -- 原始邮件 -- > > 发件人: "ShaoFeng Shi"; ; > > 发送时间: 2017年10月23日(星期一) 下午5:09 > > 收件人: "dev" ; > > > > 主题: Re: data format when building increment cube > > > > > > > > If you create a new model from Kylin web, the model has a property > > "partition_condition_builder" which refers to the class that > > implements the PartitionConditionBuilder. > > You can manually change it to your customized implementation: > > > > > > { > > "uuid" : "ad40c5a2-a32f-4f27-84cd-85ea43af2307", > > "last_modified" : 1508748727953, > > "name" : "test_parition", > > "owner" : "admin", > > "is_draft" : false, > > "description" : "", > > "fact_table" : "DEFAULT.KYLIN_SALES", > > "lookups" : [ ], > > "dimensions" : [ { > > "table" : "KYLIN_SALES", > > "columns" : [ "TRANS_ID", "PART_DT", "LSTG_FORMAT_NAME", > > "LEAF_CATEG_ID", "LSTG_SITE_ID", "SLR_SEGMENT_CD", "SELLER_ID", > "BUYER_ID", > > "OPS_USER_ID", "OPS_REGION" ] > > } ], > > "metrics" : [ "KYLIN_SALES.PRICE", "KYLIN_SALES.ITEM_COUNT" ], > > "filter_condition" : "", > > "partition_desc" : { > > "partition_date_column" : "KYLIN_SALES.PART_DT", > > "partition_time_column" : null, > > "partition_date_start" : 0, > > "partition_date_format" : "-MM-dd", > > "partition_time_format" : "HH:mm:ss", > > "partition_type" : "APPEND", > > "partition_condition_builder" : > > "org.apache.kylin.metadata.model.PartitionDesc$ > > DefaultPartitionConditionBuilder" > > }, > > "capacity" : "MEDIUM" > > } > > > > 2017-10-23 16:19 GMT+08:00 op <520075...@qq.com>: > > > > > hi shaofeng > > > then how to use a new PartitionConditionBuilder? > > > thanks! > > > > > > > > > -- 原始邮件 -- > > > 发件人: "ShaoFeng Shi"; ; > > > 发送时间: 2017年10月17日(星期二) 晚上6:16 > > > 收件人: "dev" ; > > > > > > 主题: Re: data format when building increment cube > > > > > > > > > > > > It is not supported. You need to develop a new > PartitionConditionBuilder. > > > Please refer > > > to org.apache.kylin.metadata.model.PartitionDesc. > > > YearMonthDayPartitionConditionBuilder > > > > > > 2017-10-17 18:06 GMT+08:00 op <520075...@qq.com>: > > > > > > > > > > > hello everyone. > > > > how can i define my own data format? > > > > our hive tables are partitioned by month_id(format :mm) and > > > > subpartitioned by day_id(format :dd) > > > > > > > > thanks! > > > > > > > > > > > > > > > > -- > > > Best regards, > > > > > > Shaofeng Shi 史少锋 > > > > > > > > > > > -- > > Best regards, > > > > Shaofeng Shi 史少锋 > > > > > > -- > Best regards, > > Shaofeng Shi 史少锋 > > -- Best regards, Shaofeng Shi 史少锋
?????? data format when building increment cube
on kylin web,there is no "Edit Json" . my version 2.0 -- -- ??: "ShaoFeng Shi";; : 2017??10??23??(??) 5:53 ??: "dev" ; : Re: data format when building increment cube You can backup the metastore to local, then you will see the model json file. On Kylin web, move mouse to the model, click "Action" -> "Edit JSON", will see the JSON content. 2017-10-23 17:23 GMT+08:00 op <520075...@qq.com>: > thank you shaofeng??but i cannot find the json file > or you mean change it on kylin web?? > > > > > -- -- > ??: "ShaoFeng Shi"; ; > : 2017??10??23??(??) 5:09 > ??: "dev" ; > > : Re: data format when building increment cube > > > > If you create a new model from Kylin web, the model has a property > "partition_condition_builder" which refers to the class that > implements the PartitionConditionBuilder. > You can manually change it to your customized implementation: > > > { > "uuid" : "ad40c5a2-a32f-4f27-84cd-85ea43af2307", > "last_modified" : 1508748727953, > "name" : "test_parition", > "owner" : "admin", > "is_draft" : false, > "description" : "", > "fact_table" : "DEFAULT.KYLIN_SALES", > "lookups" : [ ], > "dimensions" : [ { > "table" : "KYLIN_SALES", > "columns" : [ "TRANS_ID", "PART_DT", "LSTG_FORMAT_NAME", > "LEAF_CATEG_ID", "LSTG_SITE_ID", "SLR_SEGMENT_CD", "SELLER_ID", "BUYER_ID", > "OPS_USER_ID", "OPS_REGION" ] > } ], > "metrics" : [ "KYLIN_SALES.PRICE", "KYLIN_SALES.ITEM_COUNT" ], > "filter_condition" : "", > "partition_desc" : { > "partition_date_column" : "KYLIN_SALES.PART_DT", > "partition_time_column" : null, > "partition_date_start" : 0, > "partition_date_format" : "-MM-dd", > "partition_time_format" : "HH:mm:ss", > "partition_type" : "APPEND", > "partition_condition_builder" : > "org.apache.kylin.metadata.model.PartitionDesc$ > DefaultPartitionConditionBuilder" > }, > "capacity" : "MEDIUM" > } > > 2017-10-23 16:19 GMT+08:00 op <520075...@qq.com>: > > > hi shaofeng > > then how to use a new PartitionConditionBuilder?? > > thanks?? > > > > > > -- -- > > ??: "ShaoFeng Shi"; ; > > : 2017??10??17??(??) 6:16 > > ??: "dev" ; > > > > : Re: data format when building increment cube > > > > > > > > It is not supported. You need to develop a new PartitionConditionBuilder. > > Please refer > > to org.apache.kylin.metadata.model.PartitionDesc. > > YearMonthDayPartitionConditionBuilder > > > > 2017-10-17 18:06 GMT+08:00 op <520075...@qq.com>: > > > > > > > > hello everyone. > > > how can i define my own data format? > > > our hive tables are partitioned by month_id(format :mm) and > > > subpartitioned by day_id(format ??dd) > > > > > > thanks! > > > > > > > > > > > -- > > Best regards, > > > > Shaofeng Shi ?? > > > > > > -- > Best regards, > > Shaofeng Shi ?? > -- Best regards, Shaofeng Shi ??
?????? data format when building increment cube
thx ?? -- -- ??: "ShaoFeng Shi";; : 2017??10??23??(??) 5:53 ??: "dev" ; : Re: data format when building increment cube You can backup the metastore to local, then you will see the model json file. On Kylin web, move mouse to the model, click "Action" -> "Edit JSON", will see the JSON content. 2017-10-23 17:23 GMT+08:00 op <520075...@qq.com>: > thank you shaofeng??but i cannot find the json file > or you mean change it on kylin web?? > > > > > -- -- > ??: "ShaoFeng Shi"; ; > : 2017??10??23??(??) 5:09 > ??: "dev" ; > > : Re: data format when building increment cube > > > > If you create a new model from Kylin web, the model has a property > "partition_condition_builder" which refers to the class that > implements the PartitionConditionBuilder. > You can manually change it to your customized implementation: > > > { > "uuid" : "ad40c5a2-a32f-4f27-84cd-85ea43af2307", > "last_modified" : 1508748727953, > "name" : "test_parition", > "owner" : "admin", > "is_draft" : false, > "description" : "", > "fact_table" : "DEFAULT.KYLIN_SALES", > "lookups" : [ ], > "dimensions" : [ { > "table" : "KYLIN_SALES", > "columns" : [ "TRANS_ID", "PART_DT", "LSTG_FORMAT_NAME", > "LEAF_CATEG_ID", "LSTG_SITE_ID", "SLR_SEGMENT_CD", "SELLER_ID", "BUYER_ID", > "OPS_USER_ID", "OPS_REGION" ] > } ], > "metrics" : [ "KYLIN_SALES.PRICE", "KYLIN_SALES.ITEM_COUNT" ], > "filter_condition" : "", > "partition_desc" : { > "partition_date_column" : "KYLIN_SALES.PART_DT", > "partition_time_column" : null, > "partition_date_start" : 0, > "partition_date_format" : "-MM-dd", > "partition_time_format" : "HH:mm:ss", > "partition_type" : "APPEND", > "partition_condition_builder" : > "org.apache.kylin.metadata.model.PartitionDesc$ > DefaultPartitionConditionBuilder" > }, > "capacity" : "MEDIUM" > } > > 2017-10-23 16:19 GMT+08:00 op <520075...@qq.com>: > > > hi shaofeng > > then how to use a new PartitionConditionBuilder?? > > thanks?? > > > > > > -- -- > > ??: "ShaoFeng Shi"; ; > > : 2017??10??17??(??) 6:16 > > ??: "dev" ; > > > > : Re: data format when building increment cube > > > > > > > > It is not supported. You need to develop a new PartitionConditionBuilder. > > Please refer > > to org.apache.kylin.metadata.model.PartitionDesc. > > YearMonthDayPartitionConditionBuilder > > > > 2017-10-17 18:06 GMT+08:00 op <520075...@qq.com>: > > > > > > > > hello everyone. > > > how can i define my own data format? > > > our hive tables are partitioned by month_id(format :mm) and > > > subpartitioned by day_id(format ??dd) > > > > > > thanks! > > > > > > > > > > > -- > > Best regards, > > > > Shaofeng Shi ?? > > > > > > -- > Best regards, > > Shaofeng Shi ?? > -- Best regards, Shaofeng Shi ??
Re: data format when building increment cube
You can backup the metastore to local, then you will see the model json file. On Kylin web, move mouse to the model, click "Action" -> "Edit JSON", will see the JSON content. 2017-10-23 17:23 GMT+08:00 op <520075...@qq.com>: > thank you shaofeng,but i cannot find the json file > or you mean change it on kylin web? > > > > > -- 原始邮件 -- > 发件人: "ShaoFeng Shi";; > 发送时间: 2017年10月23日(星期一) 下午5:09 > 收件人: "dev" ; > > 主题: Re: data format when building increment cube > > > > If you create a new model from Kylin web, the model has a property > "partition_condition_builder" which refers to the class that > implements the PartitionConditionBuilder. > You can manually change it to your customized implementation: > > > { > "uuid" : "ad40c5a2-a32f-4f27-84cd-85ea43af2307", > "last_modified" : 1508748727953, > "name" : "test_parition", > "owner" : "admin", > "is_draft" : false, > "description" : "", > "fact_table" : "DEFAULT.KYLIN_SALES", > "lookups" : [ ], > "dimensions" : [ { > "table" : "KYLIN_SALES", > "columns" : [ "TRANS_ID", "PART_DT", "LSTG_FORMAT_NAME", > "LEAF_CATEG_ID", "LSTG_SITE_ID", "SLR_SEGMENT_CD", "SELLER_ID", "BUYER_ID", > "OPS_USER_ID", "OPS_REGION" ] > } ], > "metrics" : [ "KYLIN_SALES.PRICE", "KYLIN_SALES.ITEM_COUNT" ], > "filter_condition" : "", > "partition_desc" : { > "partition_date_column" : "KYLIN_SALES.PART_DT", > "partition_time_column" : null, > "partition_date_start" : 0, > "partition_date_format" : "-MM-dd", > "partition_time_format" : "HH:mm:ss", > "partition_type" : "APPEND", > "partition_condition_builder" : > "org.apache.kylin.metadata.model.PartitionDesc$ > DefaultPartitionConditionBuilder" > }, > "capacity" : "MEDIUM" > } > > 2017-10-23 16:19 GMT+08:00 op <520075...@qq.com>: > > > hi shaofeng > > then how to use a new PartitionConditionBuilder? > > thanks! > > > > > > -- 原始邮件 -- > > 发件人: "ShaoFeng Shi"; ; > > 发送时间: 2017年10月17日(星期二) 晚上6:16 > > 收件人: "dev" ; > > > > 主题: Re: data format when building increment cube > > > > > > > > It is not supported. You need to develop a new PartitionConditionBuilder. > > Please refer > > to org.apache.kylin.metadata.model.PartitionDesc. > > YearMonthDayPartitionConditionBuilder > > > > 2017-10-17 18:06 GMT+08:00 op <520075...@qq.com>: > > > > > > > > hello everyone. > > > how can i define my own data format? > > > our hive tables are partitioned by month_id(format :mm) and > > > subpartitioned by day_id(format :dd) > > > > > > thanks! > > > > > > > > > > > -- > > Best regards, > > > > Shaofeng Shi 史少锋 > > > > > > -- > Best regards, > > Shaofeng Shi 史少锋 > -- Best regards, Shaofeng Shi 史少锋
?????? data format when building increment cube
thank you shaofeng??but i cannot find the json file or you mean change it on kylin web?? -- -- ??: "ShaoFeng Shi";; : 2017??10??23??(??) 5:09 ??: "dev" ; : Re: data format when building increment cube If you create a new model from Kylin web, the model has a property "partition_condition_builder" which refers to the class that implements the PartitionConditionBuilder. You can manually change it to your customized implementation: { "uuid" : "ad40c5a2-a32f-4f27-84cd-85ea43af2307", "last_modified" : 1508748727953, "name" : "test_parition", "owner" : "admin", "is_draft" : false, "description" : "", "fact_table" : "DEFAULT.KYLIN_SALES", "lookups" : [ ], "dimensions" : [ { "table" : "KYLIN_SALES", "columns" : [ "TRANS_ID", "PART_DT", "LSTG_FORMAT_NAME", "LEAF_CATEG_ID", "LSTG_SITE_ID", "SLR_SEGMENT_CD", "SELLER_ID", "BUYER_ID", "OPS_USER_ID", "OPS_REGION" ] } ], "metrics" : [ "KYLIN_SALES.PRICE", "KYLIN_SALES.ITEM_COUNT" ], "filter_condition" : "", "partition_desc" : { "partition_date_column" : "KYLIN_SALES.PART_DT", "partition_time_column" : null, "partition_date_start" : 0, "partition_date_format" : "-MM-dd", "partition_time_format" : "HH:mm:ss", "partition_type" : "APPEND", "partition_condition_builder" : "org.apache.kylin.metadata.model.PartitionDesc$DefaultPartitionConditionBuilder" }, "capacity" : "MEDIUM" } 2017-10-23 16:19 GMT+08:00 op <520075...@qq.com>: > hi shaofeng > then how to use a new PartitionConditionBuilder?? > thanks?? > > > -- -- > ??: "ShaoFeng Shi"; ; > : 2017??10??17??(??) 6:16 > ??: "dev" ; > > : Re: data format when building increment cube > > > > It is not supported. You need to develop a new PartitionConditionBuilder. > Please refer > to org.apache.kylin.metadata.model.PartitionDesc. > YearMonthDayPartitionConditionBuilder > > 2017-10-17 18:06 GMT+08:00 op <520075...@qq.com>: > > > > > hello everyone. > > how can i define my own data format? > > our hive tables are partitioned by month_id(format :mm) and > > subpartitioned by day_id(format ??dd) > > > > thanks! > > > > > > -- > Best regards, > > Shaofeng Shi ?? > -- Best regards, Shaofeng Shi ??
Re: data format when building increment cube
If you create a new model from Kylin web, the model has a property "partition_condition_builder" which refers to the class that implements the PartitionConditionBuilder. You can manually change it to your customized implementation: { "uuid" : "ad40c5a2-a32f-4f27-84cd-85ea43af2307", "last_modified" : 1508748727953, "name" : "test_parition", "owner" : "admin", "is_draft" : false, "description" : "", "fact_table" : "DEFAULT.KYLIN_SALES", "lookups" : [ ], "dimensions" : [ { "table" : "KYLIN_SALES", "columns" : [ "TRANS_ID", "PART_DT", "LSTG_FORMAT_NAME", "LEAF_CATEG_ID", "LSTG_SITE_ID", "SLR_SEGMENT_CD", "SELLER_ID", "BUYER_ID", "OPS_USER_ID", "OPS_REGION" ] } ], "metrics" : [ "KYLIN_SALES.PRICE", "KYLIN_SALES.ITEM_COUNT" ], "filter_condition" : "", "partition_desc" : { "partition_date_column" : "KYLIN_SALES.PART_DT", "partition_time_column" : null, "partition_date_start" : 0, "partition_date_format" : "-MM-dd", "partition_time_format" : "HH:mm:ss", "partition_type" : "APPEND", "partition_condition_builder" : "org.apache.kylin.metadata.model.PartitionDesc$DefaultPartitionConditionBuilder" }, "capacity" : "MEDIUM" } 2017-10-23 16:19 GMT+08:00 op <520075...@qq.com>: > hi shaofeng > then how to use a new PartitionConditionBuilder? > thanks! > > > -- 原始邮件 -- > 发件人: "ShaoFeng Shi";; > 发送时间: 2017年10月17日(星期二) 晚上6:16 > 收件人: "dev" ; > > 主题: Re: data format when building increment cube > > > > It is not supported. You need to develop a new PartitionConditionBuilder. > Please refer > to org.apache.kylin.metadata.model.PartitionDesc. > YearMonthDayPartitionConditionBuilder > > 2017-10-17 18:06 GMT+08:00 op <520075...@qq.com>: > > > > > hello everyone. > > how can i define my own data format? > > our hive tables are partitioned by month_id(format :mm) and > > subpartitioned by day_id(format :dd) > > > > thanks! > > > > > > -- > Best regards, > > Shaofeng Shi 史少锋 > -- Best regards, Shaofeng Shi 史少锋
?????? data format when building increment cube
hi shaofeng then how to use a new PartitionConditionBuilder?? thanks?? -- -- ??: "ShaoFeng Shi";; : 2017??10??17??(??) 6:16 ??: "dev" ; : Re: data format when building increment cube It is not supported. You need to develop a new PartitionConditionBuilder. Please refer to org.apache.kylin.metadata.model.PartitionDesc.YearMonthDayPartitionConditionBuilder 2017-10-17 18:06 GMT+08:00 op <520075...@qq.com>: > > hello everyone. > how can i define my own data format? > our hive tables are partitioned by month_id(format :mm) and > subpartitioned by day_id(format ??dd) > > thanks! > -- Best regards, Shaofeng Shi ??
[jira] [Created] (KYLIN-2957) Fix copy ACL issue when migrate cube
Zhong Yanghong created KYLIN-2957: - Summary: Fix copy ACL issue when migrate cube Key: KYLIN-2957 URL: https://issues.apache.org/jira/browse/KYLIN-2957 Project: Kylin Issue Type: Bug Components: Tools, Build and Test Reporter: Zhong Yanghong Assignee: Zhong Yanghong -- This message was sent by Atlassian JIRA (v6.4.14#64029)