Re: Re: does kylin support top-N on a count or count distinct measure?

2016-08-08 Thread ShaoFeng Shi
Hi Tiansheng, The less post-aggregation, the better query performance; So for a specific query, if the "signle groupby column topN" need further aggregation to get the final result, but "multiple groupby column topN" doesn't, then the later one would have better performance. I didn't compare tha

Re: Re: does kylin support top-N on a count or count distinct measure?

2016-08-08 Thread 张天生
Btw, i have a cube, there are topN(sum(clicks),groupby appid), count(distinct deviceid), ... measures. According to the blog http://www.infoq.com/cn/news/2016/08/Apache-Kylin-Top-N?utm_campaign=infoq_content&utm_source=infoq&utm_medium=feed&utm_term=global from shaofeng, it will combine appid and s

Re: Re: does kylin support top-N on a count or count distinct measure?

2016-08-08 Thread 张天生
I have a question: whether multiple column groupby is better performance than a single column groupby in topN measure. As i known it all can agg other dimensions. Whether it there was performance optimization in mulitple column groupby in topN measure. ShaoFeng Shi 于2016年8月8日周一 下午6:20写道: > Albert

Re: No implementation for getValueFromId in AppendTrieDictionary.java

2016-08-08 Thread big data
For the global dicionary, in my opinion, getVauefromId is as same important as getIdfromvalue. And global dictionary must be AppendTrieDictonary(AM I right?), how it deal with the above problem? And how global dictionary store huge dimensions value and encode them? Thanks. 在 16/8/9 上午10:41, Ye

Re: No implementation for getValueFromId in AppendTrieDictionary.java

2016-08-08 Thread Yerui Sun
The dict id in AppendTrieDicitionary was stored in serialized bytes, not calculated by position. The cost is very high when get value by id from AppendTrieDictionary since it’s difficult to locate the dict id. That’s why disabled the interface AppendTrieDictionary.getValueFromId(). It’s also why

No implementation for getValueFromId in AppendTrieDictionary.java

2016-08-08 Thread big data
Hi, Why there is no implementation for getValueFromId() in AppendTrieDictionary. [cid:part1.A434B669.39CF7030@outlook.com]

Mesures on merged dimension

2016-08-08 Thread Sandeep Khurana
Hello I have a big dimension table which I merged into fact table. If it were not merged then I could create measures on the join key. Now since it is merged with fact table, how can I still create the measure within the same merged fact ? e.g. if sales is fact table and there are 2 dimension

[Draft][REPORT] Apache Kylin - Aug 2016

2016-08-08 Thread Luke Han
Dear community, I have drafted below board report for review, please help to check and let me know if there's any issue. Feel free to reply here if there's more activities, community development and so on which should included in this report. Will submit this report to board later.

Re: Tableau & Kylin Manual

2016-08-08 Thread Luke Han
Looks great, thanks Alberto Best Regards! - Luke Han On Fri, Aug 5, 2016 at 2:21 PM, Alberto Ramón wrote: > @hongbin ma: > can we repost it to our blog http://kylin.apache.org/blog/? do you have > any concerns? no problem, do it ¡¡ > > I think Kylie is a great work, it's w

MR job id as a part of REST API response

2016-08-08 Thread Abhishek Sinha
Currently the job status api returns the information of all steps involved in the build, overall progress of the cube etc. There is no way currently to track the mr job associated with each step. It can be a part of the response as well. Regards, Abhishek

Re: Re: Scan row count exceeded threshold

2016-08-08 Thread ShaoFeng Shi
Cool, you get it :-) 2016-08-08 11:30 GMT+08:00 赵天烁 : > already sovled by change those two date dims with UHC into yyMMdd formated > integer ,trim HHmmss, which dramatically reduce the cardinality,then put > mandatory filter dim at the begining of row key.thx > > -- >

Re: Time partitioning across multiple fields

2016-08-08 Thread ShaoFeng Shi
Use a view to concate them into a -MM-dd format, or follow the YearMonthDayPartitionConditionBuilder in PartitionDesc.java (need manually edit cube metadata). 2016-08-08 15:14 GMT+08:00 热爱大发挥 <385639...@qq.com>: > Time partitioning across multiple fields, as follows > year stirng > month stri

Re: Re: does kylin support top-N on a count or count distinct measure?

2016-08-08 Thread ShaoFeng Shi
Alberto is correct; SUM(1) and multiple columns are implemented in Kylin core, but from UI you couldn't define that; You need manually edit metadata for that. 2016-08-08 18:02 GMT+08:00 赵天烁 : > ok,I'll have a try > > -- > > 赵天烁 > > Kevin Zhao > > *zhaotians...@meizu.co

Re: Re: does kylin support top-N on a count or count distinct measure?

2016-08-08 Thread 赵天烁
ok,I'll have a try 赵天烁 Kevin Zhao zhaotians...@meizu.com 珠海市魅族科技有限公司 MEIZU Technology Co., Ltd. 广东省珠海市科技创新海岸魅族科技楼 MEIZU Tech Bldg., Technology & Innovation Coast Zhuhai, 519085, Guangdong, China meizu.com From: Alberto Ramón

Re: Re: does kylin support top-N on a count or count distinct measure?

2016-08-08 Thread Alberto Ramón
In teorical en v1.5.3, you can Group by 'n' columns: https://issues.apache.org/jira/browse/KYLIN-1693 I don't tested 1.5.3 yet, and I don't know if has been implemented in UI Kylin, perhaps you can add this columns to JSON manually :) BR, Alberto 2016-08-08 11:37 GMT+02:00 赵天烁 : > SUM(1)? you

Re: Re: does kylin support top-N on a count or count distinct measure?

2016-08-08 Thread 赵天烁
SUM(1)? you mean just left ORDER|SUM by Column empty? ,then another prob is I can't configure more than one group by column to it,how to walk around that? 赵天烁 Kevin Zhao zhaotians...@meizu.com 珠海市魅族科技有限公司 MEIZU Technology Co., Ltd.

Time partitioning across multiple fields

2016-08-08 Thread ??????????
Time partitioning across multiple fields, as follows year stirng month string day string than how to set the "Partition Date Column"?