Yes! The results of "select count(*) from {fact_table}" both in hive and
kylin are the same!
but the result of "select sum(4g),sum(wifi) from {fact_table}" in hive is
"295763957458,456897813426"
the result of "select sum(4g),sum(wifi) from {fact_table}" in kylin is
"89605527250,164840037298",
Hi, have you try the parameter `kylin.query.enable-dynamic-column=true` which
can count the column without NULL values.
BRs,
May
-- --
??: "George Ni";
: 2018??11??15??(??) 6:10
??: "dev";
: Re: ??kylin??
Haha, maybe you have two cube with same implement, and one of them has the
newest data but another has the expired, please check if the cube using by the
query is your concerned.
Original Message
Sender:George nince...@alumni.sjtu.edu.cn
Recipient:dev...@kylin.apache.org
Date:Thursday, Nov 15,
Hi, have you queried with "select count(*) from {fact_table}" both in hive and
kylin, and please check they have the same result.
I doubt the segments you have built do not include all your hive data.
Best regards,
Chun’en Ni(George)
- 原始邮件 -
发件人: 742093...@qq.com
收件人: dev
已发送邮件: Th
大神们好!
请容我慢慢描述我的问题
在kylin使用中,我建立了一个model,暂且叫他 big_int_sum_model,Dimensions有11个,
Measures 有三个 分别是 求count, SUM, SUM, sum 是对 两个bigint字段求和,分别为 sum_4g 和sum_wifi
,用以统计4g总流量和wifi总流量
hive表中的数据有400万,在cube之后,可以从kylin查出数据。
但是出现了一个奇怪的现象:
从 hive表中统计的4g流量总数为 295763957458 ,但是kylin中统计出来的是89605527250,