consultation for kylin2.0 parameters

2017-07-04 Thread
Hi ,all: I have a question about “kylin.dictionary.growing-enabled”, Can you tell me the function of this parameter? Thanks!

Detailed query of "RAW"

2017-07-03 Thread
Hi,all There is a business scenario is the detailed query. There are 45 metric for design to “RAW”, one dimension and high base,but in “Rowkey” of “Advanced Setting”,the Encoding is fixed._length,not dict. But when build the cube, there are some errors in “#4 Step Name: Build Dimension

答复: How kylin2.0 open distributed build data dictionary

2017-07-03 Thread
Thank you ,I will have a try! -邮件原件- 发件人: ShaoFeng Shi [mailto:shaofeng...@apache.org] 发送时间: 2017年7月3日 22:02 收件人: dev 主题: Re: How kylin2.0 open distributed build data dictionary I think it is enabled by default: kylin.engine.mr.build-dict-in-reducer=true 2017-07-03 14:03 GMT+08:00 仇同

How kylin2.0 open distributed build data dictionary

2017-07-03 Thread
Hi,all Now,I’m using the version 0f kylin 2.0,so how kylin2.0 open distributed build dictionary? Is itthat to configure a parameter in “kylin.properties”? Thanks!

java.lang.RuntimeException: Too big dictionary, dictionary cannot be bigger than 2GB

2017-02-14 Thread
Hi ,all The first step in cube merge, an error : java.lang.RuntimeException: Too big dictionary, dictionary cannot be bigger than 2GB at org.apache.kylin.dict.TrieDictionaryBuilder.buildTrieBytes(TrieDictionaryBuilder.java:421) at

create dictionary error

2017-02-09 Thread
Hi,all Building operation error on the of Step Name: Build Dimension Dictionary: java.lang.RuntimeException: Failed to create dictionary on DMT.DMT_KYLIN_JDMALL_ORDR_DTL_I_D.SALE_ORD_ID at org.apache.kylin.dict.DictionaryManager.buildDictionary(DictionaryManager.java:325)

答复: 答复: 回复:答复: Kylin1.6.0流式Cube查询时间错误

2016-12-07 Thread
here some threads about getting the right Date from Kylin > http://apache-kylin.74782.x6.nabble.com/JDBC-query-result- > Date-column-get-wrong-value-td5370.html > > 在 2016年12月8日 上午11:09,仇同心 <qiutong...@jd.com>写道: > > > 是的,构建好的segment进行SQL查询的时候,发现这些查询记录的时间也都是早了8小时 > &

答复: 回复:答复: Kylin1.6.0流式Cube查询时间错误

2016-12-07 Thread
,但是我还没有进行验证。 除此之外,我在对构建好的segment进行SQL查询的时候,发现这些查询记录的时间也都是早了8小时,不知道你有没有碰到这个问题? 在2016年12月8日 10:57, 仇同心<qiutong...@jd.com>写道: 我也遇到了一样的问题,cube的Last Build Time 是正确的:2016-12-08 10:48:37 GMT+8 但是segment的时间早8个小时: Start Time: 2016-12-08 02:44:00 End Time: 2016-12-08 02:45:00 请问这个问题是kylin哪里造成

答复: Kylin1.6.0流式Cube查询时间错误

2016-12-07 Thread
我也遇到了一样的问题,cube的Last Build Time 是正确的:2016-12-08 10:48:37 GMT+8 但是segment的时间早8个小时: Start Time: 2016-12-08 02:44:00 End Time: 2016-12-08 02:45:00 请问这个问题是kylin哪里造成的? 发件人: 汪胜 [mailto:sky...@163.com] 发送时间: 2016年12月6日 21:17 收件人: dev 主题: Re: Kylin1.6.0流式Cube查询时间错误 你好, 非常感谢您的回答,但是我仍然有两个地方不太理解,望指教: 1

Consulting "EXTENDED_COLUMN"

2016-11-30 Thread
Hi ,all I don’t understand the usage scenarios of EXTENDED_COLUMN,although I saw this article “https://issues.apache.org/jira/browse/KYLIN-1313”. What,s the means about parameters of “Host Column” and “Extended Column”? Why use this expression,and what aspects of optimization that this

Cube optimization for help

2016-11-27 Thread
Hi,all There is a cube optimization for help. Cuhe has 15 dimensions, including 14 normal dimensions and 1 derived dimension, and the cardinality of all dimensions is not high;And this cube also has 10 measures, including 1 count expression,2 sum expressions and 7 COUNT_DISTINCT expressions,

答复: About hybrid model of Kylin

2016-11-14 Thread
Thanks! 发件人: Dong Li [mailto:lid...@apache.org] 发送时间: 2016年11月14日 18:13 收件人: dev@kylin.apache.org; u...@kylin.apache.org 主题: Re: About hybrid model of Kylin Here’s the document about hybrid: http://kylin.apache.org/blog/2015/09/25/hybrid-model/ Thanks, Dong Li Original Message Sender: 仇同心

About hybrid model of Kylin

2016-11-14 Thread
Hi ,all When I designed a cube, and have built a period of time, if Iwant to modify the cube dimensions or measurements, It is suggested to use hybrid mode: Create a new cube, and then build back from now;Kylin has a hybrid model, can put the new cube cube and history to make a

答复: Cube Merge Error

2016-11-10 Thread
dictionary to do a string -> integer convertion. To support ultra high cardinality, the "global dictionary" should be selected as it has less memory footprint, which can support up to 2 billion cardinality: https://kylin.apache.org/blog/2016/08/01/count-distinct-in-kylin/ 2016-11-10 14:35

Cube 构建优化咨询

2016-11-09 Thread
大家好: 目前在构建cube时遇到问题:cube维度的基数不是很高,但是度量里的字段基数很高,Build Dimension Dictionary就非常的占用本机内存,选取的度量的基数有千万、亿,甚至是十亿左右的,度量大多都是SUM,Count_distinct的精确计算。数据量是10个月的数据,我们是打算一次跑完10个月历史数据,然后在按日增跑作业。 服务器的内存配置为125G,#4 Step Name: Build Dimension Dictionary 会一直在跑很长时间,最后到导致内存溢出。 对于这种度量基数高的问题,有什么好的优化方案吗?

Cube Merge Error

2016-11-09 Thread
Hi, The first step in the cube to merge, #1 Step Name: Merge Cuboid Dictionary Error Log info: 2016-11-10 14:08:00,798 DEBUG [pool-7-thread-1] dict.DictionaryGenerator:91 : Dictionary value samples: 10101001120172=> 479, 10101003212212=>480, 10101003812579=>481, 10101005033448=>482,

Cube Merge Error

2016-11-09 Thread
Hi, The first step in the cube to merge, #1 Step Name: Merge Cuboid Dictionary Error Log info: 2016-11-10 14:08:00,798 DEBUG [pool-7-thread-1] dict.DictionaryGenerator:91 : Dictionary value samples: 10101001120172=> 479, 10101003212212=>480, 10101003812579=>481, 10101005033448=>482,

Error of migrating cubes

2016-10-17 Thread
Hi, Throw an exception information when using org.apache.kylin.storage.hbase.util .CubeMigrationCLI migration cube, kylin version is kylin-1.5.4.1-HBase1.x. 2016-10-17 14:57:25,595 INFO [main CubeMigrationCLI:325]: Executing operation: ADD_INTO_PROJECT:CUBE[name=testc], testc, test,

kylin cube migrating

2016-10-14 Thread
大家好: 目前在两台服务器上分别搭建了一个kylin servr,想做cube迁移。 能否用案例说明下kylin不同环境下cube迁移工具的参数说明 org.apache.kylin.storage.hbase.util.CubeMigrationCLI 特别是以下两个参数: srcKylinConfigUri: The KylinConfig of the cube’s source dstKylinConfigUri: The KylinConfig of the cube’s new home 谢谢!

cube详细页的SQL标签页报错

2016-10-08 Thread
大家好: 线上环境kylin 1.5.4.1版本,在cube详细页的SQL标签页报错,不能正常显示cube的sql语句,后台日志错误信息为: ERROR [http-bio-8070-exec-2] controller.BasicController:44 : java.lang.NullPointerException at org.apache.kylin.engine.EngineFactory.batchEngine(EngineFactory.java:44) at

kylin-1.5.4同步hive元数据报错

2016-09-19 Thread
大家好: 今天在使用kylin1.5.4版本时,在同步hive元数据时报错: Load Hive Table Metadata From Tree页面一直显示:Loading Databases. 错误信息打印在kylin.out文件 SEVERE: Servlet.service() for servlet [kylin] in context with path [/kylin] threw exception [Handler processing failed; nested exception is java.lang.NoClassDefFoundError:

答复: cube build error in "4 Step Name: Build Dimension Dictionary"

2016-09-12 Thread
非常感谢~ 发件人: 赵天烁 [mailto:zhaotians...@meizu.com] 发送时间: 2016年9月12日 19:30 收件人: 仇同心; user; dev 主题: 回复: cube build error in "4 Step Name: Build Dimension Dictionary" maybe is this issue ,already been solved at 1.5.4 https://issues.apache.org/jira/browse/KYLIN-1834 __

cube build error in "4 Step Name: Build Dimension Dictionary"

2016-09-12 Thread
大家好: 在cube 构建的第四步:Build Dimension Dictionary 报错: java.lang.IllegalArgumentException: Value not exists! at org.apache.kylin.common.util.Dictionary.getIdFromValueBytes(Dictionary.java:162) at org.apache.kylin.dict.TrieDictionary.getIdFromValueImpl(TrieDictionary.java:167)

kylin-1.5.3版本页面登录报错

2016-08-02 Thread
大家好: 升级到1.5.3版本后,启动kylin,登录时kylin.log文件没生成,在kylin.out文件里报错: INFO: Initializing log4j from [classpath:kylin-server-log4j.properties] Aug 02, 2016 2:26:53 PM org.apache.catalina.core.StandardContext listenerStart SEVERE: Exception sending context initialized event to listener instance of class

答复: 答复: kylin查询,报超时异常:Timeout visiting cube!

2016-07-04 Thread
we know; from Day to Month, it need aggregate 30 times data in memory to result set; For Quarter it need more; So when the measure is "memory-hungry" measure (like distinct count, raw, top-n), it is likely to get the out of memory error; you can try to define "month" and "

关于cube build和merge的问题

2016-07-01 Thread
大家好: 目前在kylin的使用过程中遇到以下一个问题: Cube构建时,开始时间设置为:2016-06-27 00:00:00结束时间为:2016-06-27 16:00:00 这样build 任务时,第一步从hive 抽取数据时where 条件是 >=2016-06-27 and <2016-06-27 导致没有抽取到数据,最后保存的hbase里也只是一个空表而已。 为了预计算27号的数据,也build另一个job, 开始时间设置为:2016-06-27 16:00:00结束时间为:2016-06-28 00:00:00 这样build 任务时,第一步从hive

答复: cube drop problem

2016-07-01 Thread
p./bin/metastore.sh reset./bin/metastore.sh > restore $KYLIN_HOME/meta_backups/meta__xx_xx_xx_xx_xx > > > ------ 原始邮件 -- > 发件人: "仇同心";<qiutong...@jd.com>; > 发送时间: 2016年6月30日(星期四) 下午2:18 > 收件人: "u...@kylin.apache.org"<u...@k

答复: kylin查询,报超时异常:Timeout visiting cube!

2016-06-30 Thread
count" measures, are they HyperLogLog counter or Bitmap counter? 2016-06-30 18:09 GMT+08:00 仇同心 <qiutong...@jd.com>: > 大家好: > Kylin查询时报超时异常,sql是: > select b.dim_month_name,sum(a.ordr_amt) as 订单金额, > sum(a.pay_amt) as 支付金额,count(*) as 订单数, > count(distinct

kylin查询,报超时异常:Timeout visiting cube!

2016-06-30 Thread
大家好: Kylin查询时报超时异常,sql是: select b.dim_month_name,sum(a.ordr_amt) as 订单金额, sum(a.pay_amt) as 支付金额,count(*) as 订单数, count(distinct a.user_pin)as 用户数,count(distinct a.is_new) as 新用户数 from dmt.dmt_mem_vip_tx_ordr_det_i_d a left join dim.dim_day b on a.pay_time=b.dim_day_txdate left join

kylin查询,报超时异常:Timeout visiting cube!

2016-06-30 Thread
大家好: Kylin查询时报超时异常,sql是: select b.dim_month_name,sum(a.ordr_amt) as 订单金额, sum(a.pay_amt) as 支付金额,count(*) as 订单数, count(distinct a.user_pin)as 用户数,count(distinct a.is_new) as 新用户数 from dmt.dmt_mem_vip_tx_ordr_det_i_d a left join dim.dim_day b on a.pay_time=b.dim_day_txdate left join

kylin查询,报超时异常:Timeout visiting cube!

2016-06-30 Thread
大家好: Kylin查询时报超时异常,sql是: select b.dim_month_name,sum(a.ordr_amt) as 订单金额, sum(a.pay_amt) as 支付金额,count(*) as 订单数, count(distinct a.user_pin)as 用户数,count(distinct a.is_new) as 新用户数 from dmt.dmt_mem_vip_tx_ordr_det_i_d a left join dim.dim_day b on a.pay_time=b.dim_day_txdate left join

cube drop problem

2016-06-30 Thread
大家好: 在cubes list页面drop掉某些cube后,为啥每次loadAllCubeInstance时报错 2016-06-30 10:47:19,773 ERROR [localhost-startStop-1] cube.CubeManager:862 : Error during load cube instance /cube/cube1 .json java.lang.IllegalStateException: CubeInstance desc not found 'CUBE1', at /cube/cube1.json at

cube drop problem

2016-06-29 Thread
大家好: 在cubes list页面drop掉某些cube后,为啥每次loadAllCubeInstance时报错 2016-06-30 10:47:19,773 ERROR [localhost-startStop-1] cube.CubeManager:862 : Error during load cube instance /cube/cube1 .json java.lang.IllegalStateException: CubeInstance desc not found 'CUBE1', at /cube/cube1.json at

kylin Remote JDBC Driver problem

2016-06-23 Thread
大家好: 用kylin的JDBC查询,查询条件是date类型的,但是查询不出数据,以下是代码: driver = (Driver) Class.forName("org.apache.kylin.jdbc.Driver").newInstance(); Properties info = new Properties(); info.put("user", "ADMIN"); info.put("password", "KYLIN"); conn =

kylin Remote JDBC Driver problem

2016-06-23 Thread
大家好: 用kylin的JDBC查询,查询条件是date类型的,但是查询不出数据,以下是代码: driver = (Driver) Class.forName("org.apache.kylin.jdbc.Driver").newInstance(); Properties info = new Properties(); info.put("user", "ADMIN"); info.put("password", "KYLIN"); conn =

答复: 回复:DISTINCT_COUNT精确计算问题

2016-06-21 Thread
您好: Hash值是否会出现重复呢? 谢谢! 发件人: Weatherpop [mailto:623891...@qq.com] 发送时间: 2016年6月21日 15:11 收件人: u...@kylin.apache.org; dev@kylin.apache.org 主题: 回复:DISTINCT_COUNT精确计算问题 可以自己做一张映射表,把值hash成int后就可以用精确的count distinct了 我们这边目前实践暂时是这样的 -- 原始邮件 -- 发件人: "仇同心"

DISTINCT_COUNT精确计算问题

2016-06-21 Thread
大家好: Hive字段类型为varchar,字段内容也包含英文字母和中文,对这样的字段能否做DISTINCT_COUNT精确计算?如果不能,有什么好的建议吗? 谢谢!

关于measure预计算

2016-06-17 Thread
大家好: 在cube构建时,根据cube 设计时,measure可以有不同的聚合函数。我想找到根据不同的聚合函数来做计算的源码,但是在 // Phase 3: Build Cube addLayerCubingSteps(result, jobId, cuboidRootPath); // layer cubing, only selected algorithm will execute result.addTask(createInMemCubingStep(jobId, cuboidRootPath)); // inmem cubing, only selected

Extract Fact Table Distinct Columns problem

2016-06-16 Thread
Kylin crew: “Extract Fact Table Distinct Columns”, In this step will often happen OOM,I don't know what this step is doing, Can do a simple function description? In this FactDistinctColumnsJob.class , Line 80 ,List columnsNeedDict = cubeMgr.getAllDictColumnsOnFact(cubeDesc) : The method to

Extract Fact Table Distinct Columns problem

2016-06-16 Thread
Kylin crew: “Extract Fact Table Distinct Columns”, In this step will often happen OOM,I don't know what this step is doing, Can do a simple function description? In this FactDistinctColumnsJob.class , Line 80 ,List columnsNeedDict = cubeMgr.getAllDictColumnsOnFact(cubeDesc) : The method to

答复: Questions about kylin page and js modify packaging

2016-06-14 Thread
- 赵天烁 Kevin Zhao zhaotians...@meizu.com<mailto:zhaotians...@meizu.com> 珠海市魅族科技有限公司 MEIZU Technology Co., Ltd. 广东省珠海市科技创新海岸魅族科技楼 MEIZU Tech Bldg., Technology & Innovation Coast Zhuhai, 519085, Guangdong, China meizu.com 发件人: 仇同心 [mailto:qiutong...@jd

Questions about kylin page and js modify packaging

2016-06-14 Thread
您好: 由于根据公司项目使用需要,修改了部分js和html,但是这些修改的js和html要打到生产环境的哪个路径下? 在/kylin/tomcat/路径下没找到相关文件。 附件是相关js贴图。 [cid:image001.png@01D1C659.8536DFD0]