Kylin 2.2.0 cluster installation steps guide.

2017-11-29 Thread Prasanna
Hi all, If anybody is using kylin 2.2.0 in cluster setup mode, please guide me how to setup. I tried with kylin older versions cluster setup document, I am able to start kylin service but its giving as , In another node test_cube is in READY status. But in this node its showing as

Re: availableVirtualCores

2017-11-29 Thread Alberto Ramón
yes, sorry: When you execute:* ${KYLIN_HOME}/bin/check-env.sh* it creates a file: ${KYLIN_HOME}/logs/cluster.info with this text: availableMB=40460<- Correct availableVirtualCor*es=3 * <- NO correct which is used by: check-spark.sh in lines:

Re: how big cardinal of a column if we want to code a column as dict?

2017-11-29 Thread ShaoFeng Shi
Hi Hao, It depends on how big the dictionary will be, and how fast you expect to get. You can do some profiling test to verify it. Kylin uses EHCache, if it couldn't fulfill the need, you can extend it to other cache implementations. 2017-11-29 19:12 GMT+08:00 杨浩 : > Thank

count distinct

2017-11-29 Thread 崔苗
Hi, we want to get count(distinct user) group by hour/day/week/month/year,now we have a problem: what's the content of count(distinct user) that kylin keeps,the distinct users set or just a count number? If we want to count (distinct user) by year,do we need to keep data for a year in hive?

Re: how big cardinal of a column if we want to code a column as dict?

2017-11-29 Thread 杨浩
Thank you , it has been enabled for the count_disctinct column has been stored in HDFS. What I warry about is that if a big dict is not in the cache , a query may be very slow for having to fetch data from hbase or hdfs. 2017-11-29 16:58 GMT+08:00 ShaoFeng Shi : > Hi

Re: how big cardinal of a column if we want to code a column as dict?

2017-11-29 Thread ShaoFeng Shi
Hi Hao, Kylin will automatically detect whether a resource size exceeds HBase cell's max size; if yes, it will save it to HDFS: https://github.com/apache/kylin/blob/master/storage-hbase/src/main/java/org/apache/kylin/storage/hbase/HBaseResourceStore.java#L419 Please check whether it works on

Re: how big cardinal of a column if we want to code a column as dict?

2017-11-29 Thread 杨浩
I have generated the dict data size on TrieDictionaryForestBenchmark. If cardinality is less than 2, the dict size will be less than 802KB. WIll the cardinality be less than 2 to set a col as dict if we want to speed up query speed if the cell size (less than 1MB) is limit by hbase