[jira] [Issue Comment Deleted] (KYLIN-1122) Kylin support detail data query from fact table

2016-01-12 Thread Xiaoyu Wang (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiaoyu Wang updated KYLIN-1122:
---
Comment: was deleted

(was: for the "3)" adjustSqlDigest may be contain the parameter "cubeDesc"  
will be better, In the raw custom function scenario, will use all columns or 
dimensions.
)

> Kylin support detail data query from fact table
> ---
>
> Key: KYLIN-1122
> URL: https://issues.apache.org/jira/browse/KYLIN-1122
> Project: Kylin
>  Issue Type: New Feature
>  Components: Query Engine
>Affects Versions: v1.2
>Reporter: Xiaoyu Wang
>Assignee: liyang
> Fix For: v2.0, v1.3
>
> Attachments: 
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact(2.x-staging).patch, 
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact(update-v2-1.x-staging).patch,
>  
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact-new-impl-under-refactoring-2.x-staging.patch
>
>
> Now Kylin does not support query correct detail rows from fact table like:
> select column1,column2,column3 from fact_table
> The jira KYLIN-1075 add the "SUM" function on the measure column if defined.
> But only the column number type is support.
> I change some code to support this issue:
> Add a "VALUE" measure function : the same value and datatype in the input and 
> output of this function.
> If you want to query detail data from fact table
> *require*:
> 1.Configure the column which not dimensions to "VALUE" or "SUM" measure.(If 
> not configure measure function in the column will get NULL value)
> 2.The source table must has an unique value column and configure it as 
> dimension.
> If you have the better solution please comment here.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1122) Kylin support detail data query from fact table

2016-01-12 Thread Xiaoyu Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095778#comment-15095778
 ] 

Xiaoyu Wang commented on KYLIN-1122:


for the "3)" adjustSqlDigest may be contain the parameter "cubeDesc"  will be 
better, In the raw custom function scenario, will use all columns or dimensions.


> Kylin support detail data query from fact table
> ---
>
> Key: KYLIN-1122
> URL: https://issues.apache.org/jira/browse/KYLIN-1122
> Project: Kylin
>  Issue Type: New Feature
>  Components: Query Engine
>Affects Versions: v1.2
>Reporter: Xiaoyu Wang
>Assignee: liyang
> Fix For: v2.0, v1.3
>
> Attachments: 
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact(2.x-staging).patch, 
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact(update-v2-1.x-staging).patch,
>  
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact-new-impl-under-refactoring-2.x-staging.patch
>
>
> Now Kylin does not support query correct detail rows from fact table like:
> select column1,column2,column3 from fact_table
> The jira KYLIN-1075 add the "SUM" function on the measure column if defined.
> But only the column number type is support.
> I change some code to support this issue:
> Add a "VALUE" measure function : the same value and datatype in the input and 
> output of this function.
> If you want to query detail data from fact table
> *require*:
> 1.Configure the column which not dimensions to "VALUE" or "SUM" measure.(If 
> not configure measure function in the column will get NULL value)
> 2.The source table must has an unique value column and configure it as 
> dimension.
> If you have the better solution please comment here.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1122) Kylin support detail data query from fact table

2016-01-12 Thread Xiaoyu Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095779#comment-15095779
 ] 

Xiaoyu Wang commented on KYLIN-1122:


for the "3)" adjustSqlDigest may be contain the parameter "cubeDesc"  will be 
better, In the raw custom function scenario, will use all columns or dimensions.


> Kylin support detail data query from fact table
> ---
>
> Key: KYLIN-1122
> URL: https://issues.apache.org/jira/browse/KYLIN-1122
> Project: Kylin
>  Issue Type: New Feature
>  Components: Query Engine
>Affects Versions: v1.2
>Reporter: Xiaoyu Wang
>Assignee: liyang
> Fix For: v2.0, v1.3
>
> Attachments: 
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact(2.x-staging).patch, 
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact(update-v2-1.x-staging).patch,
>  
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact-new-impl-under-refactoring-2.x-staging.patch
>
>
> Now Kylin does not support query correct detail rows from fact table like:
> select column1,column2,column3 from fact_table
> The jira KYLIN-1075 add the "SUM" function on the measure column if defined.
> But only the column number type is support.
> I change some code to support this issue:
> Add a "VALUE" measure function : the same value and datatype in the input and 
> output of this function.
> If you want to query detail data from fact table
> *require*:
> 1.Configure the column which not dimensions to "VALUE" or "SUM" measure.(If 
> not configure measure function in the column will get NULL value)
> 2.The source table must has an unique value column and configure it as 
> dimension.
> If you have the better solution please comment here.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1313) Enable deriving dimensions on non PK/FK

2016-01-12 Thread hongbin ma (JIRA)
hongbin ma created KYLIN-1313:
-

 Summary: Enable deriving dimensions on non PK/FK
 Key: KYLIN-1313
 URL: https://issues.apache.org/jira/browse/KYLIN-1313
 Project: Kylin
  Issue Type: Improvement
Reporter: hongbin ma
Assignee: hongbin ma


currently derived column has to be columns on look table, and the derived host 
column has to be PK/FK(It's also a problem when the lookup table grows every 
large). Sometimes columns on the fact exhibit deriving relationship too. Here's 
an example fact table:

(dt date, seller_id bigint, seller_name varchar(100) , item_id bigint, item_url 
varchar(1000), count decimal, price decimal)

seller_name is uniquely determined by each seller id, and item_url is uniquely 
determined by each item_id. The users does not expect to do filtering on 
columns like seller name or item_url, they just want to retrieve it when they 
do grouping/filtering on other dimensions like selller id, item id or even 
other dimensions like dt.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Closed] (KYLIN-1309) cordova-plugin-inappbrowser v1.1.1 android PDF rendering

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma closed KYLIN-1309.
-
Resolution: Invalid

> cordova-plugin-inappbrowser v1.1.1 android PDF rendering
> 
>
> Key: KYLIN-1309
> URL: https://issues.apache.org/jira/browse/KYLIN-1309
> Project: Kylin
>  Issue Type: Bug
>  Components: Web 
>Reporter: Wade Cantrell
>Assignee: Zhong,Jason
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1312) Enhance DeployCoprocessorCLI to support Cube level filter

2016-01-12 Thread Dong Li (JIRA)
Dong Li created KYLIN-1312:
--

 Summary: Enhance DeployCoprocessorCLI to support Cube level filter
 Key: KYLIN-1312
 URL: https://issues.apache.org/jira/browse/KYLIN-1312
 Project: Kylin
  Issue Type: Improvement
  Components: Tools, Build and Test
Affects Versions: v2.0
Reporter: Dong Li
Assignee: Dong Li
Priority: Minor


Currently DeployCoprocessorCLI can only filter on Tables, we need to enhance it 
to support Cube as filter.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-299) add service to differentiate the deploy env

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-299:
-
Assignee: Dong Li  (was: ZhouQianhao)

> add service to differentiate the deploy env
> ---
>
> Key: KYLIN-299
> URL: https://issues.apache.org/jira/browse/KYLIN-299
> Project: Kylin
>  Issue Type: Wish
>  Components: Tools, Build and Test
>Reporter: Luke Han
>Assignee: Dong Li
>  Labels: github-import
> Fix For: v0.6.5
>
>
> different deploy env (dev,qa,prod) may need different config,
> need to add this feature
>  Imported from GitHub 
> Url: https://github.com/KylinOLAP/Kylin/issues/206
> Created by: [janzhongi|https://github.com/janzhongi]
> Labels: newfeature, 
> Assignee: [janzhongi|https://github.com/janzhongi]
> Created at: Wed Dec 10 10:46:23 CST 2014
> State: open



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-480) Add client information into Job

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-480:
-
Assignee: Dong Li  (was: ZhouQianhao)

> Add client information into Job
> ---
>
> Key: KYLIN-480
> URL: https://issues.apache.org/jira/browse/KYLIN-480
> Project: Kylin
>  Issue Type: Wish
>  Components: Job Engine
>Reporter: Luke Han
>Assignee: Dong Li
>  Labels: github-import
> Fix For: v2.0
>
>
> As an administer and cube owner, I would like to know one job be triggered by 
> which user, when, and triggered by which client, corn tab, web, 3rd party 
> client...
>  Imported from GitHub 
> Url: https://github.com/KylinOLAP/Kylin/issues/25
> Created by: [lukehan|https://github.com/lukehan]
> Labels: enhancement, 
> Milestone: v0.6.5 Release
> Assignee: [Kejia-Wang|https://github.com/Kejia-Wang]
> Created at: Wed Oct 29 10:09:22 CST 2014
> State: open



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-557) HBase KV pair check before and after Bulkload

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-557:
-
Assignee: Dong Li  (was: ZhouQianhao)

> HBase KV pair check before and after Bulkload
> -
>
> Key: KYLIN-557
> URL: https://issues.apache.org/jira/browse/KYLIN-557
> Project: Kylin
>  Issue Type: Wish
>  Components: Job Engine
>Reporter: Luke Han
>Assignee: Dong Li
> Fix For: v2.0
>
>
> Data consistency is key of cube to serve query.
> Verify KV pair number before and after bulkload to ensure all data be stored 
> well in HBase.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-227) Support "Pause" on Kylin Job

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-227:
-
Assignee: Dong Li  (was: ZhouQianhao)

> Support "Pause" on Kylin Job
> 
>
> Key: KYLIN-227
> URL: https://issues.apache.org/jira/browse/KYLIN-227
> Project: Kylin
>  Issue Type: Wish
>  Components: Job Engine
>Reporter: Luke Han
>Assignee: Dong Li
>  Labels: github-import
> Fix For: Future
>
>
> Add one action called "Pause" to stop current job, user could resume this job 
> later.
> ![image|https://cloud.githubusercontent.com/assets/1104017/5556023/54ae27e2-8d07-11e4-8efb-a22c041243ba.png]
>  Imported from GitHub 
> Url: https://github.com/KylinOLAP/Kylin/issues/278
> Created by: [lukehan|https://github.com/lukehan]
> Labels: newfeature, 
> Created at: Fri Dec 26 13:59:03 CST 2014
> State: open



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-196) Support Job Priority

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-196:
-
Assignee: Dong Li  (was: ZhouQianhao)

> Support Job Priority
> 
>
> Key: KYLIN-196
> URL: https://issues.apache.org/jira/browse/KYLIN-196
> Project: Kylin
>  Issue Type: New Feature
>  Components: Job Engine
>Reporter: Luke Han
>Assignee: Dong Li
>  Labels: github-import, newbie
> Fix For: v2.0
>
>
> Support Job Priority in cube build engine.
>  Imported from GitHub 
> Url: https://github.com/KylinOLAP/Kylin/issues/309
> Created by: [lukehan|https://github.com/lukehan]
> Labels: newfeature, 
> Milestone: Backlog
> Created at: Fri Dec 26 15:04:45 CST 2014
> State: open



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-741) Read data from SparkSQL

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-741:
-
Assignee: Dong Li  (was: ZhouQianhao)

> Read data from SparkSQL
> ---
>
> Key: KYLIN-741
> URL: https://issues.apache.org/jira/browse/KYLIN-741
> Project: Kylin
>  Issue Type: New Feature
>  Components: Job Engine, SparkSQL
>Reporter: Luke Han
>Assignee: Dong Li
> Fix For: Backlog
>
>
> Read data from SparkSQL directly.
> There are some instances enabled SparkSQL interface for data consuming, it 
> will be great if Kylin could read data directly from SparkSQL. 
> This feature does not require Spark Cube Build Engine to be ready. It could 
> continue to leverage existing MR cube build engine and process data on Hadoop 
> cluster then persistent cube to HBase.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-1057) No email notification on job failure in some rare cases

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-1057:
--
Assignee: Dong Li  (was: ZhouQianhao)

> No email notification on job failure in some rare cases
> ---
>
> Key: KYLIN-1057
> URL: https://issues.apache.org/jira/browse/KYLIN-1057
> Project: Kylin
>  Issue Type: Bug
>  Components: Job Engine
>Reporter: Shaofeng SHI
>Assignee: Dong Li
>
> Usually Kylin will send email notification (if configured) on job failure; 
> while today we found there is no email under two job failures; This downgrade 
> the admin's response efficiency, need check the root cause.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-559) put different build job's log into different files

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-559:
-
Assignee: Dong Li  (was: ZhouQianhao)

> put different build job's log into different files
> --
>
> Key: KYLIN-559
> URL: https://issues.apache.org/jira/browse/KYLIN-559
> Project: Kylin
>  Issue Type: Wish
>Reporter: hongbin ma
>Assignee: Dong Li
>Priority: Minor
>
> currently all build job's log are messed up in a single file 
> (kylin_job.log?), which is very unfriendly to those who want to check logs



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-745) Generic Data Reader

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-745:
-
Assignee: Dong Li  (was: ZhouQianhao)

> Generic Data Reader
> ---
>
> Key: KYLIN-745
> URL: https://issues.apache.org/jira/browse/KYLIN-745
> Project: Kylin
>  Issue Type: Wish
>  Components: Job Engine, Spark Engine
>Reporter: Luke Han
>Assignee: Dong Li
>
> When data be stored on existing DW like Oracle, it's not be able to read 
> directly through Kylin to build cube. 
> There are many requirements coming from different teams like Candor about 
> this.
> There are two options:
> #1, copy your data to Hive and then build cube through Kylin. There are some 
> cases are running this model to bring data into Hive from DW and leveraging 
> Kylin very well. 
> #2, rewrite  data read module to pull data from Oracle directly. Actually, 
> the first step of cube build is generate Hive Query to read data and generate 
> one temp table in Hive, so it should be not too complicated to do this (but 
> it depends network and others, otherwise, #1 will be more efficient one). 
> Then process cube build as normal. Using generical reader to read data from 
> any SQL rdbms through JDBC or other protocol will be perfect solution since 
> cube could be built without ETL process
> Scope:
> Only ready data directly from existing RDBMs and store jointed result in Hive 
> (temp table) for further processing, no any other transfomation. 
> By design, Kylin is OLAP system not ETL one.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-580) Auto resume jobs after server restart

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-580:
-
Assignee: Dong Li  (was: ZhouQianhao)

> Auto resume jobs after server restart
> -
>
> Key: KYLIN-580
> URL: https://issues.apache.org/jira/browse/KYLIN-580
> Project: Kylin
>  Issue Type: New Feature
>  Components: Job Engine
>Reporter: Luke Han
>Assignee: Dong Li
>Priority: Minor
> Fix For: Backlog
>
>
> Once kylin service re-deployed or restarted, all running jobs will be set to 
> pending status. 
> There's requirement to auto resume such jobs after these actions
> And please leave one setting in kylin.properties to let administrator enable 
> this feature or not.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-1094) improve performance of spark cubing

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-1094:
--
Assignee: Dong Li  (was: ZhouQianhao)

> improve performance of spark cubing
> ---
>
> Key: KYLIN-1094
> URL: https://issues.apache.org/jira/browse/KYLIN-1094
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Spark Engine
>Affects Versions: v2.0
>Reporter: ZhouQianhao
>Assignee: Dong Li
> Fix For: v2.1
>
>
> POC result of spark cubing shows that, on a dataset of 150 million records, 
> MR is about 100% faster than Spark, however we believe that Spark could be at 
> least at same speed as MR, so optimization is needed here.
> We are asking Spark community for help now.
> the cluster info:
> vm: 8 nodes * (128G mem + 64 core)
> hadoop cluster: hdp 2.2.6
> spark running mode: yarn-client
> spark version: 1.5.1



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-744) Spark Cube Build Engine

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-744:
-
Assignee: Dong Li  (was: ZhouQianhao)

> Spark Cube Build Engine
> ---
>
> Key: KYLIN-744
> URL: https://issues.apache.org/jira/browse/KYLIN-744
> Project: Kylin
>  Issue Type: New Feature
>  Components: Spark Engine
>Reporter: Luke Han
>Assignee: Dong Li
>
> Efficiency cube generate engine with Spark instead of MapReduce job.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-1221) Allow user to kill Hadoop job in the job list page

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-1221:
--
Assignee: Dong Li  (was: ZhouQianhao)

> Allow user to kill Hadoop job in the job list page
> --
>
> Key: KYLIN-1221
> URL: https://issues.apache.org/jira/browse/KYLIN-1221
> Project: Kylin
>  Issue Type: Wish
>  Components: Job Engine
>Affects Versions: v1.1, v1.0, v0.7.2, v0.7.1
>Reporter: Shaofeng SHI
>Assignee: Dong Li
>  Labels: newbie
> Fix For: Backlog
>
>
> Sometimes user expects to abondon the current running hadoop job, he can 
> choose to kill the related job directly from the Kylin job page, instead of 
> manually run yarn or mapred CLI to kill it.
> Besides, when user discards a Kylin job, the underlying running hadoop job 
> should also be killed automatically, to release the computing resources in 
> time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-947) streaming fail at fetching timestamp from kafka

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-947:
-
Assignee: Dong Li  (was: ZhouQianhao)

> streaming fail at fetching timestamp from kafka
> ---
>
> Key: KYLIN-947
> URL: https://issues.apache.org/jira/browse/KYLIN-947
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v2.0
>Reporter: hongbin ma
>Assignee: Dong Li
>
> It seems kafka is failing to fetch a offset that is exactly between 
> startoffset and endoffset, which should not happen at all
> 15/08/13 21:25:21 INFO streaming.StreamingUtil: topic: 
> Trkng.druid-sessionEvent, partitionId: 1, try to find closest offset with 
> timestamp: 143949810 between offset {4476901430, 4479189591}
> 15/08/13 21:26:25 ERROR streaming.StreamingBootstrap: fail to get start 
> offset partitionId: 1, target timestamp: 143949810
> java.lang.IllegalStateException: try to get timestamp of topic:, 
> partitionId: 1, offset: 4478045510, failed to get StreamMessage from kafka
> at 
> org.apache.kylin.streaming.StreamingUtil.getKafkaMessage(StreamingUtil.java:72)
> at 
> org.apache.kylin.streaming.StreamingUtil.getDataTimestamp(StreamingUtil.java:135)
> at 
> org.apache.kylin.streaming.StreamingUtil.getDataTimestamp(StreamingUtil.java:127)
> at 
> org.apache.kylin.streaming.StreamingUtil.binarySearch(StreamingUtil.java:100)
> at 
> org.apache.kylin.streaming.StreamingUtil.findClosestOffsetWithDataTimestamp(StreamingUtil.java:80)
> at 
> org.apache.kylin.job.streaming.StreamingBootstrap$4.run(StreamingBootstrap.java:265)
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:745)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-903) automate metadata cleanup job

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-903:
-
Assignee: Dong Li  (was: ZhouQianhao)

> automate metadata cleanup job
> -
>
> Key: KYLIN-903
> URL: https://issues.apache.org/jira/browse/KYLIN-903
> Project: Kylin
>  Issue Type: Improvement
>Reporter: ZhouQianhao
>Assignee: Dong Li
> Fix For: v2.0
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-1241) support machine resource isolation in project level with different kylin_job_conf.xml file

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-1241:
--
Assignee: Dong Li  (was: ZhouQianhao)

> support machine resource isolation in project level with different 
> kylin_job_conf.xml file
> --
>
> Key: KYLIN-1241
> URL: https://issues.apache.org/jira/browse/KYLIN-1241
> Project: Kylin
>  Issue Type: New Feature
>  Components: Job Engine
>Affects Versions: v1.1.1
>Reporter: Wang,Dong
>Assignee: Dong Li
>Priority: Minor
> Fix For: v1.3
>
> Attachments: KYLIN-1241.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-1302) java.lang.NullPointerException

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-1302:
--
Assignee: Dong Li  (was: ZhouQianhao)

> java.lang.NullPointerException
> --
>
> Key: KYLIN-1302
> URL: https://issues.apache.org/jira/browse/KYLIN-1302
> Project: Kylin
>  Issue Type: Bug
>  Components: Job Engine
>Affects Versions: v1.2
> Environment: CDH-5.4.8-1,hadoop-2.6.0,hive-1.1.0,hbase-1.0.0
>Reporter: gewenhui
>Assignee: Dong Li
> Fix For: 1.2
>
>
> in the UI of 7070,job run here:
> #2 Step Name: Extract Fact Table Distinct Columns
> 2016-01-10 07:44:35.222 - State of Hadoop job: job_1449729554133_9107:RUNNING 
> - UNDEFINED
> 2016-01-10 07:44:45.236 - State of Hadoop job: 
> job_1449729554133_9107:FINISHED - SUCCEEDED
> org.apache.hadoop.ipc.RemoteException(java.lang.NullPointerException): 
> java.lang.NullPointerException
>   at 
> org.apache.hadoop.mapreduce.v2.hs.HistoryClientService$HSClientProtocolHandler.getCounters(HistoryClientService.java:228)
>   at 
> org.apache.hadoop.mapreduce.v2.api.impl.pb.service.MRClientProtocolPBServiceImpl.getCounters(MRClientProtocolPBServiceImpl.java:159)
>   at 
> org.apache.hadoop.yarn.proto.MRClientProtocol$MRClientProtocolService$2.callBlockingMethod(MRClientProtocol.java:281)
>   at 
> org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:619)
>   at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1060)
>   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2044)
>   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2040)
>   at java.security.AccessController.doPrivileged(Native Method)
>   at javax.security.auth.Subject.doAs(Subject.java:415)
>   at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1671)
>   at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2038)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-1269) Can not get last error message in email

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-1269:
--
Assignee: Dong Li  (was: ZhouQianhao)

> Can not get last error message in email
> ---
>
> Key: KYLIN-1269
> URL: https://issues.apache.org/jira/browse/KYLIN-1269
> Project: Kylin
>  Issue Type: Bug
>  Components: Job Engine
>Affects Versions: v1.0, 2.0
>Reporter: fengYu
>Assignee: Dong Li
> Fix For: v2.0
>
> Attachments: 
> 0002-KYLIN-1269-Can-not-get-last-error-message-in-email.patch
>
>
> When I deploy kylin-2.0 in my env and job execute failed, When send email of 
> job status changed, I get the exception : 
> java.lang.IllegalArgumentException: Illegal group reference
> at java.util.regex.Matcher.appendReplacement(Matcher.java:808)
> at java.util.regex.Matcher.replaceAll(Matcher.java:906)
> at java.lang.String.replaceAll(String.java:2162)
> at 
> org.apache.kylin.engine.mr.CubingJob.formatNotifications(CubingJob.java:140)
> at 
> org.apache.kylin.job.execution.AbstractExecutable.notifyUserStatusChange(AbstractExecutable.java:213)
> at 
> org.apache.kylin.job.execution.DefaultChainedExecutable.onExecuteFinished(DefaultChainedExecutable.java:100)
> at 
> org.apache.kylin.engine.mr.CubingJob.onExecuteFinished(CubingJob.java:166)
> at 
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:113)
> at 
> org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:124)
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:745)
> This is because  error message contain special character like '$' or '\', 
> replaceAll will throw exception while encounter this string. 
> http://docs.oracle.com/javase/6/docs/api/java/lang/String.html#replaceAll(java.lang.String,
>  java.lang.String)
> What is more, Kylin can not send error message of last executable job, I 
> modify it too...



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1311) Stream cubing auto assignment and load balance

2016-01-12 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1311:
---

 Summary: Stream cubing auto assignment and load balance
 Key: KYLIN-1311
 URL: https://issues.apache.org/jira/browse/KYLIN-1311
 Project: Kylin
  Issue Type: Sub-task
  Components: Job Engine
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v2.1


This is a sub task of KYLIN-1117, need assign the stream cubing job to cluster 
for load balance and fail over.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Issue Comment Deleted] (KYLIN-1302) java.lang.NullPointerException

2016-01-12 Thread gewenhui (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

gewenhui updated KYLIN-1302:

Comment: was deleted

(was: I use CDH-5.4.8-1,hadoop-2.6.0,hive-1.1.0,hbase-1.0.0,kylin-1.2)

> java.lang.NullPointerException
> --
>
> Key: KYLIN-1302
> URL: https://issues.apache.org/jira/browse/KYLIN-1302
> Project: Kylin
>  Issue Type: Bug
>  Components: Job Engine
>Affects Versions: v1.2
> Environment: CDH-5.4.8-1,hadoop-2.6.0,hive-1.1.0,hbase-1.0.0
>Reporter: gewenhui
>Assignee: ZhouQianhao
> Fix For: 1.2
>
>
> in the UI of 7070,job run here:
> #2 Step Name: Extract Fact Table Distinct Columns
> 2016-01-10 07:44:35.222 - State of Hadoop job: job_1449729554133_9107:RUNNING 
> - UNDEFINED
> 2016-01-10 07:44:45.236 - State of Hadoop job: 
> job_1449729554133_9107:FINISHED - SUCCEEDED
> org.apache.hadoop.ipc.RemoteException(java.lang.NullPointerException): 
> java.lang.NullPointerException
>   at 
> org.apache.hadoop.mapreduce.v2.hs.HistoryClientService$HSClientProtocolHandler.getCounters(HistoryClientService.java:228)
>   at 
> org.apache.hadoop.mapreduce.v2.api.impl.pb.service.MRClientProtocolPBServiceImpl.getCounters(MRClientProtocolPBServiceImpl.java:159)
>   at 
> org.apache.hadoop.yarn.proto.MRClientProtocol$MRClientProtocolService$2.callBlockingMethod(MRClientProtocol.java:281)
>   at 
> org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:619)
>   at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1060)
>   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2044)
>   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2040)
>   at java.security.AccessController.doPrivileged(Native Method)
>   at javax.security.auth.Subject.doAs(Subject.java:415)
>   at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1671)
>   at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2038)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1302) java.lang.NullPointerException

2016-01-12 Thread gewenhui (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095559#comment-15095559
 ] 

gewenhui commented on KYLIN-1302:
-

I use CDH-5.4.8-1,hadoop-2.6.0,hive-1.1.0,hbase-1.0.0,kylin-1.2

> java.lang.NullPointerException
> --
>
> Key: KYLIN-1302
> URL: https://issues.apache.org/jira/browse/KYLIN-1302
> Project: Kylin
>  Issue Type: Bug
>  Components: Job Engine
>Affects Versions: v1.2
> Environment: CDH-5.4.8-1,hadoop-2.6.0,hive-1.1.0,hbase-1.0.0
>Reporter: gewenhui
>Assignee: ZhouQianhao
> Fix For: 1.2
>
>
> in the UI of 7070,job run here:
> #2 Step Name: Extract Fact Table Distinct Columns
> 2016-01-10 07:44:35.222 - State of Hadoop job: job_1449729554133_9107:RUNNING 
> - UNDEFINED
> 2016-01-10 07:44:45.236 - State of Hadoop job: 
> job_1449729554133_9107:FINISHED - SUCCEEDED
> org.apache.hadoop.ipc.RemoteException(java.lang.NullPointerException): 
> java.lang.NullPointerException
>   at 
> org.apache.hadoop.mapreduce.v2.hs.HistoryClientService$HSClientProtocolHandler.getCounters(HistoryClientService.java:228)
>   at 
> org.apache.hadoop.mapreduce.v2.api.impl.pb.service.MRClientProtocolPBServiceImpl.getCounters(MRClientProtocolPBServiceImpl.java:159)
>   at 
> org.apache.hadoop.yarn.proto.MRClientProtocol$MRClientProtocolService$2.callBlockingMethod(MRClientProtocol.java:281)
>   at 
> org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:619)
>   at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1060)
>   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2044)
>   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2040)
>   at java.security.AccessController.doPrivileged(Native Method)
>   at javax.security.auth.Subject.doAs(Subject.java:415)
>   at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1671)
>   at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2038)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1302) java.lang.NullPointerException

2016-01-12 Thread gewenhui (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095560#comment-15095560
 ] 

gewenhui commented on KYLIN-1302:
-

I use CDH-5.4.8-1,hadoop-2.6.0,hive-1.1.0,hbase-1.0.0,kylin-1.2

> java.lang.NullPointerException
> --
>
> Key: KYLIN-1302
> URL: https://issues.apache.org/jira/browse/KYLIN-1302
> Project: Kylin
>  Issue Type: Bug
>  Components: Job Engine
>Affects Versions: v1.2
> Environment: CDH-5.4.8-1,hadoop-2.6.0,hive-1.1.0,hbase-1.0.0
>Reporter: gewenhui
>Assignee: ZhouQianhao
> Fix For: 1.2
>
>
> in the UI of 7070,job run here:
> #2 Step Name: Extract Fact Table Distinct Columns
> 2016-01-10 07:44:35.222 - State of Hadoop job: job_1449729554133_9107:RUNNING 
> - UNDEFINED
> 2016-01-10 07:44:45.236 - State of Hadoop job: 
> job_1449729554133_9107:FINISHED - SUCCEEDED
> org.apache.hadoop.ipc.RemoteException(java.lang.NullPointerException): 
> java.lang.NullPointerException
>   at 
> org.apache.hadoop.mapreduce.v2.hs.HistoryClientService$HSClientProtocolHandler.getCounters(HistoryClientService.java:228)
>   at 
> org.apache.hadoop.mapreduce.v2.api.impl.pb.service.MRClientProtocolPBServiceImpl.getCounters(MRClientProtocolPBServiceImpl.java:159)
>   at 
> org.apache.hadoop.yarn.proto.MRClientProtocol$MRClientProtocolService$2.callBlockingMethod(MRClientProtocol.java:281)
>   at 
> org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:619)
>   at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1060)
>   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2044)
>   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2040)
>   at java.security.AccessController.doPrivileged(Native Method)
>   at javax.security.auth.Subject.doAs(Subject.java:415)
>   at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1671)
>   at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2038)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1122) Kylin support detail data query from fact table

2016-01-12 Thread hongbin ma (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095517#comment-15095517
 ] 

hongbin ma commented on KYLIN-1122:
---

i think this is will be a critical function for kylin, gonna speed up merging it

> Kylin support detail data query from fact table
> ---
>
> Key: KYLIN-1122
> URL: https://issues.apache.org/jira/browse/KYLIN-1122
> Project: Kylin
>  Issue Type: New Feature
>  Components: Query Engine
>Affects Versions: v1.2
>Reporter: Xiaoyu Wang
>Assignee: liyang
> Fix For: v2.0, v1.3
>
> Attachments: 
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact(2.x-staging).patch, 
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact(update-v2-1.x-staging).patch,
>  
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact-new-impl-under-refactoring-2.x-staging.patch
>
>
> Now Kylin does not support query correct detail rows from fact table like:
> select column1,column2,column3 from fact_table
> The jira KYLIN-1075 add the "SUM" function on the measure column if defined.
> But only the column number type is support.
> I change some code to support this issue:
> Add a "VALUE" measure function : the same value and datatype in the input and 
> output of this function.
> If you want to query detail data from fact table
> *require*:
> 1.Configure the column which not dimensions to "VALUE" or "SUM" measure.(If 
> not configure measure function in the column will get NULL value)
> 2.The source table must has an unique value column and configure it as 
> dimension.
> If you have the better solution please comment here.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1310) Auto Merge Time Ranges are filled with default options when edit a cube without auto merge

2016-01-12 Thread Dong Li (JIRA)
Dong Li created KYLIN-1310:
--

 Summary: Auto Merge Time Ranges are filled with default options 
when edit a cube without auto merge
 Key: KYLIN-1310
 URL: https://issues.apache.org/jira/browse/KYLIN-1310
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v2.0
Reporter: Dong Li
Assignee: Zhong,Jason


1. Create a cube without auto merge
2. Edit the cube, switch to "Refresh Setting" tabpage
3. Found Auto merge time ranges are filled with default options.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1309) cordova-plugin-inappbrowser v1.1.1 android PDF rendering

2016-01-12 Thread Wade Cantrell (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15094322#comment-15094322
 ] 

Wade Cantrell commented on KYLIN-1309:
--

Build the app to ios and android using cordova and run it on ios it worked well 
showing me the pdf file, but on android it shows me only a white page.
do you have an experience with that?


> cordova-plugin-inappbrowser v1.1.1 android PDF rendering
> 
>
> Key: KYLIN-1309
> URL: https://issues.apache.org/jira/browse/KYLIN-1309
> Project: Kylin
>  Issue Type: Bug
>  Components: Web 
>Reporter: Wade Cantrell
>Assignee: Zhong,Jason
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1309) cordova-plugin-inappbrowser v1.1.1 android PDF rendering

2016-01-12 Thread Wade Cantrell (JIRA)
Wade Cantrell created KYLIN-1309:


 Summary: cordova-plugin-inappbrowser v1.1.1 android PDF rendering
 Key: KYLIN-1309
 URL: https://issues.apache.org/jira/browse/KYLIN-1309
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Reporter: Wade Cantrell
Assignee: Zhong,Jason






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-1304) Incorrect count estimate upon HyperLogLogPlusCounter.merge operation.

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-1304:
--
Assignee: liyang  (was: hongbin ma)

> Incorrect count estimate upon HyperLogLogPlusCounter.merge operation.
> -
>
> Key: KYLIN-1304
> URL: https://issues.apache.org/jira/browse/KYLIN-1304
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v1.1.1
>Reporter: Prashant Prakash
>Assignee: liyang
> Attachments: registers.txt
>
>
> We are using hllc12 for evaluating uniques for a measure in a table 
> partitioned over date.
> The uniques estimate for individual dates 2016-01-07, 2016-01-08, 2016-01-09 
> are 93,728,324, 90,982,364, 45,485,278 respectively. 
> But the uniques across days, which is calculated through 
> HyperLogLogPlusCounter.merge operation gives a incorrect value 67,980,576. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (KYLIN-1304) Incorrect count estimate upon HyperLogLogPlusCounter.merge operation.

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma reassigned KYLIN-1304:
-

Assignee: hongbin ma

> Incorrect count estimate upon HyperLogLogPlusCounter.merge operation.
> -
>
> Key: KYLIN-1304
> URL: https://issues.apache.org/jira/browse/KYLIN-1304
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v1.1.1
>Reporter: Prashant Prakash
>Assignee: hongbin ma
> Attachments: registers.txt
>
>
> We are using hllc12 for evaluating uniques for a measure in a table 
> partitioned over date.
> The uniques estimate for individual dates 2016-01-07, 2016-01-08, 2016-01-09 
> are 93,728,324, 90,982,364, 45,485,278 respectively. 
> But the uniques across days, which is calculated through 
> HyperLogLogPlusCounter.merge operation gives a incorrect value 67,980,576. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1054) Support Hive client Beeline

2016-01-12 Thread huawang (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15094015#comment-15094015
 ] 

huawang commented on KYLIN-1054:


The only thing need to do is changing the hive -e in AbstractJobBuilder.java to 
beeline -u -e , hive client beeline will be supported. The 
GarbageCollectionStep.java needs also to be edited to beeline -u -e.

> Support Hive client Beeline
> ---
>
> Key: KYLIN-1054
> URL: https://issues.apache.org/jira/browse/KYLIN-1054
> Project: Kylin
>  Issue Type: Wish
>Affects Versions: v1.0, v0.7.2, v0.7.1
>Reporter: Shaofeng SHI
>
> Some user ever mentioned that in their environment the "hive" shell client 
> isn't available, only Beeline is allowed; If Kylin can support using Beeline, 
> that would be a nice feature for him.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Reopened] (KYLIN-742) Route unsupported queries to Hive (on Spark)

2016-01-12 Thread Shaofeng SHI (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shaofeng SHI reopened KYLIN-742:


Didn't see this in 2.x branch, reopen it.

> Route unsupported queries to Hive (on Spark)
> 
>
> Key: KYLIN-742
> URL: https://issues.apache.org/jira/browse/KYLIN-742
> Project: Kylin
>  Issue Type: New Feature
>  Components: Query Engine, SparkSQL
>Reporter: Luke Han
>Assignee: liyang
> Fix For: v1.0
>
> Attachments: 1.x-HBase1.x-KYLIN-742-update.patch, 
> 1.x-HBase1.x-KYLIN-742-with-review-comments-#1-update-21_09_2015.patch, 
> Design Documentation.pdf
>
>
> When Kylin can't serve coming SQL, it will be better to route it to enabled 
> SQL on Hadoop like SparkSQL and execute there. 
> Then get result and return to client through Kylin server.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1308) query storage v2 enable parallel cube visiting

2016-01-12 Thread hongbin ma (JIRA)
hongbin ma created KYLIN-1308:
-

 Summary: query storage v2 enable parallel cube visiting
 Key: KYLIN-1308
 URL: https://issues.apache.org/jira/browse/KYLIN-1308
 Project: Kylin
  Issue Type: Improvement
Reporter: hongbin ma
Assignee: hongbin ma


Currently if a cube has multiple segments, the endpoint coprocessor invocations 
for each segments are executed sequentially. We'll try to parallize it to see 
how it contributes to performance



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1122) Kylin support detail data query from fact table

2016-01-12 Thread Xiaoyu Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093740#comment-15093740
 ] 

Xiaoyu Wang commented on KYLIN-1122:


I created a new branch "KYLIN-1122" base on 2.x-staging. and submit my patch.

> Kylin support detail data query from fact table
> ---
>
> Key: KYLIN-1122
> URL: https://issues.apache.org/jira/browse/KYLIN-1122
> Project: Kylin
>  Issue Type: New Feature
>  Components: Query Engine
>Affects Versions: v1.2
>Reporter: Xiaoyu Wang
>Assignee: liyang
> Fix For: v2.0, v1.3
>
> Attachments: 
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact(2.x-staging).patch, 
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact(update-v2-1.x-staging).patch,
>  
> 0001-KYLIN-1122-Kylin-support-detail-data-query-from-fact-new-impl-under-refactoring-2.x-staging.patch
>
>
> Now Kylin does not support query correct detail rows from fact table like:
> select column1,column2,column3 from fact_table
> The jira KYLIN-1075 add the "SUM" function on the measure column if defined.
> But only the column number type is support.
> I change some code to support this issue:
> Add a "VALUE" measure function : the same value and datatype in the input and 
> output of this function.
> If you want to query detail data from fact table
> *require*:
> 1.Configure the column which not dimensions to "VALUE" or "SUM" measure.(If 
> not configure measure function in the column will get NULL value)
> 2.The source table must has an unique value column and configure it as 
> dimension.
> If you have the better solution please comment here.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (KYLIN-1220) JsonMappingException: Can not deserialize instance of java.lang.String out of START_ARRAY

2016-01-12 Thread Zhong,Jason (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhong,Jason resolved KYLIN-1220.

   Resolution: Fixed
Fix Version/s: 2.0

> JsonMappingException: Can not deserialize instance of java.lang.String out of 
> START_ARRAY
> -
>
> Key: KYLIN-1220
> URL: https://issues.apache.org/jira/browse/KYLIN-1220
> Project: Kylin
>  Issue Type: Bug
>  Components: Web 
>Affects Versions: v2.0
> Environment: HDP2.3.2
>Reporter: wangxianbin
>Assignee: Zhong,Jason
> Fix For: 2.0
>
>
> JsonMappingException when I create cube, I think it cause by class var 
> "column" in class "DimensionDesc" which is not String array any more, how 
> ever web didn't follow up this change.
> L4J [2015-12-10 
> 17:23:50,781][ERROR][org.apache.kylin.rest.controller.CubeController] - The 
> cube definition is not valid.
> com.fasterxml.jackson.databind.JsonMappingException: Can not deserialize 
> instance of java.lang.String out of START_ARRAY token
>  at [Source: java.io.StringReader@407d4132; line: 10, column: 22] (through 
> reference chain: 
> org.apache.kylin.cube.model.CubeDesc["dimensions"]->org.apache.kylin.cube.model.DimensionDesc["column"])
>   at 
> com.fasterxml.jackson.databind.JsonMappingException.from(JsonMappingException.java:164)
>   at 
> com.fasterxml.jackson.databind.DeserializationContext.mappingException(DeserializationContext.java:575)
>   at 
> com.fasterxml.jackson.databind.deser.std.StringDeserializer.deserialize(StringDeserializer.java:46)
>   at 
> com.fasterxml.jackson.databind.deser.std.StringDeserializer.deserialize(StringDeserializer.java:11)
>   at 
> com.fasterxml.jackson.databind.deser.SettableBeanProperty.deserialize(SettableBeanProperty.java:464)
>   at 
> com.fasterxml.jackson.databind.deser.impl.FieldProperty.deserializeAndSet(FieldProperty.java:107)
>   at 
> com.fasterxml.jackson.databind.deser.BeanDeserializer.deserializeFromObject(BeanDeserializer.java:295)
>   at 
> com.fasterxml.jackson.databind.deser.BeanDeserializer.deserialize(BeanDeserializer.java:121)
>   at 
> com.fasterxml.jackson.databind.deser.std.CollectionDeserializer.deserialize(CollectionDeserializer.java:230)
>   at 
> com.fasterxml.jackson.databind.deser.std.CollectionDeserializer.deserialize(CollectionDeserializer.java:207)
>   at 
> com.fasterxml.jackson.databind.deser.std.CollectionDeserializer.deserialize(CollectionDeserializer.java:23)
>   at 
> com.fasterxml.jackson.databind.deser.SettableBeanProperty.deserialize(SettableBeanProperty.java:464)
>   at 
> com.fasterxml.jackson.databind.deser.impl.FieldProperty.deserializeAndSet(FieldProperty.java:107)
>   at 
> com.fasterxml.jackson.databind.deser.BeanDeserializer.deserializeFromObject(BeanDeserializer.java:295)
>   at 
> com.fasterxml.jackson.databind.deser.BeanDeserializer.deserialize(BeanDeserializer.java:121)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper._readMapAndClose(ObjectMapper.java:2888)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper.readValue(ObjectMapper.java:2034)
>   at org.apache.kylin.common.util.JsonUtil.readValue(JsonUtil.java:54)
>   at 
> org.apache.kylin.rest.controller.CubeController.deserializeCubeDesc(CubeController.java:699)
>   at 
> org.apache.kylin.rest.controller.CubeController.saveCubeDesc(CubeController.java:327)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1220) JsonMappingException: Can not deserialize instance of java.lang.String out of START_ARRAY

2016-01-12 Thread Zhong,Jason (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093716#comment-15093716
 ] 

Zhong,Jason commented on KYLIN-1220:


it's resolved now

> JsonMappingException: Can not deserialize instance of java.lang.String out of 
> START_ARRAY
> -
>
> Key: KYLIN-1220
> URL: https://issues.apache.org/jira/browse/KYLIN-1220
> Project: Kylin
>  Issue Type: Bug
>  Components: Web 
>Affects Versions: v2.0
> Environment: HDP2.3.2
>Reporter: wangxianbin
>Assignee: Zhong,Jason
> Fix For: 2.0
>
>
> JsonMappingException when I create cube, I think it cause by class var 
> "column" in class "DimensionDesc" which is not String array any more, how 
> ever web didn't follow up this change.
> L4J [2015-12-10 
> 17:23:50,781][ERROR][org.apache.kylin.rest.controller.CubeController] - The 
> cube definition is not valid.
> com.fasterxml.jackson.databind.JsonMappingException: Can not deserialize 
> instance of java.lang.String out of START_ARRAY token
>  at [Source: java.io.StringReader@407d4132; line: 10, column: 22] (through 
> reference chain: 
> org.apache.kylin.cube.model.CubeDesc["dimensions"]->org.apache.kylin.cube.model.DimensionDesc["column"])
>   at 
> com.fasterxml.jackson.databind.JsonMappingException.from(JsonMappingException.java:164)
>   at 
> com.fasterxml.jackson.databind.DeserializationContext.mappingException(DeserializationContext.java:575)
>   at 
> com.fasterxml.jackson.databind.deser.std.StringDeserializer.deserialize(StringDeserializer.java:46)
>   at 
> com.fasterxml.jackson.databind.deser.std.StringDeserializer.deserialize(StringDeserializer.java:11)
>   at 
> com.fasterxml.jackson.databind.deser.SettableBeanProperty.deserialize(SettableBeanProperty.java:464)
>   at 
> com.fasterxml.jackson.databind.deser.impl.FieldProperty.deserializeAndSet(FieldProperty.java:107)
>   at 
> com.fasterxml.jackson.databind.deser.BeanDeserializer.deserializeFromObject(BeanDeserializer.java:295)
>   at 
> com.fasterxml.jackson.databind.deser.BeanDeserializer.deserialize(BeanDeserializer.java:121)
>   at 
> com.fasterxml.jackson.databind.deser.std.CollectionDeserializer.deserialize(CollectionDeserializer.java:230)
>   at 
> com.fasterxml.jackson.databind.deser.std.CollectionDeserializer.deserialize(CollectionDeserializer.java:207)
>   at 
> com.fasterxml.jackson.databind.deser.std.CollectionDeserializer.deserialize(CollectionDeserializer.java:23)
>   at 
> com.fasterxml.jackson.databind.deser.SettableBeanProperty.deserialize(SettableBeanProperty.java:464)
>   at 
> com.fasterxml.jackson.databind.deser.impl.FieldProperty.deserializeAndSet(FieldProperty.java:107)
>   at 
> com.fasterxml.jackson.databind.deser.BeanDeserializer.deserializeFromObject(BeanDeserializer.java:295)
>   at 
> com.fasterxml.jackson.databind.deser.BeanDeserializer.deserialize(BeanDeserializer.java:121)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper._readMapAndClose(ObjectMapper.java:2888)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper.readValue(ObjectMapper.java:2034)
>   at org.apache.kylin.common.util.JsonUtil.readValue(JsonUtil.java:54)
>   at 
> org.apache.kylin.rest.controller.CubeController.deserializeCubeDesc(CubeController.java:699)
>   at 
> org.apache.kylin.rest.controller.CubeController.saveCubeDesc(CubeController.java:327)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (KYLIN-1251) add toggle event for tree label

2016-01-12 Thread Zhong,Jason (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhong,Jason resolved KYLIN-1251.

   Resolution: Fixed
Fix Version/s: v1.3
   v2.0

> add toggle event for tree label
> ---
>
> Key: KYLIN-1251
> URL: https://issues.apache.org/jira/browse/KYLIN-1251
> Project: Kylin
>  Issue Type: Improvement
>  Components: Web 
>Affects Versions: v2.0, v1.2
>Reporter: Zhong,Jason
>Assignee: Zhong,Jason
> Fix For: v2.0, v1.3
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1294) Filter of NOT IN (...) can cause incorrect result or NPE

2016-01-12 Thread huawang (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093637#comment-15093637
 ] 

huawang commented on KYLIN-1294:


Do you mean that only if the bug of CALCITE-980 fixed could we continue to 
study my problem? 
I don't know whether the problem of result data is incorrect related to the bug 
of CALCITE-980?

> Filter of NOT IN (...) can cause incorrect result or NPE
> 
>
> Key: KYLIN-1294
> URL: https://issues.apache.org/jira/browse/KYLIN-1294
> Project: Kylin
>  Issue Type: Bug
>Reporter: liyang
>Assignee: hongbin ma
>
> The version I use is 1.0.
>   Recently I updated to 1.2. The problent is like following:
> Case1:***
>   When I use “not in” in my sql, the resultset is less than my 
> correct data. Like following:
>  When the sql I use not including a “not in” is :
>  SELECT "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE" AS "H_CODE",
>COUNT(1) AS "sum_Number_of_Records_ok",
>SUM("KYLIN_VIEW_TVAD_SUMMARY"."OCCU") AS "sum_OCCU_ok"
>   FROM "BD_WAREHOUSE"."KYLIN_VIEW_TVAD_SUMMARY" "KYLIN_VIEW_TVAD_SUMMARY"
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_CITY_V_KYLIN" 
> "KYLIN_TV_DIM_CITY_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."CITY" =
>"KYLIN_TV_DIM_CITY_V_KYLIN"."CITY")
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_DATE_CUBE_V_KYLIN" 
> "KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."DT" =
>"KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"."DT1")
> GROUP BY "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE"
> The result is like following:
> (null) 206735 436180
> other169103517   480104320
> 无 25970813161884
> A   42464098 148884448
> B   31945514 121964156
> O  27878957265948
> K   37694048 127243245
> Whiel when I use a sql having “not in”, like following:
> SELECT "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE" AS "H_CODE",
>COUNT(1) AS "sum_Number_of_Records_ok",
>SUM("KYLIN_VIEW_TVAD_SUMMARY"."OCCU") AS "sum_OCCU_ok"
>   FROM "BD_WAREHOUSE"."KYLIN_VIEW_TVAD_SUMMARY" "KYLIN_VIEW_TVAD_SUMMARY"
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_CITY_V_KYLIN" 
> "KYLIN_TV_DIM_CITY_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."CITY" =
>"KYLIN_TV_DIM_CITY_V_KYLIN"."CITY")
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_DATE_CUBE_V_KYLIN" 
> "KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."DT" =
>"KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"."DT1")
>  WHERE ("KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE" NOT IN ('other'))
>  GROUP BY "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE"
> The result I got is, there are only 3 lines, and the results are all less 
> than the true value:
> 无 323911 646111
> A   7463670  21674399
> K   15248274 54972183
> Case2:
>  Sql1: select channel_id, sum(occu) from kylin_view_tvad_summary 
> where channel_id not in (2, 3) group by channel_id.
>  Here, 2 is a member of channel_id in kylin_view_tvad_summary, while 
> 3 is not a member of kylin_view_tvad_summary, there will be a null pointer 
> exception like following:
>  java.sql.SQLException: error while executing SQL "select channel_id, 
> sum(occu) from kylin_view_tvad_summary where channel_id not in (2, 3) group 
> by channel_id
> LIMIT 5": null
>  at org.apache.calcite.avatica.Helper.createException(Helper.java:41)
>  at 
> org.apache.calcite.avatica.AvaticaStatement.executeInternal(AvaticaStatement.java:112)
>  at 
> org.apache.calcite.avatica.AvaticaStatement.executeQuery(AvaticaStatement.java:130)
>  at 
> org.apache.kylin.rest.service.QueryService.execute(QueryService.java:354)
>  at 
> org.apache.kylin.rest.service.QueryService.queryWithSqlMassage(QueryService.java:268)
>  at 
> org.apache.kylin.rest.service.QueryService.query(QueryService.java:114)
>  at 
> org.apache.kylin.rest.service.QueryService$$FastClassByCGLIB$$4957273f.invoke()
>  at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:204)
> at 
> org.springframework.aop.framework.Cglib2AopProxy$DynamicAdvisedInterceptor.intercept(Cglib2AopProxy.java:618)
>  at 
> org.apache.kylin.rest.service.QueryService$$EnhancerByCGLIB$$3a29d57a.query()
>  at 
> org.apache.kylin.rest.controller.QueryController.doQueryWithCache(QueryController.java:178)
>  at 
> org.apache.kylin.rest.controller.QueryController.query(QueryController.java:85)
>  at sun.reflect.GeneratedMethodAccessor153.invoke(Unknown Source)
>  at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  at j

[jira] [Commented] (KYLIN-1294) Filter of NOT IN (...) can cause incorrect result or NPE

2016-01-12 Thread hongbin ma (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093623#comment-15093623
 ] 

hongbin ma commented on KYLIN-1294:
---

the issue blocks at CALCITE-980 before we could even reproduce your problem

> Filter of NOT IN (...) can cause incorrect result or NPE
> 
>
> Key: KYLIN-1294
> URL: https://issues.apache.org/jira/browse/KYLIN-1294
> Project: Kylin
>  Issue Type: Bug
>Reporter: liyang
>Assignee: hongbin ma
>
> The version I use is 1.0.
>   Recently I updated to 1.2. The problent is like following:
> Case1:***
>   When I use “not in” in my sql, the resultset is less than my 
> correct data. Like following:
>  When the sql I use not including a “not in” is :
>  SELECT "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE" AS "H_CODE",
>COUNT(1) AS "sum_Number_of_Records_ok",
>SUM("KYLIN_VIEW_TVAD_SUMMARY"."OCCU") AS "sum_OCCU_ok"
>   FROM "BD_WAREHOUSE"."KYLIN_VIEW_TVAD_SUMMARY" "KYLIN_VIEW_TVAD_SUMMARY"
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_CITY_V_KYLIN" 
> "KYLIN_TV_DIM_CITY_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."CITY" =
>"KYLIN_TV_DIM_CITY_V_KYLIN"."CITY")
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_DATE_CUBE_V_KYLIN" 
> "KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."DT" =
>"KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"."DT1")
> GROUP BY "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE"
> The result is like following:
> (null) 206735 436180
> other169103517   480104320
> 无 25970813161884
> A   42464098 148884448
> B   31945514 121964156
> O  27878957265948
> K   37694048 127243245
> Whiel when I use a sql having “not in”, like following:
> SELECT "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE" AS "H_CODE",
>COUNT(1) AS "sum_Number_of_Records_ok",
>SUM("KYLIN_VIEW_TVAD_SUMMARY"."OCCU") AS "sum_OCCU_ok"
>   FROM "BD_WAREHOUSE"."KYLIN_VIEW_TVAD_SUMMARY" "KYLIN_VIEW_TVAD_SUMMARY"
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_CITY_V_KYLIN" 
> "KYLIN_TV_DIM_CITY_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."CITY" =
>"KYLIN_TV_DIM_CITY_V_KYLIN"."CITY")
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_DATE_CUBE_V_KYLIN" 
> "KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."DT" =
>"KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"."DT1")
>  WHERE ("KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE" NOT IN ('other'))
>  GROUP BY "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE"
> The result I got is, there are only 3 lines, and the results are all less 
> than the true value:
> 无 323911 646111
> A   7463670  21674399
> K   15248274 54972183
> Case2:
>  Sql1: select channel_id, sum(occu) from kylin_view_tvad_summary 
> where channel_id not in (2, 3) group by channel_id.
>  Here, 2 is a member of channel_id in kylin_view_tvad_summary, while 
> 3 is not a member of kylin_view_tvad_summary, there will be a null pointer 
> exception like following:
>  java.sql.SQLException: error while executing SQL "select channel_id, 
> sum(occu) from kylin_view_tvad_summary where channel_id not in (2, 3) group 
> by channel_id
> LIMIT 5": null
>  at org.apache.calcite.avatica.Helper.createException(Helper.java:41)
>  at 
> org.apache.calcite.avatica.AvaticaStatement.executeInternal(AvaticaStatement.java:112)
>  at 
> org.apache.calcite.avatica.AvaticaStatement.executeQuery(AvaticaStatement.java:130)
>  at 
> org.apache.kylin.rest.service.QueryService.execute(QueryService.java:354)
>  at 
> org.apache.kylin.rest.service.QueryService.queryWithSqlMassage(QueryService.java:268)
>  at 
> org.apache.kylin.rest.service.QueryService.query(QueryService.java:114)
>  at 
> org.apache.kylin.rest.service.QueryService$$FastClassByCGLIB$$4957273f.invoke()
>  at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:204)
> at 
> org.springframework.aop.framework.Cglib2AopProxy$DynamicAdvisedInterceptor.intercept(Cglib2AopProxy.java:618)
>  at 
> org.apache.kylin.rest.service.QueryService$$EnhancerByCGLIB$$3a29d57a.query()
>  at 
> org.apache.kylin.rest.controller.QueryController.doQueryWithCache(QueryController.java:178)
>  at 
> org.apache.kylin.rest.controller.QueryController.query(QueryController.java:85)
>  at sun.reflect.GeneratedMethodAccessor153.invoke(Unknown Source)
>  at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  at java.lang.reflect.Method.invoke(Method.java:606)
>  at 
> org.springframework.web.method.support.Invoca

[jira] [Commented] (KYLIN-1294) Filter of NOT IN (...) can cause incorrect result or NPE

2016-01-12 Thread huawang (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093619#comment-15093619
 ] 

huawang commented on KYLIN-1294:


The question is even the sql doesn't throw a NPE, the result set of the sql is 
not correct as shown in the but description.

> Filter of NOT IN (...) can cause incorrect result or NPE
> 
>
> Key: KYLIN-1294
> URL: https://issues.apache.org/jira/browse/KYLIN-1294
> Project: Kylin
>  Issue Type: Bug
>Reporter: liyang
>Assignee: hongbin ma
>
> The version I use is 1.0.
>   Recently I updated to 1.2. The problent is like following:
> Case1:***
>   When I use “not in” in my sql, the resultset is less than my 
> correct data. Like following:
>  When the sql I use not including a “not in” is :
>  SELECT "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE" AS "H_CODE",
>COUNT(1) AS "sum_Number_of_Records_ok",
>SUM("KYLIN_VIEW_TVAD_SUMMARY"."OCCU") AS "sum_OCCU_ok"
>   FROM "BD_WAREHOUSE"."KYLIN_VIEW_TVAD_SUMMARY" "KYLIN_VIEW_TVAD_SUMMARY"
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_CITY_V_KYLIN" 
> "KYLIN_TV_DIM_CITY_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."CITY" =
>"KYLIN_TV_DIM_CITY_V_KYLIN"."CITY")
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_DATE_CUBE_V_KYLIN" 
> "KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."DT" =
>"KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"."DT1")
> GROUP BY "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE"
> The result is like following:
> (null) 206735 436180
> other169103517   480104320
> 无 25970813161884
> A   42464098 148884448
> B   31945514 121964156
> O  27878957265948
> K   37694048 127243245
> Whiel when I use a sql having “not in”, like following:
> SELECT "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE" AS "H_CODE",
>COUNT(1) AS "sum_Number_of_Records_ok",
>SUM("KYLIN_VIEW_TVAD_SUMMARY"."OCCU") AS "sum_OCCU_ok"
>   FROM "BD_WAREHOUSE"."KYLIN_VIEW_TVAD_SUMMARY" "KYLIN_VIEW_TVAD_SUMMARY"
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_CITY_V_KYLIN" 
> "KYLIN_TV_DIM_CITY_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."CITY" =
>"KYLIN_TV_DIM_CITY_V_KYLIN"."CITY")
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_DATE_CUBE_V_KYLIN" 
> "KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."DT" =
>"KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"."DT1")
>  WHERE ("KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE" NOT IN ('other'))
>  GROUP BY "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE"
> The result I got is, there are only 3 lines, and the results are all less 
> than the true value:
> 无 323911 646111
> A   7463670  21674399
> K   15248274 54972183
> Case2:
>  Sql1: select channel_id, sum(occu) from kylin_view_tvad_summary 
> where channel_id not in (2, 3) group by channel_id.
>  Here, 2 is a member of channel_id in kylin_view_tvad_summary, while 
> 3 is not a member of kylin_view_tvad_summary, there will be a null pointer 
> exception like following:
>  java.sql.SQLException: error while executing SQL "select channel_id, 
> sum(occu) from kylin_view_tvad_summary where channel_id not in (2, 3) group 
> by channel_id
> LIMIT 5": null
>  at org.apache.calcite.avatica.Helper.createException(Helper.java:41)
>  at 
> org.apache.calcite.avatica.AvaticaStatement.executeInternal(AvaticaStatement.java:112)
>  at 
> org.apache.calcite.avatica.AvaticaStatement.executeQuery(AvaticaStatement.java:130)
>  at 
> org.apache.kylin.rest.service.QueryService.execute(QueryService.java:354)
>  at 
> org.apache.kylin.rest.service.QueryService.queryWithSqlMassage(QueryService.java:268)
>  at 
> org.apache.kylin.rest.service.QueryService.query(QueryService.java:114)
>  at 
> org.apache.kylin.rest.service.QueryService$$FastClassByCGLIB$$4957273f.invoke()
>  at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:204)
> at 
> org.springframework.aop.framework.Cglib2AopProxy$DynamicAdvisedInterceptor.intercept(Cglib2AopProxy.java:618)
>  at 
> org.apache.kylin.rest.service.QueryService$$EnhancerByCGLIB$$3a29d57a.query()
>  at 
> org.apache.kylin.rest.controller.QueryController.doQueryWithCache(QueryController.java:178)
>  at 
> org.apache.kylin.rest.controller.QueryController.query(QueryController.java:85)
>  at sun.reflect.GeneratedMethodAccessor153.invoke(Unknown Source)
>  at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  at java.lang.reflect.Method.invoke(Method.java:606)
>  at 
> or

[jira] [Commented] (KYLIN-1294) Filter of NOT IN (...) can cause incorrect result or NPE

2016-01-12 Thread hongbin ma (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093604#comment-15093604
 ] 

hongbin ma commented on KYLIN-1294:
---

the issue is pending on https://issues.apache.org/jira/browse/CALCITE-980, 
which is fixed in calcite 1.6. We'll upgrade to calcite 1.6 as soon as they 
release it

> Filter of NOT IN (...) can cause incorrect result or NPE
> 
>
> Key: KYLIN-1294
> URL: https://issues.apache.org/jira/browse/KYLIN-1294
> Project: Kylin
>  Issue Type: Bug
>Reporter: liyang
>Assignee: hongbin ma
>
> The version I use is 1.0.
>   Recently I updated to 1.2. The problent is like following:
> Case1:***
>   When I use “not in” in my sql, the resultset is less than my 
> correct data. Like following:
>  When the sql I use not including a “not in” is :
>  SELECT "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE" AS "H_CODE",
>COUNT(1) AS "sum_Number_of_Records_ok",
>SUM("KYLIN_VIEW_TVAD_SUMMARY"."OCCU") AS "sum_OCCU_ok"
>   FROM "BD_WAREHOUSE"."KYLIN_VIEW_TVAD_SUMMARY" "KYLIN_VIEW_TVAD_SUMMARY"
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_CITY_V_KYLIN" 
> "KYLIN_TV_DIM_CITY_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."CITY" =
>"KYLIN_TV_DIM_CITY_V_KYLIN"."CITY")
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_DATE_CUBE_V_KYLIN" 
> "KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."DT" =
>"KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"."DT1")
> GROUP BY "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE"
> The result is like following:
> (null) 206735 436180
> other169103517   480104320
> 无 25970813161884
> A   42464098 148884448
> B   31945514 121964156
> O  27878957265948
> K   37694048 127243245
> Whiel when I use a sql having “not in”, like following:
> SELECT "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE" AS "H_CODE",
>COUNT(1) AS "sum_Number_of_Records_ok",
>SUM("KYLIN_VIEW_TVAD_SUMMARY"."OCCU") AS "sum_OCCU_ok"
>   FROM "BD_WAREHOUSE"."KYLIN_VIEW_TVAD_SUMMARY" "KYLIN_VIEW_TVAD_SUMMARY"
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_CITY_V_KYLIN" 
> "KYLIN_TV_DIM_CITY_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."CITY" =
>"KYLIN_TV_DIM_CITY_V_KYLIN"."CITY")
>   LEFT JOIN "BD_WAREHOUSE"."KYLIN_TV_DIM_DATE_CUBE_V_KYLIN" 
> "KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"
> ON ("KYLIN_VIEW_TVAD_SUMMARY"."DT" =
>"KYLIN_TV_DIM_DATE_CUBE_V_KYLIN"."DT1")
>  WHERE ("KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE" NOT IN ('other'))
>  GROUP BY "KYLIN_TV_DIM_CITY_V_KYLIN"."H_CODE"
> The result I got is, there are only 3 lines, and the results are all less 
> than the true value:
> 无 323911 646111
> A   7463670  21674399
> K   15248274 54972183
> Case2:
>  Sql1: select channel_id, sum(occu) from kylin_view_tvad_summary 
> where channel_id not in (2, 3) group by channel_id.
>  Here, 2 is a member of channel_id in kylin_view_tvad_summary, while 
> 3 is not a member of kylin_view_tvad_summary, there will be a null pointer 
> exception like following:
>  java.sql.SQLException: error while executing SQL "select channel_id, 
> sum(occu) from kylin_view_tvad_summary where channel_id not in (2, 3) group 
> by channel_id
> LIMIT 5": null
>  at org.apache.calcite.avatica.Helper.createException(Helper.java:41)
>  at 
> org.apache.calcite.avatica.AvaticaStatement.executeInternal(AvaticaStatement.java:112)
>  at 
> org.apache.calcite.avatica.AvaticaStatement.executeQuery(AvaticaStatement.java:130)
>  at 
> org.apache.kylin.rest.service.QueryService.execute(QueryService.java:354)
>  at 
> org.apache.kylin.rest.service.QueryService.queryWithSqlMassage(QueryService.java:268)
>  at 
> org.apache.kylin.rest.service.QueryService.query(QueryService.java:114)
>  at 
> org.apache.kylin.rest.service.QueryService$$FastClassByCGLIB$$4957273f.invoke()
>  at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:204)
> at 
> org.springframework.aop.framework.Cglib2AopProxy$DynamicAdvisedInterceptor.intercept(Cglib2AopProxy.java:618)
>  at 
> org.apache.kylin.rest.service.QueryService$$EnhancerByCGLIB$$3a29d57a.query()
>  at 
> org.apache.kylin.rest.controller.QueryController.doQueryWithCache(QueryController.java:178)
>  at 
> org.apache.kylin.rest.controller.QueryController.query(QueryController.java:85)
>  at sun.reflect.GeneratedMethodAccessor153.invoke(Unknown Source)
>  at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  at java.lang.reflect.Metho

[jira] [Commented] (KYLIN-1302) java.lang.NullPointerException

2016-01-12 Thread Shaofeng SHI (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093597#comment-15093597
 ] 

Shaofeng SHI commented on KYLIN-1302:
-

Hi gewen, please provide more info about the issue (hadoop environment, kylin 
version etc), and how to reproduce; only a stacktrace sometimes is poor for a 
developer to understand the issue. If you can solve the issue and contribute a 
patch, that will be welcomed.

> java.lang.NullPointerException
> --
>
> Key: KYLIN-1302
> URL: https://issues.apache.org/jira/browse/KYLIN-1302
> Project: Kylin
>  Issue Type: Bug
>  Components: Job Engine
>Affects Versions: v1.2
> Environment: CDH-5.4.8-1,hadoop-2.6.0,hive-1.1.0,hbase-1.0.0
>Reporter: gewenhui
>Assignee: ZhouQianhao
> Fix For: 1.2
>
>
> in the UI of 7070,job run here:
> #2 Step Name: Extract Fact Table Distinct Columns
> 2016-01-10 07:44:35.222 - State of Hadoop job: job_1449729554133_9107:RUNNING 
> - UNDEFINED
> 2016-01-10 07:44:45.236 - State of Hadoop job: 
> job_1449729554133_9107:FINISHED - SUCCEEDED
> org.apache.hadoop.ipc.RemoteException(java.lang.NullPointerException): 
> java.lang.NullPointerException
>   at 
> org.apache.hadoop.mapreduce.v2.hs.HistoryClientService$HSClientProtocolHandler.getCounters(HistoryClientService.java:228)
>   at 
> org.apache.hadoop.mapreduce.v2.api.impl.pb.service.MRClientProtocolPBServiceImpl.getCounters(MRClientProtocolPBServiceImpl.java:159)
>   at 
> org.apache.hadoop.yarn.proto.MRClientProtocol$MRClientProtocolService$2.callBlockingMethod(MRClientProtocol.java:281)
>   at 
> org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:619)
>   at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1060)
>   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2044)
>   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2040)
>   at java.security.AccessController.doPrivileged(Native Method)
>   at javax.security.auth.Subject.doAs(Subject.java:415)
>   at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1671)
>   at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2038)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1082) Hive dependencies should be add to tmpjars

2016-01-12 Thread wangxianbin (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093523#comment-15093523
 ] 

wangxianbin commented on KYLIN-1082:


OK, I think all we need to do is just merge new patch uploaded by fengyu, maybe 
a little modification.

> Hive dependencies should be add to tmpjars
> --
>
> Key: KYLIN-1082
> URL: https://issues.apache.org/jira/browse/KYLIN-1082
> Project: Kylin
>  Issue Type: Bug
>Reporter: liyang
>Assignee: Zhong Yanghong
>  Labels: newbie
>
> Currently kylin assume all data nodes have hive deployment at exact same FS 
> location. However, a better position is to think hive as a client side app. 
> Then we need to ship hive jar with MR job every time.
> This make deploy kylin a lot easier in cluster that does not have hive on all 
> data nodes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1082) Hive dependencies should be add to tmpjars

2016-01-12 Thread liyang (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093507#comment-15093507
 ] 

liyang commented on KYLIN-1082:
---

Yes it's agreed that hive jar better get auto submitted. And that's why we have 
this JIRA.

I've invited Zhong, Yanghong to work on this item.

> Hive dependencies should be add to tmpjars
> --
>
> Key: KYLIN-1082
> URL: https://issues.apache.org/jira/browse/KYLIN-1082
> Project: Kylin
>  Issue Type: Bug
>Reporter: liyang
>Assignee: Zhong Yanghong
>  Labels: newbie
>
> Currently kylin assume all data nodes have hive deployment at exact same FS 
> location. However, a better position is to think hive as a client side app. 
> Then we need to ship hive jar with MR job every time.
> This make deploy kylin a lot easier in cluster that does not have hive on all 
> data nodes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-1082) Hive dependencies should be add to tmpjars

2016-01-12 Thread liyang (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

liyang updated KYLIN-1082:
--
Assignee: Zhong Yanghong

> Hive dependencies should be add to tmpjars
> --
>
> Key: KYLIN-1082
> URL: https://issues.apache.org/jira/browse/KYLIN-1082
> Project: Kylin
>  Issue Type: Bug
>Reporter: liyang
>Assignee: Zhong Yanghong
>  Labels: newbie
>
> Currently kylin assume all data nodes have hive deployment at exact same FS 
> location. However, a better position is to think hive as a client side app. 
> Then we need to ship hive jar with MR job every time.
> This make deploy kylin a lot easier in cluster that does not have hive on all 
> data nodes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1307) revisit growing dictionary

2016-01-12 Thread hongbin ma (JIRA)
hongbin ma created KYLIN-1307:
-

 Summary: revisit growing dictionary
 Key: KYLIN-1307
 URL: https://issues.apache.org/jira/browse/KYLIN-1307
 Project: Kylin
  Issue Type: Improvement
Reporter: hongbin ma
Assignee: hongbin ma


the optimization in KYLIN-871 will affect segment pruning based on dictionary 
values, which in turn hurts query performance. Will temporally roll back it and 
await more thorough thinking



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-871) growing dictionary for streaming case

2016-01-12 Thread hongbin ma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hongbin ma updated KYLIN-871:
-
Description: streaming cubes will generate a lot of dictionaries, and they 
may differ from each other only a little. growing dict swallows new entries and 
generate a bigger dict each time a new segment is being built. in this way the 
entries of dicts get controlled.

> growing dictionary for streaming case
> -
>
> Key: KYLIN-871
> URL: https://issues.apache.org/jira/browse/KYLIN-871
> Project: Kylin
>  Issue Type: Improvement
>Reporter: hongbin ma
>Assignee: hongbin ma
> Fix For: v2.0
>
>
> streaming cubes will generate a lot of dictionaries, and they may differ from 
> each other only a little. growing dict swallows new entries and generate a 
> bigger dict each time a new segment is being built. in this way the entries 
> of dicts get controlled.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)