[jira] [Updated] (KYLIN-4437) Should replace deprecated "mapred.job.name"

2020-03-27 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4437:

Summary: Should replace deprecated  "mapred.job.name"  (was: Should replace 
mapred.job.name, because it's deprecated)

> Should replace deprecated  "mapred.job.name"
> 
>
> Key: KYLIN-4437
> URL: https://issues.apache.org/jira/browse/KYLIN-4437
> Project: Kylin
>  Issue Type: Improvement
>Reporter: ZhouKang
>Assignee: ZhouKang
>Priority: Major
>
> mapred.job.name is deprecated.
> see: 
> [https://hadoop.apache.org/docs/r2.10.0/hadoop-project-dist/hadoop-common/DeprecatedProperties.html]
> we should replace it to mapreduce.job.name



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4437) Should replace mapred.job.name, because it's deprecated

2020-03-27 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4437:

Summary: Should replace mapred.job.name, because it's deprecated  (was: 
Should replace mapred.job.name, because it is deprecated)

> Should replace mapred.job.name, because it's deprecated
> ---
>
> Key: KYLIN-4437
> URL: https://issues.apache.org/jira/browse/KYLIN-4437
> Project: Kylin
>  Issue Type: Improvement
>Reporter: ZhouKang
>Assignee: ZhouKang
>Priority: Major
>
> mapred.job.name is deprecated.
> see: 
> [https://hadoop.apache.org/docs/r2.10.0/hadoop-project-dist/hadoop-common/DeprecatedProperties.html]
> we should replace it to mapreduce.job.name



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4437) Should replace mapred.job.name, because it is deprecated

2020-03-27 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4437:
---

 Summary: Should replace mapred.job.name, because it is deprecated
 Key: KYLIN-4437
 URL: https://issues.apache.org/jira/browse/KYLIN-4437
 Project: Kylin
  Issue Type: Improvement
Reporter: ZhouKang
Assignee: ZhouKang


mapred.job.name is deprecated.

see: 
[https://hadoop.apache.org/docs/r2.10.0/hadoop-project-dist/hadoop-common/DeprecatedProperties.html]

we should replace it to mapreduce.job.name



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4413) Add canary tool to Kylin

2020-03-09 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4413:
---

 Summary: Add canary tool to Kylin
 Key: KYLIN-4413
 URL: https://issues.apache.org/jira/browse/KYLIN-4413
 Project: Kylin
  Issue Type: Improvement
Reporter: ZhouKang


Canary tool can be used to detect service availability, and also can detect 
query latency.

Many other hadoop system has canary, such as HBase and HDFS.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4398) Display Kylin Version on UI

2020-03-09 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17055532#comment-17055532
 ] 

ZhouKang commented on KYLIN-4398:
-

[~yaho] 

Sure, the url is ok. 

I usually do operation in wrong cluster, so I wonder that whether cluster name 
show in UI will be better.

> Display Kylin Version on UI
> ---
>
> Key: KYLIN-4398
> URL: https://issues.apache.org/jira/browse/KYLIN-4398
> Project: Kylin
>  Issue Type: Improvement
>  Components: Web 
>Reporter: Zhong Yanghong
>Assignee: Julian Pan
>Priority: Minor
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4412) Show cluster name in job notification email title

2020-03-09 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4412:
---

 Summary: Show cluster name in job notification email title
 Key: KYLIN-4412
 URL: https://issues.apache.org/jira/browse/KYLIN-4412
 Project: Kylin
  Issue Type: Improvement
Reporter: ZhouKang
Assignee: ZhouKang






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4398) Display Kylin Version on UI

2020-03-09 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17055506#comment-17055506
 ] 

ZhouKang commented on KYLIN-4398:
-

maybe more information should be displayed on UI, such as Kylin cluster name, 
Kylin env(DEV, TST, PROD).

It's better if could be config in kylin.properties like kylin.web.help

> Display Kylin Version on UI
> ---
>
> Key: KYLIN-4398
> URL: https://issues.apache.org/jira/browse/KYLIN-4398
> Project: Kylin
>  Issue Type: Improvement
>  Components: Web 
>Reporter: Zhong Yanghong
>Assignee: Julian Pan
>Priority: Minor
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (KYLIN-4400) Use beeline as hive client in system-cube.sh

2020-03-03 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang reassigned KYLIN-4400:
---

Assignee: ZhouKang

> Use beeline as hive client in system-cube.sh
> 
>
> Key: KYLIN-4400
> URL: https://issues.apache.org/jira/browse/KYLIN-4400
> Project: Kylin
>  Issue Type: Improvement
>Reporter: ZhouKang
>Assignee: ZhouKang
>Priority: Major
>
> Just like sample.sh
> We should use hive-client-mode's client in kylin.properties to submit hive 
> sql.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4401) NPE when bash bin/system-cube.sh setup

2020-03-03 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4401:
---

 Summary: NPE when bash bin/system-cube.sh setup
 Key: KYLIN-4401
 URL: https://issues.apache.org/jira/browse/KYLIN-4401
 Project: Kylin
  Issue Type: Improvement
Reporter: ZhouKang
Assignee: ZhouKang


when use bin/system-cube.sh to setup project KYLIN_SYSTEM cube, there is a NPE
{code:java}
// code placeholder
2020-03-03 22:35:21,206 ERROR [main] cli.CubeSignatureRefresher:105 : 
error2020-03-03 22:35:21,206 ERROR [main] cli.CubeSignatureRefresher:105 : 
errorjava.lang.NullPointerException at 
org.apache.kylin.cube.cli.CubeSignatureRefresher.updateCubeDesc(CubeSignatureRefresher.java:98)
 at 
org.apache.kylin.cube.cli.CubeSignatureRefresher.update(CubeSignatureRefresher.java:74)
 at 
org.apache.kylin.cube.cli.CubeSignatureRefresher.main(CubeSignatureRefresher.java:117)Exception
 in thread "main" java.lang.NullPointerException at 
org.apache.kylin.cube.cli.CubeSignatureRefresher.updateCubeDesc(CubeSignatureRefresher.java:106)
 at 
org.apache.kylin.cube.cli.CubeSignatureRefresher.update(CubeSignatureRefresher.java:74)
 at 
org.apache.kylin.cube.cli.CubeSignatureRefresher.main(CubeSignatureRefresher.java:117)
{code}
It happened when refresh cube signature.  I think it's not such important, but 
will be better if fixed.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4400) Use beeline as hive client in system-cube.sh

2020-03-03 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4400:
---

 Summary: Use beeline as hive client in system-cube.sh
 Key: KYLIN-4400
 URL: https://issues.apache.org/jira/browse/KYLIN-4400
 Project: Kylin
  Issue Type: Improvement
Reporter: ZhouKang


Just like sample.sh

We should use hive-client-mode's client in kylin.properties to submit hive sql.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4231) write conflict while add model by different query server parallelly

2020-03-03 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17049997#comment-17049997
 ] 

ZhouKang commented on KYLIN-4231:
-

[~shaofengshi]

yes, you are right.  I think we should share the http session in all Kylin 
instances.

We use Nginx as load balance. By this rule, we found that the user login info 
cannot shared by different Kylin instance, which made the query request failed.

Maybe we could add a cache to spring framework.

[~xiacongling]  will add this feature later.

> write conflict while add model by different query server parallelly
> ---
>
> Key: KYLIN-4231
> URL: https://issues.apache.org/jira/browse/KYLIN-4231
> Project: Kylin
>  Issue Type: New Feature
>Reporter: ZhouKang
>Priority: Major
>
> a kylin cluster have more than 1 query server, all of them are the backend 
> server of nginx.
> when our user use RESTful API to create model in the *same* project 
> parallelly, there will be a problem.
> the server returns:
> {code:java}
> // code placeholder
> Overwriting conflict /project/learn_kylin.json, expect old TS 1572596034269, 
> but it is 1572596042929
> {code}
> BUT, the model '/model_desc/xxx.json' has already in metastore, so the next 
> time our user want
> to retry (create model), he will get:
> {code:java}
> // code placeholder
> Overwriting conflict /model_desc/test_945.json, expect old TS 0, but it is 
> 1572596193812
> {code}
> I think the problem is that:
> when server A receive PUT model, it will change the project info, but the 
> cache update broadcast cannot be processed by server B before server B 
> processing another model creation request.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4268) build job failed caused by hadoop utils update

2020-02-28 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17048140#comment-17048140
 ] 

ZhouKang commented on KYLIN-4268:
-

I think it's worked using solution 1. 

So close this issue.

> build job failed caused by hadoop utils update
> --
>
> Key: KYLIN-4268
> URL: https://issues.apache.org/jira/browse/KYLIN-4268
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> When we update hadoop package, such as hbase and hive. The job will fill 
> cause by:
>  
> {code:java}
> // code placeholder
> Exception in thread "main" java.io.FileNotFoundException: File 
> file:/home/work/app/kylin-all-2.5.2-2.5.2-HBase2-SNAPSHOT/hbase-2.2.0-3.1.0/lib/hbase-common-2.2.0-3.1.0.jar
>  does not exist
>   at 
> org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:534)
>   at 
> org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:747)
>   at 
> org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:524)
>   at 
> org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:416)
>   at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:337)
>   at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:289)
>   at 
> org.apache.spark.deploy.yarn.Client.copyFileToRemote(Client.scala:406)
>   at 
> org.apache.spark.deploy.yarn.Client.org\$apache\$spark\$deploy\$yarn\$Client\$\$distribute\
> {code}
>  
> The root cause is kylin will generate the full command as soon as the job 
> summited. And the parameter '--jars' use file's absolute path.
>   
> there will be 2 ways to fix this problem.
>  # keep the old package in the same path 
>  # change the generation of  '--jars' to the stage of  job execution.
>  
> which is the better choose? 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Closed] (KYLIN-4268) build job failed caused by hadoop utils update

2020-02-28 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang closed KYLIN-4268.
---
Resolution: Abandoned

> build job failed caused by hadoop utils update
> --
>
> Key: KYLIN-4268
> URL: https://issues.apache.org/jira/browse/KYLIN-4268
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> When we update hadoop package, such as hbase and hive. The job will fill 
> cause by:
>  
> {code:java}
> // code placeholder
> Exception in thread "main" java.io.FileNotFoundException: File 
> file:/home/work/app/kylin-all-2.5.2-2.5.2-HBase2-SNAPSHOT/hbase-2.2.0-3.1.0/lib/hbase-common-2.2.0-3.1.0.jar
>  does not exist
>   at 
> org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:534)
>   at 
> org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:747)
>   at 
> org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:524)
>   at 
> org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:416)
>   at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:337)
>   at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:289)
>   at 
> org.apache.spark.deploy.yarn.Client.copyFileToRemote(Client.scala:406)
>   at 
> org.apache.spark.deploy.yarn.Client.org\$apache\$spark\$deploy\$yarn\$Client\$\$distribute\
> {code}
>  
> The root cause is kylin will generate the full command as soon as the job 
> summited. And the parameter '--jars' use file's absolute path.
>   
> there will be 2 ways to fix this problem.
>  # keep the old package in the same path 
>  # change the generation of  '--jars' to the stage of  job execution.
>  
> which is the better choose? 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4379) Calculate column cardinality cannot use kylin config overwrite cause job failed

2020-02-17 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17038229#comment-17038229
 ] 

ZhouKang commented on KYLIN-4379:
-

Hi, [~nichunen] 
I have updated the details. 

If necessary, I can upload screenshots.

> Calculate column cardinality cannot use kylin config overwrite cause job 
> failed
> ---
>
> Key: KYLIN-4379
> URL: https://issues.apache.org/jira/browse/KYLIN-4379
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Assignee: ZhouKang
>Priority: Major
>
> Calculate column cardinality submit job to the default queue, which cause job 
> failed.
> It seems this job cannot use kylin config overwrite.
> When you load or reload data from hive to kylin, there will be a checkbox as 
> "Calculate column cardinality".  If this checkbox is selected, Kylin will 
> submit a MR/Spark job.
> When a MR job sumitted to yarn, it will use default queue. But in many 
> environment, it's forbidden. You should use the queue in 
> kylin.engine.mr.config-override.mapreduce.job.queuename



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4379) Calculate column cardinality cannot use kylin config overwrite cause job failed

2020-02-17 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4379:

Description: 
Calculate column cardinality submit job to the default queue, which cause job 
failed.

It seems this job cannot use kylin config overwrite.

When you load or reload data from hive to kylin, there will be a checkbox as 
"Calculate column cardinality".  If this checkbox is selected, Kylin will 
submit a MR/Spark job.
When a MR job sumitted to yarn, it will use default queue. But in many 
environment, it's forbidden. You should use the queue in 
kylin.engine.mr.config-override.mapreduce.job.queuename

  was:
Calculate column cardinality submit job to the default queue, which cause job 
failed.

It seems this job cannot use kylin config overwrite.


> Calculate column cardinality cannot use kylin config overwrite cause job 
> failed
> ---
>
> Key: KYLIN-4379
> URL: https://issues.apache.org/jira/browse/KYLIN-4379
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Assignee: ZhouKang
>Priority: Major
>
> Calculate column cardinality submit job to the default queue, which cause job 
> failed.
> It seems this job cannot use kylin config overwrite.
> When you load or reload data from hive to kylin, there will be a checkbox as 
> "Calculate column cardinality".  If this checkbox is selected, Kylin will 
> submit a MR/Spark job.
> When a MR job sumitted to yarn, it will use default queue. But in many 
> environment, it's forbidden. You should use the queue in 
> kylin.engine.mr.config-override.mapreduce.job.queuename



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4379) Calculate column cardinality cannot use kylin config overwrite cause job failed

2020-02-16 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4379:

Description: 
Calculate column cardinality submit job to the default queue, which cause job 
failed.

It seems this job cannot use kylin config overwrite.

  was:
Calculate column cardinality submit job to the root queue, which cause job 
failed.

It seems this job cannot use kylin config overwrite.


> Calculate column cardinality cannot use kylin config overwrite cause job 
> failed
> ---
>
> Key: KYLIN-4379
> URL: https://issues.apache.org/jira/browse/KYLIN-4379
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Assignee: ZhouKang
>Priority: Major
>
> Calculate column cardinality submit job to the default queue, which cause job 
> failed.
> It seems this job cannot use kylin config overwrite.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4379) Calculate column cardinality cannot use kylin config overwrite cause job failed

2020-02-16 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4379:
---

 Summary: Calculate column cardinality cannot use kylin config 
overwrite cause job failed
 Key: KYLIN-4379
 URL: https://issues.apache.org/jira/browse/KYLIN-4379
 Project: Kylin
  Issue Type: Bug
Reporter: ZhouKang
Assignee: ZhouKang


Calculate column cardinality submit job to the root queue, which cause job 
failed.

It seems this job cannot use kylin config overwrite.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4377) Project's admin cannot operate Hybrid model

2020-02-11 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4377:
---

 Summary: Project's admin cannot operate Hybrid model
 Key: KYLIN-4377
 URL: https://issues.apache.org/jira/browse/KYLIN-4377
 Project: Kylin
  Issue Type: Bug
Reporter: ZhouKang
Assignee: ZhouKang






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4322) Cost–benefit of compression HBase result

2020-01-30 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17027168#comment-17027168
 ] 

ZhouKang commented on KYLIN-4322:
-

Thank you  [~shaofengshi] 
I will do more work in testing and repairment

> Cost–benefit of compression HBase result
> 
>
> Key: KYLIN-4322
> URL: https://issues.apache.org/jira/browse/KYLIN-4322
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Assignee: ZhouKang
>Priority: Major
> Fix For: v3.1.0
>
>
> kylin.storage.hbase.endpoint-compress-result is  TRUE as default.
> In our production environment, when the hbase scan result is larger than 
> 200M, it will take more than 10s to compress data.
> We can find this by hbase's log:
> ||Size||avg rate||min rate||avg time||max time||
> |<1M|0.12|0.25|0.18ms|0.7s|
> |1M ~ 10M|0.39|0.97|0.2s|0.6s|
> |10M ~ 100M|0.47|0.81|2s|6.3s|
> |>100M|0.95|0.96|15.7s|24.8s|
> Notice:
>  # rate: compressed data size / origin data size
>  # when the source data size is < 1M, compressed data may larger than the 
> source data. So the table(Row 1) only calculate then compressed data less 
> than the source data
>  # In our environment, 65% compression data (<1M) is larger than source data 
> When source data is less then 10M, the latency of data transmission is 
> acceptability. When data is larger then 100M, it will take a long time to 
> compress data.
>  
> So, I think kylin.storage.hbase.endpoint-compress-result  should be FALSE by 
> default;
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4363) Query failed with "I failed to find one of the right cookies" error

2020-01-30 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17027166#comment-17027166
 ] 

ZhouKang commented on KYLIN-4363:
-

I will revert this commit.



In my test cases, the setting seems worked( have note upgrade coprocessor). 
Coprocessor's log is
{code:java}
// code placeholder
# kylin.storage.hbase.endpoint-compress-result=TRUE
2020-01-31,10:01:18,005 INFO 
[RpcServer.default.RWQ.Fifo.read.handler=315,queue=7,port=24600] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
start query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0 in thread 
RpcServer.default.RWQ.Fifo.read.handler=315,queue=7,port=24600
2020-01-31,10:01:18,015 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.gridtable.GTScanRequest: pre aggregating results before 
returning
2020-01-31,10:01:18,015 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.gridtable.GTAggregateScanner: setting IGTBypassChecker of child
2020-01-31,10:01:18,468 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.gridtable.GTAggregateScanner: GTAggregateScanner input rows: 
399
2020-01-31,10:01:18,639 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.gridtable.GTAggregateScanner: closing aggrCache
2020-01-31,10:01:18,639 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
Total scanned 399 rows and 150930080 bytes
2020-01-31,10:01:23,903 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
Size of final result = 76890905 (82646392 before compressing)

# kylin.storage.hbase.endpoint-compress-result=FALSE
2020-01-31,10:59:54,657 INFO 
[RpcServer.default.RWQ.Fifo.read.handler=406,queue=6,port=24600] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
start query 3645d35d-cb78-10c4-c327-a04acc9ff2a4 in thread 
RpcServer.default.RWQ.Fifo.read.handler=406,queue=6,port=24600
2020-01-31,10:59:54,666 INFO [Query 3645d35d-cb78-10c4-c327-a04acc9ff2a4-492] 
org.apache.kylin.gridtable.GTScanRequest: pre aggregating results before 
returning
2020-01-31,10:59:54,666 INFO [Query 3645d35d-cb78-10c4-c327-a04acc9ff2a4-492] 
org.apache.kylin.gridtable.GTAggregateScanner: setting IGTBypassChecker of child
2020-01-31,10:59:58,772 INFO [Query 3645d35d-cb78-10c4-c327-a04acc9ff2a4-492] 
org.apache.kylin.gridtable.GTAggregateScanner: GTAggregateScanner input rows: 
399
2020-01-31,11:00:00,064 INFO [Query 3645d35d-cb78-10c4-c327-a04acc9ff2a4-492] 
org.apache.kylin.gridtable.GTAggregateScanner: closing aggrCache
2020-01-31,11:00:00,064 INFO [Query 3645d35d-cb78-10c4-c327-a04acc9ff2a4-492] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
Total scanned 399 rows and 150930080 bytes
2020-01-31,11:00:00,103 INFO [Query 3645d35d-cb78-10c4-c327-a04acc9ff2a4-492] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
Size of final result = 71950057 (71950057 before compressing){code}
 

I will test it fully and repair it later.

> Query failed with "I failed to find one of the right cookies" error
> ---
>
> Key: KYLIN-4363
> URL: https://issues.apache.org/jira/browse/KYLIN-4363
> Project: Kylin
>  Issue Type: Bug
>  Components: Query Engine
>Affects Versions: v3.1.0
>Reporter: Shao Feng Shi
>Assignee: ZhouKang
>Priority: Major
> Fix For: v3.1.0
>
>
> {code:java}
> // code placeholder
> Caused by: java.lang.RuntimeException: I failed to find one of the right 
> cookies. 1701223222Caused by: java.lang.RuntimeException: I failed to find 
> one of the right cookies. 1701223222 at 
> org.roaringbitmap.buffer.ImmutableRoaringArray.(ImmutableRoaringArray.java:46)
>  at 
> org.roaringbitmap.buffer.ImmutableRoaringBitmap.(ImmutableRoaringBitmap.java:908)
>  at 
> org.apache.kylin.measure.bitmap.RoaringBitmapCounter.peekLength(RoaringBitmapCounter.java:141)
>  at 
> org.apache.kylin.measure.bitmap.BitmapSerializer.peekLength(BitmapSerializer.java:69)
>  at 
> org.apache.kylin.cube.gridtable.CubeCodeSystem.codeLength(CubeCodeSystem.java:100)
>  at org.apache.kylin.gridtable.GTRecord.loadColumns(GTRecord.java:279) at 
> org.apache.kylin.storage.gtrecord.PartitionResultIterator.next(PartitionResultIterator.java:56)
>  at 
> org.apache.kylin.storage.gtrecord.PartitionResultIterator.next(PartitionResultIterator.java:35)
>  at com.google.common.collect.Iterators$PeekingImpl.next(Iterators.java:1222) 
> at 
> org.apache.kylin.storage.gtrecord.SortMergedPartitionResultIterator.next(SortMergedPartitionResultIterator.java:93)
>  at 
> org.apache.kylin.storage.gtrecord.SortMergedPart

[jira] [Comment Edited] (KYLIN-4363) Query failed with "I failed to find one of the right cookies" error

2020-01-30 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17027166#comment-17027166
 ] 

ZhouKang edited comment on KYLIN-4363 at 1/31/20 3:39 AM:
--

I will revert this commit.

In my test cases, the setting seems worked( did not upgrade coprocessor). 
Coprocessor's log is
{code:java}
// code placeholder
# kylin.storage.hbase.endpoint-compress-result=TRUE
2020-01-31,10:01:18,005 INFO 
[RpcServer.default.RWQ.Fifo.read.handler=315,queue=7,port=24600] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
start query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0 in thread 
RpcServer.default.RWQ.Fifo.read.handler=315,queue=7,port=24600
2020-01-31,10:01:18,015 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.gridtable.GTScanRequest: pre aggregating results before 
returning
2020-01-31,10:01:18,015 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.gridtable.GTAggregateScanner: setting IGTBypassChecker of child
2020-01-31,10:01:18,468 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.gridtable.GTAggregateScanner: GTAggregateScanner input rows: 
399
2020-01-31,10:01:18,639 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.gridtable.GTAggregateScanner: closing aggrCache
2020-01-31,10:01:18,639 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
Total scanned 399 rows and 150930080 bytes
2020-01-31,10:01:23,903 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
Size of final result = 76890905 (82646392 before compressing)

# kylin.storage.hbase.endpoint-compress-result=FALSE
2020-01-31,10:59:54,657 INFO 
[RpcServer.default.RWQ.Fifo.read.handler=406,queue=6,port=24600] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
start query 3645d35d-cb78-10c4-c327-a04acc9ff2a4 in thread 
RpcServer.default.RWQ.Fifo.read.handler=406,queue=6,port=24600
2020-01-31,10:59:54,666 INFO [Query 3645d35d-cb78-10c4-c327-a04acc9ff2a4-492] 
org.apache.kylin.gridtable.GTScanRequest: pre aggregating results before 
returning
2020-01-31,10:59:54,666 INFO [Query 3645d35d-cb78-10c4-c327-a04acc9ff2a4-492] 
org.apache.kylin.gridtable.GTAggregateScanner: setting IGTBypassChecker of child
2020-01-31,10:59:58,772 INFO [Query 3645d35d-cb78-10c4-c327-a04acc9ff2a4-492] 
org.apache.kylin.gridtable.GTAggregateScanner: GTAggregateScanner input rows: 
399
2020-01-31,11:00:00,064 INFO [Query 3645d35d-cb78-10c4-c327-a04acc9ff2a4-492] 
org.apache.kylin.gridtable.GTAggregateScanner: closing aggrCache
2020-01-31,11:00:00,064 INFO [Query 3645d35d-cb78-10c4-c327-a04acc9ff2a4-492] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
Total scanned 399 rows and 150930080 bytes
2020-01-31,11:00:00,103 INFO [Query 3645d35d-cb78-10c4-c327-a04acc9ff2a4-492] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
Size of final result = 71950057 (71950057 before compressing){code}
 

I will test it fully and repair it later.


was (Author: zhoukangcn):
I will revert this commit.



In my test cases, the setting seems worked( have note upgrade coprocessor). 
Coprocessor's log is
{code:java}
// code placeholder
# kylin.storage.hbase.endpoint-compress-result=TRUE
2020-01-31,10:01:18,005 INFO 
[RpcServer.default.RWQ.Fifo.read.handler=315,queue=7,port=24600] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
start query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0 in thread 
RpcServer.default.RWQ.Fifo.read.handler=315,queue=7,port=24600
2020-01-31,10:01:18,015 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.gridtable.GTScanRequest: pre aggregating results before 
returning
2020-01-31,10:01:18,015 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.gridtable.GTAggregateScanner: setting IGTBypassChecker of child
2020-01-31,10:01:18,468 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.gridtable.GTAggregateScanner: GTAggregateScanner input rows: 
399
2020-01-31,10:01:18,639 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.gridtable.GTAggregateScanner: closing aggrCache
2020-01-31,10:01:18,639 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
Total scanned 399 rows and 150930080 bytes
2020-01-31,10:01:23,903 INFO [Query 745eb1e9-992e-e4ac-ce15-951ca5b3eae0-401] 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService: 
Size of final result = 76890905 (82646392 before compressing)

# kylin.storage.hbase.endpoint-compress-result=FALSE
2020-01-31,10:59:54,657 INFO 
[RpcServer.default.RWQ.F

[jira] [Assigned] (KYLIN-4357) NPE in drop/cancel unexist job

2020-01-19 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang reassigned KYLIN-4357:
---

Assignee: ZhouKang

> NPE in drop/cancel unexist job
> --
>
> Key: KYLIN-4357
> URL: https://issues.apache.org/jira/browse/KYLIN-4357
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Assignee: ZhouKang
>Priority: Major
>
> REST api 
>  jobs/\{uuid}/cancel
> jobs/\{uuid}/drop
> if uuid is not exist, NPE happened.
> {code:java}
> // code placeholder
> Caused by: java.lang.NullPointerException
>  at 
> org.apache.kylin.rest.util.AclEvaluate.getProjectByJob(AclEvaluate.java:51)
>  at 
> org.apache.kylin.rest.util.AclEvaluate.checkProjectOperationPermission(AclEvaluate.java:107)
>  at org.apache.kylin.rest.service.JobService.cancelJob(JobService.java:610)
>  at 
> org.apache.kylin.rest.controller.JobController.cancel(JobController.java:204)
>  ... 75 more\n
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4357) NPE in drop/cancel unexist job

2020-01-19 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4357:

Description: 
REST api 
 jobs/\{uuid}/cancel

jobs/\{uuid}/drop

if uuid is not exist, NPE happened.
{code:java}
// code placeholder
Caused by: java.lang.NullPointerException\n\tat 
org.apache.kylin.rest.util.AclEvaluate.getProjectByJob(AclEvaluate.java:51)\n\tat
 
org.apache.kylin.rest.util.AclEvaluate.checkProjectOperationPermission(AclEvaluate.java:107)\n\tat
 org.apache.kylin.rest.service.JobService.cancelJob(JobService.java:610)\n\tat 
org.apache.kylin.rest.controller.JobController.cancel(JobController.java:204)\n\t...
 75 more\n
{code}

  was:
REST api 
jobs/\{uuid}/cancel

jobs/\{uuid}/drop

if uuid is not exist, NPE happened.


> NPE in drop/cancel unexist job
> --
>
> Key: KYLIN-4357
> URL: https://issues.apache.org/jira/browse/KYLIN-4357
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> REST api 
>  jobs/\{uuid}/cancel
> jobs/\{uuid}/drop
> if uuid is not exist, NPE happened.
> {code:java}
> // code placeholder
> Caused by: java.lang.NullPointerException\n\tat 
> org.apache.kylin.rest.util.AclEvaluate.getProjectByJob(AclEvaluate.java:51)\n\tat
>  
> org.apache.kylin.rest.util.AclEvaluate.checkProjectOperationPermission(AclEvaluate.java:107)\n\tat
>  
> org.apache.kylin.rest.service.JobService.cancelJob(JobService.java:610)\n\tat 
> org.apache.kylin.rest.controller.JobController.cancel(JobController.java:204)\n\t...
>  75 more\n
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4357) NPE in drop/cancel unexist job

2020-01-19 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4357:

Description: 
REST api 
 jobs/\{uuid}/cancel

jobs/\{uuid}/drop

if uuid is not exist, NPE happened.
{code:java}
// code placeholder
Caused by: java.lang.NullPointerException
 at org.apache.kylin.rest.util.AclEvaluate.getProjectByJob(AclEvaluate.java:51)
 at 
org.apache.kylin.rest.util.AclEvaluate.checkProjectOperationPermission(AclEvaluate.java:107)
 at org.apache.kylin.rest.service.JobService.cancelJob(JobService.java:610)
 at 
org.apache.kylin.rest.controller.JobController.cancel(JobController.java:204)
 ... 75 more\n
{code}

  was:
REST api 
 jobs/\{uuid}/cancel

jobs/\{uuid}/drop

if uuid is not exist, NPE happened.
{code:java}
// code placeholder
Caused by: java.lang.NullPointerException\n\tat 
org.apache.kylin.rest.util.AclEvaluate.getProjectByJob(AclEvaluate.java:51)
 at 
org.apache.kylin.rest.util.AclEvaluate.checkProjectOperationPermission(AclEvaluate.java:107)
 at org.apache.kylin.rest.service.JobService.cancelJob(JobService.java:610)
 at 
org.apache.kylin.rest.controller.JobController.cancel(JobController.java:204)
 ... 75 more\n
{code}


> NPE in drop/cancel unexist job
> --
>
> Key: KYLIN-4357
> URL: https://issues.apache.org/jira/browse/KYLIN-4357
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> REST api 
>  jobs/\{uuid}/cancel
> jobs/\{uuid}/drop
> if uuid is not exist, NPE happened.
> {code:java}
> // code placeholder
> Caused by: java.lang.NullPointerException
>  at 
> org.apache.kylin.rest.util.AclEvaluate.getProjectByJob(AclEvaluate.java:51)
>  at 
> org.apache.kylin.rest.util.AclEvaluate.checkProjectOperationPermission(AclEvaluate.java:107)
>  at org.apache.kylin.rest.service.JobService.cancelJob(JobService.java:610)
>  at 
> org.apache.kylin.rest.controller.JobController.cancel(JobController.java:204)
>  ... 75 more\n
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4357) NPE in drop/cancel unexist job

2020-01-19 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4357:

Description: 
REST api 
 jobs/\{uuid}/cancel

jobs/\{uuid}/drop

if uuid is not exist, NPE happened.
{code:java}
// code placeholder
Caused by: java.lang.NullPointerException\n\tat 
org.apache.kylin.rest.util.AclEvaluate.getProjectByJob(AclEvaluate.java:51)
 at 
org.apache.kylin.rest.util.AclEvaluate.checkProjectOperationPermission(AclEvaluate.java:107)
 at org.apache.kylin.rest.service.JobService.cancelJob(JobService.java:610)
 at 
org.apache.kylin.rest.controller.JobController.cancel(JobController.java:204)
 ... 75 more\n
{code}

  was:
REST api 
 jobs/\{uuid}/cancel

jobs/\{uuid}/drop

if uuid is not exist, NPE happened.
{code:java}
// code placeholder
Caused by: java.lang.NullPointerException\n\tat 
org.apache.kylin.rest.util.AclEvaluate.getProjectByJob(AclEvaluate.java:51)\n\tat
 
org.apache.kylin.rest.util.AclEvaluate.checkProjectOperationPermission(AclEvaluate.java:107)\n\tat
 org.apache.kylin.rest.service.JobService.cancelJob(JobService.java:610)\n\tat 
org.apache.kylin.rest.controller.JobController.cancel(JobController.java:204)\n\t...
 75 more\n
{code}


> NPE in drop/cancel unexist job
> --
>
> Key: KYLIN-4357
> URL: https://issues.apache.org/jira/browse/KYLIN-4357
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> REST api 
>  jobs/\{uuid}/cancel
> jobs/\{uuid}/drop
> if uuid is not exist, NPE happened.
> {code:java}
> // code placeholder
> Caused by: java.lang.NullPointerException\n\tat 
> org.apache.kylin.rest.util.AclEvaluate.getProjectByJob(AclEvaluate.java:51)
>  at 
> org.apache.kylin.rest.util.AclEvaluate.checkProjectOperationPermission(AclEvaluate.java:107)
>  at org.apache.kylin.rest.service.JobService.cancelJob(JobService.java:610)
>  at 
> org.apache.kylin.rest.controller.JobController.cancel(JobController.java:204)
>  ... 75 more\n
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4357) NPE in drop/cancel unexist job

2020-01-19 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4357:
---

 Summary: NPE in drop/cancel unexist job
 Key: KYLIN-4357
 URL: https://issues.apache.org/jira/browse/KYLIN-4357
 Project: Kylin
  Issue Type: Bug
Reporter: ZhouKang


REST api 
jobs/\{uuid}/cancel

jobs/\{uuid}/drop

if uuid is not exist, NPE happened.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (KYLIN-4356) Failed "Hive Column Cardinality calculation for table" jobs cannot be delete

2020-01-19 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang reassigned KYLIN-4356:
---

Assignee: ZhouKang

> Failed "Hive Column Cardinality calculation for table" jobs cannot be delete 
> -
>
> Key: KYLIN-4356
> URL: https://issues.apache.org/jira/browse/KYLIN-4356
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Assignee: ZhouKang
>Priority: Major
>
> In KylinHealthCheck, we found there are too many failed jobs like this:
> "name" : "Hive Column Cardinality calculation for table 'xxx.xxx'"
> the FAILED job cannot be cleanup by MetastoreCleanupJob,
> and the RESTful API  DELETE jobs/\{uuid}/drop cannot work, return exeception 
> {code:java}
> // code placeholder
> Caused by: org.apache.kylin.rest.exception.BadRequestException: Illegal job 
> type, id: 4257d976-706b-90cb-d82b-1828e0128fd1.
>   at 
> org.apache.kylin.rest.service.JobService.getSingleJobInstance(JobService.java:501)
>   at 
> org.apache.kylin.rest.service.JobService.getJobInstance(JobService.java:486)
>   at 
> org.apache.kylin.rest.controller.JobController.dropJob(JobController.java:264)
>   ... 75 more
>  
> {code}
> seems there is no api to delete failed job of "Cardinality calculation" .
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4356) Failed "Hive Column Cardinality calculation for table" jobs cannot be delete

2020-01-19 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4356:
---

 Summary: Failed "Hive Column Cardinality calculation for table" 
jobs cannot be delete 
 Key: KYLIN-4356
 URL: https://issues.apache.org/jira/browse/KYLIN-4356
 Project: Kylin
  Issue Type: Bug
Reporter: ZhouKang


In KylinHealthCheck, we found there are too many failed jobs like this:

"name" : "Hive Column Cardinality calculation for table 'xxx.xxx'"

the FAILED job cannot be cleanup by MetastoreCleanupJob,

and the RESTful API  DELETE jobs/\{uuid}/drop cannot work, return exeception 
{code:java}
// code placeholder
Caused by: org.apache.kylin.rest.exception.BadRequestException: Illegal job 
type, id: 4257d976-706b-90cb-d82b-1828e0128fd1.
  at 
org.apache.kylin.rest.service.JobService.getSingleJobInstance(JobService.java:501)
  at 
org.apache.kylin.rest.service.JobService.getJobInstance(JobService.java:486)
  at 
org.apache.kylin.rest.controller.JobController.dropJob(JobController.java:264)
  ... 75 more
 
{code}

seems there is no api to delete failed job of "Cardinality calculation" .

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4322) Cost–benefit of compression HBase result

2019-12-31 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4322:

Description: 
kylin.storage.hbase.endpoint-compress-result is  TRUE as default.

In our production environment, when the hbase scan result is larger than 200M, 
it will take more than 10s to compress data.

We can find this by hbase's log:
||Size||avg rate||min rate||avg time||max time||
|<1M|0.12|0.25|0.18ms|0.7s|
|1M ~ 10M|0.39|0.97|0.2s|0.6s|
|10M ~ 100M|0.47|0.81|2s|6.3s|
|>100M|0.95|0.96|15.7s|24.8s|

Notice:
 # rate: compressed data size / origin data size
 # when the source data size is < 1M, compressed data may larger than the 
source data. So the table(Row 1) only calculate then compressed data less than 
the source data
 # In our environment, 65% compression data (<1M) is larger than source data 

When source data is less then 10M, the latency of data transmission is 
acceptability. When data is larger then 100M, it will take a long time to 
compress data.

 

So, I think kylin.storage.hbase.endpoint-compress-result  should be FALSE by 
default;

 

  was:
kylin.storage.hbase.endpoint-compress-result is  TRUE as default.

In our production environment, when the hbase scan result is larger than 200M, 
it will take more than 10s to compress data.

We can find this by hbase's log:
||Size||avg rate||max rate||avg time||max time||
|<1M|0.12|0.25|0.18ms|0.7s|
|1M ~ 10M|0.39|0.97|0.2s|0.6s|
|10M ~ 100M|0.47|0.81|2s|6.3s|
|>100M|0.95|0.96|15.7s|24.8s|

Notice:
 # rate: compressed data size / origin data size
 # when the source data size is < 1M, compressed data may larger than the 
source data. So the table(Row 1) only calculate then compressed data less than 
the source data
 # In our environment, 65% compression data (<1M) is larger than source data 

When source data is less then 10M, the latency of data transmission is 
acceptability. When data is larger then 100M, it will take a long time to 
compress data.

 

So, I think kylin.storage.hbase.endpoint-compress-result  should be FALSE by 
default;

 


> Cost–benefit of compression HBase result
> 
>
> Key: KYLIN-4322
> URL: https://issues.apache.org/jira/browse/KYLIN-4322
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> kylin.storage.hbase.endpoint-compress-result is  TRUE as default.
> In our production environment, when the hbase scan result is larger than 
> 200M, it will take more than 10s to compress data.
> We can find this by hbase's log:
> ||Size||avg rate||min rate||avg time||max time||
> |<1M|0.12|0.25|0.18ms|0.7s|
> |1M ~ 10M|0.39|0.97|0.2s|0.6s|
> |10M ~ 100M|0.47|0.81|2s|6.3s|
> |>100M|0.95|0.96|15.7s|24.8s|
> Notice:
>  # rate: compressed data size / origin data size
>  # when the source data size is < 1M, compressed data may larger than the 
> source data. So the table(Row 1) only calculate then compressed data less 
> than the source data
>  # In our environment, 65% compression data (<1M) is larger than source data 
> When source data is less then 10M, the latency of data transmission is 
> acceptability. When data is larger then 100M, it will take a long time to 
> compress data.
>  
> So, I think kylin.storage.hbase.endpoint-compress-result  should be FALSE by 
> default;
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4322) Cost–benefit of compression HBase result

2019-12-31 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4322:

Description: 
kylin.storage.hbase.endpoint-compress-result is  TRUE as default.

In our production environment, when the hbase scan result is larger than 200M, 
it will take more than 10s to compress data.

We can find this by hbase's log:
||Size||avg rate||max rate||avg time||max time||
|<1M|0.12|0.25|0.18ms|0.7s|
|1M ~ 10M|0.39|0.97|0.2s|0.6s|
|10M ~ 100M|0.47|0.81|2s|6.3s|
|>100M|0.95|0.96|15.7s|24.8s|

Notice:
 # rate: compressed data size / origin data size
 # when the source data size is < 1M, compressed data may larger than the 
source data. So the table(Row 1) only calculate then compressed data less than 
the source data
 # In our environment, 65% compression data (<1M) is larger than source data 

When source data is less then 10M, the latency of data transmission is 
acceptability. When data is larger then 100M, it will take a long time to 
compress data.

 

So, I think kylin.storage.hbase.endpoint-compress-result  should be FALSE by 
default;

 

  was:
kylin.storage.hbase.endpoint-compress-result is  TRUE as default.

In our production environment, when the hbase scan result is larger than 200M, 
it will take more than 10s to compress data.

We can find this by hbase's log:
||Size||avg rate||max rate||avg time||max time||
|<1M|0.12|0.25|0.18ms|0.7s|
|1M ~ 10M|0.39|0.97|0.2s|0.6s|
|10M ~ 100M|0.47|0.81|2s|6.3s|
|>100M|0.95|0.96|15.7s|24.8s|

rate: compressed data size / origin data size

 AND please NOTICE that,

when the source data size is less than 1M, 65% compression data is larger than 
source data.

When source data is less then 10M, the latency of data transmission is 
acceptability. When data is larger then 100M, it will take a long time to 
compress data.

 

So, I think kylin.storage.hbase.endpoint-compress-result  should be FALSE by 
default;

 


> Cost–benefit of compression HBase result
> 
>
> Key: KYLIN-4322
> URL: https://issues.apache.org/jira/browse/KYLIN-4322
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> kylin.storage.hbase.endpoint-compress-result is  TRUE as default.
> In our production environment, when the hbase scan result is larger than 
> 200M, it will take more than 10s to compress data.
> We can find this by hbase's log:
> ||Size||avg rate||max rate||avg time||max time||
> |<1M|0.12|0.25|0.18ms|0.7s|
> |1M ~ 10M|0.39|0.97|0.2s|0.6s|
> |10M ~ 100M|0.47|0.81|2s|6.3s|
> |>100M|0.95|0.96|15.7s|24.8s|
> Notice:
>  # rate: compressed data size / origin data size
>  # when the source data size is < 1M, compressed data may larger than the 
> source data. So the table(Row 1) only calculate then compressed data less 
> than the source data
>  # In our environment, 65% compression data (<1M) is larger than source data 
> When source data is less then 10M, the latency of data transmission is 
> acceptability. When data is larger then 100M, it will take a long time to 
> compress data.
>  
> So, I think kylin.storage.hbase.endpoint-compress-result  should be FALSE by 
> default;
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4322) Cost–benefit of compression HBase result

2019-12-30 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4322:

Summary: Cost–benefit of compression HBase result  (was: ROA of compression 
HBase result)

> Cost–benefit of compression HBase result
> 
>
> Key: KYLIN-4322
> URL: https://issues.apache.org/jira/browse/KYLIN-4322
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> kylin.storage.hbase.endpoint-compress-result is  TRUE as default.
> In our production environment, when the hbase scan result is larger than 
> 200M, it will take more than 10s to compress data.
> We can find this by hbase's log:
> ||Size||avg rate||max rate||avg time||max time||
> |<1M|0.12|0.25|0.18ms|0.7s|
> |1M ~ 10M|0.39|0.97|0.2s|0.6s|
> |10M ~ 100M|0.47|0.81|2s|6.3s|
> |>100M|0.95|0.96|15.7s|24.8s|
> rate: compressed data size / origin data size
>  AND please NOTICE that,
> when the source data size is less than 1M, 65% compression data is larger 
> than source data.
> When source data is less then 10M, the latency of data transmission is 
> acceptability. When data is larger then 100M, it will take a long time to 
> compress data.
>  
> So, I think kylin.storage.hbase.endpoint-compress-result  should be FALSE by 
> default;
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4322) ROA of compression HBase result

2019-12-30 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4322:

Summary: ROA of compression HBase result  (was: ROC of compression HBase 
result)

> ROA of compression HBase result
> ---
>
> Key: KYLIN-4322
> URL: https://issues.apache.org/jira/browse/KYLIN-4322
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> kylin.storage.hbase.endpoint-compress-result is  TRUE as default.
> In our production environment, when the hbase scan result is larger than 
> 200M, it will take more than 10s to compress data.
> We can find this by hbase's log:
> ||Size||avg rate||max rate||avg time||max time||
> |<1M|0.12|0.25|0.18ms|0.7s|
> |1M ~ 10M|0.39|0.97|0.2s|0.6s|
> |10M ~ 100M|0.47|0.81|2s|6.3s|
> |>100M|0.95|0.96|15.7s|24.8s|
> rate: compressed data size / origin data size
>  AND please NOTICE that,
> when the source data size is less than 1M, 65% compression data is larger 
> than source data.
> When source data is less then 10M, the latency of data transmission is 
> acceptability. When data is larger then 100M, it will take a long time to 
> compress data.
>  
> So, I think kylin.storage.hbase.endpoint-compress-result  should be FALSE by 
> default;
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4322) ROC of compression HBase result

2019-12-30 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4322:

Description: 
kylin.storage.hbase.endpoint-compress-result is  TRUE as default.

In our production environment, when the hbase scan result is larger than 200M, 
it will take more than 10s to compress data.

We can find this by hbase's log:
||Size||avg rate||max rate||avg time||max time||
|<1M|0.12|0.25|0.18ms|0.7s|
|1M ~ 10M|0.39|0.97|0.2s|0.6s|
|10M ~ 100M|0.47|0.81|2s|6.3s|
|>100M|0.95|0.96|15.7s|24.8s|

rate: compressed data size / origin data size

 AND please NOTICE that,

when the source data size is less than 1M, 65% compression data is larger than 
source data.

When source data is less then 10M, the latency of data transmission is 
acceptability. When data is larger then 100M, it will take a long time to 
compress data.

 

So, I think kylin.storage.hbase.endpoint-compress-result  should be FALSE by 
default;

 

> ROC of compression HBase result
> ---
>
> Key: KYLIN-4322
> URL: https://issues.apache.org/jira/browse/KYLIN-4322
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> kylin.storage.hbase.endpoint-compress-result is  TRUE as default.
> In our production environment, when the hbase scan result is larger than 
> 200M, it will take more than 10s to compress data.
> We can find this by hbase's log:
> ||Size||avg rate||max rate||avg time||max time||
> |<1M|0.12|0.25|0.18ms|0.7s|
> |1M ~ 10M|0.39|0.97|0.2s|0.6s|
> |10M ~ 100M|0.47|0.81|2s|6.3s|
> |>100M|0.95|0.96|15.7s|24.8s|
> rate: compressed data size / origin data size
>  AND please NOTICE that,
> when the source data size is less than 1M, 65% compression data is larger 
> than source data.
> When source data is less then 10M, the latency of data transmission is 
> acceptability. When data is larger then 100M, it will take a long time to 
> compress data.
>  
> So, I think kylin.storage.hbase.endpoint-compress-result  should be FALSE by 
> default;
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4322) ROC of compression HBase result

2019-12-30 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4322:

Summary: ROC of compression HBase result  (was: HBase)

> ROC of compression HBase result
> ---
>
> Key: KYLIN-4322
> URL: https://issues.apache.org/jira/browse/KYLIN-4322
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4322) HBase

2019-12-30 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4322:
---

 Summary: HBase
 Key: KYLIN-4322
 URL: https://issues.apache.org/jira/browse/KYLIN-4322
 Project: Kylin
  Issue Type: Bug
Reporter: ZhouKang






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (KYLIN-4227) add segment build callback

2019-12-19 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16999837#comment-16999837
 ] 

ZhouKang edited comment on KYLIN-4227 at 12/19/19 8:58 AM:
---

HTTP callback settings screen shot. see: Attachments

HTTP callback request :
{code:java}
// code placeholder
GET /test?cube=kylin_sales_cube&date_start=132537600&date_end=157515840 
HTTP/1.1
Host: {host}:
Connection: Keep-Alive
{code}
  


was (Author: zhoukangcn):
HTTP callback settings screen shot. see: Attachments

 

HTTP callback request :
{code:java}
// code placeholder
GET /test?cube=kylin_sales_cube&date_start=132537600&date_end=157515840 
HTTP/1.1
Host: {host}:
Connection: Keep-Alive
{code}
 

 

> add segment build callback
> --
>
> Key: KYLIN-4227
> URL: https://issues.apache.org/jira/browse/KYLIN-4227
> Project: Kylin
>  Issue Type: New Feature
>Reporter: ZhouKang
>Priority: Major
> Attachments: 1.png, 2.png
>
>
> add HTTP callback for cube, some other system could use this feature to add a 
> hook when segment building done.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (KYLIN-4227) add segment build callback

2019-12-19 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16999837#comment-16999837
 ] 

ZhouKang edited comment on KYLIN-4227 at 12/19/19 8:30 AM:
---

HTTP callback settings screen shot. see: Attachments

 

HTTP callback request :
{code:java}
// code placeholder
GET /test?cube=kylin_sales_cube&date_start=132537600&date_end=157515840 
HTTP/1.1
Host: {host}:
Connection: Keep-Alive
{code}
 

 


was (Author: zhoukangcn):
HTTP callback settings and HTTP request screen shot

!1.png!

> add segment build callback
> --
>
> Key: KYLIN-4227
> URL: https://issues.apache.org/jira/browse/KYLIN-4227
> Project: Kylin
>  Issue Type: New Feature
>Reporter: ZhouKang
>Priority: Major
> Attachments: 1.png, 2.png
>
>
> add HTTP callback for cube, some other system could use this feature to add a 
> hook when segment building done.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4227) add segment build callback

2019-12-19 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16999837#comment-16999837
 ] 

ZhouKang commented on KYLIN-4227:
-

HTTP callback settings and HTTP request screen shot

!1.png!

> add segment build callback
> --
>
> Key: KYLIN-4227
> URL: https://issues.apache.org/jira/browse/KYLIN-4227
> Project: Kylin
>  Issue Type: New Feature
>Reporter: ZhouKang
>Priority: Major
> Attachments: 1.png, 2.png
>
>
> add HTTP callback for cube, some other system could use this feature to add a 
> hook when segment building done.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4227) add segment build callback

2019-12-19 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4227:

Attachment: 2.png
1.png

> add segment build callback
> --
>
> Key: KYLIN-4227
> URL: https://issues.apache.org/jira/browse/KYLIN-4227
> Project: Kylin
>  Issue Type: New Feature
>Reporter: ZhouKang
>Priority: Major
> Attachments: 1.png, 2.png
>
>
> add HTTP callback for cube, some other system could use this feature to add a 
> hook when segment building done.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4306) rollback when addModelToProject failed.

2019-12-18 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4306:
---

 Summary: rollback when addModelToProject failed.
 Key: KYLIN-4306
 URL: https://issues.apache.org/jira/browse/KYLIN-4306
 Project: Kylin
  Issue Type: Sub-task
Reporter: ZhouKang


write conflict while add model by different query server parallelly, and 
addModelToProject failed.

In KylinClusterHealthCheck, there are some error log caused by this bug:
{code:java}
// code placeholder
Error loading DataModelDesc at /model_desc/model_{xxx}.json
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4293) Backport HBASE-22887 to Kylin HFileOutputFormat3

2019-12-10 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16993121#comment-16993121
 ] 

ZhouKang commented on KYLIN-4293:
-

+1

> Backport HBASE-22887 to Kylin HFileOutputFormat3
> 
>
> Key: KYLIN-4293
> URL: https://issues.apache.org/jira/browse/KYLIN-4293
> Project: Kylin
>  Issue Type: Improvement
>  Components: Job Engine
>Reporter: Shao Feng Shi
>Assignee: langdamao
>Priority: Major
> Fix For: v2.6.5, v3.1.0
>
>
> As Kylin forked HBase's HFileOutputFormat2 as HFileOutputFormat3, so this 
> bugfix need be applied in Kylin:
> https://issues.apache.org/jira/browse/HBASE-22887
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4283) Job error because of FileNotFound in Step Garbage Collection

2019-12-06 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4283:
---

 Summary: Job error because of  FileNotFound in Step Garbage 
Collection
 Key: KYLIN-4283
 URL: https://issues.apache.org/jira/browse/KYLIN-4283
 Project: Kylin
  Issue Type: Bug
Reporter: ZhouKang


 

Job failed because of FileNotFound. 

I think Garbage Collection failed should not make the build job failed
{code:java}
// code placeholder
java.io.FileNotFoundException: File 
/user/s_kylin/kylin_zjyprc_bigdata_staging/kylin_zjyprc_bigdata_staging-kylin_metadata/kylin-ff3f6fc3-2211-3593-1ba4-9b5326a05258/cube
 does not exist.
at 
org.apache.hadoop.hdfs.DistributedFileSystem.listStatusInternal(DistributedFileSystem.java:773)
at 
org.apache.hadoop.hdfs.DistributedFileSystem.access$600(DistributedFileSystem.java:113)
at 
org.apache.hadoop.hdfs.DistributedFileSystem$17.doCall(DistributedFileSystem.java:831)
at 
org.apache.hadoop.hdfs.DistributedFileSystem$17.doCall(DistributedFileSystem.java:827)
at 
org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
at 
org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:838)
at 
org.apache.kylin.storage.hbase.steps.HDFSPathGarbageCollectionStep.dropHdfsPathOnCluster(HDFSPathGarbageCollectionStep.java:95)
at 
org.apache.kylin.storage.hbase.steps.HDFSPathGarbageCollectionStep.doWork(HDFSPathGarbageCollectionStep.java:65)
at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:180)
at 
org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:71)
at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:180)
at 
org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:114)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
{code}
 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4272) problems of docker/build_image.sh

2019-11-26 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4272:
---

 Summary: problems of docker/build_image.sh
 Key: KYLIN-4272
 URL: https://issues.apache.org/jira/browse/KYLIN-4272
 Project: Kylin
  Issue Type: Bug
Reporter: ZhouKang


this script can only be executed in sub dir "docker", if your want to execute 
as following, some error got:
{code:java}
// code placeholder
bash docker/build_image.sh{code}
And, the source code's dir name must be kylin, you cannot use other name.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (KYLIN-4178) Job scheduler support safe mode

2019-11-26 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16982364#comment-16982364
 ] 

ZhouKang edited comment on KYLIN-4178 at 11/26/19 11:12 AM:


It does have the problem. Have you submited a PR to fix it? or i can fix it 
later.


was (Author: zhoukangcn):
It does have the problem. Have you submit a PR to fix it? or i can fix it later.

> Job scheduler support safe mode 
> 
>
> Key: KYLIN-4178
> URL: https://issues.apache.org/jira/browse/KYLIN-4178
> Project: Kylin
>  Issue Type: Improvement
>Reporter: ZhouKang
>Priority: Major
> Fix For: v3.0.0
>
> Attachments: image-2019-11-26-18-42-33-473.png
>
>
> Job scheduler should support safe mode in case of the HBase cluster change.
> In xiaomi, we want update the HBase cluster from hbase0.98 to HBase 2.0. The 
> history data can be migrated previously, but the job has been submitted will 
> keep running and write data to the old cluster. So we need a method to ensure 
> that, job create htable in the old cluster will write data to the old 
> cluster, and the job have not create htable should not be scheduled.
> So we need  job scheduler safe mode. Open safe mode before changing cluster 
> config,  the  running jobs can run continuous, and the new job cannot be 
> scheduled. 
> After all running job finished, we can change the cluster config to the new 
> one,  and rest of job can be scheduled again.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4178) Job scheduler support safe mode

2019-11-26 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16982364#comment-16982364
 ] 

ZhouKang commented on KYLIN-4178:
-

It does have the problem. Have you submit a PR to fix it? or i can fix it later.

> Job scheduler support safe mode 
> 
>
> Key: KYLIN-4178
> URL: https://issues.apache.org/jira/browse/KYLIN-4178
> Project: Kylin
>  Issue Type: Improvement
>Reporter: ZhouKang
>Priority: Major
> Fix For: v3.0.0
>
> Attachments: image-2019-11-26-18-42-33-473.png
>
>
> Job scheduler should support safe mode in case of the HBase cluster change.
> In xiaomi, we want update the HBase cluster from hbase0.98 to HBase 2.0. The 
> history data can be migrated previously, but the job has been submitted will 
> keep running and write data to the old cluster. So we need a method to ensure 
> that, job create htable in the old cluster will write data to the old 
> cluster, and the job have not create htable should not be scheduled.
> So we need  job scheduler safe mode. Open safe mode before changing cluster 
> config,  the  running jobs can run continuous, and the new job cannot be 
> scheduled. 
> After all running job finished, we can change the cluster config to the new 
> one,  and rest of job can be scheduled again.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4268) build job failed caused by hadoop utils update

2019-11-25 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4268:

Summary: build job failed caused by hadoop utils update  (was: build job 
failed caused by java lib update)

> build job failed caused by hadoop utils update
> --
>
> Key: KYLIN-4268
> URL: https://issues.apache.org/jira/browse/KYLIN-4268
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> When we update hadoop package, such as hbase and hive. The job will fill 
> cause by:
>  
> {code:java}
> // code placeholder
> Exception in thread "main" java.io.FileNotFoundException: File 
> file:/home/work/app/kylin-all-2.5.2-2.5.2-HBase2-SNAPSHOT/hbase-2.2.0-3.1.0/lib/hbase-common-2.2.0-3.1.0.jar
>  does not exist
>   at 
> org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:534)
>   at 
> org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:747)
>   at 
> org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:524)
>   at 
> org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:416)
>   at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:337)
>   at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:289)
>   at 
> org.apache.spark.deploy.yarn.Client.copyFileToRemote(Client.scala:406)
>   at 
> org.apache.spark.deploy.yarn.Client.org\$apache\$spark\$deploy\$yarn\$Client\$\$distribute\
> {code}
>  
> The root cause is kylin will generate the full command as soon as the job 
> summited. And the parameter '--jars' use file's absolute path.
>   
> there will be 2 ways to fix this problem.
>  # keep the old package in the same path 
>  # change the generation of  '--jars' to the stage of  job execution.
>  
> which is the better choose? 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4268) build job failed caused by java lib update

2019-11-25 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4268:
---

 Summary: build job failed caused by java lib update
 Key: KYLIN-4268
 URL: https://issues.apache.org/jira/browse/KYLIN-4268
 Project: Kylin
  Issue Type: Bug
Reporter: ZhouKang


When we update hadoop package, such as hbase and hive. The job will fill cause 
by:

 
{code:java}
// code placeholder
Exception in thread "main" java.io.FileNotFoundException: File 
file:/home/work/app/kylin-all-2.5.2-2.5.2-HBase2-SNAPSHOT/hbase-2.2.0-3.1.0/lib/hbase-common-2.2.0-3.1.0.jar
 does not exist
at 
org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:534)
at 
org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:747)
at 
org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:524)
at 
org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:416)
at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:337)
at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:289)
at 
org.apache.spark.deploy.yarn.Client.copyFileToRemote(Client.scala:406)
at 
org.apache.spark.deploy.yarn.Client.org\$apache\$spark\$deploy\$yarn\$Client\$\$distribute\
{code}
 

The root cause is kylin will generate the full command as soon as the job 
summited. And the parameter '--jars' use file's absolute path.
 

there will be 2 ways to fix this problem.
 # keep the old package in the same path 
 # change the generation of  '--jars' to the stage of  job execution.

 

which is the better choose?

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4268) build job failed caused by java lib update

2019-11-25 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4268:

Description: 
When we update hadoop package, such as hbase and hive. The job will fill cause 
by:

 
{code:java}
// code placeholder
Exception in thread "main" java.io.FileNotFoundException: File 
file:/home/work/app/kylin-all-2.5.2-2.5.2-HBase2-SNAPSHOT/hbase-2.2.0-3.1.0/lib/hbase-common-2.2.0-3.1.0.jar
 does not exist
at 
org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:534)
at 
org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:747)
at 
org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:524)
at 
org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:416)
at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:337)
at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:289)
at 
org.apache.spark.deploy.yarn.Client.copyFileToRemote(Client.scala:406)
at 
org.apache.spark.deploy.yarn.Client.org\$apache\$spark\$deploy\$yarn\$Client\$\$distribute\
{code}
 

The root cause is kylin will generate the full command as soon as the job 
summited. And the parameter '--jars' use file's absolute path.
  

there will be 2 ways to fix this problem.
 # keep the old package in the same path 
 # change the generation of  '--jars' to the stage of  job execution.

 

which is the better choose? 

  was:
When we update hadoop package, such as hbase and hive. The job will fill cause 
by:

 
{code:java}
// code placeholder
Exception in thread "main" java.io.FileNotFoundException: File 
file:/home/work/app/kylin-all-2.5.2-2.5.2-HBase2-SNAPSHOT/hbase-2.2.0-3.1.0/lib/hbase-common-2.2.0-3.1.0.jar
 does not exist
at 
org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:534)
at 
org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:747)
at 
org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:524)
at 
org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:416)
at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:337)
at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:289)
at 
org.apache.spark.deploy.yarn.Client.copyFileToRemote(Client.scala:406)
at 
org.apache.spark.deploy.yarn.Client.org\$apache\$spark\$deploy\$yarn\$Client\$\$distribute\
{code}
 

The root cause is kylin will generate the full command as soon as the job 
summited. And the parameter '--jars' use file's absolute path.
 

there will be 2 ways to fix this problem.
 # keep the old package in the same path 
 # change the generation of  '--jars' to the stage of  job execution.

 

which is the better choose?

 

 


> build job failed caused by java lib update
> --
>
> Key: KYLIN-4268
> URL: https://issues.apache.org/jira/browse/KYLIN-4268
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> When we update hadoop package, such as hbase and hive. The job will fill 
> cause by:
>  
> {code:java}
> // code placeholder
> Exception in thread "main" java.io.FileNotFoundException: File 
> file:/home/work/app/kylin-all-2.5.2-2.5.2-HBase2-SNAPSHOT/hbase-2.2.0-3.1.0/lib/hbase-common-2.2.0-3.1.0.jar
>  does not exist
>   at 
> org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:534)
>   at 
> org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:747)
>   at 
> org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:524)
>   at 
> org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:416)
>   at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:337)
>   at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:289)
>   at 
> org.apache.spark.deploy.yarn.Client.copyFileToRemote(Client.scala:406)
>   at 
> org.apache.spark.deploy.yarn.Client.org\$apache\$spark\$deploy\$yarn\$Client\$\$distribute\
> {code}
>  
> The root cause is kylin will generate the full command as soon as the job 
> summited. And the parameter '--jars' use file's absolute path.
>   
> there will be 2 ways to fix this problem.
>  # keep the old package in the same path 
>  # change the generation of  '--jars' to the stage of  job execution.
>  
> which is the better choose? 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4267) Add doc for KYLIN-4178

2019-11-25 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4267:

Issue Type: Task  (was: Bug)

> Add doc for KYLIN-4178
> --
>
> Key: KYLIN-4267
> URL: https://issues.apache.org/jira/browse/KYLIN-4267
> Project: Kylin
>  Issue Type: Task
>Reporter: ZhouKang
>Assignee: ZhouKang
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4267) Add doc for KYLIN-4178

2019-11-24 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4267:
---

 Summary: Add doc for KYLIN-4178
 Key: KYLIN-4267
 URL: https://issues.apache.org/jira/browse/KYLIN-4267
 Project: Kylin
  Issue Type: Bug
Reporter: ZhouKang
Assignee: ZhouKang






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (KYLIN-4233) simple time schedule module for kylin job

2019-11-13 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang reassigned KYLIN-4233:
---

Assignee: ZhouKang

> simple time schedule module for kylin job
> -
>
> Key: KYLIN-4233
> URL: https://issues.apache.org/jira/browse/KYLIN-4233
> Project: Kylin
>  Issue Type: New Feature
>Reporter: ZhouKang
>Assignee: ZhouKang
>Priority: Major
>
> add a simple time schedule module, so user can use kylin instead of 
> third-party schedule system



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4244) ClassNotFoundException while use org.apache.kylin.engine.mr.common.CubeStatsReader in bash

2019-11-07 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4244:

Description: 
use org.apache.kylin.engine.mr.common.CubeStatsReader to print estimated size 
for cube

 
{code:java}
// code placeholder
bash ./kylin.sh org.apache.kylin.engine.mr.common.CubeStatsReader {cube_name}
{code}
get an Exception
{code:java}
// code placeholder
Exception in thread "main" java.lang.NoClassDefFoundError: 
com/tdunning/math/stats/TDigest
at 
org.apache.kylin.measure.percentile.PercentileSerializer.current(PercentileSerializer.java:62)
at 
org.apache.kylin.measure.percentile.PercentileSerializer.getStorageBytesEstimate(PercentileSerializer.java:52)
at 
org.apache.kylin.metadata.datatype.DataType.getStorageBytesEstimate(DataType.java:256)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.estimateCuboidStorageSize(CubeStatsReader.java:251)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.getCuboidSizeMapFromRowCount(CubeStatsReader.java:211)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.getCuboidSizeMap(CubeStatsReader.java:170)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.print(CubeStatsReader.java:273)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.main(CubeStatsReader.java:435)
Caused by: java.lang.ClassNotFoundException: com.tdunning.math.stats.TDigest
at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
... 8 more
{code}
 

  was:
use kylin.sh

 
{code:java}
// code placeholder
bash ./kylin.sh org.apache.kylin.engine.mr.common.CubeStatsReader 
caifeng_usage_cube
{code}
get an Exception
{code:java}
// code placeholder
Exception in thread "main" java.lang.NoClassDefFoundError: 
com/tdunning/math/stats/TDigest
at 
org.apache.kylin.measure.percentile.PercentileSerializer.current(PercentileSerializer.java:62)
at 
org.apache.kylin.measure.percentile.PercentileSerializer.getStorageBytesEstimate(PercentileSerializer.java:52)
at 
org.apache.kylin.metadata.datatype.DataType.getStorageBytesEstimate(DataType.java:256)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.estimateCuboidStorageSize(CubeStatsReader.java:251)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.getCuboidSizeMapFromRowCount(CubeStatsReader.java:211)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.getCuboidSizeMap(CubeStatsReader.java:170)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.print(CubeStatsReader.java:273)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.main(CubeStatsReader.java:435)
Caused by: java.lang.ClassNotFoundException: com.tdunning.math.stats.TDigest
at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
... 8 more
{code}
 


> ClassNotFoundException while use 
> org.apache.kylin.engine.mr.common.CubeStatsReader in bash
> --
>
> Key: KYLIN-4244
> URL: https://issues.apache.org/jira/browse/KYLIN-4244
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> use org.apache.kylin.engine.mr.common.CubeStatsReader to print estimated size 
> for cube
>  
> {code:java}
> // code placeholder
> bash ./kylin.sh org.apache.kylin.engine.mr.common.CubeStatsReader {cube_name}
> {code}
> get an Exception
> {code:java}
> // code placeholder
> Exception in thread "main" java.lang.NoClassDefFoundError: 
> com/tdunning/math/stats/TDigest
>   at 
> org.apache.kylin.measure.percentile.PercentileSerializer.current(PercentileSerializer.java:62)
>   at 
> org.apache.kylin.measure.percentile.PercentileSerializer.getStorageBytesEstimate(PercentileSerializer.java:52)
>   at 
> org.apache.kylin.metadata.datatype.DataType.getStorageBytesEstimate(DataType.java:256)
>   at 
> org.apache.kylin.engine.mr.common.CubeStatsReader.estimateCuboidStorageSize(CubeStatsReader.java:251)
>   at 
> org.apache.kylin.engine.mr.common.CubeStatsReader.getCuboidSizeMapFromRowCount(CubeStatsReader.java:211)
>   at 
> org.apache.kylin.engine.mr.common.CubeStatsReader.getCuboidSizeMap(CubeStatsReader.java:170)
>   at 
> org.apache.kylin.engine.mr.common.CubeStatsReader.print(CubeStatsReader.java:273)
>   at 
> org.apache.kylin.engine.mr.common.CubeStatsReader.main(CubeStatsReader.java:435)
> Caused by: java.lang.ClassNotFoundE

[jira] [Created] (KYLIN-4244) ClassNotFoundException while use org.apache.kylin.engine.mr.common.CubeStatsReader in bash

2019-11-07 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4244:
---

 Summary: ClassNotFoundException while use 
org.apache.kylin.engine.mr.common.CubeStatsReader in bash
 Key: KYLIN-4244
 URL: https://issues.apache.org/jira/browse/KYLIN-4244
 Project: Kylin
  Issue Type: Bug
Reporter: ZhouKang


use kylin.sh

 
{code:java}
// code placeholder
bash ./kylin.sh org.apache.kylin.engine.mr.common.CubeStatsReader 
caifeng_usage_cube
{code}
get an Exception
{code:java}
// code placeholder
Exception in thread "main" java.lang.NoClassDefFoundError: 
com/tdunning/math/stats/TDigest
at 
org.apache.kylin.measure.percentile.PercentileSerializer.current(PercentileSerializer.java:62)
at 
org.apache.kylin.measure.percentile.PercentileSerializer.getStorageBytesEstimate(PercentileSerializer.java:52)
at 
org.apache.kylin.metadata.datatype.DataType.getStorageBytesEstimate(DataType.java:256)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.estimateCuboidStorageSize(CubeStatsReader.java:251)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.getCuboidSizeMapFromRowCount(CubeStatsReader.java:211)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.getCuboidSizeMap(CubeStatsReader.java:170)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.print(CubeStatsReader.java:273)
at 
org.apache.kylin.engine.mr.common.CubeStatsReader.main(CubeStatsReader.java:435)
Caused by: java.lang.ClassNotFoundException: com.tdunning.math.stats.TDigest
at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
... 8 more
{code}
 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Issue Comment Deleted] (KYLIN-4241) Dynamically reload user's security configuration with properties or yaml config

2019-11-06 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4241:

Comment: was deleted

(was: I think KYLIN-4212 is a better way)

> Dynamically reload user's security configuration with properties or yaml 
> config
> ---
>
> Key: KYLIN-4241
> URL: https://issues.apache.org/jira/browse/KYLIN-4241
> Project: Kylin
>  Issue Type: Improvement
>  Components: Security
>Affects Versions: v2.6.0
> Environment: testing
>Reporter: xiang zhang
>Priority: Major
> Fix For: Future
>
> Attachments: kylin-4241-usage-instructions.pdf
>
>
> In the default kylin security authentication configuration, the user needs to 
> configure the user and password in kylinSecurity.xml. The ability to 
> dynamically load user permissions, authentication passwords, etc., increases 
> convenience and scalability.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4241) Dynamically reload user's security configuration with properties or yaml config

2019-11-06 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968959#comment-16968959
 ] 

ZhouKang commented on KYLIN-4241:
-

I think KYLIN-4212 is a better way

> Dynamically reload user's security configuration with properties or yaml 
> config
> ---
>
> Key: KYLIN-4241
> URL: https://issues.apache.org/jira/browse/KYLIN-4241
> Project: Kylin
>  Issue Type: Improvement
>  Components: Security
>Affects Versions: v2.6.0
> Environment: testing
>Reporter: xiang zhang
>Priority: Major
> Fix For: Future
>
>
> In the default kylin security authentication configuration, the user needs to 
> configure the user and password in kylinSecurity.xml. The ability to 
> dynamically load user permissions, authentication passwords, etc., increases 
> convenience and scalability.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4185) CubeStatsReader estimate wrong cube size

2019-11-06 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968205#comment-16968205
 ] 

ZhouKang commented on KYLIN-4185:
-

In our production environment,  the estimated size and real size is as follows:

cube_sample, segment: 2019110200_2019110300 

estimate size: 1736731MB  HTable size: 190701MB

cube_sample, segment: 2019100100_2019100800 (this is a merged segemtn)
estimate size: 49251MB   HTable size: 1153314MB

> CubeStatsReader estimate wrong cube size
> 
>
> Key: KYLIN-4185
> URL: https://issues.apache.org/jira/browse/KYLIN-4185
> Project: Kylin
>  Issue Type: Improvement
>Reporter: ZhouKang
>Priority: Major
>
> CubeStatsReader estimate wrong cube size, which cause a lot of problems.
> when the estimated size is much larger than the real size, the spark 
> application's executor number is small, and cube build step will take a long 
> time. sometime the step will failed due to the large dataset.
> When the estimated size is much smaller than the real size. the cuboid file 
> in HDFS is small, and there are much of cuboid file.
>  
> In our production environment, both the two situation happened.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4232) add sql function date_from_now

2019-11-05 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4232:

Description: SELECT "DATE", count(1) from table WHERE "DATE" = 
'date_from_now(-1, '-MM-dd') group by "DATE"  (was: SELECT \"DATE\", 
count(1) from table WHERE \"DATE\" = ''date_from_now(-1, '-MM-dd') group by 
\"DATE\")

> add sql  function date_from_now
> ---
>
> Key: KYLIN-4232
> URL: https://issues.apache.org/jira/browse/KYLIN-4232
> Project: Kylin
>  Issue Type: New Feature
>Reporter: ZhouKang
>Priority: Major
>
> SELECT "DATE", count(1) from table WHERE "DATE" = 'date_from_now(-1, 
> '-MM-dd') group by "DATE"



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4232) add sql function date_from_now

2019-11-05 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4232:

Description: SELECT "DATE", count(1) from table WHERE "DATE" = 
date_from_now(-1, '-MM-dd') group by "DATE"  (was: SELECT "DATE", count(1) 
from table WHERE "DATE" = 'date_from_now(-1, '-MM-dd') group by "DATE")

> add sql  function date_from_now
> ---
>
> Key: KYLIN-4232
> URL: https://issues.apache.org/jira/browse/KYLIN-4232
> Project: Kylin
>  Issue Type: New Feature
>Reporter: ZhouKang
>Priority: Major
>
> SELECT "DATE", count(1) from table WHERE "DATE" = date_from_now(-1, 
> '-MM-dd') group by "DATE"



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4234) add a project extractor for Kylin admin

2019-11-01 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4234:
---

 Summary: add a project extractor for Kylin admin
 Key: KYLIN-4234
 URL: https://issues.apache.org/jira/browse/KYLIN-4234
 Project: Kylin
  Issue Type: New Feature
Reporter: ZhouKang


the data in Kylin metastore is not relational, so when there are many projects 
or models in the system, it's hard to get all information.

sometimes we need feature such as:
 * project's email list
 * cube list and status (last modify time)

 
 Suggestions are welcomed, thanks~



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4233) simple time schedule module for kylin job

2019-11-01 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4233:
---

 Summary: simple time schedule module for kylin job
 Key: KYLIN-4233
 URL: https://issues.apache.org/jira/browse/KYLIN-4233
 Project: Kylin
  Issue Type: New Feature
Reporter: ZhouKang


add a simple time schedule module, so user can use kylin instead of third-party 
schedule system



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4232) add sql function date_from_now

2019-11-01 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4232:
---

 Summary: add sql  function date_from_now
 Key: KYLIN-4232
 URL: https://issues.apache.org/jira/browse/KYLIN-4232
 Project: Kylin
  Issue Type: New Feature
Reporter: ZhouKang


SELECT \"DATE\", count(1) from table WHERE \"DATE\" = ''date_from_now(-1, 
'-MM-dd') group by \"DATE\"



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4231) write conflict while add model by different query server parallelly

2019-11-01 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4231:
---

 Summary: write conflict while add model by different query server 
parallelly
 Key: KYLIN-4231
 URL: https://issues.apache.org/jira/browse/KYLIN-4231
 Project: Kylin
  Issue Type: New Feature
Reporter: ZhouKang


a kylin cluster have more than 1 query server, all of them are the backend 
server of nginx.

when our user use RESTful API to create model in the *same* project parallelly, 
there will be a problem.

the server returns:
{code:java}
// code placeholder
Overwriting conflict /project/learn_kylin.json, expect old TS 1572596034269, 
but it is 1572596042929
{code}
BUT, the model '/model_desc/xxx.json' has already in metastore, so the next 
time our user want

to retry (create model), he will get:
{code:java}
// code placeholder
Overwriting conflict /model_desc/test_945.json, expect old TS 0, but it is 
1572596193812
{code}
I think the problem is that:

when server A receive PUT model, it will change the project info, but the cache 
update broadcast cannot be processed by server B before server B processing 
another model creation request.

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4227) add segment build callback

2019-10-30 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4227:
---

 Summary: add segment build callback
 Key: KYLIN-4227
 URL: https://issues.apache.org/jira/browse/KYLIN-4227
 Project: Kylin
  Issue Type: New Feature
Reporter: ZhouKang


add HTTP callback for cube, some other system could use this feature to add a 
hook when segment building done.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4226) Skip current unavailable tables when updating hbase coprocessor

2019-10-29 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4226:
---

 Summary: Skip current unavailable tables when updating hbase 
coprocessor
 Key: KYLIN-4226
 URL: https://issues.apache.org/jira/browse/KYLIN-4226
 Project: Kylin
  Issue Type: Improvement
Reporter: ZhouKang


when update hbase coprocessor for htable, there are exceptions cause by 
unavailable tables.
{code:java}
// code placeholder
2019-08-22 09:12:20,445 INFO  [main] util.DeployCoprocessorCLI:181 : Commit 
Information: 2945cb67bae6242f275a038e7bf872a7a9797464;
Exception in thread "main" org.apache.hadoop.hbase.TableNotFoundException: 
kylin_zjyprc_bigdata_staging:KYLIN_UCHB8D2MVP
at 
org.apache.hadoop.hbase.client.HBaseAdmin.getTableDescriptor(HBaseAdmin.java:567)
at 
org.apache.hadoop.hbase.client.HBaseAdmin.getDescriptor(HBaseAdmin.java:365)
at 
org.apache.kylin.storage.hbase.util.DeployCoprocessorCLI.filterByGitCommit(DeployCoprocessorCLI.java:183)
at 
org.apache.kylin.storage.hbase.util.DeployCoprocessorCLI.main(DeployCoprocessorCLI.java:131)
2019-08-22 09:12:21,207 INFO  [close-hbase-conn] hbase.HBaseConnection:137 : 
Closing HBase connections...
2019-08-22 09:12:21,208 INFO  [close-hbase-conn] 
client.ConnectionImplementation:1889 : Closing master protocol: MasterService
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4225) unclosed hive session cause too many temp file

2019-10-29 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4225:

Description: 
in org.apache.kylin.metrics.lib.impl.hive.HiveProducer, when need add a new 
partition to metrics data, a SessionState will be started , and unclosed after 
use.

which cause many temp dir in HDFS (\{hadoop.tmp.dir}/tmp/hive/xxx).

I think the session should be stoped manually

  was:
in org.apache.kylin.metrics.lib.impl.hive.HiveProducer, when need add a new 
partition to metrics data, a SessionState will be started , and unclosed after 
use.

which cause many temp dir in HDFS (\{ROOT}/tmp/hive/xxx).

I think the session should be stoped manually


> unclosed hive session cause too many temp file
> --
>
> Key: KYLIN-4225
> URL: https://issues.apache.org/jira/browse/KYLIN-4225
> Project: Kylin
>  Issue Type: Improvement
>Reporter: ZhouKang
>Priority: Major
>
> in org.apache.kylin.metrics.lib.impl.hive.HiveProducer, when need add a new 
> partition to metrics data, a SessionState will be started , and unclosed 
> after use.
> which cause many temp dir in HDFS (\{hadoop.tmp.dir}/tmp/hive/xxx).
> I think the session should be stoped manually



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4225) unclosed hive session cause too many temp file

2019-10-29 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4225:
---

 Summary: unclosed hive session cause too many temp file
 Key: KYLIN-4225
 URL: https://issues.apache.org/jira/browse/KYLIN-4225
 Project: Kylin
  Issue Type: Improvement
Reporter: ZhouKang


in org.apache.kylin.metrics.lib.impl.hive.HiveProducer, when need add a new 
partition to metrics data, a SessionState will be started , and unclosed after 
use.

which cause many temp dir in HDFS (\{ROOT}/tmp/hive/xxx).

I think the session should be stoped manually



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4139) Compatible old user security xml config when user upgrate new kylin version

2019-10-29 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16961967#comment-16961967
 ] 

ZhouKang commented on KYLIN-4139:
-

I notice that, in KYLIN-3531 (https://issues.apache.org/jira/browse/KYLIN-3531) 
save uppercase username in metadata,

and  then in KYLIN-4103 (https://issues.apache.org/jira/browse/KYLIN-4103) make 
user name in granting operation of project is case insensitive

but in this issue, username in metadata is  original. 

what is the standard rule for username?

>  Compatible old user security xml config when user upgrate new kylin version
> 
>
> Key: KYLIN-4139
> URL: https://issues.apache.org/jira/browse/KYLIN-4139
> Project: Kylin
>  Issue Type: Improvement
>Reporter: luguosheng
>Assignee: luguosheng
>Priority: Major
> Fix For: v3.0.0-beta
>
>
>          
>  Forward compatible
>  # when user config 'spring.profiles.active=testing', sync old version user 
> metadata with user config in kylinSecurity.xml
> Other improvement
>  # Remove the logic of auto trans new user name to uppercase



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (KYLIN-4139) Compatible old user security xml config when user upgrate new kylin version

2019-10-29 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16961967#comment-16961967
 ] 

ZhouKang edited comment on KYLIN-4139 at 10/29/19 12:38 PM:


I notice that, in KYLIN-3531 save uppercase username in metadata,

and  then in KYLIN-4103  make user name in granting operation of project is 
case insensitive

but in this issue, username in metadata is  original. 

what is the standard rule for username?


was (Author: zhoukangcn):
I notice that, in KYLIN-3531 (https://issues.apache.org/jira/browse/KYLIN-3531) 
save uppercase username in metadata,

and  then in KYLIN-4103 (https://issues.apache.org/jira/browse/KYLIN-4103) make 
user name in granting operation of project is case insensitive

but in this issue, username in metadata is  original. 

what is the standard rule for username?

>  Compatible old user security xml config when user upgrate new kylin version
> 
>
> Key: KYLIN-4139
> URL: https://issues.apache.org/jira/browse/KYLIN-4139
> Project: Kylin
>  Issue Type: Improvement
>Reporter: luguosheng
>Assignee: luguosheng
>Priority: Major
> Fix For: v3.0.0-beta
>
>
>          
>  Forward compatible
>  # when user config 'spring.profiles.active=testing', sync old version user 
> metadata with user config in kylinSecurity.xml
> Other improvement
>  # Remove the logic of auto trans new user name to uppercase



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4207) CubeMetadataValidator exception

2019-10-27 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16960801#comment-16960801
 ] 

ZhouKang commented on KYLIN-4207:
-

see: [https://github.com/apache/kylin/pull/900]

> CubeMetadataValidator exception
> ---
>
> Key: KYLIN-4207
> URL: https://issues.apache.org/jira/browse/KYLIN-4207
> Project: Kylin
>  Issue Type: Improvement
>Reporter: ZhouKang
>Priority: Major
>
> {code:java}
> // code placeholder
> 2019-10-28 11:19:08,583 ERROR [http-nio-7070-exec-9] 
> validation.CubeMetadataValidator:56 : Construct cube metadata validator rule: 
> org.apache.kylin.cube.model.validation.rule.CubeInfoRule failed. Ignore this 
> rule
> java.lang.UnsupportedOperationException
> at java.util.AbstractList.add(AbstractList.java:148)
> at java.util.AbstractList.add(AbstractList.java:108)
> at 
> org.apache.kylin.cube.model.validation.CubeMetadataValidator.(CubeMetadataValidator.java:54)
> {code}
> should not use Arrays.asList() in CubeMetadataValidator.java:49, which will 
> cause UnsupportedOperationExcepiton
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4207) CubeMetadataValidator exception

2019-10-27 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4207:

Description: 
{code:java}
// code placeholder
2019-10-28 11:19:08,583 ERROR [http-nio-7070-exec-9] 
validation.CubeMetadataValidator:56 : Construct cube metadata validator rule: 
org.apache.kylin.cube.model.validation.rule.CubeInfoRule failed. Ignore this 
rule
java.lang.UnsupportedOperationException
at java.util.AbstractList.add(AbstractList.java:148)
at java.util.AbstractList.add(AbstractList.java:108)
at 
org.apache.kylin.cube.model.validation.CubeMetadataValidator.(CubeMetadataValidator.java:54)
{code}
should not use Arrays.asList() in CubeMetadataValidator.java:49, which will 
cause UnsupportedOperationExcepiton

 

  was:
{code:java}
// code placeholder
2019-10-28 11:19:08,583 ERROR [http-nio-7070-exec-9] 
validation.CubeMetadataValidator:56 : Construct cube metadata validator rule: 
org.apache.kylin.cube.model.validation.rule.CubeInfoRule failed. Ignore this 
rule
java.lang.UnsupportedOperationException
at java.util.AbstractList.add(AbstractList.java:148)
at java.util.AbstractList.add(AbstractList.java:108)
at 
org.apache.kylin.cube.model.validation.CubeMetadataValidator.(CubeMetadataValidator.java:54)
{code}
should not use Arrays.asList() in CubeMetadataValidator.java:49, which will 
cause UnsupportedOperationExcepiton

 

it is 


> CubeMetadataValidator exception
> ---
>
> Key: KYLIN-4207
> URL: https://issues.apache.org/jira/browse/KYLIN-4207
> Project: Kylin
>  Issue Type: Improvement
>Reporter: ZhouKang
>Priority: Major
>
> {code:java}
> // code placeholder
> 2019-10-28 11:19:08,583 ERROR [http-nio-7070-exec-9] 
> validation.CubeMetadataValidator:56 : Construct cube metadata validator rule: 
> org.apache.kylin.cube.model.validation.rule.CubeInfoRule failed. Ignore this 
> rule
> java.lang.UnsupportedOperationException
> at java.util.AbstractList.add(AbstractList.java:148)
> at java.util.AbstractList.add(AbstractList.java:108)
> at 
> org.apache.kylin.cube.model.validation.CubeMetadataValidator.(CubeMetadataValidator.java:54)
> {code}
> should not use Arrays.asList() in CubeMetadataValidator.java:49, which will 
> cause UnsupportedOperationExcepiton
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4185) CubeStatsReader estimate wrong cube size

2019-09-30 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4185:
---

 Summary: CubeStatsReader estimate wrong cube size
 Key: KYLIN-4185
 URL: https://issues.apache.org/jira/browse/KYLIN-4185
 Project: Kylin
  Issue Type: Improvement
Reporter: ZhouKang


CubeStatsReader estimate wrong cube size, which cause a lot of problems.

when the estimated size is much larger than the real size, the spark 
application's executor number is small, and cube build step will take a long 
time. sometime the step will failed due to the large dataset.

When the estimated size is much smaller than the real size. the cuboid file in 
HDFS is small, and there are much of cuboid file.

 

In our production environment, both the two situation happened.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4174) job statistics info is incorrect

2019-09-26 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939145#comment-16939145
 ] 

ZhouKang commented on KYLIN-4174:
-

[~wangrupeng] 

sorry, miss description in issue. you should choose a project first.

> job statistics info is incorrect
> 
>
> Key: KYLIN-4174
> URL: https://issues.apache.org/jira/browse/KYLIN-4174
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Assignee: luguosheng
>Priority: Major
> Attachments: image-2019-09-27-12-27-50-336.png, 
> image-2019-09-27-12-27-50-393.png, image3.png
>
>
> Kylin: master branch
> The job statistics info is incorrect. The number behind type is not equal to 
> the size of job list.
> Step 1: login kylin webui, open the Monitor
> Step 2: refresh the web page
>  
> You can see, the query params of RESTful API is wrong, projectName is "_null"
> {code:java}
> // code placeholder
> curl 
> 'http://zjy-bigdata-prc-kylin04.bj:7070/kylin/api/jobs/overview?jobSearchMode=ALL&projectName=_null&timeFilter=1'
>  
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4178) Job scheduler support safe mode

2019-09-25 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4178:
---

 Summary: Job scheduler support safe mode 
 Key: KYLIN-4178
 URL: https://issues.apache.org/jira/browse/KYLIN-4178
 Project: Kylin
  Issue Type: Improvement
Reporter: ZhouKang


Job scheduler should support safe mode in case of the HBase cluster change.

In xiaomi, we want update the HBase cluster from hbase0.98 to HBase 2.0. The 
history data can be migrated previously, but the job has been submitted will 
keep running and write data to the old cluster. So we need a method to ensure 
that, job create htable in the old cluster will write data to the old cluster, 
and the job have not create htable should not be scheduled.

So we need  job scheduler safe mode. Open safe mode before changing cluster 
config,  the  running jobs can run continuous, and the new job cannot be 
scheduled. 

After all running job finished, we can change the cluster config to the new 
one,  and rest of job can be scheduled again.

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4174) job statistics info is incorrect

2019-09-20 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4174:
---

 Summary: job statistics info is incorrect
 Key: KYLIN-4174
 URL: https://issues.apache.org/jira/browse/KYLIN-4174
 Project: Kylin
  Issue Type: Bug
Reporter: ZhouKang


Kylin: master branch

The job statistics info is incorrect. The number behind type is not equal to 
the size of job list.

Step 1: login kylin webui, open the Monitor

Step 2: refresh the web page

 

You can see, the query params of RESTful API is wrong, projectName is "_null"
{code:java}
// code placeholder
curl 
'http://zjy-bigdata-prc-kylin04.bj:7070/kylin/api/jobs/overview?jobSearchMode=ALL&projectName=_null&timeFilter=1'
 
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-3973) InvalidProtocolBufferException: Protocol message was too large. May be malicious.

2019-09-19 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-3973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16933326#comment-16933326
 ] 

ZhouKang commented on KYLIN-3973:
-

glad to hear that

> InvalidProtocolBufferException: Protocol message was too large.  May be 
> malicious.
> --
>
> Key: KYLIN-3973
> URL: https://issues.apache.org/jira/browse/KYLIN-3973
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v2.6.1
>Reporter: Grzegorz Kołakowski
>Priority: Major
>
> For many queries I receive the following exception.
> {noformat}
> 2019-04-23 11:33:15,576 WARN  [kylin-coproc--pool6-t17] 
> client.SyncCoprocessorRpcChannel:54 : Call failed on IOException
> com.google.protobuf.InvalidProtocolBufferException: Protocol message was too 
> large.  May be malicious.  Use CodedInputStream.setSizeLimit() to increase 
> the size limit.
> at 
> com.google.protobuf.InvalidProtocolBufferException.sizeLimitExceeded(InvalidProtocolBufferException.java:110)
> at 
> com.google.protobuf.CodedInputStream.refillBuffer(CodedInputStream.java:755)
> at 
> com.google.protobuf.CodedInputStream.isAtEnd(CodedInputStream.java:701)
> at 
> com.google.protobuf.CodedInputStream.readTag(CodedInputStream.java:99)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse.(CubeVisitProtos.java:2307)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse.(CubeVisitProtos.java:2271)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$1.parsePartialFrom(CubeVisitProtos.java:2380)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$1.parsePartialFrom(CubeVisitProtos.java:2375)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$Builder.mergeFrom(CubeVisitProtos.java:5101)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$Builder.mergeFrom(CubeVisitProtos.java:4949)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:337)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:267)
> at 
> com.google.protobuf.AbstractMessageLite$Builder.mergeFrom(AbstractMessageLite.java:210)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:904)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:267)
> at 
> org.apache.hadoop.hbase.ipc.CoprocessorRpcUtils.getResponse(CoprocessorRpcUtils.java:141)
> at 
> org.apache.hadoop.hbase.client.RegionCoprocessorRpcChannel.callExecService(RegionCoprocessorRpcChannel.java:94)
> at 
> org.apache.hadoop.hbase.client.SyncCoprocessorRpcChannel.callMethod(SyncCoprocessorRpcChannel.java:52)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitService$Stub.visitCube(CubeVisitProtos.java:5616)
> at 
> org.apache.kylin.storage.hbase.cube.v2.CubeHBaseEndpointRPC$1$1.call(CubeHBaseEndpointRPC.java:246)
> at 
> org.apache.kylin.storage.hbase.cube.v2.CubeHBaseEndpointRPC$1$1.call(CubeHBaseEndpointRPC.java:242)
> at org.apache.hadoop.hbase.client.HTable$12.call(HTable.java:1012)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> at java.lang.Thread.run(Thread.java:745)
> {noformat}
> I use lz4 compression algorithm in HBase.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-3973) InvalidProtocolBufferException: Protocol message was too large. May be malicious.

2019-09-18 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-3973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16932395#comment-16932395
 ] 

ZhouKang commented on KYLIN-3973:
-

Try to replace other package contain CubeVisitProtos. I'm not sure which file 
it used. you can use java --verbose:class to findout.
Otherwise you can try to replace the whole environment

 

> InvalidProtocolBufferException: Protocol message was too large.  May be 
> malicious.
> --
>
> Key: KYLIN-3973
> URL: https://issues.apache.org/jira/browse/KYLIN-3973
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v2.6.1
>Reporter: Grzegorz Kołakowski
>Priority: Major
>
> For many queries I receive the following exception.
> {noformat}
> 2019-04-23 11:33:15,576 WARN  [kylin-coproc--pool6-t17] 
> client.SyncCoprocessorRpcChannel:54 : Call failed on IOException
> com.google.protobuf.InvalidProtocolBufferException: Protocol message was too 
> large.  May be malicious.  Use CodedInputStream.setSizeLimit() to increase 
> the size limit.
> at 
> com.google.protobuf.InvalidProtocolBufferException.sizeLimitExceeded(InvalidProtocolBufferException.java:110)
> at 
> com.google.protobuf.CodedInputStream.refillBuffer(CodedInputStream.java:755)
> at 
> com.google.protobuf.CodedInputStream.isAtEnd(CodedInputStream.java:701)
> at 
> com.google.protobuf.CodedInputStream.readTag(CodedInputStream.java:99)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse.(CubeVisitProtos.java:2307)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse.(CubeVisitProtos.java:2271)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$1.parsePartialFrom(CubeVisitProtos.java:2380)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$1.parsePartialFrom(CubeVisitProtos.java:2375)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$Builder.mergeFrom(CubeVisitProtos.java:5101)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$Builder.mergeFrom(CubeVisitProtos.java:4949)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:337)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:267)
> at 
> com.google.protobuf.AbstractMessageLite$Builder.mergeFrom(AbstractMessageLite.java:210)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:904)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:267)
> at 
> org.apache.hadoop.hbase.ipc.CoprocessorRpcUtils.getResponse(CoprocessorRpcUtils.java:141)
> at 
> org.apache.hadoop.hbase.client.RegionCoprocessorRpcChannel.callExecService(RegionCoprocessorRpcChannel.java:94)
> at 
> org.apache.hadoop.hbase.client.SyncCoprocessorRpcChannel.callMethod(SyncCoprocessorRpcChannel.java:52)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitService$Stub.visitCube(CubeVisitProtos.java:5616)
> at 
> org.apache.kylin.storage.hbase.cube.v2.CubeHBaseEndpointRPC$1$1.call(CubeHBaseEndpointRPC.java:246)
> at 
> org.apache.kylin.storage.hbase.cube.v2.CubeHBaseEndpointRPC$1$1.call(CubeHBaseEndpointRPC.java:242)
> at org.apache.hadoop.hbase.client.HTable$12.call(HTable.java:1012)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> at java.lang.Thread.run(Thread.java:745)
> {noformat}
> I use lz4 compression algorithm in HBase.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (KYLIN-4169) too many logs while DataModelManager init, cause the first RESTful API hang for a long time

2019-09-18 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16932289#comment-16932289
 ] 

ZhouKang edited comment on KYLIN-4169 at 9/18/19 10:39 AM:
---

I think the problem is not caused by un-existed models. 

The stack is here:

 
{code:java}
// code placeholder
at 
org.apache.kylin.metadata.model.DataModelManager.getModels(DataModelManager.java:190)
at 
org.apache.kylin.metadata.model.DataModelManager$1.initEntityAfterReload(DataModelManager.java:92)
at 
org.apache.kylin.metadata.model.DataModelManager$1.initEntityAfterReload(DataModelManager.java:87)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadAt(CachedCrudAssist.java:167)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadQuietlyAt(CachedCrudAssist.java:144)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadAll(CachedCrudAssist.java:127)
at 
org.apache.kylin.metadata.model.DataModelManager.init(DataModelManager.java:100)
at 
org.apache.kylin.metadata.model.DataModelManager.(DataModelManager.java:80)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at 
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
at 
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
at 
org.apache.kylin.metadata.model.DataModelManager.newInstance(DataModelManager.java:61)
{code}
 

 

and the code: CachedCrudAssist.java:125-127

 
{code:java}
// code placeholder
List paths = store.collectResourceRecursively(resRootPath, 
resPathSuffix);
for (String path : paths) {
reloadQuietlyAt(path);
}
{code}
 

everytime load a single meta record, initEntityAfterReload will scan the whole 
project. At this moment, the remain models (which is valid actually) have not 
been loaded to the cache.

If there are 100 models in a project, the logs count will be 1+2+3+...+99


was (Author: zhoukangcn):
I think the problem is not caused by un-existed models. 

The stack is here:

 
{code:java}
// code placeholder
at 
org.apache.kylin.metadata.model.DataModelManager.getModels(DataModelManager.java:190)
at 
org.apache.kylin.metadata.model.DataModelManager$1.initEntityAfterReload(DataModelManager.java:92)
at 
org.apache.kylin.metadata.model.DataModelManager$1.initEntityAfterReload(DataModelManager.java:87)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadAt(CachedCrudAssist.java:167)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadQuietlyAt(CachedCrudAssist.java:144)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadAll(CachedCrudAssist.java:127)
at 
org.apache.kylin.metadata.model.DataModelManager.init(DataModelManager.java:100)
at 
org.apache.kylin.metadata.model.DataModelManager.(DataModelManager.java:80)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at 
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
at 
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
at 
org.apache.kylin.metadata.model.DataModelManager.newInstance(DataModelManager.java:61)
{code}
 

 

and the code: CachedCrudAssist.java:125-127

 
{code:java}
// code placeholder
List paths = store.collectResourceRecursively(resRootPath, 
resPathSuffix);
for (String path : paths) {
reloadQuietlyAt(path);
}
{code}
 

everytime load a single meta record, initEntityAfterReload will scan the whole 
project. At this moment, the remain models (which is valid actually) have not 
been loaded to the cache. 

> too many logs while DataModelManager init, cause the first RESTful API hang 
> for a long time
> ---
>
> Key: KYLIN-4169
> URL: https://issues.apache.org/jira/browse/KYLIN-4169
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> Too many logs while DataModelManager init. It is will happend after startup 
> or reload metastore.
> In our production enviroment,there are 120,000+ lines.
> {code:java}
> // code placeholder
> grep 'is missing or unloaded yet' kylin.log.8|wc -l
> 121861
> {code}
>  
> which cause the first sql query's time up to 40s.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (KYLIN-4169) too many logs while DataModelManager init, cause the first RESTful API hang for a long time

2019-09-18 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16932289#comment-16932289
 ] 

ZhouKang edited comment on KYLIN-4169 at 9/18/19 10:39 AM:
---

I think the problem is not caused by un-existed models. 

The stack is here:

 
{code:java}
// code placeholder
at 
org.apache.kylin.metadata.model.DataModelManager.getModels(DataModelManager.java:190)
at 
org.apache.kylin.metadata.model.DataModelManager$1.initEntityAfterReload(DataModelManager.java:92)
at 
org.apache.kylin.metadata.model.DataModelManager$1.initEntityAfterReload(DataModelManager.java:87)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadAt(CachedCrudAssist.java:167)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadQuietlyAt(CachedCrudAssist.java:144)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadAll(CachedCrudAssist.java:127)
at 
org.apache.kylin.metadata.model.DataModelManager.init(DataModelManager.java:100)
at 
org.apache.kylin.metadata.model.DataModelManager.(DataModelManager.java:80)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at 
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
at 
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
at 
org.apache.kylin.metadata.model.DataModelManager.newInstance(DataModelManager.java:61)
{code}
 

 

and the code: CachedCrudAssist.java:125-127

 
{code:java}
// code placeholder
List paths = store.collectResourceRecursively(resRootPath, 
resPathSuffix);
for (String path : paths) {
reloadQuietlyAt(path);
}
{code}
 

everytime load a single meta record, initEntityAfterReload will scan the whole 
project. At this moment, the remain models (which is valid actually) have not 
been loaded to the cache.

If there are 100 models in a project, the logs count will be 99+98+97+...+1


was (Author: zhoukangcn):
I think the problem is not caused by un-existed models. 

The stack is here:

 
{code:java}
// code placeholder
at 
org.apache.kylin.metadata.model.DataModelManager.getModels(DataModelManager.java:190)
at 
org.apache.kylin.metadata.model.DataModelManager$1.initEntityAfterReload(DataModelManager.java:92)
at 
org.apache.kylin.metadata.model.DataModelManager$1.initEntityAfterReload(DataModelManager.java:87)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadAt(CachedCrudAssist.java:167)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadQuietlyAt(CachedCrudAssist.java:144)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadAll(CachedCrudAssist.java:127)
at 
org.apache.kylin.metadata.model.DataModelManager.init(DataModelManager.java:100)
at 
org.apache.kylin.metadata.model.DataModelManager.(DataModelManager.java:80)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at 
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
at 
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
at 
org.apache.kylin.metadata.model.DataModelManager.newInstance(DataModelManager.java:61)
{code}
 

 

and the code: CachedCrudAssist.java:125-127

 
{code:java}
// code placeholder
List paths = store.collectResourceRecursively(resRootPath, 
resPathSuffix);
for (String path : paths) {
reloadQuietlyAt(path);
}
{code}
 

everytime load a single meta record, initEntityAfterReload will scan the whole 
project. At this moment, the remain models (which is valid actually) have not 
been loaded to the cache.

If there are 100 models in a project, the logs count will be 1+2+3+...+99

> too many logs while DataModelManager init, cause the first RESTful API hang 
> for a long time
> ---
>
> Key: KYLIN-4169
> URL: https://issues.apache.org/jira/browse/KYLIN-4169
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> Too many logs while DataModelManager init. It is will happend after startup 
> or reload metastore.
> In our production enviroment,there are 120,000+ lines.
> {code:java}
> // code placeholder
> grep 'is missing or unloaded yet' kylin.log.8|wc -l
> 121861
> {code}
>  
> which cause the first sql query's time up to 40s.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4169) too many logs while DataModelManager init, cause the first RESTful API hang for a long time

2019-09-18 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16932289#comment-16932289
 ] 

ZhouKang commented on KYLIN-4169:
-

I think the problem is not caused by un-existed models. 

The stack is here:

 
{code:java}
// code placeholder
at 
org.apache.kylin.metadata.model.DataModelManager.getModels(DataModelManager.java:190)
at 
org.apache.kylin.metadata.model.DataModelManager$1.initEntityAfterReload(DataModelManager.java:92)
at 
org.apache.kylin.metadata.model.DataModelManager$1.initEntityAfterReload(DataModelManager.java:87)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadAt(CachedCrudAssist.java:167)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadQuietlyAt(CachedCrudAssist.java:144)
at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.reloadAll(CachedCrudAssist.java:127)
at 
org.apache.kylin.metadata.model.DataModelManager.init(DataModelManager.java:100)
at 
org.apache.kylin.metadata.model.DataModelManager.(DataModelManager.java:80)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at 
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
at 
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
at 
org.apache.kylin.metadata.model.DataModelManager.newInstance(DataModelManager.java:61)
{code}
 

 

and the code: CachedCrudAssist.java:125-127

 
{code:java}
// code placeholder
List paths = store.collectResourceRecursively(resRootPath, 
resPathSuffix);
for (String path : paths) {
reloadQuietlyAt(path);
}
{code}
 

everytime load a single meta record, initEntityAfterReload will scan the whole 
project. At this moment, the remain models (which is valid actually) have not 
been loaded to the cache. 

> too many logs while DataModelManager init, cause the first RESTful API hang 
> for a long time
> ---
>
> Key: KYLIN-4169
> URL: https://issues.apache.org/jira/browse/KYLIN-4169
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> Too many logs while DataModelManager init. It is will happend after startup 
> or reload metastore.
> In our production enviroment,there are 120,000+ lines.
> {code:java}
> // code placeholder
> grep 'is missing or unloaded yet' kylin.log.8|wc -l
> 121861
> {code}
>  
> which cause the first sql query's time up to 40s.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-3973) InvalidProtocolBufferException: Protocol message was too large. May be malicious.

2019-09-17 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-3973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16932075#comment-16932075
 ] 

ZhouKang commented on KYLIN-3973:
-

[~shaoxiaowei] If you are using HBase>2.0.0, you can fix it by 
[https://github.com/apache/kylin/pull/837]

This a quick fix.

> InvalidProtocolBufferException: Protocol message was too large.  May be 
> malicious.
> --
>
> Key: KYLIN-3973
> URL: https://issues.apache.org/jira/browse/KYLIN-3973
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v2.6.1
>Reporter: Grzegorz Kołakowski
>Priority: Major
>
> For many queries I receive the following exception.
> {noformat}
> 2019-04-23 11:33:15,576 WARN  [kylin-coproc--pool6-t17] 
> client.SyncCoprocessorRpcChannel:54 : Call failed on IOException
> com.google.protobuf.InvalidProtocolBufferException: Protocol message was too 
> large.  May be malicious.  Use CodedInputStream.setSizeLimit() to increase 
> the size limit.
> at 
> com.google.protobuf.InvalidProtocolBufferException.sizeLimitExceeded(InvalidProtocolBufferException.java:110)
> at 
> com.google.protobuf.CodedInputStream.refillBuffer(CodedInputStream.java:755)
> at 
> com.google.protobuf.CodedInputStream.isAtEnd(CodedInputStream.java:701)
> at 
> com.google.protobuf.CodedInputStream.readTag(CodedInputStream.java:99)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse.(CubeVisitProtos.java:2307)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse.(CubeVisitProtos.java:2271)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$1.parsePartialFrom(CubeVisitProtos.java:2380)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$1.parsePartialFrom(CubeVisitProtos.java:2375)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$Builder.mergeFrom(CubeVisitProtos.java:5101)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$Builder.mergeFrom(CubeVisitProtos.java:4949)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:337)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:267)
> at 
> com.google.protobuf.AbstractMessageLite$Builder.mergeFrom(AbstractMessageLite.java:210)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:904)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:267)
> at 
> org.apache.hadoop.hbase.ipc.CoprocessorRpcUtils.getResponse(CoprocessorRpcUtils.java:141)
> at 
> org.apache.hadoop.hbase.client.RegionCoprocessorRpcChannel.callExecService(RegionCoprocessorRpcChannel.java:94)
> at 
> org.apache.hadoop.hbase.client.SyncCoprocessorRpcChannel.callMethod(SyncCoprocessorRpcChannel.java:52)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitService$Stub.visitCube(CubeVisitProtos.java:5616)
> at 
> org.apache.kylin.storage.hbase.cube.v2.CubeHBaseEndpointRPC$1$1.call(CubeHBaseEndpointRPC.java:246)
> at 
> org.apache.kylin.storage.hbase.cube.v2.CubeHBaseEndpointRPC$1$1.call(CubeHBaseEndpointRPC.java:242)
> at org.apache.hadoop.hbase.client.HTable$12.call(HTable.java:1012)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> at java.lang.Thread.run(Thread.java:745)
> {noformat}
> I use lz4 compression algorithm in HBase.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4169) too many logs while DataModelManager init, cause the first RESTful API hang for a long time

2019-09-17 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4169:
---

 Summary: too many logs while DataModelManager init, cause the 
first RESTful API hang for a long time
 Key: KYLIN-4169
 URL: https://issues.apache.org/jira/browse/KYLIN-4169
 Project: Kylin
  Issue Type: Bug
Reporter: ZhouKang


Too many logs while DataModelManager init. It is will happend after startup or 
reload metastore.

In our production enviroment,there are 120,000+ lines.
{code:java}
// code placeholder
grep 'is missing or unloaded yet' kylin.log.8|wc -l
121861
{code}
 

which cause the first sql query's time up to 40s.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4161) exception in update metrics when the response is null

2019-09-08 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16925296#comment-16925296
 ] 

ZhouKang commented on KYLIN-4161:
-

Maybe this bug is fixed in Kylin:master. 

I found that, QueryService use buildSqlResponse()

 

> exception in update metrics when the response is null 
> --
>
> Key: KYLIN-4161
> URL: https://issues.apache.org/jira/browse/KYLIN-4161
> Project: Kylin
>  Issue Type: Bug
>Reporter: ZhouKang
>Priority: Major
>
> when something error in sql query, the response‘s cube and results is null, 
> which cause an exception.
> {code:java}
> // code placeholder
> service.QueryService:432 : Write metric error.java.lang.NullPointerException  
>   at 
> org.apache.kylin.rest.metrics.QueryMetricsFacade.updateMetricsToReservoir(QueryMetricsFacade.java:158)
>     at 
> org.apache.kylin.rest.metrics.QueryMetricsFacade.updateMetrics(QueryMetricsFacade.java:77)
> {code}
> {code:java}
> // code placeholder
> service.QueryService:432 : Write metric error.java.lang.NullPointerException  
>   at 
> org.apache.kylin.rest.metrics.QueryMetricsFacade.updateMetricsToReservoir(QueryMetricsFacade.java:89)
>   
> {code}



--
This message was sent by Atlassian Jira
(v8.3.2#803003)


[jira] [Created] (KYLIN-4161) exception in update metrics when the response is null

2019-09-06 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4161:
---

 Summary: exception in update metrics when the response is null 
 Key: KYLIN-4161
 URL: https://issues.apache.org/jira/browse/KYLIN-4161
 Project: Kylin
  Issue Type: Bug
Reporter: ZhouKang


when something error in sql query, the response‘s cube and results is null, 
which cause an exception.
{code:java}
// code placeholder
service.QueryService:432 : Write metric error.java.lang.NullPointerException
    at 
org.apache.kylin.rest.metrics.QueryMetricsFacade.updateMetricsToReservoir(QueryMetricsFacade.java:158)
    at 
org.apache.kylin.rest.metrics.QueryMetricsFacade.updateMetrics(QueryMetricsFacade.java:77)
{code}
{code:java}
// code placeholder
service.QueryService:432 : Write metric error.java.lang.NullPointerException
    at 
org.apache.kylin.rest.metrics.QueryMetricsFacade.updateMetricsToReservoir(QueryMetricsFacade.java:89)
  
{code}



--
This message was sent by Atlassian Jira
(v8.3.2#803003)


[jira] [Issue Comment Deleted] (KYLIN-3973) InvalidProtocolBufferException: Protocol message was too large. May be malicious.

2019-09-04 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-3973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-3973:

Comment: was deleted

(was: I think CodedInputStream's sizelimit should be seted in CubeVisitProtos. )

> InvalidProtocolBufferException: Protocol message was too large.  May be 
> malicious.
> --
>
> Key: KYLIN-3973
> URL: https://issues.apache.org/jira/browse/KYLIN-3973
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v2.6.1
>Reporter: Grzegorz Kołakowski
>Priority: Major
>
> For many queries I receive the following exception.
> {noformat}
> 2019-04-23 11:33:15,576 WARN  [kylin-coproc--pool6-t17] 
> client.SyncCoprocessorRpcChannel:54 : Call failed on IOException
> com.google.protobuf.InvalidProtocolBufferException: Protocol message was too 
> large.  May be malicious.  Use CodedInputStream.setSizeLimit() to increase 
> the size limit.
> at 
> com.google.protobuf.InvalidProtocolBufferException.sizeLimitExceeded(InvalidProtocolBufferException.java:110)
> at 
> com.google.protobuf.CodedInputStream.refillBuffer(CodedInputStream.java:755)
> at 
> com.google.protobuf.CodedInputStream.isAtEnd(CodedInputStream.java:701)
> at 
> com.google.protobuf.CodedInputStream.readTag(CodedInputStream.java:99)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse.(CubeVisitProtos.java:2307)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse.(CubeVisitProtos.java:2271)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$1.parsePartialFrom(CubeVisitProtos.java:2380)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$1.parsePartialFrom(CubeVisitProtos.java:2375)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$Builder.mergeFrom(CubeVisitProtos.java:5101)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$Builder.mergeFrom(CubeVisitProtos.java:4949)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:337)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:267)
> at 
> com.google.protobuf.AbstractMessageLite$Builder.mergeFrom(AbstractMessageLite.java:210)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:904)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:267)
> at 
> org.apache.hadoop.hbase.ipc.CoprocessorRpcUtils.getResponse(CoprocessorRpcUtils.java:141)
> at 
> org.apache.hadoop.hbase.client.RegionCoprocessorRpcChannel.callExecService(RegionCoprocessorRpcChannel.java:94)
> at 
> org.apache.hadoop.hbase.client.SyncCoprocessorRpcChannel.callMethod(SyncCoprocessorRpcChannel.java:52)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitService$Stub.visitCube(CubeVisitProtos.java:5616)
> at 
> org.apache.kylin.storage.hbase.cube.v2.CubeHBaseEndpointRPC$1$1.call(CubeHBaseEndpointRPC.java:246)
> at 
> org.apache.kylin.storage.hbase.cube.v2.CubeHBaseEndpointRPC$1$1.call(CubeHBaseEndpointRPC.java:242)
> at org.apache.hadoop.hbase.client.HTable$12.call(HTable.java:1012)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> at java.lang.Thread.run(Thread.java:745)
> {noformat}
> I use lz4 compression algorithm in HBase.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)


[jira] [Commented] (KYLIN-3973) InvalidProtocolBufferException: Protocol message was too large. May be malicious.

2019-09-03 Thread ZhouKang (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-3973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16921879#comment-16921879
 ] 

ZhouKang commented on KYLIN-3973:
-

I think CodedInputStream's sizelimit should be seted in CubeVisitProtos. 

> InvalidProtocolBufferException: Protocol message was too large.  May be 
> malicious.
> --
>
> Key: KYLIN-3973
> URL: https://issues.apache.org/jira/browse/KYLIN-3973
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v2.6.1
>Reporter: Grzegorz Kołakowski
>Priority: Major
>
> For many queries I receive the following exception.
> {noformat}
> 2019-04-23 11:33:15,576 WARN  [kylin-coproc--pool6-t17] 
> client.SyncCoprocessorRpcChannel:54 : Call failed on IOException
> com.google.protobuf.InvalidProtocolBufferException: Protocol message was too 
> large.  May be malicious.  Use CodedInputStream.setSizeLimit() to increase 
> the size limit.
> at 
> com.google.protobuf.InvalidProtocolBufferException.sizeLimitExceeded(InvalidProtocolBufferException.java:110)
> at 
> com.google.protobuf.CodedInputStream.refillBuffer(CodedInputStream.java:755)
> at 
> com.google.protobuf.CodedInputStream.isAtEnd(CodedInputStream.java:701)
> at 
> com.google.protobuf.CodedInputStream.readTag(CodedInputStream.java:99)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse.(CubeVisitProtos.java:2307)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse.(CubeVisitProtos.java:2271)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$1.parsePartialFrom(CubeVisitProtos.java:2380)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$1.parsePartialFrom(CubeVisitProtos.java:2375)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$Builder.mergeFrom(CubeVisitProtos.java:5101)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitResponse$Builder.mergeFrom(CubeVisitProtos.java:4949)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:337)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:267)
> at 
> com.google.protobuf.AbstractMessageLite$Builder.mergeFrom(AbstractMessageLite.java:210)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:904)
> at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:267)
> at 
> org.apache.hadoop.hbase.ipc.CoprocessorRpcUtils.getResponse(CoprocessorRpcUtils.java:141)
> at 
> org.apache.hadoop.hbase.client.RegionCoprocessorRpcChannel.callExecService(RegionCoprocessorRpcChannel.java:94)
> at 
> org.apache.hadoop.hbase.client.SyncCoprocessorRpcChannel.callMethod(SyncCoprocessorRpcChannel.java:52)
> at 
> org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitService$Stub.visitCube(CubeVisitProtos.java:5616)
> at 
> org.apache.kylin.storage.hbase.cube.v2.CubeHBaseEndpointRPC$1$1.call(CubeHBaseEndpointRPC.java:246)
> at 
> org.apache.kylin.storage.hbase.cube.v2.CubeHBaseEndpointRPC$1$1.call(CubeHBaseEndpointRPC.java:242)
> at org.apache.hadoop.hbase.client.HTable$12.call(HTable.java:1012)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> at java.lang.Thread.run(Thread.java:745)
> {noformat}
> I use lz4 compression algorithm in HBase.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)


[jira] [Updated] (KYLIN-4143) truncate spark executable job output

2019-09-02 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4143:

Attachment: KYLIN-4143.master.001.patch

> truncate spark executable job output 
> -
>
> Key: KYLIN-4143
> URL: https://issues.apache.org/jira/browse/KYLIN-4143
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v2.5.2
>Reporter: ZhouKang
>Priority: Major
> Attachments: KYLIN-4143.master.001.patch
>
>
>  
> truncate spark job's output when the job exec ret is not equal 0, which made 
> the execute output content too large.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)


[jira] [Updated] (KYLIN-4143) truncate spark executable job output

2019-09-02 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4143:

Description: 
 

truncate spark job's output when the job exec ret is not equal 0, which made 
the execute output content too large.

  was:
 

truncate spark job's output when the job exec ret is not equal 0, which made 
the execute output is too large.


> truncate spark executable job output 
> -
>
> Key: KYLIN-4143
> URL: https://issues.apache.org/jira/browse/KYLIN-4143
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v2.5.2
>Reporter: ZhouKang
>Priority: Major
>
>  
> truncate spark job's output when the job exec ret is not equal 0, which made 
> the execute output content too large.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)


[jira] [Updated] (KYLIN-4143) truncate spark executable job output

2019-09-02 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4143:

Description: 
 

truncate spark job's output when the job exec ret is not equal 0, which made 
the execute output is too large.

  was:
Kylin: 2.5.2

job engine: spark

 

After long time running, job executor will record plenty of content in job 
output file, which make ExecutableManager get output fail.  And the job will be 
marked as "error".
{code:java}
// code placeholder
2019-08-15 11:58:34,442 ERROR [Scheduler 1278051386 Job 
db9b01d0-fd8c-d168-8cfa-2e11d807972c-128] spark.SparkExecutable:359 : error run 
spark job:
 java.lang.RuntimeException: 
org.apache.kylin.job.exception.PersistentException: 
com.fasterxml.jackson.databind.JsonMappingException: Unexpected end-of-input in 
VALUE_STRING
 at [Source: (DataInputStream); line: 5, column: 10728969]
 at [Source: (DataInputStream); line: 5, column: 15] (through reference chain: 
org.apache.kylin.job.dao.ExecutableOutputPO["content"])
 at 
org.apache.kylin.job.execution.ExecutableManager.getOutput(ExecutableManager.java:164)
 at 
org.apache.kylin.job.execution.AbstractExecutable.getOutput(AbstractExecutable.java:414)
 at 
org.apache.kylin.job.execution.AbstractExecutable.isDiscarded(AbstractExecutable.java:525)
 at 
org.apache.kylin.engine.spark.SparkExecutable.doWork(SparkExecutable.java:304)
 at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:178)
 at 
org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:71)
 at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:178)
 at 
org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:114)
 at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
 at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
 at java.lang.Thread.run(Thread.java:748)
 Caused by: org.apache.kylin.job.exception.PersistentException: 
com.fasterxml.jackson.databind.JsonMappingException: Unexpected end-of-input in 
VALUE_STRING
 at [Source: (DataInputStream); line: 5, column: 10728969]
 at [Source: (DataInputStream); line: 5, column: 15] (through reference chain: 
org.apache.kylin.job.dao.ExecutableOutputPO["content"])
 at org.apache.kylin.job.dao.ExecutableDao.getJobOutput(ExecutableDao.java:356)
 at 
org.apache.kylin.job.execution.ExecutableManager.getOutput(ExecutableManager.java:159)

 

... 10 more
 Caused by: com.fasterxml.jackson.databind.JsonMappingException: Unexpected 
end-of-input in VALUE_STRING
 at [Source: (DataInputStream); line: 5, column: 10728969]
 at [Source: (DataInputStream); line: 5, column: 15] (through reference chain: 
org.apache.kylin.job.dao.ExecutableOutputPO["content"])
 at 
com.fasterxml.jackson.databind.JsonMappingException.wrapWithPath(JsonMappingException.java:391)
 at 
com.fasterxml.jackson.databind.JsonMappingException.wrapWithPath(JsonMappingException.java:351)
 at 
com.fasterxml.jackson.databind.deser.BeanDeserializerBase.wrapAndThrow(BeanDeserializerBase.java:1704)
 at 
com.fasterxml.jackson.databind.deser.BeanDeserializer.vanillaDeserialize(BeanDeserializer.java:290)
 at 
com.fasterxml.jackson.databind.deser.BeanDeserializer.deserialize(BeanDeserializer.java:151)
 at 
com.fasterxml.jackson.databind.ObjectMapper._readMapAndClose(ObjectMapper.java:4001)
 at 
com.fasterxml.jackson.databind.ObjectMapper.readValue(ObjectMapper.java:3058)
 at org.apache.kylin.common.util.JsonUtil.readValue(JsonUtil.java:69)
 at 
org.apache.kylin.common.persistence.JsonSerializer.deserialize(JsonSerializer.java:46)
 at 
org.apache.kylin.common.persistence.ResourceStore.getResource(ResourceStore.java:182)
{code}
 


> truncate spark executable job output 
> -
>
> Key: KYLIN-4143
> URL: https://issues.apache.org/jira/browse/KYLIN-4143
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v2.5.2
>Reporter: ZhouKang
>Priority: Major
>
>  
> truncate spark job's output when the job exec ret is not equal 0, which made 
> the execute output is too large.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)


[jira] [Updated] (KYLIN-4143) truncate spark executable job output

2019-09-02 Thread ZhouKang (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ZhouKang updated KYLIN-4143:

Summary: truncate spark executable job output   (was: Get job status fail 
when too much content in job output)

> truncate spark executable job output 
> -
>
> Key: KYLIN-4143
> URL: https://issues.apache.org/jira/browse/KYLIN-4143
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v2.5.2
>Reporter: ZhouKang
>Priority: Major
>
> Kylin: 2.5.2
> job engine: spark
>  
> After long time running, job executor will record plenty of content in job 
> output file, which make ExecutableManager get output fail.  And the job will 
> be marked as "error".
> {code:java}
> // code placeholder
> 2019-08-15 11:58:34,442 ERROR [Scheduler 1278051386 Job 
> db9b01d0-fd8c-d168-8cfa-2e11d807972c-128] spark.SparkExecutable:359 : error 
> run spark job:
>  java.lang.RuntimeException: 
> org.apache.kylin.job.exception.PersistentException: 
> com.fasterxml.jackson.databind.JsonMappingException: Unexpected end-of-input 
> in VALUE_STRING
>  at [Source: (DataInputStream); line: 5, column: 10728969]
>  at [Source: (DataInputStream); line: 5, column: 15] (through reference 
> chain: org.apache.kylin.job.dao.ExecutableOutputPO["content"])
>  at 
> org.apache.kylin.job.execution.ExecutableManager.getOutput(ExecutableManager.java:164)
>  at 
> org.apache.kylin.job.execution.AbstractExecutable.getOutput(AbstractExecutable.java:414)
>  at 
> org.apache.kylin.job.execution.AbstractExecutable.isDiscarded(AbstractExecutable.java:525)
>  at 
> org.apache.kylin.engine.spark.SparkExecutable.doWork(SparkExecutable.java:304)
>  at 
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:178)
>  at 
> org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:71)
>  at 
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:178)
>  at 
> org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:114)
>  at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>  at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>  at java.lang.Thread.run(Thread.java:748)
>  Caused by: org.apache.kylin.job.exception.PersistentException: 
> com.fasterxml.jackson.databind.JsonMappingException: Unexpected end-of-input 
> in VALUE_STRING
>  at [Source: (DataInputStream); line: 5, column: 10728969]
>  at [Source: (DataInputStream); line: 5, column: 15] (through reference 
> chain: org.apache.kylin.job.dao.ExecutableOutputPO["content"])
>  at 
> org.apache.kylin.job.dao.ExecutableDao.getJobOutput(ExecutableDao.java:356)
>  at 
> org.apache.kylin.job.execution.ExecutableManager.getOutput(ExecutableManager.java:159)
>  
> ... 10 more
>  Caused by: com.fasterxml.jackson.databind.JsonMappingException: Unexpected 
> end-of-input in VALUE_STRING
>  at [Source: (DataInputStream); line: 5, column: 10728969]
>  at [Source: (DataInputStream); line: 5, column: 15] (through reference 
> chain: org.apache.kylin.job.dao.ExecutableOutputPO["content"])
>  at 
> com.fasterxml.jackson.databind.JsonMappingException.wrapWithPath(JsonMappingException.java:391)
>  at 
> com.fasterxml.jackson.databind.JsonMappingException.wrapWithPath(JsonMappingException.java:351)
>  at 
> com.fasterxml.jackson.databind.deser.BeanDeserializerBase.wrapAndThrow(BeanDeserializerBase.java:1704)
>  at 
> com.fasterxml.jackson.databind.deser.BeanDeserializer.vanillaDeserialize(BeanDeserializer.java:290)
>  at 
> com.fasterxml.jackson.databind.deser.BeanDeserializer.deserialize(BeanDeserializer.java:151)
>  at 
> com.fasterxml.jackson.databind.ObjectMapper._readMapAndClose(ObjectMapper.java:4001)
>  at 
> com.fasterxml.jackson.databind.ObjectMapper.readValue(ObjectMapper.java:3058)
>  at org.apache.kylin.common.util.JsonUtil.readValue(JsonUtil.java:69)
>  at 
> org.apache.kylin.common.persistence.JsonSerializer.deserialize(JsonSerializer.java:46)
>  at 
> org.apache.kylin.common.persistence.ResourceStore.getResource(ResourceStore.java:182)
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.2#803003)


[jira] [Created] (KYLIN-4144) Cannot update user's info because of case-sensitive process

2019-08-20 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4144:
---

 Summary: Cannot update user's info because of case-sensitive 
process
 Key: KYLIN-4144
 URL: https://issues.apache.org/jira/browse/KYLIN-4144
 Project: Kylin
  Issue Type: Bug
Reporter: ZhouKang
 Fix For: v2.5.3


In https://issues.apache.org/jira/browse/KYLIN-3531 , uppercase of usernames 
will be saved in HBase, which make our case faild.

 

when users was added useing kylin:v2.4.0 (maybe older), the rawkey in hbase is 
lowercase, when update user's info in Kylin:v2.5.3, there is en error.
{code:java}
// code placeholder
Overwriting conflict /user/, expect old TS xxx, but it is 0, the expected 
new TS: {code}



--
This message was sent by Atlassian Jira
(v8.3.2#803003)


[jira] [Created] (KYLIN-4143) Get job status fail when too much content in job output

2019-08-20 Thread ZhouKang (Jira)
ZhouKang created KYLIN-4143:
---

 Summary: Get job status fail when too much content in job output
 Key: KYLIN-4143
 URL: https://issues.apache.org/jira/browse/KYLIN-4143
 Project: Kylin
  Issue Type: Bug
Affects Versions: v2.5.2
Reporter: ZhouKang


Kylin: 2.5.2

job engine: spark

 

After long time running, job executor will record plenty of content in job 
output file, which make ExecutableManager get output fail.  And the job will be 
marked as "error".
{code:java}
// code placeholder
2019-08-15 11:58:34,442 ERROR [Scheduler 1278051386 Job 
db9b01d0-fd8c-d168-8cfa-2e11d807972c-128] spark.SparkExecutable:359 : error run 
spark job:
 java.lang.RuntimeException: 
org.apache.kylin.job.exception.PersistentException: 
com.fasterxml.jackson.databind.JsonMappingException: Unexpected end-of-input in 
VALUE_STRING
 at [Source: (DataInputStream); line: 5, column: 10728969]
 at [Source: (DataInputStream); line: 5, column: 15] (through reference chain: 
org.apache.kylin.job.dao.ExecutableOutputPO["content"])
 at 
org.apache.kylin.job.execution.ExecutableManager.getOutput(ExecutableManager.java:164)
 at 
org.apache.kylin.job.execution.AbstractExecutable.getOutput(AbstractExecutable.java:414)
 at 
org.apache.kylin.job.execution.AbstractExecutable.isDiscarded(AbstractExecutable.java:525)
 at 
org.apache.kylin.engine.spark.SparkExecutable.doWork(SparkExecutable.java:304)
 at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:178)
 at 
org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:71)
 at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:178)
 at 
org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:114)
 at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
 at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
 at java.lang.Thread.run(Thread.java:748)
 Caused by: org.apache.kylin.job.exception.PersistentException: 
com.fasterxml.jackson.databind.JsonMappingException: Unexpected end-of-input in 
VALUE_STRING
 at [Source: (DataInputStream); line: 5, column: 10728969]
 at [Source: (DataInputStream); line: 5, column: 15] (through reference chain: 
org.apache.kylin.job.dao.ExecutableOutputPO["content"])
 at org.apache.kylin.job.dao.ExecutableDao.getJobOutput(ExecutableDao.java:356)
 at 
org.apache.kylin.job.execution.ExecutableManager.getOutput(ExecutableManager.java:159)

 

... 10 more
 Caused by: com.fasterxml.jackson.databind.JsonMappingException: Unexpected 
end-of-input in VALUE_STRING
 at [Source: (DataInputStream); line: 5, column: 10728969]
 at [Source: (DataInputStream); line: 5, column: 15] (through reference chain: 
org.apache.kylin.job.dao.ExecutableOutputPO["content"])
 at 
com.fasterxml.jackson.databind.JsonMappingException.wrapWithPath(JsonMappingException.java:391)
 at 
com.fasterxml.jackson.databind.JsonMappingException.wrapWithPath(JsonMappingException.java:351)
 at 
com.fasterxml.jackson.databind.deser.BeanDeserializerBase.wrapAndThrow(BeanDeserializerBase.java:1704)
 at 
com.fasterxml.jackson.databind.deser.BeanDeserializer.vanillaDeserialize(BeanDeserializer.java:290)
 at 
com.fasterxml.jackson.databind.deser.BeanDeserializer.deserialize(BeanDeserializer.java:151)
 at 
com.fasterxml.jackson.databind.ObjectMapper._readMapAndClose(ObjectMapper.java:4001)
 at 
com.fasterxml.jackson.databind.ObjectMapper.readValue(ObjectMapper.java:3058)
 at org.apache.kylin.common.util.JsonUtil.readValue(JsonUtil.java:69)
 at 
org.apache.kylin.common.persistence.JsonSerializer.deserialize(JsonSerializer.java:46)
 at 
org.apache.kylin.common.persistence.ResourceStore.getResource(ResourceStore.java:182)
{code}
 



--
This message was sent by Atlassian Jira
(v8.3.2#803003)