[jira] [Created] (KYLIN-3828) ArrayIndexOutOfBoundsException thrown when build a streaming cube with empty data in its first dimension

2019-02-25 Thread nichunen (JIRA)
nichunen created KYLIN-3828:
---

 Summary: ArrayIndexOutOfBoundsException thrown when build a 
streaming cube with empty data in its first dimension
 Key: KYLIN-3828
 URL: https://issues.apache.org/jira/browse/KYLIN-3828
 Project: Kylin
  Issue Type: Bug
  Components: NRT Streaming
Affects Versions: v2.6.0
Reporter: nichunen
Assignee: nichunen
 Fix For: Future


The root cause is 
in org.apache.kylin.source.kafka.hadoop.KafkaFlatTableMapper#doMap
{code:java}
public void doMap(LongWritable key, BytesWritable value, Context context) 
throws IOException, InterruptedException {
ByteBuffer buffer = ByteBuffer.wrap(value.getBytes(), 0, 
value.getLength());
StreamingMessageRow row = streamingParser.parse(buffer).get(0);
if (row == null) {
throw new IllegalArgumentException("");
}

data = StringUtil.join(row.getData(), delimiter);
// output this row to value
outValue.set(Bytes.toBytes(data));
context.write(outKey, outValue);
}
{code}
Method _StringUtil.join_ is used to join a collection of string to a single 
string with delimiter. But _org.apache.kylin.common.util.StringUtil#join_ will 
ignore the leading empty strings in the colletion, source code is:
{code:java}
public static String join(Iterable parts, String separator) {
StringBuilder buf = new StringBuilder();
for (String p : parts) {
if (buf.length() > 0)
buf.append(separator);
buf.append(p);
}
return buf.toString();
}
{code}





--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] nichunen opened a new pull request #482: KYLIN-3828 Fix heading empty string be ignored in StringUtil.join

2019-02-25 Thread GitBox
nichunen opened a new pull request #482: KYLIN-3828 Fix heading empty string be 
ignored in StringUtil.join
URL: https://github.com/apache/kylin/pull/482
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] asfgit commented on issue #482: KYLIN-3828 Fix heading empty string be ignored in StringUtil.join

2019-02-25 Thread GitBox
asfgit commented on issue #482: KYLIN-3828 Fix heading empty string be ignored 
in StringUtil.join
URL: https://github.com/apache/kylin/pull/482#issuecomment-466910470
 
 
   Can one of the admins verify this patch?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Commented] (KYLIN-3828) ArrayIndexOutOfBoundsException thrown when build a streaming cube with empty data in its first dimension

2019-02-25 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776620#comment-16776620
 ] 

ASF GitHub Bot commented on KYLIN-3828:
---

nichunen commented on pull request #482: KYLIN-3828 Fix heading empty string be 
ignored in StringUtil.join
URL: https://github.com/apache/kylin/pull/482
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> ArrayIndexOutOfBoundsException thrown when build a streaming cube with empty 
> data in its first dimension
> 
>
> Key: KYLIN-3828
> URL: https://issues.apache.org/jira/browse/KYLIN-3828
> Project: Kylin
>  Issue Type: Bug
>  Components: NRT Streaming
>Affects Versions: v2.6.0
>Reporter: nichunen
>Assignee: nichunen
>Priority: Major
> Fix For: Future
>
>
> The root cause is 
> in org.apache.kylin.source.kafka.hadoop.KafkaFlatTableMapper#doMap
> {code:java}
> public void doMap(LongWritable key, BytesWritable value, Context context) 
> throws IOException, InterruptedException {
> ByteBuffer buffer = ByteBuffer.wrap(value.getBytes(), 0, 
> value.getLength());
> StreamingMessageRow row = streamingParser.parse(buffer).get(0);
> if (row == null) {
> throw new IllegalArgumentException("");
> }
> data = StringUtil.join(row.getData(), delimiter);
> // output this row to value
> outValue.set(Bytes.toBytes(data));
> context.write(outKey, outValue);
> }
> {code}
> Method _StringUtil.join_ is used to join a collection of string to a single 
> string with delimiter. But _org.apache.kylin.common.util.StringUtil#join_ 
> will ignore the leading empty strings in the colletion, source code is:
> {code:java}
> public static String join(Iterable parts, String separator) {
> StringBuilder buf = new StringBuilder();
> for (String p : parts) {
> if (buf.length() > 0)
> buf.append(separator);
> buf.append(p);
> }
> return buf.toString();
> }
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (KYLIN-3826) MergeCuboidJob only uploads necessary segment's dictionary

2019-02-25 Thread Shaofeng SHI (JIRA)


 [ 
https://issues.apache.org/jira/browse/KYLIN-3826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shaofeng SHI reassigned KYLIN-3826:
---

Assignee: Shaofeng SHI

> MergeCuboidJob only uploads necessary segment's dictionary
> --
>
> Key: KYLIN-3826
> URL: https://issues.apache.org/jira/browse/KYLIN-3826
> Project: Kylin
>  Issue Type: Improvement
>  Components: Job Engine
>Reporter: Shaofeng SHI
>Assignee: Shaofeng SHI
>Priority: Major
> Fix For: v2.6.1
>
>
> On yesterday's Kylin meetup, Zhang Wei mentioned that the "MergeCuboidJob" 
> will upload all segment's metadata, which will take extra long time when the 
> segment number is large. While this is unnecessary.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (KYLIN-3826) MergeCuboidJob only uploads necessary segment's dictionary

2019-02-25 Thread Shaofeng SHI (JIRA)


 [ 
https://issues.apache.org/jira/browse/KYLIN-3826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shaofeng SHI resolved KYLIN-3826.
-
Resolution: Fixed

> MergeCuboidJob only uploads necessary segment's dictionary
> --
>
> Key: KYLIN-3826
> URL: https://issues.apache.org/jira/browse/KYLIN-3826
> Project: Kylin
>  Issue Type: Improvement
>  Components: Job Engine
>Reporter: Shaofeng SHI
>Assignee: Shaofeng SHI
>Priority: Major
> Fix For: v2.6.1
>
>
> On yesterday's Kylin meetup, Zhang Wei mentioned that the "MergeCuboidJob" 
> will upload all segment's metadata, which will take extra long time when the 
> segment number is large. While this is unnecessary.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] codecov-io commented on issue #482: KYLIN-3828 Fix heading empty string be ignored in StringUtil.join

2019-02-25 Thread GitBox
codecov-io commented on issue #482: KYLIN-3828 Fix heading empty string be 
ignored in StringUtil.join
URL: https://github.com/apache/kylin/pull/482#issuecomment-466919471
 
 
   # [Codecov](https://codecov.io/gh/apache/kylin/pull/482?src=pr&el=h1) Report
   > :exclamation: No coverage uploaded for pull request base 
(`master@500f102`). [Click here to learn what that 
means](https://docs.codecov.io/docs/error-reference#section-missing-base-commit).
   > The diff coverage is `54.54%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/kylin/pull/482/graphs/tree.svg?width=650&token=JawVgbgsVo&height=150&src=pr)](https://codecov.io/gh/apache/kylin/pull/482?src=pr&el=tree)
   
   ```diff
   @@Coverage Diff@@
   ## master #482   +/-   ##
   =
 Coverage  ?   24.48%   
 Complexity? 4964   
   =
 Files ? 1144   
 Lines ?69428   
 Branches  ? 9898   
   =
 Hits  ?16996   
 Misses?50714   
 Partials  ? 1718
   ```
   
   
   | [Impacted 
Files](https://codecov.io/gh/apache/kylin/pull/482?src=pr&el=tree) | Coverage Δ 
| Complexity Δ | |
   |---|---|---|---|
   | 
[.../java/org/apache/kylin/common/util/StringUtil.java](https://codecov.io/gh/apache/kylin/pull/482/diff?src=pr&el=tree#diff-Y29yZS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2t5bGluL2NvbW1vbi91dGlsL1N0cmluZ1V0aWwuamF2YQ==)
 | `15.05% <54.54%> (ø)` | `5 <0> (?)` | |
   
   --
   
   [Continue to review full report at 
Codecov](https://codecov.io/gh/apache/kylin/pull/482?src=pr&el=continue).
   > **Legend** - [Click here to learn 
more](https://docs.codecov.io/docs/codecov-delta)
   > `Δ = absolute  (impact)`, `ø = not affected`, `? = missing data`
   > Powered by 
[Codecov](https://codecov.io/gh/apache/kylin/pull/482?src=pr&el=footer). Last 
update 
[500f102...5bd3e9c](https://codecov.io/gh/apache/kylin/pull/482?src=pr&el=lastupdated).
 Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] coveralls commented on issue #482: KYLIN-3828 Fix heading empty string be ignored in StringUtil.join

2019-02-25 Thread GitBox
coveralls commented on issue #482: KYLIN-3828 Fix heading empty string be 
ignored in StringUtil.join
URL: https://github.com/apache/kylin/pull/482#issuecomment-466920942
 
 
   ## Pull Request Test Coverage Report for [Build 
4162](https://coveralls.io/builds/21825581)
   
   * **10** of **11**   **(90.91%)**  changed or added relevant lines in **1** 
file are covered.
   * **3** unchanged lines in **1** file lost coverage.
   * Overall coverage increased (+**0.02%**) to **26.96%**
   
   ---
   
   |  Changes Missing Coverage | Covered Lines | Changed/Added Lines | % |
   | :-|--||---: |
   | 
[core-common/src/main/java/org/apache/kylin/common/util/StringUtil.java](https://coveralls.io/builds/21825581/source?filename=core-common%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fkylin%2Fcommon%2Futil%2FStringUtil.java#L81)
 | 10 | 11 | 90.91%
   
   
   |  Files with Coverage Reduction | New Missed Lines | % |
   | :-|--|--: |
   | 
[core-cube/src/main/java/org/apache/kylin/cube/inmemcubing/MemDiskStore.java](https://coveralls.io/builds/21825581/source?filename=core-cube%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fkylin%2Fcube%2Finmemcubing%2FMemDiskStore.java#L449)
 | 3 | 78.42% |
   
   
   |  Totals | [![Coverage 
Status](https://coveralls.io/builds/21825581/badge)](https://coveralls.io/builds/21825581)
 |
   | :-- | --: |
   | Change from base [Build 4161](https://coveralls.io/builds/21821945): |  
0.02% |
   | Covered Lines: | 18718 |
   | Relevant Lines: | 69428 |
   
   ---
   # 💛  - [Coveralls](https://coveralls.io)
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Commented] (KYLIN-3816) Current CI doesn't cover the case of streaming table join lookup table

2019-02-25 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776645#comment-16776645
 ] 

ASF GitHub Bot commented on KYLIN-3816:
---

Wayne1c commented on pull request #483: KYLIN-3816 Make CI cover streaming 
table join case
URL: https://github.com/apache/kylin/pull/483
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Current CI doesn't cover the case of streaming table join lookup table 
> ---
>
> Key: KYLIN-3816
> URL: https://issues.apache.org/jira/browse/KYLIN-3816
> Project: Kylin
>  Issue Type: Improvement
>  Components: Tools, Build and Test
>Reporter: Chao Long
>Assignee: Chao Long
>Priority: Major
> Fix For: v2.6.1
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] Wayne1c opened a new pull request #483: KYLIN-3816 Make CI cover streaming table join case

2019-02-25 Thread GitBox
Wayne1c opened a new pull request #483: KYLIN-3816 Make CI cover streaming 
table join case
URL: https://github.com/apache/kylin/pull/483
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] asfgit commented on issue #483: KYLIN-3816 Make CI cover streaming table join case

2019-02-25 Thread GitBox
asfgit commented on issue #483: KYLIN-3816 Make CI cover streaming table join 
case
URL: https://github.com/apache/kylin/pull/483#issuecomment-466924299
 
 
   Can one of the admins verify this patch?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] coveralls commented on issue #483: KYLIN-3816 Make CI cover streaming table join case

2019-02-25 Thread GitBox
coveralls commented on issue #483: KYLIN-3816 Make CI cover streaming table 
join case
URL: https://github.com/apache/kylin/pull/483#issuecomment-466934743
 
 
   ## Pull Request Test Coverage Report for [Build 
4163](https://coveralls.io/builds/21826094)
   
   * **0** of **1**   **(0.0%)**  changed or added relevant line in **1** file 
are covered.
   * **3** unchanged lines in **1** file lost coverage.
   * Overall coverage increased (+**0.006%**) to **26.945%**
   
   ---
   
   |  Changes Missing Coverage | Covered Lines | Changed/Added Lines | % |
   | :-|--||---: |
   | 
[source-kafka/src/main/java/org/apache/kylin/source/kafka/KafkaSource.java](https://coveralls.io/builds/21826094/source?filename=source-kafka%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fkylin%2Fsource%2Fkafka%2FKafkaSource.java#L255)
 | 0 | 1 | 0.0%
   
   
   |  Files with Coverage Reduction | New Missed Lines | % |
   | :-|--|--: |
   | 
[core-cube/src/main/java/org/apache/kylin/cube/inmemcubing/MemDiskStore.java](https://coveralls.io/builds/21826094/source?filename=core-cube%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fkylin%2Fcube%2Finmemcubing%2FMemDiskStore.java#L449)
 | 3 | 78.42% |
   
   
   |  Totals | [![Coverage 
Status](https://coveralls.io/builds/21826094/badge)](https://coveralls.io/builds/21826094)
 |
   | :-- | --: |
   | Change from base [Build 4161](https://coveralls.io/builds/21821945): |  
0.006% |
   | Covered Lines: | 18705 |
   | Relevant Lines: | 69420 |
   
   ---
   # 💛  - [Coveralls](https://coveralls.io)
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] codecov-io commented on issue #483: KYLIN-3816 Make CI cover streaming table join case

2019-02-25 Thread GitBox
codecov-io commented on issue #483: KYLIN-3816 Make CI cover streaming table 
join case
URL: https://github.com/apache/kylin/pull/483#issuecomment-466935858
 
 
   # [Codecov](https://codecov.io/gh/apache/kylin/pull/483?src=pr&el=h1) Report
   > :exclamation: No coverage uploaded for pull request base 
(`master@500f102`). [Click here to learn what that 
means](https://docs.codecov.io/docs/error-reference#section-missing-base-commit).
   > The diff coverage is `0%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/kylin/pull/483/graphs/tree.svg?width=650&token=JawVgbgsVo&height=150&src=pr)](https://codecov.io/gh/apache/kylin/pull/483?src=pr&el=tree)
   
   ```diff
   @@Coverage Diff@@
   ## master #483   +/-   ##
   =
 Coverage  ?   24.46%   
 Complexity? 4960   
   =
 Files ? 1144   
 Lines ?69420   
 Branches  ? 9895   
   =
 Hits  ?16986   
 Misses?50719   
 Partials  ? 1715
   ```
   
   
   | [Impacted 
Files](https://codecov.io/gh/apache/kylin/pull/483?src=pr&el=tree) | Coverage Δ 
| Complexity Δ | |
   |---|---|---|---|
   | 
[...ava/org/apache/kylin/source/kafka/KafkaSource.java](https://codecov.io/gh/apache/kylin/pull/483/diff?src=pr&el=tree#diff-c291cmNlLWthZmthL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9reWxpbi9zb3VyY2Uva2Fma2EvS2Fma2FTb3VyY2UuamF2YQ==)
 | `0% <0%> (ø)` | `0 <0> (?)` | |
   
   --
   
   [Continue to review full report at 
Codecov](https://codecov.io/gh/apache/kylin/pull/483?src=pr&el=continue).
   > **Legend** - [Click here to learn 
more](https://docs.codecov.io/docs/codecov-delta)
   > `Δ = absolute  (impact)`, `ø = not affected`, `? = missing data`
   > Powered by 
[Codecov](https://codecov.io/gh/apache/kylin/pull/483?src=pr&el=footer). Last 
update 
[500f102...27875dc](https://codecov.io/gh/apache/kylin/pull/483?src=pr&el=lastupdated).
 Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Commented] (KYLIN-3322) TopN requires a SUM to work

2019-02-25 Thread KANG-SEN LU (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776848#comment-16776848
 ] 

KANG-SEN LU commented on KYLIN-3322:


Hi, Shaofeng:

 

Thanks for your response. I have two points to add.
 # What if I already put SUM(X) in a separated cube, why do I have to add 
SUM(X) into second cube while I am defining TOPN(X) in the second cube. If it 
is just redundant metadata data, I will not complain about the extra human 
effort. I am worried if the kylin may not be able to find the right cube to 
compute SUM(X), because now there are two cubes both are, supposed, equally 
qualified to answer the query. It will create more challenge to the cost 
evaluation function to kylin.
 #  My experiment seems to suggest that when SUM(X) not group by B was issued, 
the cost evaluation function sent the query to the cube containing both 
TOPN(SUM(X)) and SUM(X) and, more importantly, it goes after TOPN(SUM(X)), then 
perform SUM(X), that takes more than 20 seconds in my test case. If it goes 
after SUM(X) directly, it took less than 0.2 second. I think how kylin try to 
accomplish SUM(X) in a cube containing both TOPN(SUM(X)) and SUM(X) may not be 
correct. That is the main reason I am against this decision that in a cube 
containing TOPN(SUM(X)), one must also configure SUM(X)
 # 

 

> TopN requires a SUM to work
> ---
>
> Key: KYLIN-3322
> URL: https://issues.apache.org/jira/browse/KYLIN-3322
> Project: Kylin
>  Issue Type: Bug
>  Components: Measure - TopN
>Reporter: liyang
>Assignee: Na Zhai
>Priority: Major
>
> Currently if user creates a measure of TopN seller by sum of price, it is 
> required that user also creates a measure of SUM(price). Otherwise, NPE will 
> be thrown at query time.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] asfgit commented on issue #484: KYLIN-3820 Add a curator-based scheduler

2019-02-25 Thread GitBox
asfgit commented on issue #484: KYLIN-3820 Add a curator-based scheduler
URL: https://github.com/apache/kylin/pull/484#issuecomment-467015219
 
 
   Can one of the admins verify this patch?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] Wayne1c opened a new pull request #484: KYLIN-3820 Add a curator-based scheduler

2019-02-25 Thread GitBox
Wayne1c opened a new pull request #484: KYLIN-3820 Add a curator-based scheduler
URL: https://github.com/apache/kylin/pull/484
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Commented] (KYLIN-3820) Add a curator-based scheduler

2019-02-25 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776894#comment-16776894
 ] 

ASF GitHub Bot commented on KYLIN-3820:
---

Wayne1c commented on pull request #484: KYLIN-3820 Add a curator-based scheduler
URL: https://github.com/apache/kylin/pull/484
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Add a curator-based scheduler
> -
>
> Key: KYLIN-3820
> URL: https://issues.apache.org/jira/browse/KYLIN-3820
> Project: Kylin
>  Issue Type: Improvement
>Reporter: Chao Long
>Assignee: Chao Long
>Priority: Major
> Fix For: v3.0.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (KYLIN-3827) kylin 某一字段必须使用trim才可以查出来

2019-02-25 Thread Shaofeng SHI (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776910#comment-16776910
 ] 

Shaofeng SHI commented on KYLIN-3827:
-

Seems be related with KYLIN-3705. But I need the detailed kylin.log when 
executing the queries. Please provide the log, or upgrade to v2.5.2 and then 
rebuild the segments.

> kylin 某一字段必须使用trim才可以查出来
> 
>
> Key: KYLIN-3827
> URL: https://issues.apache.org/jira/browse/KYLIN-3827
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v2.5.1
>Reporter: litewood
>Priority: Major
>
> SELECT pt ,pageid from st.ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1 where pt = 
> '20190114' and pageid='用户标签文档'  GROUP BY pt,pageid;
> 当前sql 使用hive查询可以查出来结果。
> 但是使用kylin接口查询,除去一个近视去重选用全局字段的cube能查询出来,其他cube 必须trim(pageid) 或者使用like 
> 关键字才能查出来。下面帖出三个cube信息。
>  
> {color:#FF}可以查出来的cube info (近视去重){color}
> {
>   "uuid": "ad8cbc65-dd6c-8c12-7d18-2f625cedf7ea",
>   "last_modified": 1550719049143,
>   "version": "2.5.1.20500",
>   "name": "c_pageview",
>   "is_draft": false,
>   "model_name": "m_bury_point_analysis",
>   "description": "c_pageview",
>   "null_string": null,
>   "dimensions": [
> \{
>   "name": "APPTYPE",
>   "table": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1",
>   "column": "APPTYPE",
>   "derived": null
> },
> \{
>   "name": "PLATFORM",
>   "table": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1",
>   "column": "PLATFORM",
>   "derived": null
> },
> \{
>   "name": "UTM_SOURCE",
>   "table": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1",
>   "column": "UTM_SOURCE",
>   "derived": null
> },
> \{
>   "name": "ADCODE",
>   "table": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1",
>   "column": "ADCODE",
>   "derived": null
> },
> \{
>   "name": "ADDITIONTYPE",
>   "table": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1",
>   "column": "ADDITIONTYPE",
>   "derived": null
> },
> \{
>   "name": "ADDITIONVALUE",
>   "table": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1",
>   "column": "ADDITIONVALUE",
>   "derived": null
> },
> \{
>   "name": "APPVERSION",
>   "table": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1",
>   "column": "APPVERSION",
>   "derived": null
> },
> \{
>   "name": "CATEGORYID",
>   "table": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1",
>   "column": "CATEGORYID",
>   "derived": null
> },
> \{
>   "name": "CITYCODE",
>   "table": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1",
>   "column": "CITYCODE",
>   "derived": null
> },
> \{
>   "name": "FLAGVALUE",
>   "table": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1",
>   "column": "FLAGVALUE",
>   "derived": null
> },
> \{
>   "name": "PAGEID",
>   "table": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1",
>   "column": "PAGEID",
>   "derived": null
> },
> \{
>   "name": "H_DATE",
>   "table": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1",
>   "column": "H_DATE",
>   "derived": null
> },
> \{
>   "name": "PT",
>   "table": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1",
>   "column": "PT",
>   "derived": null
> },
> \{
>   "name": "WEEKYTH_CNT_CN",
>   "table": "DIM_PUB_DATE",
>   "column": null,
>   "derived": [
> "WEEKYTH_CNT_CN"
>   ]
> },
> \{
>   "name": "WEEKYTH_START_DATA_CN",
>   "table": "DIM_PUB_DATE",
>   "column": null,
>   "derived": [
> "WEEKYTH_START_DATA_CN"
>   ]
> },
> \{
>   "name": "WEEKYTH_END_DATA_CN",
>   "table": "DIM_PUB_DATE",
>   "column": null,
>   "derived": [
> "WEEKYTH_END_DATA_CN"
>   ]
> }
>   ],
>   "measures": [
> \{
>   "name": "PV",
>   "function": {
> "expression": "COUNT",
> "parameter": {
>   "type": "constant",
>   "value": "1"
> },
> "returntype": "bigint"
>   }
> },
> \{
>   "name": "UV",
>   "function": {
> "expression": "COUNT_DISTINCT",
> "parameter": {
>   "type": "column",
>   "value": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1.USERGUID"
> },
> "returntype": "hllc(16)"
>   }
> },
> \{
>   "name": "DUV",
>   "function": {
> "expression": "COUNT_DISTINCT",
> "parameter": {
>   "type": "column",
>   "value": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1.SSID"
> },
> "returntype": "hllc(16)"
>   }
> }
>   ],
>   "dictionaries": [],
>   "rowkey": \{
> "rowkey_columns": [
>   {
> "column": "ST_T_BIKE_YUKON_PRO_PAGEVIEW_CLEAN1.PT",
> "encoding": "dict",
> "encoding_version": 1,
> "isShardBy": false
>   },
>   \{
> "column": "S

[jira] [Commented] (KYLIN-3828) ArrayIndexOutOfBoundsException thrown when build a streaming cube with empty data in its first dimension

2019-02-25 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776916#comment-16776916
 ] 

ASF subversion and git services commented on KYLIN-3828:


Commit fb34122243372cb229b43eee29ca59f933d0ada3 in kylin's branch 
refs/heads/master from nichunen
[ https://gitbox.apache.org/repos/asf?p=kylin.git;h=fb34122 ]

KYLIN-3828 Fix heading empty string be ignored in StringUtil.join


> ArrayIndexOutOfBoundsException thrown when build a streaming cube with empty 
> data in its first dimension
> 
>
> Key: KYLIN-3828
> URL: https://issues.apache.org/jira/browse/KYLIN-3828
> Project: Kylin
>  Issue Type: Bug
>  Components: NRT Streaming
>Affects Versions: v2.6.0
>Reporter: nichunen
>Assignee: nichunen
>Priority: Major
> Fix For: Future
>
>
> The root cause is 
> in org.apache.kylin.source.kafka.hadoop.KafkaFlatTableMapper#doMap
> {code:java}
> public void doMap(LongWritable key, BytesWritable value, Context context) 
> throws IOException, InterruptedException {
> ByteBuffer buffer = ByteBuffer.wrap(value.getBytes(), 0, 
> value.getLength());
> StreamingMessageRow row = streamingParser.parse(buffer).get(0);
> if (row == null) {
> throw new IllegalArgumentException("");
> }
> data = StringUtil.join(row.getData(), delimiter);
> // output this row to value
> outValue.set(Bytes.toBytes(data));
> context.write(outKey, outValue);
> }
> {code}
> Method _StringUtil.join_ is used to join a collection of string to a single 
> string with delimiter. But _org.apache.kylin.common.util.StringUtil#join_ 
> will ignore the leading empty strings in the colletion, source code is:
> {code:java}
> public static String join(Iterable parts, String separator) {
> StringBuilder buf = new StringBuilder();
> for (String p : parts) {
> if (buf.length() > 0)
> buf.append(separator);
> buf.append(p);
> }
> return buf.toString();
> }
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] shaofengshi merged pull request #482: KYLIN-3828 Fix heading empty string be ignored in StringUtil.join

2019-02-25 Thread GitBox
shaofengshi merged pull request #482: KYLIN-3828 Fix heading empty string be 
ignored in StringUtil.join
URL: https://github.com/apache/kylin/pull/482
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Commented] (KYLIN-3828) ArrayIndexOutOfBoundsException thrown when build a streaming cube with empty data in its first dimension

2019-02-25 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776915#comment-16776915
 ] 

ASF GitHub Bot commented on KYLIN-3828:
---

shaofengshi commented on pull request #482: KYLIN-3828 Fix heading empty string 
be ignored in StringUtil.join
URL: https://github.com/apache/kylin/pull/482
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> ArrayIndexOutOfBoundsException thrown when build a streaming cube with empty 
> data in its first dimension
> 
>
> Key: KYLIN-3828
> URL: https://issues.apache.org/jira/browse/KYLIN-3828
> Project: Kylin
>  Issue Type: Bug
>  Components: NRT Streaming
>Affects Versions: v2.6.0
>Reporter: nichunen
>Assignee: nichunen
>Priority: Major
> Fix For: Future
>
>
> The root cause is 
> in org.apache.kylin.source.kafka.hadoop.KafkaFlatTableMapper#doMap
> {code:java}
> public void doMap(LongWritable key, BytesWritable value, Context context) 
> throws IOException, InterruptedException {
> ByteBuffer buffer = ByteBuffer.wrap(value.getBytes(), 0, 
> value.getLength());
> StreamingMessageRow row = streamingParser.parse(buffer).get(0);
> if (row == null) {
> throw new IllegalArgumentException("");
> }
> data = StringUtil.join(row.getData(), delimiter);
> // output this row to value
> outValue.set(Bytes.toBytes(data));
> context.write(outKey, outValue);
> }
> {code}
> Method _StringUtil.join_ is used to join a collection of string to a single 
> string with delimiter. But _org.apache.kylin.common.util.StringUtil#join_ 
> will ignore the leading empty strings in the colletion, source code is:
> {code:java}
> public static String join(Iterable parts, String separator) {
> StringBuilder buf = new StringBuilder();
> for (String p : parts) {
> if (buf.length() > 0)
> buf.append(separator);
> buf.append(p);
> }
> return buf.toString();
> }
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (KYLIN-3828) ArrayIndexOutOfBoundsException thrown when build a streaming cube with empty data in its first dimension

2019-02-25 Thread Shaofeng SHI (JIRA)


 [ 
https://issues.apache.org/jira/browse/KYLIN-3828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shaofeng SHI updated KYLIN-3828:

Fix Version/s: (was: Future)
   v2.6.1

> ArrayIndexOutOfBoundsException thrown when build a streaming cube with empty 
> data in its first dimension
> 
>
> Key: KYLIN-3828
> URL: https://issues.apache.org/jira/browse/KYLIN-3828
> Project: Kylin
>  Issue Type: Bug
>  Components: NRT Streaming
>Affects Versions: v2.6.0
>Reporter: nichunen
>Assignee: nichunen
>Priority: Major
> Fix For: v2.6.1
>
>
> The root cause is 
> in org.apache.kylin.source.kafka.hadoop.KafkaFlatTableMapper#doMap
> {code:java}
> public void doMap(LongWritable key, BytesWritable value, Context context) 
> throws IOException, InterruptedException {
> ByteBuffer buffer = ByteBuffer.wrap(value.getBytes(), 0, 
> value.getLength());
> StreamingMessageRow row = streamingParser.parse(buffer).get(0);
> if (row == null) {
> throw new IllegalArgumentException("");
> }
> data = StringUtil.join(row.getData(), delimiter);
> // output this row to value
> outValue.set(Bytes.toBytes(data));
> context.write(outKey, outValue);
> }
> {code}
> Method _StringUtil.join_ is used to join a collection of string to a single 
> string with delimiter. But _org.apache.kylin.common.util.StringUtil#join_ 
> will ignore the leading empty strings in the colletion, source code is:
> {code:java}
> public static String join(Iterable parts, String separator) {
> StringBuilder buf = new StringBuilder();
> for (String p : parts) {
> if (buf.length() > 0)
> buf.append(separator);
> buf.append(p);
> }
> return buf.toString();
> }
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] coveralls commented on issue #484: KYLIN-3820 Add a curator-based scheduler

2019-02-25 Thread GitBox
coveralls commented on issue #484: KYLIN-3820 Add a curator-based scheduler
URL: https://github.com/apache/kylin/pull/484#issuecomment-467030756
 
 
   ## Pull Request Test Coverage Report for [Build 
4164](https://coveralls.io/builds/21831279)
   
   * **114** of **177**   **(64.41%)**  changed or added relevant lines in 
**4** files are covered.
   * **3** unchanged lines in **1** file lost coverage.
   * Overall coverage increased (+**0.1%**) to **27.044%**
   
   ---
   
   |  Changes Missing Coverage | Covered Lines | Changed/Added Lines | % |
   | :-|--||---: |
   | 
[core-common/src/main/java/org/apache/kylin/common/KylinConfigBase.java](https://coveralls.io/builds/21831279/source?filename=core-common%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fkylin%2Fcommon%2FKylinConfigBase.java#L366)
 | 0 | 4 | 0.0%
   | 
[core-job/src/main/java/org/apache/kylin/job/impl/curator/CuratorLeaderSelector.java](https://coveralls.io/builds/21831279/source?filename=core-job%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fkylin%2Fjob%2Fimpl%2Fcurator%2FCuratorLeaderSelector.java#L56)
 | 30 | 43 | 69.77%
   | 
[core-common/src/main/java/org/apache/kylin/common/ServerMode.java](https://coveralls.io/builds/21831279/source?filename=core-common%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fkylin%2Fcommon%2FServerMode.java#L21)
 | 0 | 17 | 0.0%
   | 
[core-job/src/main/java/org/apache/kylin/job/impl/curator/CuratorScheduler.java](https://coveralls.io/builds/21831279/source?filename=core-job%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fkylin%2Fjob%2Fimpl%2Fcurator%2FCuratorScheduler.java#L87)
 | 84 | 113 | 74.34%
   
   
   |  Files with Coverage Reduction | New Missed Lines | % |
   | :-|--|--: |
   | 
[core-cube/src/main/java/org/apache/kylin/cube/inmemcubing/MemDiskStore.java](https://coveralls.io/builds/21831279/source?filename=core-cube%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fkylin%2Fcube%2Finmemcubing%2FMemDiskStore.java#L449)
 | 3 | 78.42% |
   
   
   |  Totals | [![Coverage 
Status](https://coveralls.io/builds/21831279/badge)](https://coveralls.io/builds/21831279)
 |
   | :-- | --: |
   | Change from base [Build 4161](https://coveralls.io/builds/21821945): |  
0.1% |
   | Covered Lines: | 18822 |
   | Relevant Lines: | 69597 |
   
   ---
   # 💛  - [Coveralls](https://coveralls.io)
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Commented] (KYLIN-3782) ZookeeperDistributedLock can't acquir lock on windows because wrong Path

2019-02-25 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776943#comment-16776943
 ] 

ASF subversion and git services commented on KYLIN-3782:


Commit 8e717518642b36bea5b35a45d7f1995c0f662c73 in kylin's branch 
refs/heads/master from hailin huang
[ https://gitbox.apache.org/repos/asf?p=kylin.git;h=8e71751 ]

KYLIN-3782 ZookeeperDistributedLock can't acquir lock on windows


> ZookeeperDistributedLock can't acquir lock on windows because wrong Path
> 
>
> Key: KYLIN-3782
> URL: https://issues.apache.org/jira/browse/KYLIN-3782
> Project: Kylin
>  Issue Type: Bug
>  Components: Storage - HBase
>Affects Versions: v2.6.0
>Reporter: hailin.huang
>Assignee: hailin.huang
>Priority: Major
> Fix For: v2.6.1
>
>
> In my windows env, when I run kylin, Kylin service can't start. In class 
> ZookeeperDistributedLock, I found  kylin use File(path).getCanonicalPath() to 
> norm windows path, however this will get a path like ' 
> C:\kylin\kylin_metadata1\job_engine\global_job_engine_lock', for 
> zookeeper,Path must start with / character.
> {color:red}return new File(path).toURI().getPath(){color}  might be better!!!
> Below is stackTrace:
> Caused by: java.lang.IllegalStateException: Error while 
> 832820@UC-20180829ZLJB trying to lock 
> C:\kylin\kylin_metadata1\job_engine\global_job_engine_lock
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lock(ZookeeperDistributedLock.java:153)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lock(ZookeeperDistributedLock.java:170)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lockJobEngine(ZookeeperDistributedLock.java:318)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperJobLock.lockJobEngine(ZookeeperJobLock.java:81)
>   at 
> org.apache.kylin.job.impl.threadpool.DefaultScheduler.init(DefaultScheduler.java:152)
>   at 
> org.apache.kylin.rest.service.JobService.afterPropertiesSet(JobService.java:126)
>   at 
> org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.invokeInitMethods(AbstractAutowireCapableBeanFactory.java:1687)
>   at 
> org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.initializeBean(AbstractAutowireCapableBeanFactory.java:1624)
>   ... 60 more
> caused by: java.lang.IllegalArgumentException: Path must start with / 
> character
>   at org.apache.curator.utils.PathUtils.validatePath(PathUtils.java:54)
>   at org.apache.curator.utils.PathUtils.validatePath(PathUtils.java:37)
>   at org.apache.curator.utils.ZKPaths.fixForNamespace(ZKPaths.java:63)
>   at 
> org.apache.curator.framework.imps.NamespaceImpl.fixForNamespace(NamespaceImpl.java:82)
>   at 
> org.apache.curator.framework.imps.CuratorFrameworkImpl.fixForNamespace(CuratorFrameworkImpl.java:579)
>   at 
> org.apache.curator.framework.imps.CreateBuilderImpl.forPath(CreateBuilderImpl.java:434)
>   at 
> org.apache.curator.framework.imps.CreateBuilderImpl.forPath(CreateBuilderImpl.java:44)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lock(ZookeeperDistributedLock.java:149)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (KYLIN-3782) ZookeeperDistributedLock can't acquir lock on windows because wrong Path

2019-02-25 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776949#comment-16776949
 ] 

ASF subversion and git services commented on KYLIN-3782:


Commit 23c8e985cb4a005f563c68e4a33c8a74a20cdfed in kylin's branch 
refs/heads/2.6.x from hailin huang
[ https://gitbox.apache.org/repos/asf?p=kylin.git;h=23c8e98 ]

KYLIN-3782 ZookeeperDistributedLock can't acquir lock on windows


> ZookeeperDistributedLock can't acquir lock on windows because wrong Path
> 
>
> Key: KYLIN-3782
> URL: https://issues.apache.org/jira/browse/KYLIN-3782
> Project: Kylin
>  Issue Type: Bug
>  Components: Storage - HBase
>Affects Versions: v2.6.0
>Reporter: hailin.huang
>Assignee: hailin.huang
>Priority: Major
> Fix For: v2.6.1
>
>
> In my windows env, when I run kylin, Kylin service can't start. In class 
> ZookeeperDistributedLock, I found  kylin use File(path).getCanonicalPath() to 
> norm windows path, however this will get a path like ' 
> C:\kylin\kylin_metadata1\job_engine\global_job_engine_lock', for 
> zookeeper,Path must start with / character.
> {color:red}return new File(path).toURI().getPath(){color}  might be better!!!
> Below is stackTrace:
> Caused by: java.lang.IllegalStateException: Error while 
> 832820@UC-20180829ZLJB trying to lock 
> C:\kylin\kylin_metadata1\job_engine\global_job_engine_lock
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lock(ZookeeperDistributedLock.java:153)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lock(ZookeeperDistributedLock.java:170)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lockJobEngine(ZookeeperDistributedLock.java:318)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperJobLock.lockJobEngine(ZookeeperJobLock.java:81)
>   at 
> org.apache.kylin.job.impl.threadpool.DefaultScheduler.init(DefaultScheduler.java:152)
>   at 
> org.apache.kylin.rest.service.JobService.afterPropertiesSet(JobService.java:126)
>   at 
> org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.invokeInitMethods(AbstractAutowireCapableBeanFactory.java:1687)
>   at 
> org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.initializeBean(AbstractAutowireCapableBeanFactory.java:1624)
>   ... 60 more
> caused by: java.lang.IllegalArgumentException: Path must start with / 
> character
>   at org.apache.curator.utils.PathUtils.validatePath(PathUtils.java:54)
>   at org.apache.curator.utils.PathUtils.validatePath(PathUtils.java:37)
>   at org.apache.curator.utils.ZKPaths.fixForNamespace(ZKPaths.java:63)
>   at 
> org.apache.curator.framework.imps.NamespaceImpl.fixForNamespace(NamespaceImpl.java:82)
>   at 
> org.apache.curator.framework.imps.CuratorFrameworkImpl.fixForNamespace(CuratorFrameworkImpl.java:579)
>   at 
> org.apache.curator.framework.imps.CreateBuilderImpl.forPath(CreateBuilderImpl.java:434)
>   at 
> org.apache.curator.framework.imps.CreateBuilderImpl.forPath(CreateBuilderImpl.java:44)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lock(ZookeeperDistributedLock.java:149)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (KYLIN-3826) MergeCuboidJob only uploads necessary segment's dictionary

2019-02-25 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776947#comment-16776947
 ] 

ASF subversion and git services commented on KYLIN-3826:


Commit 4b3a20be3cefd540a2a300bf7f01f213d6a591b7 in kylin's branch 
refs/heads/2.6.x from Shao Feng Shi
[ https://gitbox.apache.org/repos/asf?p=kylin.git;h=4b3a20b ]

KYLIN-3826 MergeCuboidJob only uploads necessary segment's dictionary


> MergeCuboidJob only uploads necessary segment's dictionary
> --
>
> Key: KYLIN-3826
> URL: https://issues.apache.org/jira/browse/KYLIN-3826
> Project: Kylin
>  Issue Type: Improvement
>  Components: Job Engine
>Reporter: Shaofeng SHI
>Assignee: Shaofeng SHI
>Priority: Major
> Fix For: v2.6.1
>
>
> On yesterday's Kylin meetup, Zhang Wei mentioned that the "MergeCuboidJob" 
> will upload all segment's metadata, which will take extra long time when the 
> segment number is large. While this is unnecessary.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (KYLIN-3828) ArrayIndexOutOfBoundsException thrown when build a streaming cube with empty data in its first dimension

2019-02-25 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776956#comment-16776956
 ] 

ASF subversion and git services commented on KYLIN-3828:


Commit d6819b3e70033888b42ab0e5585c191898f8afe9 in kylin's branch 
refs/heads/2.5.x from nichunen
[ https://gitbox.apache.org/repos/asf?p=kylin.git;h=d6819b3 ]

KYLIN-3828 Fix heading empty string be ignored in StringUtil.join


> ArrayIndexOutOfBoundsException thrown when build a streaming cube with empty 
> data in its first dimension
> 
>
> Key: KYLIN-3828
> URL: https://issues.apache.org/jira/browse/KYLIN-3828
> Project: Kylin
>  Issue Type: Bug
>  Components: NRT Streaming
>Affects Versions: v2.6.0
>Reporter: nichunen
>Assignee: nichunen
>Priority: Major
> Fix For: v2.6.1
>
>
> The root cause is 
> in org.apache.kylin.source.kafka.hadoop.KafkaFlatTableMapper#doMap
> {code:java}
> public void doMap(LongWritable key, BytesWritable value, Context context) 
> throws IOException, InterruptedException {
> ByteBuffer buffer = ByteBuffer.wrap(value.getBytes(), 0, 
> value.getLength());
> StreamingMessageRow row = streamingParser.parse(buffer).get(0);
> if (row == null) {
> throw new IllegalArgumentException("");
> }
> data = StringUtil.join(row.getData(), delimiter);
> // output this row to value
> outValue.set(Bytes.toBytes(data));
> context.write(outKey, outValue);
> }
> {code}
> Method _StringUtil.join_ is used to join a collection of string to a single 
> string with delimiter. But _org.apache.kylin.common.util.StringUtil#join_ 
> will ignore the leading empty strings in the colletion, source code is:
> {code:java}
> public static String join(Iterable parts, String separator) {
> StringBuilder buf = new StringBuilder();
> for (String p : parts) {
> if (buf.length() > 0)
> buf.append(separator);
> buf.append(p);
> }
> return buf.toString();
> }
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (KYLIN-3826) MergeCuboidJob only uploads necessary segment's dictionary

2019-02-25 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776967#comment-16776967
 ] 

ASF subversion and git services commented on KYLIN-3826:


Commit 7a6bf9871d422289cb642bb1caaefef85c9c1604 in kylin's branch 
refs/heads/2.4.x from Shao Feng Shi
[ https://gitbox.apache.org/repos/asf?p=kylin.git;h=7a6bf98 ]

KYLIN-3826 MergeCuboidJob only uploads necessary segment's dictionary


> MergeCuboidJob only uploads necessary segment's dictionary
> --
>
> Key: KYLIN-3826
> URL: https://issues.apache.org/jira/browse/KYLIN-3826
> Project: Kylin
>  Issue Type: Improvement
>  Components: Job Engine
>Reporter: Shaofeng SHI
>Assignee: Shaofeng SHI
>Priority: Major
> Fix For: v2.6.1
>
>
> On yesterday's Kylin meetup, Zhang Wei mentioned that the "MergeCuboidJob" 
> will upload all segment's metadata, which will take extra long time when the 
> segment number is large. While this is unnecessary.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (KYLIN-3782) ZookeeperDistributedLock can't acquir lock on windows because wrong Path

2019-02-25 Thread Shaofeng SHI (JIRA)


 [ 
https://issues.apache.org/jira/browse/KYLIN-3782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shaofeng SHI resolved KYLIN-3782.
-
Resolution: Fixed

> ZookeeperDistributedLock can't acquir lock on windows because wrong Path
> 
>
> Key: KYLIN-3782
> URL: https://issues.apache.org/jira/browse/KYLIN-3782
> Project: Kylin
>  Issue Type: Bug
>  Components: Storage - HBase
>Affects Versions: v2.6.0
>Reporter: hailin.huang
>Assignee: hailin.huang
>Priority: Major
> Fix For: v2.6.1
>
>
> In my windows env, when I run kylin, Kylin service can't start. In class 
> ZookeeperDistributedLock, I found  kylin use File(path).getCanonicalPath() to 
> norm windows path, however this will get a path like ' 
> C:\kylin\kylin_metadata1\job_engine\global_job_engine_lock', for 
> zookeeper,Path must start with / character.
> {color:red}return new File(path).toURI().getPath(){color}  might be better!!!
> Below is stackTrace:
> Caused by: java.lang.IllegalStateException: Error while 
> 832820@UC-20180829ZLJB trying to lock 
> C:\kylin\kylin_metadata1\job_engine\global_job_engine_lock
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lock(ZookeeperDistributedLock.java:153)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lock(ZookeeperDistributedLock.java:170)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lockJobEngine(ZookeeperDistributedLock.java:318)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperJobLock.lockJobEngine(ZookeeperJobLock.java:81)
>   at 
> org.apache.kylin.job.impl.threadpool.DefaultScheduler.init(DefaultScheduler.java:152)
>   at 
> org.apache.kylin.rest.service.JobService.afterPropertiesSet(JobService.java:126)
>   at 
> org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.invokeInitMethods(AbstractAutowireCapableBeanFactory.java:1687)
>   at 
> org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.initializeBean(AbstractAutowireCapableBeanFactory.java:1624)
>   ... 60 more
> caused by: java.lang.IllegalArgumentException: Path must start with / 
> character
>   at org.apache.curator.utils.PathUtils.validatePath(PathUtils.java:54)
>   at org.apache.curator.utils.PathUtils.validatePath(PathUtils.java:37)
>   at org.apache.curator.utils.ZKPaths.fixForNamespace(ZKPaths.java:63)
>   at 
> org.apache.curator.framework.imps.NamespaceImpl.fixForNamespace(NamespaceImpl.java:82)
>   at 
> org.apache.curator.framework.imps.CuratorFrameworkImpl.fixForNamespace(CuratorFrameworkImpl.java:579)
>   at 
> org.apache.curator.framework.imps.CreateBuilderImpl.forPath(CreateBuilderImpl.java:434)
>   at 
> org.apache.curator.framework.imps.CreateBuilderImpl.forPath(CreateBuilderImpl.java:44)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lock(ZookeeperDistributedLock.java:149)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] shaofengshi commented on issue #477: bug fixed for KYLIN-3782

2019-02-25 Thread GitBox
shaofengshi commented on issue #477: bug fixed for KYLIN-3782
URL: https://github.com/apache/kylin/pull/477#issuecomment-467040007
 
 
   Merged with patch, close the PR.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] shaofengshi closed pull request #477: bug fixed for KYLIN-3782

2019-02-25 Thread GitBox
shaofengshi closed pull request #477: bug fixed for KYLIN-3782
URL: https://github.com/apache/kylin/pull/477
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Commented] (KYLIN-3782) ZookeeperDistributedLock can't acquir lock on windows because wrong Path

2019-02-25 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776968#comment-16776968
 ] 

ASF GitHub Bot commented on KYLIN-3782:
---

shaofengshi commented on pull request #477: bug fixed for KYLIN-3782
URL: https://github.com/apache/kylin/pull/477
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> ZookeeperDistributedLock can't acquir lock on windows because wrong Path
> 
>
> Key: KYLIN-3782
> URL: https://issues.apache.org/jira/browse/KYLIN-3782
> Project: Kylin
>  Issue Type: Bug
>  Components: Storage - HBase
>Affects Versions: v2.6.0
>Reporter: hailin.huang
>Assignee: hailin.huang
>Priority: Major
> Fix For: v2.6.1
>
>
> In my windows env, when I run kylin, Kylin service can't start. In class 
> ZookeeperDistributedLock, I found  kylin use File(path).getCanonicalPath() to 
> norm windows path, however this will get a path like ' 
> C:\kylin\kylin_metadata1\job_engine\global_job_engine_lock', for 
> zookeeper,Path must start with / character.
> {color:red}return new File(path).toURI().getPath(){color}  might be better!!!
> Below is stackTrace:
> Caused by: java.lang.IllegalStateException: Error while 
> 832820@UC-20180829ZLJB trying to lock 
> C:\kylin\kylin_metadata1\job_engine\global_job_engine_lock
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lock(ZookeeperDistributedLock.java:153)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lock(ZookeeperDistributedLock.java:170)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lockJobEngine(ZookeeperDistributedLock.java:318)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperJobLock.lockJobEngine(ZookeeperJobLock.java:81)
>   at 
> org.apache.kylin.job.impl.threadpool.DefaultScheduler.init(DefaultScheduler.java:152)
>   at 
> org.apache.kylin.rest.service.JobService.afterPropertiesSet(JobService.java:126)
>   at 
> org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.invokeInitMethods(AbstractAutowireCapableBeanFactory.java:1687)
>   at 
> org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.initializeBean(AbstractAutowireCapableBeanFactory.java:1624)
>   ... 60 more
> caused by: java.lang.IllegalArgumentException: Path must start with / 
> character
>   at org.apache.curator.utils.PathUtils.validatePath(PathUtils.java:54)
>   at org.apache.curator.utils.PathUtils.validatePath(PathUtils.java:37)
>   at org.apache.curator.utils.ZKPaths.fixForNamespace(ZKPaths.java:63)
>   at 
> org.apache.curator.framework.imps.NamespaceImpl.fixForNamespace(NamespaceImpl.java:82)
>   at 
> org.apache.curator.framework.imps.CuratorFrameworkImpl.fixForNamespace(CuratorFrameworkImpl.java:579)
>   at 
> org.apache.curator.framework.imps.CreateBuilderImpl.forPath(CreateBuilderImpl.java:434)
>   at 
> org.apache.curator.framework.imps.CreateBuilderImpl.forPath(CreateBuilderImpl.java:44)
>   at 
> org.apache.kylin.storage.hbase.util.ZookeeperDistributedLock.lock(ZookeeperDistributedLock.java:149)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (KYLIN-3828) ArrayIndexOutOfBoundsException thrown when build a streaming cube with empty data in its first dimension

2019-02-25 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776948#comment-16776948
 ] 

ASF subversion and git services commented on KYLIN-3828:


Commit 9a3ec734d1ccaf81a9f6df819961d7b9f541b7df in kylin's branch 
refs/heads/2.6.x from nichunen
[ https://gitbox.apache.org/repos/asf?p=kylin.git;h=9a3ec73 ]

KYLIN-3828 Fix heading empty string be ignored in StringUtil.join


> ArrayIndexOutOfBoundsException thrown when build a streaming cube with empty 
> data in its first dimension
> 
>
> Key: KYLIN-3828
> URL: https://issues.apache.org/jira/browse/KYLIN-3828
> Project: Kylin
>  Issue Type: Bug
>  Components: NRT Streaming
>Affects Versions: v2.6.0
>Reporter: nichunen
>Assignee: nichunen
>Priority: Major
> Fix For: v2.6.1
>
>
> The root cause is 
> in org.apache.kylin.source.kafka.hadoop.KafkaFlatTableMapper#doMap
> {code:java}
> public void doMap(LongWritable key, BytesWritable value, Context context) 
> throws IOException, InterruptedException {
> ByteBuffer buffer = ByteBuffer.wrap(value.getBytes(), 0, 
> value.getLength());
> StreamingMessageRow row = streamingParser.parse(buffer).get(0);
> if (row == null) {
> throw new IllegalArgumentException("");
> }
> data = StringUtil.join(row.getData(), delimiter);
> // output this row to value
> outValue.set(Bytes.toBytes(data));
> context.write(outKey, outValue);
> }
> {code}
> Method _StringUtil.join_ is used to join a collection of string to a single 
> string with delimiter. But _org.apache.kylin.common.util.StringUtil#join_ 
> will ignore the leading empty strings in the colletion, source code is:
> {code:java}
> public static String join(Iterable parts, String separator) {
> StringBuilder buf = new StringBuilder();
> for (String p : parts) {
> if (buf.length() > 0)
> buf.append(separator);
> buf.append(p);
> }
> return buf.toString();
> }
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (KYLIN-3826) MergeCuboidJob only uploads necessary segment's dictionary

2019-02-25 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16776955#comment-16776955
 ] 

ASF subversion and git services commented on KYLIN-3826:


Commit 7a731c8d8832e7292cc9b79de4dbda98fd52c0ff in kylin's branch 
refs/heads/2.5.x from Shao Feng Shi
[ https://gitbox.apache.org/repos/asf?p=kylin.git;h=7a731c8 ]

KYLIN-3826 MergeCuboidJob only uploads necessary segment's dictionary


> MergeCuboidJob only uploads necessary segment's dictionary
> --
>
> Key: KYLIN-3826
> URL: https://issues.apache.org/jira/browse/KYLIN-3826
> Project: Kylin
>  Issue Type: Improvement
>  Components: Job Engine
>Reporter: Shaofeng SHI
>Assignee: Shaofeng SHI
>Priority: Major
> Fix For: v2.6.1
>
>
> On yesterday's Kylin meetup, Zhang Wei mentioned that the "MergeCuboidJob" 
> will upload all segment's metadata, which will take extra long time when the 
> segment number is large. While this is unnecessary.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3829) Chinese document on the kylin official site about sample cube got the sql wrong

2019-02-25 Thread JIRA
王汝鹏 created KYLIN-3829:
--

 Summary: Chinese document on the kylin official site about sample 
cube got the sql wrong
 Key: KYLIN-3829
 URL: https://issues.apache.org/jira/browse/KYLIN-3829
 Project: Kylin
  Issue Type: Improvement
  Components: Documentation, Website
Affects Versions: v2.6.0
Reporter: 王汝鹏


As we can see, the sql in the [Chinese document about the quick start with 
sample 
cube|[http://kylin.apache.org/cn/docs/tutorial/kylin_sample.html]|http://kylin.apache.org/cn/docs/tutorial/kylin_sample.html],]
 misses a piece which should be "select part_dt, sum(price) as total_selled, 
count(distinct seller_id) as sellers from kylin_sales group by part_dt order by 
part_dt". But the English version is correct.  With the helping of the front 
end code checking tool, you can see the right sql, so it's just a front end 
display issue and I think it's easy to fix. :)

I've tested by chrome, firefox and edge and got the same problem, so it's not 
the problem of brower .

It could be confusing for the beginner though it's easy to resolve.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (KYLIN-3814) Add pause interval for job retry

2019-02-25 Thread JIRA


 [ 
https://issues.apache.org/jira/browse/KYLIN-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

王汝鹏 updated KYLIN-3814:
---
Attachment: (was: kylin.jpg)

> Add pause interval for job retry
> 
>
> Key: KYLIN-3814
> URL: https://issues.apache.org/jira/browse/KYLIN-3814
> Project: Kylin
>  Issue Type: Improvement
>  Components: Job Engine
>Reporter: PENG Zhengshuai
>Assignee: PENG Zhengshuai
>Priority: Blocker
>
> When there is an exception occurred while the job was running, like network 
> shake or ZK service shake.  The job will be retried. The configuration 
> "kylin.job.retry" will handle this. 
> But sometimes we need to pause a little time before the retry job start. The 
> purpose is to wait for the network shake or ZK shake recovered, otherwise the 
> retry make no sense.
> The design is to add a configuration "kylin.job.retry-interval", the default 
> is 3 milliseconds. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (KYLIN-3814) Add pause interval for job retry

2019-02-25 Thread JIRA


 [ 
https://issues.apache.org/jira/browse/KYLIN-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

王汝鹏 updated KYLIN-3814:
---
Attachment: kylin.jpg

> Add pause interval for job retry
> 
>
> Key: KYLIN-3814
> URL: https://issues.apache.org/jira/browse/KYLIN-3814
> Project: Kylin
>  Issue Type: Improvement
>  Components: Job Engine
>Reporter: PENG Zhengshuai
>Assignee: PENG Zhengshuai
>Priority: Blocker
> Attachments: kylin.jpg
>
>
> When there is an exception occurred while the job was running, like network 
> shake or ZK service shake.  The job will be retried. The configuration 
> "kylin.job.retry" will handle this. 
> But sometimes we need to pause a little time before the retry job start. The 
> purpose is to wait for the network shake or ZK shake recovered, otherwise the 
> retry make no sense.
> The design is to add a configuration "kylin.job.retry-interval", the default 
> is 3 milliseconds. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (KYLIN-3814) Add pause interval for job retry

2019-02-25 Thread Shaofeng SHI (JIRA)


 [ 
https://issues.apache.org/jira/browse/KYLIN-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shaofeng SHI updated KYLIN-3814:

 Priority: Major  (was: Blocker)
Fix Version/s: v3.0.0

> Add pause interval for job retry
> 
>
> Key: KYLIN-3814
> URL: https://issues.apache.org/jira/browse/KYLIN-3814
> Project: Kylin
>  Issue Type: Improvement
>  Components: Job Engine
>Reporter: PENG Zhengshuai
>Assignee: PENG Zhengshuai
>Priority: Major
> Fix For: v3.0.0
>
>
> When there is an exception occurred while the job was running, like network 
> shake or ZK service shake.  The job will be retried. The configuration 
> "kylin.job.retry" will handle this. 
> But sometimes we need to pause a little time before the retry job start. The 
> purpose is to wait for the network shake or ZK shake recovered, otherwise the 
> retry make no sense.
> The design is to add a configuration "kylin.job.retry-interval", the default 
> is 3 milliseconds. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (KYLIN-3814) Add pause interval for job retry

2019-02-25 Thread JIRA


[ 
https://issues.apache.org/jira/browse/KYLIN-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16777467#comment-16777467
 ] 

王汝鹏 commented on KYLIN-3814:


I'm so sorry for my wrong operation about deleting the link. Here is the  link 
[This issue links to "GitHub Pull Request #470 (Web Link)" [ 146226 
]|[https://github.com/apache/kylin/pull/470]] 

> Add pause interval for job retry
> 
>
> Key: KYLIN-3814
> URL: https://issues.apache.org/jira/browse/KYLIN-3814
> Project: Kylin
>  Issue Type: Improvement
>  Components: Job Engine
>Reporter: PENG Zhengshuai
>Assignee: PENG Zhengshuai
>Priority: Major
> Fix For: v3.0.0
>
>
> When there is an exception occurred while the job was running, like network 
> shake or ZK service shake.  The job will be retried. The configuration 
> "kylin.job.retry" will handle this. 
> But sometimes we need to pause a little time before the retry job start. The 
> purpose is to wait for the network shake or ZK shake recovered, otherwise the 
> retry make no sense.
> The design is to add a configuration "kylin.job.retry-interval", the default 
> is 3 milliseconds. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (KYLIN-3829) Chinese document on the kylin official site about sample cube got the sql wrong

2019-02-25 Thread JIRA


 [ 
https://issues.apache.org/jira/browse/KYLIN-3829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

王汝鹏 updated KYLIN-3829:
---
Attachment: kylin.jpg

> Chinese document on the kylin official site about sample cube got the sql 
> wrong
> ---
>
> Key: KYLIN-3829
> URL: https://issues.apache.org/jira/browse/KYLIN-3829
> Project: Kylin
>  Issue Type: Improvement
>  Components: Documentation, Website
>Affects Versions: v2.6.0
>Reporter: 王汝鹏
>Priority: Trivial
>  Labels: beginner, easyfix
> Attachments: kylin.jpg
>
>
> As we can see, the sql in the [Chinese document about the quick start with 
> sample 
> cube|[http://kylin.apache.org/cn/docs/tutorial/kylin_sample.html]|http://kylin.apache.org/cn/docs/tutorial/kylin_sample.html],]
>  misses a piece which should be "select part_dt, sum(price) as total_selled, 
> count(distinct seller_id) as sellers from kylin_sales group by part_dt order 
> by part_dt". But the English version is correct.  With the helping of the 
> front end code checking tool, you can see the right sql, so it's just a front 
> end display issue and I think it's easy to fix. :)
> I've tested by chrome, firefox and edge and got the same problem, so it's not 
> the problem of brower .
> It could be confusing for the beginner though it's easy to resolve.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (KYLIN-3814) Add pause interval for job retry

2019-02-25 Thread Shaofeng SHI (JIRA)


 [ 
https://issues.apache.org/jira/browse/KYLIN-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shaofeng SHI resolved KYLIN-3814.
-
Resolution: Fixed

> Add pause interval for job retry
> 
>
> Key: KYLIN-3814
> URL: https://issues.apache.org/jira/browse/KYLIN-3814
> Project: Kylin
>  Issue Type: Improvement
>  Components: Job Engine
>Reporter: PENG Zhengshuai
>Assignee: PENG Zhengshuai
>Priority: Major
> Fix For: v3.0.0
>
>
> When there is an exception occurred while the job was running, like network 
> shake or ZK service shake.  The job will be retried. The configuration 
> "kylin.job.retry" will handle this. 
> But sometimes we need to pause a little time before the retry job start. The 
> purpose is to wait for the network shake or ZK shake recovered, otherwise the 
> retry make no sense.
> The design is to add a configuration "kylin.job.retry-interval", the default 
> is 3 milliseconds. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (KYLIN-3829) Chinese document on the kylin official site about sample cube got the sql wrong

2019-02-25 Thread JIRA


 [ 
https://issues.apache.org/jira/browse/KYLIN-3829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

王汝鹏 updated KYLIN-3829:
---
Description: 
As we can see, the sql in the [Chinese document about the quick start with 
sample cube|[http://kylin.apache.org/cn/docs/tutorial/kylin_sample.html]] 
misses a piece which should be "select part_dt, sum(price) as total_selled, 
count(distinct seller_id) as sellers from kylin_sales group by part_dt order by 
part_dt". But the English version is correct.  With the helping of the front 
end code checking tool, you can see the right sql, so it's just a front end 
display issue and I think it's easy to fix. :)

I've tested by chrome, firefox and edge and got the same problem, so it's not 
the problem of brower .

It could be confusing for the beginner though it's easy to resolve.

  was:
As we can see, the sql in the [Chinese document about the quick start with 
sample 
cube|[http://kylin.apache.org/cn/docs/tutorial/kylin_sample.html]|http://kylin.apache.org/cn/docs/tutorial/kylin_sample.html],]
 misses a piece which should be "select part_dt, sum(price) as total_selled, 
count(distinct seller_id) as sellers from kylin_sales group by part_dt order by 
part_dt". But the English version is correct.  With the helping of the front 
end code checking tool, you can see the right sql, so it's just a front end 
display issue and I think it's easy to fix. :)

I've tested by chrome, firefox and edge and got the same problem, so it's not 
the problem of brower .

It could be confusing for the beginner though it's easy to resolve.


> Chinese document on the kylin official site about sample cube got the sql 
> wrong
> ---
>
> Key: KYLIN-3829
> URL: https://issues.apache.org/jira/browse/KYLIN-3829
> Project: Kylin
>  Issue Type: Improvement
>  Components: Documentation, Website
>Affects Versions: v2.6.0
>Reporter: 王汝鹏
>Priority: Trivial
>  Labels: beginner, easyfix
> Attachments: kylin.jpg
>
>
> As we can see, the sql in the [Chinese document about the quick start with 
> sample cube|[http://kylin.apache.org/cn/docs/tutorial/kylin_sample.html]] 
> misses a piece which should be "select part_dt, sum(price) as total_selled, 
> count(distinct seller_id) as sellers from kylin_sales group by part_dt order 
> by part_dt". But the English version is correct.  With the helping of the 
> front end code checking tool, you can see the right sql, so it's just a front 
> end display issue and I think it's easy to fix. :)
> I've tested by chrome, firefox and edge and got the same problem, so it's not 
> the problem of brower .
> It could be confusing for the beginner though it's easy to resolve.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (KYLIN-3810) Kylin service nodes discovery with Zookeeper

2019-02-25 Thread Chao Long (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16777515#comment-16777515
 ] 

Chao Long commented on KYLIN-3810:
--

Recently, I added a curator-based scheduler in kylin and I found some work 
duplicate with yours. In my work, I use the curator-x-discovery component to 
implement kylin service auto discovery and failover and implement the automatic 
election and monitoring of the job node leader through zk. So, I want to 
discuss with you guys which realization is better.:D KYLIN-3820

> Kylin service nodes discovery with Zookeeper 
> -
>
> Key: KYLIN-3810
> URL: https://issues.apache.org/jira/browse/KYLIN-3810
> Project: Kylin
>  Issue Type: Improvement
>  Components: Metadata, Others
>Affects Versions: v2.3.1
>Reporter: zhangwei
>Assignee: zhangwei
>Priority: Minor
> Attachments: zookeeperCacheSync.patch, 同步阻塞.png, 引入zookeeper.png
>
>
> 由于Kylin的元数据同步使用了Boardcast类中的线程池,所以一旦负载均衡查询节点中一台机器岩机,Kylin的同步请求线程会堵塞在岩机的查询节点,最后将线程资源全部占满,如图同步到1,2节点的请求堵塞在线程队列中导致查询节点获取不到最新元数据从而无法查询到实时构建好的数据
> 我们的方案:
> 引进了服务发现组建zookeeper



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (KYLIN-3809) Support Zookeeper based rest server discovery

2019-02-25 Thread Chao Long (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16777514#comment-16777514
 ] 

Chao Long commented on KYLIN-3809:
--

Recently, I added a curator-based scheduler in kylin and I found some work 
duplicate with yours. In my work, I use the curator-x-discovery component to 
implement kylin service auto discovery and failover and implement the automatic 
election and monitoring of the job node leader through zk. So, I want to 
discuss with you guys which realization is better.:D 
[KYLIN-3820|https://issues.apache.org/jira/browse/KYLIN-3820]

> Support Zookeeper based rest server discovery
> -
>
> Key: KYLIN-3809
> URL: https://issues.apache.org/jira/browse/KYLIN-3809
> Project: Kylin
>  Issue Type: New Feature
>Reporter: Liu Shaohui
>Assignee: Liu Shaohui
>Priority: Minor
>
> Currently to broadcast config or meta changes, all kylin servers must be set 
> in kylin.properties. It's not convenient when adding or removing kylin server 
> especially in k8s env.
>  
> So we can register the endpoint to zk and make the rest server discovery  
> automatically.
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3830) return wrong result when 'SELECT SUM(dim1)' without set a relative metric of dim1.

2019-02-25 Thread Yuzhang QIU (JIRA)
Yuzhang QIU created KYLIN-3830:
--

 Summary: return wrong result when 'SELECT SUM(dim1)' without set a 
relative metric of dim1.
 Key: KYLIN-3830
 URL: https://issues.apache.org/jira/browse/KYLIN-3830
 Project: Kylin
  Issue Type: Bug
Affects Versions: v2.5.2
Reporter: Yuzhang QIU


Hi, dear team:
  I design an cube1 based on table table1 with dim1, dim2, dim3 and only 
one metric count(1), and 'SELECT SUM(dim1) FROM table1 group by dim2', Kylin 
process this SQL and return some result1. It seems ok. But as we know, Kylin 
don't store the detail data, the dimensions' members have been encoded and 
stored in Hbase as rowkey(cause I don't set any metric with an column). So, is 
the result1 right?
  Then, I clone cube1 to cube2, and set a metric SUM(dim1). the same SQL has 
been passed to kylin and got result2. It's different from result1 at the 
aggregation field. I also pass same SQL to hive and got result3, it's same with 
result2.
  Yes, I turn off the pushdown.
  I think there are some problems.
  I can't upload some picture of results for secret policy, sorry for that.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (KYLIN-3811) Support the standalone HBase cluster using HDFS HA

2019-02-25 Thread zhangwei (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16777689#comment-16777689
 ] 

zhangwei commented on KYLIN-3811:
-

because LoadIncrementalHfile is hbase class and we don't want to change class 
of hbase so we hack this class in kylin side

this is main change left is kylinloadincrementalHFile right is 
LoadIncrementalHfile

 !screenshot-1.png! 

> Support the standalone HBase cluster using HDFS HA
> --
>
> Key: KYLIN-3811
> URL: https://issues.apache.org/jira/browse/KYLIN-3811
> Project: Kylin
>  Issue Type: Improvement
>  Components: Storage - HBase
>Affects Versions: v2.3.1
>Reporter: zhangwei
>Assignee: zhangwei
>Priority: Minor
> Attachments: hdfs_ha.patch, hdfs_ha_2.patch, screenshot-1.png
>
>
> 独立hbase 集群的 hdfs namenode主备切换导致Kylin无法正常工作的问题
> 痛点:
> ylin.hbase.cluster.fs=hdfs://namenode:54310,在hbase的hdfs namenode切换之后会,和hbase 
> hdfs相关的步骤会出现失败
> 我们的方案:(所有的修改都基于kylin的代码,不需要hbase 做修改) 
> kylin.properties 
> kylin.hbase.cluster.fs=hdfs://nshbase
> kylin.storage.hbase.cluster-hdfs-config-file=hbase-hdfs-site.xml
> hbase-hdfs-site.xml配置 hbase hdfs的HA连接方式,其中dfs.nameservices= nshbase



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (KYLIN-3811) Support the standalone HBase cluster using HDFS HA

2019-02-25 Thread zhangwei (JIRA)


 [ 
https://issues.apache.org/jira/browse/KYLIN-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

zhangwei updated KYLIN-3811:

Attachment: screenshot-1.png

> Support the standalone HBase cluster using HDFS HA
> --
>
> Key: KYLIN-3811
> URL: https://issues.apache.org/jira/browse/KYLIN-3811
> Project: Kylin
>  Issue Type: Improvement
>  Components: Storage - HBase
>Affects Versions: v2.3.1
>Reporter: zhangwei
>Assignee: zhangwei
>Priority: Minor
> Attachments: hdfs_ha.patch, hdfs_ha_2.patch, screenshot-1.png
>
>
> 独立hbase 集群的 hdfs namenode主备切换导致Kylin无法正常工作的问题
> 痛点:
> ylin.hbase.cluster.fs=hdfs://namenode:54310,在hbase的hdfs namenode切换之后会,和hbase 
> hdfs相关的步骤会出现失败
> 我们的方案:(所有的修改都基于kylin的代码,不需要hbase 做修改) 
> kylin.properties 
> kylin.hbase.cluster.fs=hdfs://nshbase
> kylin.storage.hbase.cluster-hdfs-config-file=hbase-hdfs-site.xml
> hbase-hdfs-site.xml配置 hbase hdfs的HA连接方式,其中dfs.nameservices= nshbase



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3831) 唯独超过62生成cuboid 错误

2019-02-25 Thread zhangwei (JIRA)
zhangwei created KYLIN-3831:
---

 Summary: 唯独超过62生成cuboid 错误
 Key: KYLIN-3831
 URL: https://issues.apache.org/jira/browse/KYLIN-3831
 Project: Kylin
  Issue Type: Wish
  Components: Others
Affects Versions: v2.3.1
Reporter: zhangwei






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (KYLIN-3831) 唯独超过62生成cuboid 错误

2019-02-25 Thread zhangwei (JIRA)


[ 
https://issues.apache.org/jira/browse/KYLIN-3831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16777694#comment-16777694
 ] 

zhangwei commented on KYLIN-3831:
-

kylin.cube.rowkey.max-size 这个参数其实不能设置超过默认的63 
因为生成cuboid的算法对64位的long做位移,一旦维度过多就会算出cuboid = -1 导致无法生成cube

测试代码
public class TestBitIndex { public static void main(String[] args) { long 
fullMask = 0L; for (int i = 62; i >=0 ; i--) { fullMask |= 1L << i; } 
System.out.println(fullMask); } }

9223372036854775807

public class TestBitIndex { public static void main(String[] args) { long 
fullMask = 0L; for (int i = 65; i >=0 ; i--) { fullMask |= 1L << i; } 
System.out.println(fullMask); } }

-1

DefaultCuboidScheduler
@Override
public List getSpanningCuboid(long cuboid) {
if (cuboid > max || cuboid < 0) {
throw new IllegalArgumentException("Cuboid " + cuboid + " is out of 
scope 0-" + max);
}

List spanning = parent2child.get(cuboid);
if (spanning == null) {
return Collections.EMPTY_LIST;
}
return spanning;
}

> 唯独超过62生成cuboid 错误
> -
>
> Key: KYLIN-3831
> URL: https://issues.apache.org/jira/browse/KYLIN-3831
> Project: Kylin
>  Issue Type: Wish
>  Components: Others
>Affects Versions: v2.3.1
>Reporter: zhangwei
>Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (KYLIN-3831) 唯独超过62生成cuboid 错误

2019-02-25 Thread zhangwei (JIRA)


 [ 
https://issues.apache.org/jira/browse/KYLIN-3831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

zhangwei reassigned KYLIN-3831:
---

Assignee: zhangwei

> 唯独超过62生成cuboid 错误
> -
>
> Key: KYLIN-3831
> URL: https://issues.apache.org/jira/browse/KYLIN-3831
> Project: Kylin
>  Issue Type: Wish
>  Components: Others
>Affects Versions: v2.3.1
>Reporter: zhangwei
>Assignee: zhangwei
>Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)