[jira] [Updated] (KYLIN-4015) Kylin build cube error at the "Build UHC Dictionary" step

2019-05-28 Thread Shaofeng SHI (JIRA)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shaofeng SHI updated KYLIN-4015:

Fix Version/s: v2.6.3

> Kylin build cube error at the "Build UHC Dictionary" step
> -
>
> Key: KYLIN-4015
> URL: https://issues.apache.org/jira/browse/KYLIN-4015
> Project: Kylin
>  Issue Type: Bug
>  Components: Job Engine
>Affects Versions: v2.5.2
> Environment: Fusion Insight
>Reporter: zhao jintao
>Assignee: zhao jintao
>Priority: Major
>  Labels: easyfix
> Fix For: v2.6.3
>
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> Hi All:
> We know, kylin builds dimension dictionary in kylin job client. But if a cube 
> has uhc dimensions, it will cost much more CPU and memory resources. Kylin 
> provides the ability to build uhc dictionary using the MR engine to reduce 
> the resource consumption of the build engine.
> But I find that the "Build UHC Dictionary" step build error. This step run 
> using MR engine. This is the error info from yarn:
> org.apache.hadoop.mapred.YarnChild: Exception running child : 
> java.io.IOException: 
> hdfs://hacluster/xxx.../xxx/fact_distinct_columns/xxx/FIELD_NAME.dic-r-1 
> not a SequenceFile.
>  at org.apache.hadoop.io.SequenceFile$Reader.init(SequenceFile.java:)
>  at org.apache.hadoop.io.SequenceFile$Reader.initialize(SequenceFile.java:)
>  at org.apache.hadoop.io.SequenceFile$Reader.(SequenceFile.java:)
> The reason of this problem is that the "Extract Fact Table Distinct " step 
> output two type of files:".dci" and ".rldict"; but the ".dci" file is not  a 
> sequence file, so the "Build UHC Dictionary" step should filter ".dci" file 
> when run with MR engine.
> I resolve this problem and will summit my code.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (KYLIN-4015) Kylin build cube error at the "Build UHC Dictionary" step

2019-05-26 Thread zhao jintao (JIRA)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

zhao jintao updated KYLIN-4015:
---
Component/s: (was: Metadata)
 Job Engine

> Kylin build cube error at the "Build UHC Dictionary" step
> -
>
> Key: KYLIN-4015
> URL: https://issues.apache.org/jira/browse/KYLIN-4015
> Project: Kylin
>  Issue Type: Bug
>  Components: Job Engine
>Affects Versions: v2.5.2
> Environment: Fusion Insight
>Reporter: zhao jintao
>Assignee: zhao jintao
>Priority: Major
>  Labels: easyfix
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> Hi All:
> We know, kylin builds dimension dictionary in kylin job client. But if a cube 
> has uhc dimensions, it will cost much more CPU and memory resources. Kylin 
> provides the ability to build uhc dictionary using the MR engine to reduce 
> the resource consumption of the build engine.
> But I find that the "Build UHC Dictionary" step build error. This step run 
> using MR engine. This is the error info from yarn:
> org.apache.hadoop.mapred.YarnChild: Exception running child : 
> java.io.IOException: 
> hdfs://hacluster/xxx.../xxx/fact_distinct_columns/xxx/FIELD_NAME.dic-r-1 
> not a SequenceFile.
>  at org.apache.hadoop.io.SequenceFile$Reader.init(SequenceFile.java:)
>  at org.apache.hadoop.io.SequenceFile$Reader.initialize(SequenceFile.java:)
>  at org.apache.hadoop.io.SequenceFile$Reader.(SequenceFile.java:)
> The reason of this problem is that the "Extract Fact Table Distinct " step 
> output two type of files:".dci" and ".rldict"; but the ".dci" file is not  a 
> sequence file, so the "Build UHC Dictionary" step should filter ".dci" file 
> when run with MR engine.
> I resolve this problem and will summit my code.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)