[ https://issues.apache.org/jira/browse/KYLIN-2135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15829672#comment-15829672 ]
Shaofeng SHI commented on KYLIN-2135: ------------------------------------- I discussed this with xiefan, when multiple reducers are used for 1 column, the dict build will be happend in the job engine (not reducer). Kylin will automatically handle that, so there should be no conflict. or maybe I didn't get the question, if true please elaborate. thanks! > Enlarge FactDistinctColumns reducer number > ------------------------------------------ > > Key: KYLIN-2135 > URL: https://issues.apache.org/jira/browse/KYLIN-2135 > Project: Kylin > Issue Type: Improvement > Components: Job Engine > Affects Versions: v1.5.4.1 > Reporter: kangkaisen > Assignee: kangkaisen > Fix For: v2.0.0 > > Attachments: KYLIN-2135.patch, new.png, old.png > > > When the hive table has billions of rows and use global dictionary for > precise count distinct measures, the {{Extract Fact Table Distinct Columns}} > job will run o long time. > So we could use more reducer to deal with the one column. -- This message was sent by Atlassian JIRA (v6.3.4#6332)