[ https://issues.apache.org/jira/browse/KYLIN-2135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
kangkaisen updated KYLIN-2135: ------------------------------ Attachment: KYLIN-2135.patch Update the patch. This patch is based on master branch. 1 Enlarge the reducer number for the UHC.(ShardByColumns and GlobalDictionaryColumns) 2 update the close() method and next() method in {{DFSFileTableReader}}. 3 move the max reducer count verdict to {{FactDistinctColumnsJob}} Shaofeng, please you review the patch again. Thanks you. > Enlarge FactDistinctColumns reducer number > ------------------------------------------ > > Key: KYLIN-2135 > URL: https://issues.apache.org/jira/browse/KYLIN-2135 > Project: Kylin > Issue Type: Improvement > Components: Job Engine > Affects Versions: v1.5.4.1 > Reporter: kangkaisen > Assignee: kangkaisen > Attachments: KYLIN-2135.patch, KYLIN-2135.patch, new.png, old.png > > > When the hive table has billions of rows and use global dictionary for > precise count distinct measures, the {{Extract Fact Table Distinct Columns}} > job will run o long time. > So we could use more reducer to deal with the one column. -- This message was sent by Atlassian JIRA (v6.3.4#6332)