wangxiaojing created KYLIN-4342:
-----------------------------------

             Summary: Build Global Dict by MR/Hive New Version
                 Key: KYLIN-4342
                 URL: https://issues.apache.org/jira/browse/KYLIN-4342
             Project: Kylin
          Issue Type: Improvement
    Affects Versions: Future
            Reporter: wangxiaojing
            Assignee: wangxiaojing


At present, there are two limitations in the implementation of global 
dictionary through MR/Hive:
1. Limited by Hive order by global sorting on the shuffle stage, the memory and 
build time becomes uncontrollable with data volume reaching billion level. We 
have tested the base of 800 million level to configure 15g memory, and the 
build time of build dictionary needs more than 10 hours;
2. Multi global dictionary columns is calculated serially.

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to