Hyunsik Choi created TAJO-574:
---------------------------------

             Summary: Add a sort-based physical executor for column partition 
store
                 Key: TAJO-574
                 URL: https://issues.apache.org/jira/browse/TAJO-574
             Project: Tajo
          Issue Type: New Feature
          Components: physical operator
            Reporter: Hyunsik Choi
            Assignee: Hyunsik Choi
             Fix For: 0.8-incubating


ColumnPartitionStoreExec keeps numerous open files while it is storing all 
data. In addition, it's random write gives burden to HDFS namenode.

To solve this problem, I would like to propose a sort-based physical executor 
for column partition store. It assumes that input tuples are sorted in an 
ascending or descending order of partition keys. It means that it needs extra 
sort operation. But, it opens only one file simultaneously. It writes all data 
sequentially. In many cases, it would be the best choice for column partition 
store.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to