Hyunsik Choi created TAJO-574:
---------------------------------
Summary: Add a sort-based physical executor for column partition
store
Key: TAJO-574
URL: https://issues.apache.org/jira/browse/TAJO-574
Project: Tajo
Issue Type: New Feature
Components: physical operator
Reporter: Hyunsik Choi
Assignee: Hyunsik Choi
Fix For: 0.8-incubating
ColumnPartitionStoreExec keeps numerous open files while it is storing all
data. In addition, it's random write gives burden to HDFS namenode.
To solve this problem, I would like to propose a sort-based physical executor
for column partition store. It assumes that input tuples are sorted in an
ascending or descending order of partition keys. It means that it needs extra
sort operation. But, it opens only one file simultaneously. It writes all data
sequentially. In many cases, it would be the best choice for column partition
store.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)