[
https://issues.apache.org/jira/browse/HIVE-5102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13763726#comment-13763726
]
Owen O'Malley commented on HIVE-5102:
-------------------------------------
I've uploaded a new patch to phabricator, but it got a new revision number.
Sorry about that.
> ORC getSplits should create splits based the stripes
> -----------------------------------------------------
>
> Key: HIVE-5102
> URL: https://issues.apache.org/jira/browse/HIVE-5102
> Project: Hive
> Issue Type: Bug
> Components: File Formats
> Reporter: Owen O'Malley
> Assignee: Owen O'Malley
> Attachments: HIVE-5102.D12579.1.patch, HIVE-5102.D12579.2.patch,
> HIVE-5102.D12849.1.patch
>
>
> Currently ORC inherits getSplits from FileFormat, which basically makes a
> split per an HDFS block. This can create too little parallelism and would be
> better done by having getSplits look at the file footer and create splits
> based on the stripes.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira