[
https://issues.apache.org/jira/browse/ORC-162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16146321#comment-16146321
]
ASF GitHub Bot commented on ORC-162:
------------------------------------
Github user omalley commented on the issue:
https://github.com/apache/orc/pull/163
@prasanthj There are customers out there with millions of zero byte ORC
files in their Hive warehouses. We need to have the reader not throw when they
read them with Spark, etc. Rather than patch each context where Readers may be
created, I'd rather fix the core Reader.
> Handle 0 byte files as empty ORC files
> --------------------------------------
>
> Key: ORC-162
> URL: https://issues.apache.org/jira/browse/ORC-162
> Project: ORC
> Issue Type: Bug
> Reporter: Owen O'Malley
> Assignee: Owen O'Malley
>
> Hive often creates empty files for empty buckets, which can introduce
> significant load on the HDFS cluster. Therefore, they made the Hive
> OrcOutputFormat and OrcInputFormat use 0 byte ORC files as a special case.
> We need to make the other readers treat them reasonably.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)