[
https://issues.apache.org/jira/browse/SPARK-13728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15184742#comment-15184742
]
Hyukjin Kwon edited comment on SPARK-13728 at 3/8/16 10:17 AM:
---------------------------------------------------------------
I see. I found some clues. It looks https://github.com/apache/spark/pull/11509
PR makes the output only single ORC file.
It was 10 files but this PR writes only single file. So, this could not skip
stripes in ORC.
Whether it is right to write this to single file or multiple files, I think I
should explicitly repartition the data. Can I maybe submit a PR for this?
was (Author: hyukjin.kwon):
I see. I found some clues. It looks https://github.com/apache/spark/pull/11509
PR make the output only single ORC file.
It was 10 files but this PR writes only single file. So, this could not skip
stripes in ORC.
Whether it is right to write this to single file or multiple files, I think I
should explicitly repartition the data. Can I maybe submit a PR for this?
> Fix ORC PPD
> -----------
>
> Key: SPARK-13728
> URL: https://issues.apache.org/jira/browse/SPARK-13728
> Project: Spark
> Issue Type: Sub-task
> Components: SQL
> Reporter: Michael Armbrust
>
> Fix the ignored test "Enable ORC PPD" in OrcQuerySuite.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]