[ 
https://issues.apache.org/jira/browse/SPARK-13728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15184742#comment-15184742
 ] 

Hyukjin Kwon edited comment on SPARK-13728 at 3/8/16 10:17 AM:
---------------------------------------------------------------

I see. I found some clues. It looks https://github.com/apache/spark/pull/11509 
PR makes the output only single ORC file.

It was 10 files but this PR writes only single file. So, this could not skip 
stripes in ORC.

Whether it is right to write this to single file or multiple files, I think I 
should explicitly repartition the data. Can I maybe submit a PR for this?


was (Author: hyukjin.kwon):
I see. I found some clues. It looks https://github.com/apache/spark/pull/11509 
PR make the output only single ORC file.

It was 10 files but this PR writes only single file. So, this could not skip 
stripes in ORC.

Whether it is right to write this to single file or multiple files, I think I 
should explicitly repartition the data. Can I maybe submit a PR for this?

> Fix ORC PPD
> -----------
>
>                 Key: SPARK-13728
>                 URL: https://issues.apache.org/jira/browse/SPARK-13728
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>            Reporter: Michael Armbrust
>
> Fix the ignored test "Enable ORC PPD" in OrcQuerySuite.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to