[ 
https://issues.apache.org/jira/browse/ARROW-7906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

HAOFENG DENG updated ARROW-7906:
--------------------------------
    Description: 
Just like parquet format, ORC have a big group fans in Bigdata area, it have 
better performance that parquet in some use case.

But there has a problem in python is which is does not have the standard write 
function.

 

Seem the ORC team itself maintain the standard C++ 
code([ORC-C++|[https://github.com/apache/orc/tree/master/c%2B%2B]])+ , ++ so I 
think will not take too much effort to integrate into Arrow(C+) and build the 
hook for python.

 

  was:
Just like parquet format, ORC have a big group fans in Bigdata area, it have 
better performance that parquet in some use case.

But there has a problem in python is which is does not have the standard write 
function.

 

Seem the ORC team itself maintain the standard C++ 
code([ORC-C++|[https://github.com/apache/orc/tree/master/c%2B%2B]]), so I think 
will not take too much effort to integrate into Arrow(C++) and build the hook 
for python.

 


> Full functionality for ORC format
> ---------------------------------
>
>                 Key: ARROW-7906
>                 URL: https://issues.apache.org/jira/browse/ARROW-7906
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++, Python
>            Reporter: HAOFENG DENG
>            Priority: Major
>
> Just like parquet format, ORC have a big group fans in Bigdata area, it have 
> better performance that parquet in some use case.
> But there has a problem in python is which is does not have the standard 
> write function.
>  
> Seem the ORC team itself maintain the standard C++ 
> code([ORC-C++|[https://github.com/apache/orc/tree/master/c%2B%2B]])+ , ++ so 
> I think will not take too much effort to integrate into Arrow(C+) and build 
> the hook for python.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to