[GitHub] spark issue #19943: [SPARK-16060][SQL] Support Vectorized ORC Reader

cloud-fan Mon, 08 Jan 2018 00:22:52 -0800

Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/19943
  
    @henrify  I took a look at the string/binary type of ORC batch, the data is 
stored in a ` byte[][]`, which is not a continuous byte array and we can't do a 
single copy. For better performance, I think we need to use low-level ORC 
reader API, we can consider this in the future.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #19943: [SPARK-16060][SQL] Support Vectorized ORC Reader

Reply via email to