Re: Interpreting ORC Java Reference

2020-07-22 Thread Owen O'Malley
On Mon, Jul 20, 2020 at 7:01 PM Ryan Schachte wrote: > Hi team, > apologies for the last email, believe I sent too early. I'm interested in > better understanding the ORC reference guide in the docs and wanted to > clarify some things to see if I'm understanding correctly. > > I realize for the *

Re: Interpreting ORC Java Reference

2020-07-21 Thread Ryan Schachte
Hi Owen, Thanks for the quick reply. I accidentally sent that prior question too soon and resent a follow-up with my actual question in a new thread. Thank you! Ryan On Mon, Jul 20, 2020 at 6:57 PM Owen O'Malley wrote: > A good place to start would be: > >- The core java documentation - >

Interpreting ORC Java Reference

2020-07-20 Thread Ryan Schachte
Hi team, apologies for the last email, believe I sent too early. I'm interested in better understanding the ORC reference guide in the docs and wanted to clarify some things to see if I'm understanding correctly. I realize for the *VectorizedRowBatch* approach, we write in chunks of 1024 rows and

Re: Interpreting ORC Java Reference

2020-07-20 Thread Owen O'Malley
A good place to start would be: - The core java documentation - https://orc.apache.org/docs/core-java.html - The orc to json tool - https://github.com/apache/orc/blob/master/java/tools/src/java/org/apache/orc/tools/PrintData.java Feel free to ask questions here on the dev list too.

Interpreting ORC Java Reference

2020-07-20 Thread Ryan Schachte
Hi team, I'm new to ORC and m interested in getting confirmation on how I interpret the structure of the vectorized row/column vectors.