Re: (java) Producing an in-memory Arrow buffer from a file

2020-01-24 Thread Andrew Melo
Hi Micah, On Fri, Jan 24, 2020 at 6:17 AM Micah Kornfield wrote: > Hi Andrew, > It might help to provide a little more detail on where you are starting > from and what you want to do once you have the data in arrow format. > Of course! Like I mentioned, particle physics data is processed in

Re: (java) Producing an in-memory Arrow buffer from a file

2020-01-23 Thread Micah Kornfield
Hi Andrew, It might help to provide a little more detail on where you are starting from and what you want to do once you have the data in arrow format. If you have the data already available in some sort of off-heap datastructure you can potentially avoid copies wrap with the existing ArrowBuf

Re: (java) Producing an in-memory Arrow buffer from a file

2020-01-23 Thread Sebastien Binet
hi Andrew, slightly related but probably also slightly off-topic: (for inspiration) you may want to look at how this is done in groot/rarrow where tools are exported to - expose a ROOT "schema" as an Arrow Schema - expose a ROOT Tree as an Arrow Table groot/rarrow isn't working on zero-copy of

(java) Producing an in-memory Arrow buffer from a file

2020-01-23 Thread Andrew Melo
Hello all, I work in particle physics, which has standardized on the ROOT ( http://root.cern) file format to store/process our data. The format itself is quite complicated, but the relevant part here is that after parsing/decompression, we end up with value and offset buffers holding our data.