Re: (java) Producing an in-memory Arrow buffer from a file

2020-01-23 Thread Micah Kornfield
Hi Andrew, It might help to provide a little more detail on where you are starting from and what you want to do once you have the data in arrow format. If you have the data already available in some sort of off-heap datastructure you can potentially avoid copies wrap with the existing ArrowBuf

Re: Pyarrow build/install from source in ubuntu not working

2020-01-23 Thread Sutou Kouhei
Hi, Changing > RUN python3 -c 'import pyarrow' to RUN LD_LIBRARY_PATH=/usr/local/lib python3 -c 'import pyarrow' works on my environment. Other solution: Adding ENV LD_LIBRARY_PATH=/usr/local/lib before RUN python3 -c 'import pyarrow' Dockerfile: ... RUN bash install_arrow.sh

Pyarrow build/install from source in ubuntu not working

2020-01-23 Thread Anna Waldron
Hi, I am trying to build and install pyarrow from source in an ubuntu 18.04 docker image and getting the following error when attempting to import the module: Traceback (most recent call last): > File "", line 1, in > File >

Re: (java) Producing an in-memory Arrow buffer from a file

2020-01-23 Thread Sebastien Binet
hi Andrew, slightly related but probably also slightly off-topic: (for inspiration) you may want to look at how this is done in groot/rarrow where tools are exported to - expose a ROOT "schema" as an Arrow Schema - expose a ROOT Tree as an Arrow Table groot/rarrow isn't working on zero-copy of

(java) Producing an in-memory Arrow buffer from a file

2020-01-23 Thread Andrew Melo
Hello all, I work in particle physics, which has standardized on the ROOT ( http://root.cern) file format to store/process our data. The format itself is quite complicated, but the relevant part here is that after parsing/decompression, we end up with value and offset buffers holding our data.