Hello Rares,

Is there a complete reproducer that we may try out?

Regards

Antoine.


Le 14/12/2020 à 06:52, Rares Vernica a écrit :
> Hello,
> 
> As part of a test, I'm reading a record batch from an Arrow file,
> re-batching the data in smaller batches, and writing back the result to the
> same file. I'm getting an unexpected Bad address error and I wonder what am
> I doing wrong?
> 
> reader = pyarrow.open_stream(fn)
> tbl = reader.read_all()
> 
> writer = pyarrow.ipc.RecordBatchStreamWriter(fn, tbl.schema)
> batches = tbl.to_batches(max_chunksize=200)
> writer.write_table(pyarrow.Table.from_batches(batches))
> writer.close()
> 
> Traceback (most recent call last):
>   File "tests/foo.py", line 10, in <module>
>     writer.write_table(pyarrow.Table.from_batches(batches))
>   File "pyarrow/ipc.pxi", line 237, in
> pyarrow.lib._CRecordBatchWriter.write_table
>   File "pyarrow/error.pxi", line 97, in pyarrow.lib.check_status
> OSError: [Errno 14] Error writing bytes to file. Detail: [errno 14] Bad
> address
> 
> Do I need to "close" the reader or open the writer differently?
> 
> I'm using PyArrow 0.16.0 and Python 3.8.2.
> 
> Thank you!
> Rares
> 

Reply via email to