Juan David Agudelo created ARROW-7996:
-----------------------------------------

             Summary: Error serializing empty pandas DataFrame with pyarrow
                 Key: ARROW-7996
                 URL: https://issues.apache.org/jira/browse/ARROW-7996
             Project: Apache Arrow
          Issue Type: Bug
            Reporter: Juan David Agudelo


The following code does not work:

 
{code:python}
import pandas
import pyarrow
df = pandas.DataFrame({"timestamp": [], "value_123": [], "context_123": []})
data = [df]
context = pyarrow.default_serialization_context()  
serialized_data = context.serialize(data)  
file_path = "file.txt"
with open(file_path, "wb") as f:  
    serialized_data.write_to(f)
with open(file_path, "rb") as f:  
    context = pyarrow.default_serialization_context()  
    decoded_data = context.deserialize(f.read())
{code}
Throws the following error:
{code:java}
ArrowInvalid: strides must not involve buffer over run{code}
I am using Python 3.6.9 in Ubuntu 18.04 and the version of pyarrow is 0.16.0.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to