Mitar created ARROW-2273:
----------------------------
Summary: Cannot deserialize pandas SparseDataFrame
Key: ARROW-2273
URL: https://issues.apache.org/jira/browse/ARROW-2273
Project: Apache Arrow
Issue Type: Bug
Components: Python
Affects Versions: 0.9.0
Reporter: Mitar
>>> import pyarrow
>>> import pandas
>>> a = pandas.SparseDataFrame({'A': [1, 2, 3], 'B': [4, 5, 6], 'C': [7, 8, 9]})
>>> pyarrow.deserialize(pyarrow.serialize(a).to_buffer())
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "serialization.pxi", line 441, in pyarrow.lib.deserialize
File "serialization.pxi", line 404, in pyarrow.lib.deserialize_from
File "serialization.pxi", line 257, in
pyarrow.lib.SerializedPyObject.deserialize
File "serialization.pxi", line 174, in
pyarrow.lib.SerializationContext._deserialize_callback
File
".../.virtualenv/arrow/lib/python3.6/site-packages/pyarrow/serialization.py",
line 77, in _deserialize_pandas_dataframe
return pdcompat.serialized_dict_to_dataframe(data)
File
".../.virtualenv/arrow/lib/python3.6/site-packages/pyarrow/pandas_compat.py",
line 450, in serialized_dict_to_dataframe
for block in data['blocks']]
File
".../.virtualenv/arrow/lib/python3.6/site-packages/pyarrow/pandas_compat.py",
line 450, in <listcomp>
for block in data['blocks']]
File
".../.virtualenv/arrow/lib/python3.6/site-packages/pyarrow/pandas_compat.py",
line 478, in _reconstruct_block
block = _int.make_block(block_arr, placement=placement)
File
".../.virtualenv/arrow/lib/python3.6/site-packages/pandas/core/internals.py",
line 2957, in make_block
return klass(values, ndim=ndim, fastpath=fastpath, placement=placement)
File
".../.virtualenv/arrow/lib/python3.6/site-packages/pandas/core/internals.py",
line 120, in __init__
len(self.mgr_locs)))
ValueError: Wrong number of items passed 3, placement implies 1
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)