[
https://issues.apache.org/jira/browse/ARROW-2598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17659624#comment-17659624
]
Rok Mihevc commented on ARROW-2598:
-----------------------------------
This issue has been migrated to [issue
#18993|https://github.com/apache/arrow/issues/18993] on GitHub. Please see the
[migration documentation|https://github.com/apache/arrow/issues/14542] for
further details.
> [Python] table.to_pandas segfault
> ----------------------------------
>
> Key: ARROW-2598
> URL: https://issues.apache.org/jira/browse/ARROW-2598
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Reporter: jacques
> Priority: Major
> Labels: parquet
> Fix For: 0.13.0
>
>
> Here is a small snippet which produces a segfault:
> {noformat}
> In [1]: import pyarrow as pa
> In [2]: import pyarrow.parquet as pq
> In [3]: pa_ar = pa.array([[], []])
> In [4]: pq.write_table(
> ...: table=pa.Table.from_arrays([pa_ar],["test"]),
> ...: where="test5.parquet",
> ...: compression="snappy",
> ...: flavor="spark"
> ...: )
> In [5]: pq.read_table("test5.parquet")
> Out[5]:
> pyarrow.Table
> test: list<item: null>
> child 0, item: null
> In [6]: pq.read_table("test5.parquet").to_pydict()
> Out[6]: OrderedDict([(u'test', [None, None])])
> In [7]: pq.read_table("test5.parquet").to_pandas()
> Segmentation fault
> {noformat}
> I thank you in advance for having this fixed.
> Best,
> Jacques
--
This message was sent by Atlassian Jira
(v8.20.10#820010)