[jira] [Commented] (ARROW-2598) [Python] table.to_pandas segfault

Rok Mihevc (Jira) Tue, 10 Jan 2023 23:29:09 -0800


    [ 
https://issues.apache.org/jira/browse/ARROW-2598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17659624#comment-17659624
 ]


Rok Mihevc commented on ARROW-2598:
-----------------------------------

This issue has been migrated to [issue 
#18993|https://github.com/apache/arrow/issues/18993] on GitHub. Please see the 
[migration documentation|https://github.com/apache/arrow/issues/14542] for 
further details.

> [Python]  table.to_pandas segfault
> ----------------------------------
>
>                 Key: ARROW-2598
>                 URL: https://issues.apache.org/jira/browse/ARROW-2598
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>            Reporter: jacques
>            Priority: Major
>              Labels: parquet
>             Fix For: 0.13.0
>
>
> Here is a small snippet which produces a segfault:
> {noformat}
> In [1]: import pyarrow as pa
> In [2]: import pyarrow.parquet as pq
> In [3]: pa_ar = pa.array([[], []])
> In [4]: pq.write_table(
>    ...:     table=pa.Table.from_arrays([pa_ar],["test"]),
>    ...:     where="test5.parquet",
>    ...:     compression="snappy",
>    ...:     flavor="spark"
>    ...: )
> In [5]: pq.read_table("test5.parquet")
> Out[5]: 
> pyarrow.Table
> test: list<item: null>
>   child 0, item: null
> In [6]: pq.read_table("test5.parquet").to_pydict()
> Out[6]: OrderedDict([(u'test', [None, None])])
> In [7]: pq.read_table("test5.parquet").to_pandas()
> Segmentation fault
> {noformat}
> I thank you in advance for having this fixed.
> Best, 
> Jacques



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Commented] (ARROW-2598) [Python] table.to_pandas segfault

Reply via email to