Joris Van den Bossche created ARROW-16652:
---------------------------------------------

             Summary: [Python][C++] Cast compute kernel segfaults when called 
with a Table
                 Key: ARROW-16652
                 URL: https://issues.apache.org/jira/browse/ARROW-16652
             Project: Apache Arrow
          Issue Type: Bug
          Components: C++, Python
            Reporter: Joris Van den Bossche


Passing a Table to {{{pyarrow.compute.cast}} with a scalar type gives a 
segfault:

{code}
In [1]: table = pa.table({'a': [1, 2]})

In [2]: import pyarrow.compute as pc

In [3]: pc.cast(table, pa.int64())
Segmentation fault (core dumped)
{code}

Backtrace with gdb gives:

{code}
Thread 1 "python" received signal SIGSEGV, Segmentation fault.
0x00007fba01685ada in arrow::DataType::id (this=0x0) at ../src/arrow/type.h:172
172       Type::type id() const { return id_; }
(gdb) bt
#0  0x00007fba01685ada in arrow::DataType::id (this=0x0) at 
../src/arrow/type.h:172
#1  0x00007fba019e150e in arrow::TypeEquals (left=..., right=..., 
check_metadata=false) at ../src/arrow/compare.cc:1304
#2  0x00007fba01b3484a in arrow::DataType::Equals (this=0x0, other=..., 
check_metadata=false) at ../src/arrow/type.cc:374
#3  0x00007fba01f31678 in arrow::compute::internal::(anonymous 
namespace)::CastMetaFunction::ExecuteImpl (this=0x55b6ebe63860, args=..., 
options=0x55b6ec377080, ctx=0x7ffcd8cd43a0)
    at ../src/arrow/compute/cast.cc:116
#4  0x00007fba020d9f39 in arrow::compute::MetaFunction::Execute 
(this=0x55b6ebe63860, args=..., options=0x55b6ec377080, ctx=0x7ffcd8cd43a0) at 
../src/arrow/compute/function.cc:388
#5  0x00007fb9ba95c8d9 in __pyx_pf_7pyarrow_8_compute_8Function_6call 
(__pyx_v_self=0x7fb9b7c19af0, __pyx_v_args=[<pyarrow.lib.Table at remote 
0x7fb9b7c19c70>], __pyx_v_options=0x7fb9b7c1c310, 
    __pyx_v_memory_pool=0x55b6ea466d60 <_Py_NoneStruct>) at 
/home/joris/scipy/repos/arrow/python/build/temp.linux-x86_64-3.8/_compute.cpp:11292
#6  0x00007fb9ba95c3d5 in __pyx_pw_7pyarrow_8_compute_8Function_7call 
(__pyx_v_self=<pyarrow._compute.MetaFunction at remote 0x7fb9b7c19af0>, 
__pyx_args=([<pyarrow.lib.Table at remote 0x7fb9b7c19c70>],), 
    __pyx_kwds={'options': <CastOptions at remote 0x7fb9b7c1c310>, 
'memory_pool': None}) at 
/home/joris/scipy/repos/arrow/python/build/temp.linux-x86_64-3.8/_compute.cpp:11165
#7  0x000055b6ea1fb814 in cfunction_call_varargs (kwargs=<optimized out>, 
args=<optimized out>, func=<built-in method call of 
pyarrow._compute.MetaFunction object at remote 0x7fb9b7c19af0>)
    at 
/home/conda/feedstock_root/build_artifacts/python-split_1606502903469/work/Objects/call.c:772
#8  PyCFunction_Call (func=<built-in method call of 
pyarrow._compute.MetaFunction object at remote 0x7fb9b7c19af0>, args=<optimized 
out>, kwargs=<optimized out>)
    at 
/home/conda/feedstock_root/build_artifacts/python-split_1606502903469/work/Objects/call.c:772
#9  0x00007fb9ba9e84e2 in __Pyx_PyObject_Call (func=<built-in method call of 
pyarrow._compute.MetaFunction object at remote 0x7fb9b7c19af0>, 
arg=([<pyarrow.lib.Table at remote 0x7fb9b7c19c70>],), 
    kw={'options': <CastOptions at remote 0x7fb9b7c1c310>, 'memory_pool': 
None}) at 
/home/joris/scipy/repos/arrow/python/build/temp.linux-x86_64-3.8/_compute.cpp:57961
#10 0x00007fb9ba961add in __pyx_pf_7pyarrow_8_compute_6call_function 
(__pyx_self=0x0, __pyx_v_name='cast', __pyx_v_args=[<pyarrow.lib.Table at 
remote 0x7fb9b7c19c70>], 
    __pyx_v_options=<CastOptions at remote 0x7fb9b7c1c310>, 
__pyx_v_memory_pool=None) at 
/home/joris/scipy/repos/arrow/python/build/temp.linux-x86_64-3.8/_compute.cpp:13408
#11 0x00007fb9ba961676 in __pyx_pw_7pyarrow_8_compute_7call_function 
(__pyx_self=0x0, __pyx_args=('cast', [<pyarrow.lib.Table at remote 
0x7fb9b7c19c70>], <CastOptions at remote 0x7fb9b7c1c310>), __pyx_kwds=0x0)
...

{code}



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to