Rob DiCiuccio created ARROW-7855:
------------------------------------
Summary: TypeError on mixed array values
Key: ARROW-7855
URL: https://issues.apache.org/jira/browse/ARROW-7855
Project: Apache Arrow
Issue Type: Bug
Components: Python
Affects Versions: 0.15.1, 0.16.0
Reporter: Rob DiCiuccio
The following data structure passed to `pa.array` raises a generic `TypeError`:
{code:java}
import pyarrow as pa
pa.array([{'TestKey': [123456, 'foo']}])
{code}
{code:java}
Traceback (most recent call last):
File "pyarrow_list_test.py", line 30, in <module>
pa_array = pa.array([\{'TestKey': [123456, 'foo']}])
File "pyarrow/array.pxi", line 269, in pyarrow.lib.array
File "pyarrow/array.pxi", line 38, in pyarrow.lib._sequence_to_array
TypeError: an integer is required (got type str)
{code}
I understand there may be a way to overcome this by setting the `type` value as
an argument to `pa.array`, but the use case here is storing results of a SQL
query where the structure/type of the column is unknown.
If Arrow is ultimately unable to handle this data structure without a
predefined `type` passed to `pa.array`, can the exception at least us the
PyArrow namespace (e.g. `pa.lib.ArrowTypeError` or
`pa.lib.ArrowNotImplementedError).
Any other workaround suggestions welcome.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)