[ https://issues.apache.org/jira/browse/ARROW-5287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Alessandro Molina updated ARROW-5287: ------------------------------------- Fix Version/s: 8.0.0 (was: 7.0.0) > [Python] automatic type inference for arrays of tuples > ------------------------------------------------------ > > Key: ARROW-5287 > URL: https://issues.apache.org/jira/browse/ARROW-5287 > Project: Apache Arrow > Issue Type: Improvement > Components: Python > Reporter: Joris Van den Bossche > Priority: Major > Fix For: 8.0.0 > > > Arrays of tuples are support to be converted to either ListArray or > StructArray, if you specify the type explicitly: > {code} > In [6]: pa.array([(1, 2), (3, 4, 5)], type=pa.list_(pa.int64())) > Out[6]: > <pyarrow.lib.ListArray object at 0x7f1b01a4d408> > [ > [ > 1, > 2 > ], > [ > 3, > 4, > 5 > ] > ] > In [7]: pa.array([(1, 2), (3, 4)], type=pa.struct([('a', pa.int64()), ('b', > pa.int64())])) > Out[7]: > <pyarrow.lib.StructArray object at 0x7f1b01a51b88> > -- is_valid: all not null > -- child 0 type: int64 > [ > 1, > 3 > ] > -- child 1 type: int64 > [ > 2, > 4 > ] > {code} > But not when no type is specified: > {code} > In [8]: pa.array([(1, 2), (3, 4)]) > > > --------------------------------------------------------------------------- > ArrowInvalid Traceback (most recent call last) > <ipython-input-8-ab2d80c7486d> in <module> > ----> 1 pa.array([(1, 2), (3, 4)]) > ~/scipy/repos/arrow/python/pyarrow/array.pxi in pyarrow.lib.array() > ~/scipy/repos/arrow/python/pyarrow/array.pxi in > pyarrow.lib._sequence_to_array() > ~/scipy/repos/arrow/python/pyarrow/error.pxi in pyarrow.lib.check_status() > ArrowInvalid: Could not convert (1, 2) with type tuple: did not recognize > Python value type when inferring an Arrow data type > {code} > Do we want to do automatic type inference for tuples as well? (defaulting to > the ListArray case, just as arrays of python lists are supported) > Or was there a specific reason to not support this by default? -- This message was sent by Atlassian Jira (v8.20.1#820001)