[ https://issues.apache.org/jira/browse/ARROW-8004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17055776#comment-17055776 ]
Joris Van den Bossche commented on ARROW-8004: ---------------------------------------------- For a more limited use case than general objects, i.e. array-like objects, we could also think about checking for {{__array__}} and then convert to ListArray. For example, now we can infer a list of lists or a list of numpy arrays, but will error on a list of pandas Series > [Python] Define API for user-defined conversions of array cell values in > pyarrow.array > -------------------------------------------------------------------------------------- > > Key: ARROW-8004 > URL: https://issues.apache.org/jira/browse/ARROW-8004 > Project: Apache Arrow > Issue Type: Improvement > Components: Python > Reporter: Wes McKinney > Priority: Major > > Consider the statement > {code} > pyarrow.array([v0, v1, v2, v3]) > {code} > or correspondingly > {code} > pyarrow.array(pd.Series([v0, v1, v2, v3], dtype=object)) > {code} > where {{v0, ..., v4}} are instances of types with no built-in > conversion-to-Arrow support in pyarrow. An API could be provided to allow > user-defined unboxing to a data type that the library _does_ understand (like > a NumPy array). One complexity is that if the unboxing is costly, we may need > to "keep around" the unboxed value when doing multiple passes over the data > (e.g. initially for type inference and then for conversion) -- This message was sent by Atlassian Jira (v8.3.4#803005)