[ https://issues.apache.org/jira/browse/ARROW-2101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16439912#comment-16439912 ]
ASF GitHub Bot commented on ARROW-2101: --------------------------------------- BryanCutler commented on issue #1886: ARROW-2101: [Python/C++] Correctly convert numpy arrays of bytes to arrow arrays of strings when user specifies arrow type of string URL: https://github.com/apache/arrow/pull/1886#issuecomment-381719872 It looks like you need to be given rights to have issues assigned, and I guess I'm not able to do that. @pitrou or @xhochy , would you mind doing this? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org > [Python] from_pandas reads 'str' type as binary Arrow data with Python 2 > ------------------------------------------------------------------------ > > Key: ARROW-2101 > URL: https://issues.apache.org/jira/browse/ARROW-2101 > Project: Apache Arrow > Issue Type: Bug > Components: Python > Affects Versions: 0.8.0 > Reporter: Bryan Cutler > Priority: Major > Labels: pull-request-available > Fix For: 0.10.0 > > > Using Python 2, converting Pandas with 'str' data to Arrow results in Arrow > data of binary type, even if the user supplies type information. conversion > of 'unicode' type works to create Arrow data of string types. For example > {code} > In [25]: pa.Array.from_pandas(pd.Series(['a'])).type > Out[25]: DataType(binary) > In [26]: pa.Array.from_pandas(pd.Series(['a']), type=pa.string()).type > Out[26]: DataType(binary) > In [27]: pa.Array.from_pandas(pd.Series([u'a'])).type > Out[27]: DataType(string) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)