[ 
https://issues.apache.org/jira/browse/ARROW-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16174086#comment-16174086
 ] 

Tom Augspurger edited comment on ARROW-1585 at 9/21/17 1:11 AM:
----------------------------------------------------------------

Sorry, yes, I meant for the original data to be {{ pd.DataFrame({0: [1, 2]}) }} 
(an int, not a string).

Agreed that restricting field names to strings is best. Being able to 
reconstruct the original from the metadata is sufficient.


was (Author: tomaugspurger):
Sorry, yes, I meant for the original data to be {{ pd.DataFrame({0: [1, 
2]}))).columns }} (an int, not a string).

Agreed that restricting field names to strings is best. Being able to 
reconstruct the original from the metadata is sufficient.

> serialize_pandas round trip fails on integer columns
> ----------------------------------------------------
>
>                 Key: ARROW-1585
>                 URL: https://issues.apache.org/jira/browse/ARROW-1585
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>    Affects Versions: 0.7.0
>            Reporter: Tom Augspurger
>            Priority: Minor
>             Fix For: 0.8.0
>
>
> This roundtrip fails, since the Integer column isn't converted to a string 
> after deserializing
> {code:python}
> In [1]: import pandas as pd
> im
> In [2]: import pyarrow as pa
> In [3]: pa.deserialize_pandas(pa.serialize_pandas(pd.DataFrame({"0": [1, 
> 2]}))).columns
> Out[3]: Index(['0'], dtype='object')
> {code}
> That should be an {{ Int64Index([0]) }} for the columns.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to