[ 
https://issues.apache.org/jira/browse/ARROW-376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15989991#comment-15989991
 ] 

Wes McKinney commented on ARROW-376:
------------------------------------

This is in progress in ARROW-881 https://github.com/apache/arrow/pull/612

> Python: Convert non-range Pandas indices (optionally) to Arrow
> --------------------------------------------------------------
>
>                 Key: ARROW-376
>                 URL: https://issues.apache.org/jira/browse/ARROW-376
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>            Reporter: Uwe L. Korn
>            Assignee: Phillip Cloud
>            Priority: Minor
>              Labels: newbie
>             Fix For: 0.3.0
>
>
> Currently the indices of a Pandas DataFrame are totally ignored on the Pandas 
> to Arrow conversion. We should add an option to also convert the index to an 
> Arrow column if they are not a simple range index.
> The condition for a simple index should be {{isinstance(df.index, 
> pd.RangeIndex) && (df.index._start == 0) && (df.index._stop == len(df.index)) 
> && (df.index._step == 1)}}. In this case, we can always skip the index 
> conversion. Otherwise, a new column in the Arrow table shall be created using 
> the index' name as the name of the column. Additionally there should be some 
> metadata annotation of that column that it is derived of an Pandas Index, so 
> that for roundtrips, we'll use it again as the index of a DataFrame.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to