[ https://issues.apache.org/jira/browse/ARROW-376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15990372#comment-15990372 ]
Wes McKinney edited comment on ARROW-376 at 4/30/17 8:27 PM: ------------------------------------------------------------- Removing this from release blocker. We can make 0.3.0.post artifacts if we want to get this out there before the 0.4 release was (Author: wesmckinn): Removing this from release blocker. We can release 0.3.0.post if we want to get this out there before the 0.4 release > Python: Convert non-range Pandas indices (optionally) to Arrow > -------------------------------------------------------------- > > Key: ARROW-376 > URL: https://issues.apache.org/jira/browse/ARROW-376 > Project: Apache Arrow > Issue Type: Improvement > Components: Python > Affects Versions: 0.2.0 > Reporter: Uwe L. Korn > Assignee: Phillip Cloud > Priority: Minor > Fix For: 0.4.0 > > > Currently the indices of a Pandas DataFrame are totally ignored on the Pandas > to Arrow conversion. We should add an option to also convert the index to an > Arrow column if they are not a simple range index. > The condition for a simple index should be {{isinstance(df.index, > pd.RangeIndex) && (df.index._start == 0) && (df.index._stop == len(df.index)) > && (df.index._step == 1)}}. In this case, we can always skip the index > conversion. Otherwise, a new column in the Arrow table shall be created using > the index' name as the name of the column. Additionally there should be some > metadata annotation of that column that it is derived of an Pandas Index, so > that for roundtrips, we'll use it again as the index of a DataFrame. -- This message was sent by Atlassian JIRA (v6.3.15#6346)