[ https://issues.apache.org/jira/browse/ARROW-1993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16624698#comment-16624698 ]
Uwe L. Korn commented on ARROW-1993: ------------------------------------ We need to delay to 0.12 / 0.13. This needs a lot more work to avoid costly operations. > [Python] Add function for determining implied Arrow schema from > pandas.DataFrame > -------------------------------------------------------------------------------- > > Key: ARROW-1993 > URL: https://issues.apache.org/jira/browse/ARROW-1993 > Project: Apache Arrow > Issue Type: Improvement > Components: Python > Reporter: Wes McKinney > Assignee: Uwe L. Korn > Priority: Major > Labels: beginner, pull-request-available > Fix For: 0.12.0 > > Time Spent: 40m > Remaining Estimate: 0h > > Currently the only option is to use {{Table/Array.from_pandas}} which does > significant unnecessary work and allocates memory. If only the schema is of > interest, then we could do less work and not allocate memory. > We should provide the user a function {{pyarrow.Schema.from_pandas}} which > takes a DataFrame as an input and returns the respective Arrow schema. The > functionality for determing the schema is already available in the Python > code, it is at moment just very tightly bound to the conversion > infrastructure. -- This message was sent by Atlassian JIRA (v7.6.3#76005)