[ https://issues.apache.org/jira/browse/ARROW-1993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Wes McKinney updated ARROW-1993: -------------------------------- Fix Version/s: (was: 0.13.0) 0.12.0 > [Python] Add function for determining implied Arrow schema from > pandas.DataFrame > -------------------------------------------------------------------------------- > > Key: ARROW-1993 > URL: https://issues.apache.org/jira/browse/ARROW-1993 > Project: Apache Arrow > Issue Type: Improvement > Components: Python > Reporter: Wes McKinney > Assignee: Uwe L. Korn > Priority: Major > Labels: beginner, pull-request-available > Fix For: 0.12.0 > > Time Spent: 1h > Remaining Estimate: 0h > > Currently the only option is to use {{Table/Array.from_pandas}} which does > significant unnecessary work and allocates memory. If only the schema is of > interest, then we could do less work and not allocate memory. > We should provide the user a function {{pyarrow.Schema.from_pandas}} which > takes a DataFrame as an input and returns the respective Arrow schema. The > functionality for determing the schema is already available in the Python > code, it is at moment just very tightly bound to the conversion > infrastructure. -- This message was sent by Atlassian JIRA (v7.6.3#76005)