[ https://issues.apache.org/jira/browse/ARROW-2728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Wes McKinney updated ARROW-2728: -------------------------------- Labels: parquet (was: newbie) > [Python] Support partitioned Parquet datasets using glob-style file paths > ------------------------------------------------------------------------- > > Key: ARROW-2728 > URL: https://issues.apache.org/jira/browse/ARROW-2728 > Project: Apache Arrow > Issue Type: Bug > Components: Python > Affects Versions: 0.9.0 > Environment: pyarrow : 0.9.0.post1 > dask : 0.17.1 > Mac OS > Reporter: pranav kohli > Priority: Minor > Labels: parquet > > I am saving a dask dataframe to parquet with two partition columns using the > pyarrow engine. The problem arises in scanning the partition columns. When I > scan using the directory path, I get the partition columns in the output > dataframe, whereas if I scan using the glob path, I dont get these columns > > https://github.com/apache/arrow/issues/2147 -- This message was sent by Atlassian JIRA (v7.6.3#76005)