[ https://issues.apache.org/jira/browse/ARROW-2882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17082959#comment-17082959 ]
Joris Van den Bossche commented on ARROW-2882: ---------------------------------------------- With ARROW-8039, this is now also exposed in the existing {{pq.ParquetDataset}}, when using {{use_legacy_dataset=False}}. > [C++][Python] Support AWS Firehose partition_scheme implementation for > Parquet datasets > --------------------------------------------------------------------------------------- > > Key: ARROW-2882 > URL: https://issues.apache.org/jira/browse/ARROW-2882 > Project: Apache Arrow > Issue Type: New Feature > Components: C++, Python > Reporter: Pablo Javier Takara > Priority: Major > Labels: dataset, dataset-parquet-read, parquet > Fix For: 2.0.0 > > > I'd like to be able to read a ParquetDataset generated by AWS Firehose. > The only implementation at the time of writting was the partition scheme > created by hive (year=2018/month=01/day=11). > AWS Firehose partition scheme is a little bit different (2018/01/11). > > Thanks -- This message was sent by Atlassian Jira (v8.3.4#803005)