[ https://issues.apache.org/jira/browse/ARROW-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17133662#comment-17133662 ]
Neal Richardson commented on ARROW-2801: ---------------------------------------- [~jorisvandenbossche] can you close this if/when you're satisfied that the feature is done and documented? > [Python][C++][Dataset] Implement split_row_groups for ParquetDataset > -------------------------------------------------------------------- > > Key: ARROW-2801 > URL: https://issues.apache.org/jira/browse/ARROW-2801 > Project: Apache Arrow > Issue Type: New Feature > Components: Python > Reporter: Robbie Gruener > Assignee: Joris Van den Bossche > Priority: Minor > Labels: dataset, dataset-parquet-read, parquet, > pull-request-available > Fix For: 1.0.0 > > Time Spent: 1h 50m > Remaining Estimate: 0h > > Currently the split_row_groups argument in ParquetDataset yields a not > implemented error. An easy and efficient way to implement this is by using > the summary metadata file instead of opening every footer file -- This message was sent by Atlassian Jira (v8.3.4#803005)