jorisvandenbossche edited a comment on pull request #7096: URL: https://github.com/apache/arrow/pull/7096#issuecomment-623635439
So the question comes up if we actually should also not revert the behaviour in case of `use_legacy_dataset=False` (the `_ParquetDatasetV2` shim). For me, that depends a bit on what we want to do long term with `ParquetDataset`. If we want to keep it as "the" ParquetDataset (maybe becoming a subclass of the actual Dataset class then), then I think it should have the "correct" behaviour. If we only see it as a temporary vehicle to get people try it out / have poeple eventually use the pyarrow.dataset API, then it is less important Joris Van den Bossche: Yep, will copy this comment over there ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org