Fokko opened a new pull request, #5949: URL: https://github.com/apache/iceberg/pull/5949
This is required for manually specifying datasets: https://arrow.apache.org/docs/python/dataset.html#manual-specification-of-the-dataset From PyArrow: The `dataset()` function allows easy creation of a Dataset viewing a directory, crawling all subdirectories for files, and partitioning information. However sometimes discovery is not required and the dataset’s files and partitions are already known (for example, when this information is stored in metadata). In this case, it is possible to create a Dataset explicitly without any automatic discovery or inference. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
