Ben Kietzman created ARROW-8137: ----------------------------------- Summary: [C++][Dataset] Investigate multithreaded discovery Key: ARROW-8137 URL: https://issues.apache.org/jira/browse/ARROW-8137 Project: Apache Arrow Issue Type: Improvement Components: C++, C++ - Dataset Affects Versions: 0.16.0 Reporter: Ben Kietzman Fix For: 1.0.0
Currently FileSystemDatasetFactory Inpsects all files serially. For slow file systems or systems which support batched reads, this could be accelerated by inspecting files in parallel. -- This message was sent by Atlassian Jira (v8.3.4#803005)