[ https://issues.apache.org/jira/browse/ARROW-7677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17055814#comment-17055814 ]
Joris Van den Bossche commented on ARROW-7677: ---------------------------------------------- It came up in a partitioned parquet dataset test: https://github.com/apache/arrow/blob/aec37d768abbee3ea3ce4c32002462ff0e1c3674/python/pyarrow/tests/test_dataset.py#L627-L662 Quoting myself from the PR: > From the discussion I think we thought the issue is coming from passing a > base path path with backslashes to GetTargetStats, which is then leading to > backslashes being used in the crawled file paths. > partition discovery is not working with windows path, presumably because the > path splitting doesn't work > [C++] Handle Windows file paths with backslashes in GetTargetStats > ------------------------------------------------------------------ > > Key: ARROW-7677 > URL: https://issues.apache.org/jira/browse/ARROW-7677 > Project: Apache Arrow > Issue Type: Bug > Components: C++ > Reporter: Joris Van den Bossche > Priority: Major > Fix For: 0.17.0 > > > Currently, if the base path passed to {{GetTargetStats}} has backslashes, > the produces FileStats also include them, resulting in some other > functionality (such as splitting the path) not working. -- This message was sent by Atlassian Jira (v8.3.4#803005)