[ https://issues.apache.org/jira/browse/ARROW-17633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17600861#comment-17600861 ]
Antoine Pitrou commented on ARROW-17633: ---------------------------------------- Definitely! > [Python][CI] test_write_dataset_max_rows_per_file is flaky > ---------------------------------------------------------- > > Key: ARROW-17633 > URL: https://issues.apache.org/jira/browse/ARROW-17633 > Project: Apache Arrow > Issue Type: Bug > Components: Continuous Integration, Python > Reporter: Antoine Pitrou > Priority: Major > > I am starting to see intermittent but frequent CI failures in > {{test_write_dataset_max_rows_per_file}}. > Is {{write_dataset}} supposed to create the base directory? > {code} > =================================== FAILURES > =================================== > _____________________ test_write_dataset_max_rows_per_file > _____________________ > tempdir = > PosixPath('/tmp/pytest-of-root/pytest-0/test_write_dataset_max_rows_pe0') > @pytest.mark.parquet > def test_write_dataset_max_rows_per_file(tempdir): > directory = tempdir / 'ds' > max_rows_per_file = 10 > max_rows_per_group = 10 > num_of_columns = 2 > num_of_records = 35 > > record_batch = _generate_data_and_columns(num_of_columns, > num_of_records) > > > ds.write_dataset(record_batch, directory, format="parquet", > max_rows_per_file=max_rows_per_file, > max_rows_per_group=max_rows_per_group) > usr/local/lib/python3.9/site-packages/pyarrow/tests/test_dataset.py:3919: > _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ > _ > usr/local/lib/python3.9/site-packages/pyarrow/dataset.py:988: in write_dataset > _filesystemdataset_write( > pyarrow/_dataset.pyx:2811: in pyarrow._dataset._filesystemdataset_write > ??? > _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ > _ > > ??? > E FileNotFoundError: [Errno 2] Failed to open local file > '/tmp/pytest-of-root/pytest-0/test_write_dataset_max_rows_pe0/ds/part-1.parquet'. > Detail: [errno 2] No such file or directory > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010)