[ 
https://issues.apache.org/jira/browse/ARROW-18313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17633805#comment-17633805
 ] 

Miles Granger commented on ARROW-18313:
---------------------------------------

Okay, I suspect the OS isn't allowing it as (probably) there are open files and 
when attempting to swap out temp files the system says no. :) From the pyarrow 
side and checking {{lsof +D ./}} on the directory, I don't see any open file 
handles. But I'll try to come back to this, or reference someone more familiar 
with the R API and if file handles are expected to be open there when using 
dataset.

> [R] Issues with open_dataset()
> ------------------------------
>
>                 Key: ARROW-18313
>                 URL: https://issues.apache.org/jira/browse/ARROW-18313
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: R
>            Reporter: N Gautam Animesh
>            Priority: Major
>         Attachments: image-2022-11-11-09-19-16-065.png, 
> image-2022-11-14-10-51-09-893.png
>
>
> On using open_dataset, it creates a connection due to which the files in the 
> directory get blocked and we cannot perform other operations on the file like 
> replace!
> Actual issue:
>  # We are running an atomic operation on a bunch of files, which replaces the 
> temp file names to the target file names.
>  # But while this is happening, if we try to run open_dataset() on that 
> particular directory, the atomic operation is failing and there are both 
> target files and temp files in the directory.
>  # It is blocking the files that have been read through open_dataset().
>  # Please, provide me with more about how we can handle such problems.
>  # Snapshot: !image-2022-11-11-09-19-16-065.png!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to