Nico Kruber created FLINK-6380:
----------------------------------

             Summary: BlobService concurrency issues between delete and put/get 
methods
                 Key: FLINK-6380
                 URL: https://issues.apache.org/jira/browse/FLINK-6380
             Project: Flink
          Issue Type: Bug
          Components: Network
    Affects Versions: 1.3.0
            Reporter: Nico Kruber


{{BlobCache#deleteAll(JobID)}} deletes the job directory which is only created 
at the start of {{BlobCache#getURL(BlobKey)}} which then relies on the 
directory being present.

This is not restricted to the {{BlobCache}}, though, but also affects the 
{{BlobServer}} in two ways:
1) its own local storage and
2) its backing {{BlobStore}}

For the latter, i.e. in {{FileSystemBlobStore}}, there is no guarantee that a 
directory will not be deleted concurrently (from a {{delete}} method) between 
its creation and writing a file (in a {{get}} method):

* the {{delete}} method for name-addressable blobs always deletes the 
job-specific storage directory if there is no further blob for this job
* the content-addressable blobs do that similarly but are shared among jobs and 
thus only delete directories if there is no other blob.

Since name-addressable blobs have not been used so far and the latter case 
typically does not occur concurrently with get/put requests, this has not been 
a problem so far but is more relevant after applying FLINK-6046.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to