qyw created FLINK-35150:
---------------------------

             Summary: The specified upload does not exist. The upload ID may be 
invalid
                 Key: FLINK-35150
                 URL: https://issues.apache.org/jira/browse/FLINK-35150
             Project: Flink
          Issue Type: Bug
          Components: Connectors / FileSystem
    Affects Versions: 1.15.0
            Reporter: qyw
         Attachments: image-2024-04-18-10-51-05-071.png, 
image-2024-04-18-11-03-08-998.png, image-2024-04-18-11-07-15-555.png

Flink S3 hadoop, write S3 in csv mode, I used this patch  
[FLINK-28513|https://issues.apache.org/jira/browse/FLINK-28513] .   But I don't 
understand why S3RecoverableFsDataOutputStream "sync" method of this class to 
be "completeMultipartUpload" operation, if "completeMultipartUpload" here, 
Calling close later to upload the rest of the stream will inevitably result in 
an error.   The part corresponding to uploadID has been merged.
Therefore, when the message in csv is larger than "S3_MULTIPART_MIN_PART_SIZE", 
the uploadPart will be started when switching files, then when BulkPartWriter 
performs closeForCommit, Due to the sync S3RecoverableFsDataOutputStream method 
call completeMultipartUpload, So S3RecoverableFsDataOutputStream 
"closeForCommit" method due to the uploadPart, at this time will lead to errors.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to