[jira] [Updated] (FLINK-30049) CsvBulkWriter is unsupported for S3 FileSystem in streaming sink

2022-11-16 Thread Jingsong Lee (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-30049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jingsong Lee updated FLINK-30049:
-
Summary: CsvBulkWriter is unsupported for S3 FileSystem in streaming sink  
(was: CsvBulkWriter is unsupported for S3 FileSystem)

> CsvBulkWriter is unsupported for S3 FileSystem in streaming sink
> 
>
> Key: FLINK-30049
> URL: https://issues.apache.org/jira/browse/FLINK-30049
> Project: Flink
>  Issue Type: Bug
>  Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
>Affects Versions: 1.16.0, 1.15.2
>Reporter: Jingsong Lee
>Priority: Major
>
> {code:java}
> Caused by: org.apache.flink.util.SerializedThrowable: Cannot sync state to 
> system like S3. Use persist() to create a persistent recoverable intermediate 
> point.
>   at 
> org.apache.flink.core.fs.RefCountedBufferingFileStream.sync(RefCountedBufferingFileStream.java:111)
>  
>   at 
> org.apache.flink.fs.s3.common.writer.S3RecoverableFsDataOutputStream.sync
>   at 
> org.apache.flink.formats.csv.CsvBulkWriter.finish(CsvBulkWriter.java:106) 
>   at 
> org.apache.flink.connector.file.table.FileSystemTableSink$ProjectionBulkFactory$1.finish(FileSystemTableSink.java:653)
>  
>   at 
> org.apache.flink.streaming.api.functions.sink.filesystem.BulkPartWriter.closeForCommit(BulkPartWriter.java:64)
>  
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (FLINK-30049) CsvBulkWriter is unsupported for S3 FileSystem in streaming sink

2022-11-16 Thread Jingsong Lee (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-30049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jingsong Lee updated FLINK-30049:
-
Description: 
{code:java}
Caused by: org.apache.flink.util.SerializedThrowable: Cannot sync state to 
system like S3. Use persist() to create a persistent recoverable intermediate 
point.
at 
org.apache.flink.core.fs.RefCountedBufferingFileStream.sync(RefCountedBufferingFileStream.java:111)
 
at 
org.apache.flink.fs.s3.common.writer.S3RecoverableFsDataOutputStream.sync
at 
org.apache.flink.formats.csv.CsvBulkWriter.finish(CsvBulkWriter.java:106) 
at 
org.apache.flink.connector.file.table.FileSystemTableSink$ProjectionBulkFactory$1.finish(FileSystemTableSink.java:653)
 
at 
org.apache.flink.streaming.api.functions.sink.filesystem.BulkPartWriter.closeForCommit(BulkPartWriter.java:64)
 
{code}

It looks like we should not call `sync` in CsvBulkWriter, we should just use 
`flush`.


  was:
{code:java}
Caused by: org.apache.flink.util.SerializedThrowable: Cannot sync state to 
system like S3. Use persist() to create a persistent recoverable intermediate 
point.
at 
org.apache.flink.core.fs.RefCountedBufferingFileStream.sync(RefCountedBufferingFileStream.java:111)
 
at 
org.apache.flink.fs.s3.common.writer.S3RecoverableFsDataOutputStream.sync
at 
org.apache.flink.formats.csv.CsvBulkWriter.finish(CsvBulkWriter.java:106) 
at 
org.apache.flink.connector.file.table.FileSystemTableSink$ProjectionBulkFactory$1.finish(FileSystemTableSink.java:653)
 
at 
org.apache.flink.streaming.api.functions.sink.filesystem.BulkPartWriter.closeForCommit(BulkPartWriter.java:64)
 
{code}



> CsvBulkWriter is unsupported for S3 FileSystem in streaming sink
> 
>
> Key: FLINK-30049
> URL: https://issues.apache.org/jira/browse/FLINK-30049
> Project: Flink
>  Issue Type: Bug
>  Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
>Affects Versions: 1.16.0, 1.15.2
>Reporter: Jingsong Lee
>Priority: Major
>
> {code:java}
> Caused by: org.apache.flink.util.SerializedThrowable: Cannot sync state to 
> system like S3. Use persist() to create a persistent recoverable intermediate 
> point.
>   at 
> org.apache.flink.core.fs.RefCountedBufferingFileStream.sync(RefCountedBufferingFileStream.java:111)
>  
>   at 
> org.apache.flink.fs.s3.common.writer.S3RecoverableFsDataOutputStream.sync
>   at 
> org.apache.flink.formats.csv.CsvBulkWriter.finish(CsvBulkWriter.java:106) 
>   at 
> org.apache.flink.connector.file.table.FileSystemTableSink$ProjectionBulkFactory$1.finish(FileSystemTableSink.java:653)
>  
>   at 
> org.apache.flink.streaming.api.functions.sink.filesystem.BulkPartWriter.closeForCommit(BulkPartWriter.java:64)
>  
> {code}
> It looks like we should not call `sync` in CsvBulkWriter, we should just use 
> `flush`.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)