[ https://issues.apache.org/jira/browse/FLINK-6306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Seth Wiesman updated FLINK-6306: -------------------------------- Description: Currently Flink provides the BucketingSink as an exactly once method for writing out to a file system. It provides these guarantees by moving files through several stages and deleting or truncating files that get into a bad state. While this is a powerful abstraction, it causes issues with eventually consistent file systems such as Amazon's S3 where most operations (ie rename, delete, truncate) are not guaranteed to become consistent within a reasonable amount of time. Flink should provide a sink that provides exactly once writes to a file system where only PUT operations are considered consistent. (was: Currently Flink provides the BucketingSink as an exactly once method for writing out to a file system. It provides these guarantees by moving files through several stages and deleting or truncating files that get into a bad state. While this is a powerful abstraction, it causes issues with eventually consistent file systems such as Amazon's S3 where must operations (ie rename, delete, truncate) are not guaranteed to become consistent within a reasonable amount of time. Flink should provide a sink that provides exactly once writes to a file system where only PUT operations are considered consistent. ) > Sink for eventually consistent file systems > ------------------------------------------- > > Key: FLINK-6306 > URL: https://issues.apache.org/jira/browse/FLINK-6306 > Project: Flink > Issue Type: New Feature > Components: filesystem-connector > Reporter: Seth Wiesman > Assignee: Seth Wiesman > > Currently Flink provides the BucketingSink as an exactly once method for > writing out to a file system. It provides these guarantees by moving files > through several stages and deleting or truncating files that get into a bad > state. While this is a powerful abstraction, it causes issues with eventually > consistent file systems such as Amazon's S3 where most operations (ie rename, > delete, truncate) are not guaranteed to become consistent within a reasonable > amount of time. Flink should provide a sink that provides exactly once writes > to a file system where only PUT operations are considered consistent. -- This message was sent by Atlassian JIRA (v6.3.15#6346)