[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-07-13 Thread sihuazhou
Github user sihuazhou commented on the issue: https://github.com/apache/flink/pull/5982 @StephanEwen Thanks! Looking forward~ ---

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-07-13 Thread StephanEwen
Github user StephanEwen commented on the issue: https://github.com/apache/flink/pull/5982 @sihuazhou I got caught up in some other tasks - will try to get back to this here soon, I would like to have this feature in as a base for "search for completed checkpoint". ---

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-06-19 Thread sihuazhou
Github user sihuazhou commented on the issue: https://github.com/apache/flink/pull/5982 Could anybody have a look at this? ---

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-29 Thread sihuazhou
Github user sihuazhou commented on the issue: https://github.com/apache/flink/pull/5982 @StephanEwen I've addressed your comments, could you please have a look again? ---

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-28 Thread sihuazhou
Github user sihuazhou commented on the issue: https://github.com/apache/flink/pull/5982 Yes, @StephanEwen thanks for the continuous suggestions, will follow your suggestion. ---

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-28 Thread StephanEwen
Github user StephanEwen commented on the issue: https://github.com/apache/flink/pull/5982 I think we need to have a special output stream type (`AtomicCreatingFsDataOutputStream` or similar) as the return type of `FileSystem.createAtomic()`. Otherwise, how can a user actually create a

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-16 Thread sihuazhou
Github user sihuazhou commented on the issue: https://github.com/apache/flink/pull/5982 @StephanEwen I guess this PR is already for an another look now... ---

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-15 Thread StephanEwen
Github user StephanEwen commented on the issue: https://github.com/apache/flink/pull/5982 My gut feeling is that we don't need `WriteMode.OVERWRITE` in cases where one wants such an atomic file creation... ---

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-15 Thread sihuazhou
Github user sihuazhou commented on the issue: https://github.com/apache/flink/pull/5982 @StephanEwen Thanks for your good suggestions! Will update PR like it, and what about the problem related to WriteMode.OVERWRITE, do you against if we don't support it in `createAtomically()`? --

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-15 Thread StephanEwen
Github user StephanEwen commented on the issue: https://github.com/apache/flink/pull/5982 Good point about the renaming on `close()` in case close is called for cleanup, rather than success. We could follow the same semantics as in [CheckpointStateOutputStream](https://github

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-15 Thread sihuazhou
Github user sihuazhou commented on the issue: https://github.com/apache/flink/pull/5982 Hi, I met a problem here, for the Hadoop FileSystem when the `schema='hdfs'`, we can't rename a file to a existing file. This means that it hard(I'm not sure whether it's possible) to support the `

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-14 Thread sihuazhou
Github user sihuazhou commented on the issue: https://github.com/apache/flink/pull/5982 Hi @StephanEwen, I added more tests for the `FileSystem#createAtomically()`, concerning the `TwoPhaseFsDataoutputStream`, can we introduce a `commit_on_close` option for it to make it easier to use

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-14 Thread sihuazhou
Github user sihuazhou commented on the issue: https://github.com/apache/flink/pull/5982 @StephanEwen After thinking your comments again, I think I misunderstood the `Also, we need this method to be implemented in all FileSystem subclasses.` 😭 , I will address that. But the questio

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-14 Thread sihuazhou
Github user sihuazhou commented on the issue: https://github.com/apache/flink/pull/5982 Hi @StephanEwen Thank you very much for your reply, I'm not sure whether just overriding `close()` to do `super.close()` + `rename()` is enough, for example. ``` try (outputStream = new Two

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-14 Thread StephanEwen
Github user StephanEwen commented on the issue: https://github.com/apache/flink/pull/5982 Thanks for preparing this. I looked at the `TwoPhraseFSDatautputStream` - maybe we can make this simpler. Do we need the distinction between phases? Is it not enough to behave as a regul

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-13 Thread sihuazhou
Github user sihuazhou commented on the issue: https://github.com/apache/flink/pull/5982 Hi @StephanEwen I have update the PR according to the above comments, it's ready for an another review. ---

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-11 Thread sihuazhou
Github user sihuazhou commented on the issue: https://github.com/apache/flink/pull/5982 Hi @StephanEwen, - I prefer to introduce the `create(Path, WriteMode)`, because I feel this is more extensible (it could allow us to introduce other WriteMode's in the future). - I would ch

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-11 Thread StephanEwen
Github user StephanEwen commented on the issue: https://github.com/apache/flink/pull/5982 How about adding the method `createAtomically` or so, with otherwise the same signature as the `create(Path, WriteMode)` method? ---

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-11 Thread sihuazhou
Github user sihuazhou commented on the issue: https://github.com/apache/flink/pull/5982 @StephanEwen Thank you very much for your great suggestion! I will address this that way. ---

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-11 Thread StephanEwen
Github user StephanEwen commented on the issue: https://github.com/apache/flink/pull/5982 I think this fix here might not work for S3, because a rename() with the S3 file systems will actually trigger a copy (or even a download and upload), so it is not a cheap operation. The

[GitHub] flink issue #5982: [FLINK-9325][checkpoint]generate the meta file for checkp...

2018-05-10 Thread sihuazhou
Github user sihuazhou commented on the issue: https://github.com/apache/flink/pull/5982 Hi @StephanEwen Could you please have a look at this? ---