I think you can look at this comment, thanks.

* <p>Part files can be in one of three states: {@code in-progress},
{@code pending} or {@code finished}.
* The reason for this is how the sink works together with the
checkpointing mechanism to provide exactly-once
* semantics and fault-tolerance. The part file that is currently being
written to is {@code in-progress}. Once
* a part file is closed for writing it becomes {@code pending}. When a
checkpoint is successful the currently
* pending files will be moved to {@code finished}.






2018-06-05 17:14 GMT+08:00 miki haiat <miko5...@gmail.com>:

> Im trying to write some data to Hadoop by using this code
>
> The state backend is set without time
>
> StateBackend sb = new 
> FsStateBackend("hdfs://***:9000/flink/my_city/checkpoints");
> env.setStateBackend(sb);
>
> BucketingSink<Tuple2<IntWritable, Text>> sink =
>         new BucketingSink<>("hdfs://****:9000/mycity/raw");
> sink.setBucketer(new DateTimeBucketer("yyyy-MM-dd--HHmm"));
> sink.setInactiveBucketCheckInterval(120000);
> sink.setInactiveBucketThreshold(120000);
>
> the result is that all the files are stuck in* in.programs  *status and
> not closed.
> is it related to the state backend configuration.
>
> thanks,
>
> Miki
>
>

Reply via email to