Hi,

I'm running a pipeline with Flink backend, Beam bleeding edge, Oracle Java
1.8, maven 3.3.3, linux64.

The pipeline is run with --parallelism=6.

Adding .withoutSharding() causes a TextIO sink to write only one of the
shards.

Example use:
data.apply(TextIO.Write.named("write-debug-csv").to("/tmp/some-stats"));
vs.
data.apply(TextIO.Write.named("write-debug-csv").to("/tmp/some-stats")
*.withoutSharding()*);

Result:
Only part of data is written to file. After comparing to sharded output, it
seems to be just one of shard files.

Cheers,
Pawel

Reply via email to