Hi,
I'm running a pipeline with Flink backend, Beam bleeding edge, Oracle Java
1.8, maven 3.3.3, linux64.
The pipeline is run with --parallelism=6.
Adding .withoutSharding() causes a TextIO sink to write only one of the
shards.
Example use:
data.apply(TextIO.Write.named("write-debug-csv").to("/tmp/some-stats"));
vs.
data.apply(TextIO.Write.named("write-debug-csv").to("/tmp/some-stats")
*.withoutSharding()*);
Result:
Only part of data is written to file. After comparing to sharded output, it
seems to be just one of shard files.
Cheers,
Pawel