This will be a runner-specific issue. It would be the best to file a JIRA
issue for this.

On Tue, May 31, 2016 at 9:46 AM, Jean-Baptiste Onofré <[email protected]>
wrote:

> Hi Pawel,
>
> does it happen only with the Flink runner ? I bet it happens with any
> runner.
>
> Let me take a look.
>
> Regards
> JB
>
> On 05/30/2016 01:38 AM, Pawel Szczur wrote:
>
>> Hi,
>>
>> I'm running a pipeline with Flink backend, Beam bleeding edge, Oracle
>> Java 1.8, maven 3.3.3, linux64.
>>
>> The pipeline is run with --parallelism=6.
>>
>> Adding .withoutSharding()causes a TextIO sink to write only one of the
>> shards.
>>
>> Example use:
>> data.apply(TextIO.Write.named("write-debug-csv").to("/tmp/some-stats"));
>> vs.
>>
>> data.apply(TextIO.Write.named("write-debug-csv").to("/tmp/some-stats")*.withoutSharding()*);
>>
>> Result:
>> Only part of data is written to file. After comparing to sharded output,
>> it seems to be just one of shard files.
>>
>> Cheers,
>> Pawel
>>
>
> --
> Jean-Baptiste Onofré
> [email protected]
> http://blog.nanthrax.net
> Talend - http://www.talend.com
>

Reply via email to