[ https://issues.apache.org/jira/browse/BEAM-3863?focusedWorklogId=91005&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-91005 ]
ASF GitHub Bot logged work on BEAM-3863: ---------------------------------------- Author: ASF GitHub Bot Created on: 13/Apr/18 22:55 Start Date: 13/Apr/18 22:55 Worklog Time Spent: 10m Work Description: robertwb commented on issue #4875: BEAM-3863: AfterProcessingTime trigger firing at delayedUntil time URL: https://github.com/apache/beam/pull/4875#issuecomment-381279458 Should we close this PR if it was fixed on the Flink side. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 91005) Time Spent: 1h (was: 50m) > AfterProcessingTime trigger doesn't fire reliably > ------------------------------------------------- > > Key: BEAM-3863 > URL: https://issues.apache.org/jira/browse/BEAM-3863 > Project: Beam > Issue Type: Bug > Components: sdk-java-core > Affects Versions: 2.1.0, 2.2.0, 2.3.0 > Reporter: Pawel Bartoszek > Assignee: Kenneth Knowles > Priority: Major > Time Spent: 1h > Remaining Estimate: 0h > > *Issue* > Beam AfterProcessingTime trigger doesn't fire always reliably after a > configured delay. > The following job triggers should fire after watermark passes the end of the > window and then every 5 seconds for late data and the finally at the end of > allowed lateness. > *Expected behaviour* > Late firing after processing time trigger should fire after 5 seconds since > first late records arrive in the pane. > *Actual behaviour* > From my testings late triggers works for some keys but not for the other - > it's pretty random which keys are affected. The DummySource generates 15 > distinct keys AA,BB,..., PP. For each key it sends 5 on time records and one > late record. In case late trigger firing is missed it won't fire until the > allowed lateness period. > *Job code* > {code:java} > String[] runnerArgs = {"--runner=FlinkRunner", "--parallelism=8"}; > FlinkPipelineOptions options = > PipelineOptionsFactory.fromArgs(runnerArgs).as(FlinkPipelineOptions.class); > Pipeline pipeline = Pipeline.create(options); > PCollection<String> apply = pipeline.apply(Read.from(new DummySource())) > > .apply(Window.<String>into(FixedWindows.of(Duration.standardSeconds(10))) > .triggering(AfterWatermark.pastEndOfWindow() > .withLateFirings( > AfterProcessingTime > > .pastFirstElementInPane().plusDelayOf(Duration.standardSeconds(5)))) > .accumulatingFiredPanes() > .withAllowedLateness(Duration.standardMinutes(2), > Window.ClosingBehavior.FIRE_IF_NON_EMPTY) > ); > apply.apply(Count.perElement()) > .apply(ParDo.of(new DoFn<KV<String, Long>, Long>() { > @ProcessElement > public void process(ProcessContext context, BoundedWindow window) > { > LOG.info("Count: {}. For window {}, Pane {}", > context.element(), window, context.pane()); > } > })); > pipeline.run().waitUntilFinish();{code} > > *How can you replicate the issue?* > I've created a github repo > [https://github.com/pbartoszek/BEAM-3863_late_trigger] with the code shown > above. Please check out the README file for details how to replicate the > issue. > *What's is causing the issue?* > I explained the cause in PR. > -- This message was sent by Atlassian JIRA (v7.6.3#76005)