Re: [DISCUSS] Solving timer ordering on immutable bundles

Reuven Lax Thu, 27 Jun 2019 16:35:26 -0700

The watermark holds (which is how the timer holds up the watermark today,
as there is no timer watermark) is per key. Usually the input watermark
making a "hop" is not a problem, in fact it's the normal state of affairs.


On Fri, Jun 28, 2019 at 1:08 AM Lukasz Cwik <lc...@google.com> wrote:

> Thanks Reuven and Jan.
>
> Since timers are per key, wouldn't it be that the timer watermark should
> also be per key for a StatefulDoFn and hence we would still be able to fire
> multiple timers (at most one per key) and still have good performance even
> when the input watermark makes a "hop"?
>
>
> On Thu, Jun 27, 2019 at 3:43 PM Jan Lukavský <je...@seznam.cz> wrote:
>
>> It would be possible to have "timer watermark", between input and output
>> watermark, so that input watermark >= timer watermark >= output watermark,
>> but it turns out, that doing so implies that we fire timers only for single
>> instant (because until the timer is fired and processed, the "timer
>> watermark" is on hold).
>> On 6/28/19 12:40 AM, Jan Lukavský wrote:
>>
>> At least the implementation in DirectRunner fires timers according to
>> input watemark. Holding the timer up to output watermark causes deadlocks,
>> because timers fired at time T might clear watermark hold for the same time.
>> On 6/27/19 11:55 PM, Reuven Lax wrote:
>>
>> I believe that timers correspond to watermark holds, which hold up the
>> output watermark, not the input watermark.
>>
>> On Thu, Jun 27, 2019 at 11:21 PM Lukasz Cwik <lc...@google.com> wrote:
>>
>>> I'm confused as to why it is valid to advance the watermark to T3 in the
>>> original scenario.
>>>
>>> T1 and T2 should be treated as inputs to the function and hold the input
>>> watermark hence T1 should fire and if it doesn't produce any new timers
>>> before T2, then T2 should fire since the watermark will now advance to T2.
>>> The only time you would have multiple watermark timers fire as part of the
>>> same bundle is if they were distinct timers both set to the same time.
>>>
>>> I have some examples[1] documented in the modelling, scheduling, and
>>> executing timers doc.
>>>
>>> 1:
>>> https://docs.google.com/document/d/1GRL88rKLHbMR0zJnBHYwM4xtj66VYlB112EWVUFcGB0/edit#heading=h.fzptl5h0vi9k
>>>
>>>
>>> On Wed, Jun 26, 2019 at 6:40 AM Reuven Lax <re...@google.com> wrote:
>>>
>>>> Earlier than the input watermark only applies to event time timers, but
>>>> the above problem holds for processing time timers as well.
>>>>
>>>> On Wed, Jun 26, 2019, 1:50 PM Robert Bradshaw <rober...@google.com>
>>>> wrote:
>>>>
>>>>> Yeah, it wouldn't be optimal performance-wise, but I think it's good
>>>>> to keep the bar for a correct SDK low. Might still be better than
>>>>> sending one timer per bundle, and you only pay the performance if
>>>>> timers are set earlier than the input watermark (and there was a timer
>>>>> firing in this range). (How often this happens probably varies a lot
>>>>> in practice.)
>>>>>
>>>>> On Wed, Jun 26, 2019 at 2:33 PM Reuven Lax <re...@google.com> wrote:
>>>>> >
>>>>> > This would have a lot of performance problems (especially since
>>>>> there is user code that caches within a bundle, and invalidates the cache
>>>>> at the end of every bundle). However this would be a valid "lazy"
>>>>> implementation.
>>>>> >
>>>>> > On Wed, Jun 26, 2019 at 2:29 PM Robert Bradshaw <rober...@google.com>
>>>>> wrote:
>>>>> >>
>>>>> >> Note also that a "lazy" SDK implementation would be to simply return
>>>>> >> all the timers (as if they were new timers) to runner once a timer
>>>>> set
>>>>> >> (before or at the last requested timer in the bundle) is
>>>>> encountered.
>>>>> >> E.g. Suppose we had timers T1, T3, T5 in the bundle. On firing T1,
>>>>> we
>>>>> >> set T2 and delete T3. The SDK could then claim that a timers were
>>>>> >> (again) set at T3, T5, then set one at at T2 and deleted at T3 and
>>>>> >> then be done with the bundle (not actually process T3 and T5). (One
>>>>> >> way to think about this is that timers are actually bundle splits
>>>>> into
>>>>> >> a bundle of "done" and "future" work.) A more intelligent SDK could,
>>>>> >> of course, process the whole bundle by tracking modifications to the
>>>>> >> to-be-fired timers itself rather than requiring a trip through the
>>>>> >> runner.
>>>>> >>
>>>>> >> On Wed, Jun 26, 2019 at 1:51 PM Reuven Lax <re...@google.com>
>>>>> wrote:
>>>>> >> >
>>>>> >> > I like this option the best. It might be trickier to implement,
>>>>> but seems like it would be the most consistent solution.
>>>>> >> >
>>>>> >> > Another problem it would solve is the following: let's say a
>>>>> bundle arrives containing timers T1 and T2, and while processing T1 the
>>>>> user code deletes T2 (or resets it to a time in the far future). I'm
>>>>> actually not sure what we do today, but I'm a bit afraid that we will go
>>>>> ahead and fire T2 since it's already in the bundle, which is clearly
>>>>> incorrect. The SDK needs to keep track of this and skip T2 in order to
>>>>> solve this, which is the same sort of work needed to implement Robert's
>>>>> suggestion.
>>>>> >> >
>>>>> >> > Reuven
>>>>> >> >
>>>>> >> > On Wed, Jun 26, 2019 at 12:28 PM Robert Bradshaw <
>>>>> rober...@google.com> wrote:
>>>>> >> >>
>>>>> >> >> Another option, that is nice from an API perspective but places a
>>>>> >> >> burden on SDK implementers (and possibly runners), is to
>>>>> maintain the
>>>>> >> >> ordering of timers by requiring timers to be fired in order, and
>>>>> if
>>>>> >> >> any timers are set to fire them immediately before processing
>>>>> later
>>>>> >> >> timers. In other words, if T1 sets T2 and modifies T3, these
>>>>> would
>>>>> >> >> take effect (locally, the runner may not even know about T2)
>>>>> before T3
>>>>> >> >> was processed.
>>>>> >> >>
>>>>> >> >> On Wed, Jun 26, 2019 at 11:13 AM Jan Lukavský <je...@seznam.cz>
>>>>> wrote:
>>>>> >> >> >
>>>>> >> >> > Hi,
>>>>> >> >> >
>>>>> >> >> > I have mentioned an issue I have come across [1] on several
>>>>> other
>>>>> >> >> > threads, but it probably didn't attract the attention that it
>>>>> would desire.
>>>>> >> >> >
>>>>> >> >> > I will try to restate the problem here for clarity:
>>>>> >> >> >
>>>>> >> >> >   - on runners that use concept of bundles (the original issue
>>>>> mentions
>>>>> >> >> > DirectRunner, but it will probably apply for other runners,
>>>>> which use
>>>>> >> >> > bundles, as well), the workflow is as follows:
>>>>> >> >> >
>>>>> >> >> >    a) process elements in bundle
>>>>> >> >> >
>>>>> >> >> >    b) advance watermark
>>>>> >> >> >
>>>>> >> >> >    c) process timers
>>>>> >> >> >
>>>>> >> >> >    d) continue to next bundle
>>>>> >> >> >
>>>>> >> >> >   - the issue with this is that when we are initially at time
>>>>> T0, set
>>>>> >> >> > two timers for T1 and T3, then advance watermark to T3 (or
>>>>> beyond), the
>>>>> >> >> > timers will fire (correctly) in order T1, T3, but if timer at
>>>>> T1 sets
>>>>> >> >> > another timer for T2, then this timer will be fired in next
>>>>> bundle (and
>>>>> >> >> > therefore after T3)
>>>>> >> >> >
>>>>> >> >> >   - this causes issues mostly with race conditions in window
>>>>> GC timers
>>>>> >> >> > and user timers (and users don't have any way to solve that!)
>>>>> >> >> >
>>>>> >> >> >   - note that the same applies when one timer tries to reset
>>>>> timer that
>>>>> >> >> > is already in the current bundle
>>>>> >> >> >
>>>>> >> >> > I have investigated a way of solving this by running timers
>>>>> only for
>>>>> >> >> > single timestamp (instant) at each bundle, but as Reuven
>>>>> pointed out,
>>>>> >> >> > that could regress performance (mostly by delaying firing of
>>>>> timers,
>>>>> >> >> > that could have fired). Options I see:
>>>>> >> >> >
>>>>> >> >> >   1) either set the OnTimerContext#timestamp() to current input
>>>>> >> >> > watermark (not the time that user actually set the timer), or
>>>>> >> >> >
>>>>> >> >> >   2) add OnTimerContext#getCurrentInputWatermark() and
>>>>> disallow setting
>>>>> >> >> > (or resetting) timers for time between
>>>>> OnProcessContext#timestamp and
>>>>> >> >> > OnProcessContext#getCurrentInputWatermark(), by throwing an
>>>>> exception
>>>>> >> >> >
>>>>> >> >> >   3) any other option?
>>>>> >> >> >
>>>>> >> >> > Option 1) seems to be broken by design, as it can result in
>>>>> corrupt data
>>>>> >> >> > (emitted with wrong timestamp, which is even somewhat
>>>>> arbitrary), I'm
>>>>> >> >> > including it just for completeness. Option 2) is breaking
>>>>> change, that
>>>>> >> >> > can result in PIpeline failures (although the failures will
>>>>> happen on
>>>>> >> >> > Pipelines, that are probably already broken).
>>>>> >> >> >
>>>>> >> >> > Although I have come with a workaround in the work where I
>>>>> originally
>>>>> >> >> > come across this issue, I think that this is generally serious
>>>>> and
>>>>> >> >> > should be dealt with. Mostly because when using user-facing
>>>>> APIs, there
>>>>> >> >> > are no workarounds possible, today.
>>>>> >> >> >
>>>>> >> >> > Thanks for discussion!
>>>>> >> >> >
>>>>> >> >> > Jan
>>>>> >> >> >
>>>>> >> >> > [1] https://issues.apache.org/jira/browse/BEAM-7520
>>>>> >> >> >
>>>>>
>>>>

Re: [DISCUSS] Solving timer ordering on immutable bundles

Reply via email to