Hi Kenn,

Thanks a lot for the info. I will follow the discussion.

Shen

On Fri, Jun 23, 2017 at 10:38 AM, Kenneth Knowles <k...@google.com.invalid>
wrote:

> Hi Shen,
>
> In order for this to work well with watermark tracking, we have some
> initial ideas on https://issues.apache.org/jira/browse/BEAM-644
>
> Kenn
>
> On Wed, Jun 14, 2017 at 1:34 PM, Shen Li <cs.she...@gmail.com> wrote:
>
> > Hi,
> >
> > I saw the DoFn#getAllowedTimestampSkew has been marked as deprecated.
> What
> > if a user does want to rewind back the timestamp without violating the
> > watermark?
> >
> > Consider the case where there is a GroupByKey followed by a ParDo. The
> > GroupByKey transform groups tuples into one-hour windows. Say, each value
> > of the output iterable of the GroupByKey remembers the timestamp of when
> it
> > is created. The ParDo finds the max value in the iterable and wants to
> use
> > its timestamp as the output timestamp. For example, the timestamp of the
> > GroupByKey output might be 11 AM, but the timestamp of the max value
> might
> > be 10:30 AM. Is it possible for the user-defined ParDo to rewind back the
> > timestamp to 10:30 AM?
> >
> > As the runner knows the current watermark, should there be any API for
> the
> > runner to notify the app of the allowedTimestampSkew?
> >
> > Thanks,
> >
> > Shen
> >
>

Reply via email to