Re: Questions on Table to Datastream conversion and onTimer method in KeyedProcessFunction

fuyao . li Fri, 20 Nov 2020 11:46:44 -0800

Hi Timo,

One more question, the blog also mentioned a jira task to solve thisissue. https://issues.apache.org/jira/browse/FLINK-10886. Will thisfeature be available in 1.12? Thanks!


Best,

Fuyao

On 11/20/20 11:37, fuyao...@oracle.com wrote:


Hi Timo,

Thanks for your reply! I think your suggestions is really helpful! Thegood news is that I had managed to figure out it something by myselffew days ago.


1. Thanks for the update about the table parallelism issue!

2. After trying out the idleness setting. It prevents some idlesubtasks from blocking the pipeline's overall watermark and it worksfor me. Based on my observation and reading the source code, I havesummarized some notes. Please correct me if I am wrong.


 1. (1)Watermark is independent within each subtask for an Flink operator.
 2. (2)The watermark of the multi-parallelism table operator is always
    dominated by least watermark of the current*ACTIVE*subtasks.
 3. (3)With withIdleness() configured. A subtask will be mark as idle
    if it hasn’t receive message for configured period of time. It
    will NOT execute onPeriodEmit() and emit watermark after reaching
    the idle state. Between [the start of the application/receive a
    new message]  and [reaching into the idle state], the
    onPeriodEmit() will still emit watermark and dominate the overall
    context watermark if it holds the smallest watermark among the
    subtasks.
 4. (4)Once an idle subtask receive a new message, it will switch its
    status from idle to active and start to influence the overall
    context watermark.

3. In order to route the correct information to the subtask in thejoin step, I have added the keyed() logic in the source based on thejoin key in the join step. It seems to work correctly and could routethe message to a current place.

4. For the interval join, I think I can't use it directly since I needto use full outer join to not lose any information from any upstreamdatastream. I think interval join is a inner join it can't do thistask. I guess my only option is to do full outer join with queryconfiguration.

5. One more question about the data replay issue. I read the ververicablog(https://www.ververica.com/blog/replayable-process-functions-time-ordering-and-timers)and I think with replay use case, we will face some similar issues. Ithink the suggested approach mentioned


  (1). Puts each incoming track record in a map keyed by its timestamp

(2). creates an event timer to process that record once thewatermark hits that point.

I kind of understand the idea here. Buffer all the data(maybe deletesome of the old track if processed) in a track ordered by timestampand trigger the event timer sequentially with this buffered track.

Based on my understanding, this buffered design is only suitable for*offline* data processing, right? (It is a waste of resource to bufferthis in real time. )

Also, from the article, I think they are using periodic watermarkstrategy[1]. how can they process the last piece of data records withperiodic watermark strategy since there is no more incoming data toadvance the watermark? So the last piece of data will never beprocessed here? Is there a way to gracefully handle this? My use casedoesn't allow me to lose any information.



[1]https://ci.apache.org/projects/flink/flink-docs-stable/dev/event_timestamps_watermarks.html#writing-a-periodic-watermarkgenerator

Best,

Fuyao


On 11/20/20 08:55, Timo Walther wrote:

Hi Fuyao,

sorry for not replying earlier.

You posted a lot of questions. I scanned the thread quickly, let metry to answer some of them and feel free to ask further questionsafterwards.

"is it possible to configure the parallelism for Table operation atoperator level"

No this is not possible at the moment. The reason is 1) we don't knowhow to expose such a functionality in a nice way. Maybe we will useSQL hints in the future [1]. 2) Sometime the planner sets theparalellism of operators explicitly to 1. All other operators willuse the globally defined parallelism for the pipeline (also to notmess up retraction messages internally). You will be able to set theparallelism of the sink operation in Flink 1.12.

"BoundedOutOfOrderness Watermark Generator is NOT making the eventtime to advance"

Have you checked if you can use an interval join instead of a fulljoin with state retention? Table/SQL pipelines that don't preserve atime attribute in the end might also erase the underlying watermarks.Thus, event time triggers will not work after your join.


"Why can't I update the watermarks for all 8 parallelisms?"

You could play around with idleness for your source [2]. Or you setthe source parallelism to 1 (while keeping the rest of the pipelineglobally set to 8), would that be an option?


"Some type cast behavior of retracted streams I can't explain."

toAppendStream/toRetractStream still need an update to the new typesystem. This is explained in FLIP-136 which will be part of Flink1.13 [3].


I hope I could help a bit.

Regards,
Timo

[1]https://urldefense.com/v3/__https://cwiki.apache.org/confluence/display/FLINK/FLIP-113*3A*Supports*Dynamic*Table*Options*for*Flink*SQL__;JSsrKysrKys!!GqivPVa7Brio!ItHlGfYT1dLQeAolQoFNfXPN876842lnF4hOE7cxmmTJY4tJkXUmkz7J6qWrWNk$[2]https://urldefense.com/v3/__https://ci.apache.org/projects/flink/flink-docs-stable/dev/event_timestamps_watermarks.html*dealing-with-idle-sources__;Iw!!GqivPVa7Brio!ItHlGfYT1dLQeAolQoFNfXPN876842lnF4hOE7cxmmTJY4tJkXUmkz7JMW06Who$[3]https://urldefense.com/v3/__https://cwiki.apache.org/confluence/display/FLINK/FLIP-136*3A**AImprove*interoperability*between*DataStream*and*Table*API__;JSsrKysrKysr!!GqivPVa7Brio!ItHlGfYT1dLQeAolQoFNfXPN876842lnF4hOE7cxmmTJY4tJkXUmkz7JfAjyGyQ$

On 13.11.20 21:39, Fuyao Li wrote:

Hi Matthias,

Just to provide more context on this problem. I only have 1partition per each Kafka Topic at the beginning before the joinoperation. After reading the doc:https://urldefense.com/v3/__https://ci.apache.org/projects/flink/flink-docs-stable/dev/connectors/kafka.html*kafka-consumers-and-timestamp-extractionwatermark-emission__;Iw!!GqivPVa7Brio!ItHlGfYT1dLQeAolQoFNfXPN876842lnF4hOE7cxmmTJY4tJkXUmkz7JnAwo_lc$<https://urldefense.com/v3/__https://ci.apache.org/projects/flink/flink-docs-stable/dev/connectors/kafka.html*kafka-consumers-and-timestamp-extractionwatermark-emission__;Iw!!GqivPVa7Brio!ItHlGfYT1dLQeAolQoFNfXPN876842lnF4hOE7cxmmTJY4tJkXUmkz7JnAwo_lc$>

Maybe that is the root cause of my problem here, with less than 8partitions (only 1 partition in my case), using the defaultparallelism of 8 will cause this wrong behavior. This is my guess,it takes a while to test it out... What's your opinion on this? Thanks!


Best,

Fuyao

On Fri, Nov 13, 2020 at 11:57 AM Fuyao Li <fuyaoli2...@gmail.com<mailto:fuyaoli2...@gmail.com>> wrote:


    Hi Matthias,

    One more question regarding Flink table parallelism, is it possible
    to configure the parallelism for Table operation at operator level,
    it seems we don't have such API available, right? Thanks!

    Best,
    Fuyao

    On Fri, Nov 13, 2020 at 11:48 AM Fuyao Li <fuyaoli2...@gmail.com
<mailto:fuyaoli2...@gmail.com>> wrote:

        Hi Matthias,

        Thanks for your information. I have managed to figure out the
        first issue you mentioned. Regarding the second issue. I have
        got some progress on it.

        I have sent another email with the title 'BoundedOutOfOrderness
        Watermark Generator is NOT making the event time to advance'
        using another email of mine, fuyao...@oracle.com
<mailto:fuyao...@oracle.com>. That email contains some more
        context on my issue. Please take a look. I have made some
        progress after sending that new email.

        Previously, I had managed to make timelag watermark strategy
        working in my code, but my bound out of orderness strategy or
        punctuated watermark strategy doesn't work well. It produces 8
        watermarks each time. Two cycles are shown below.

        I managed to figure out the root cause is that Flink stream
        execution environment has a default parallelism as 8.*I didn't
        notice in the doc, could the Community add this explicitly into
        the official doc to avoid some confusion? Thanks.*

         From my understanding, the watermark advances based on the

lowest watermark among the 8, so I can not advance the boundout

        of orderness watermark since I am only advancing 1 of the 8
        parallelisms. If I set the entire stream execution environment
        to be of parallelism 1, it will reflect the watermark in the
        context correctly. One more thing is that this behavior is not
        reflected in the Flink Cluster web UI interface. I can see the
        watermark is advancing, but it is not in reality. *That's

causing the inconsistency problem I mentioned in the otheremail

        I mentioned above. Will this be considered as a bug in the UI?*

        My current question is, since I have full outer join operation

before the KeyedProcessFunction here. How can I let thebound of

        orderness watermark / punctuated watermark strategy work if the
        parallelism > 1? It can only update one of the 8 parallelisms
        for the watermark for this onTimer operator. Is this related to
        my Table full outer join operation before this step? According
        to the doc,

https://urldefense.com/v3/__https://ci.apache.org/projects/flink/flink-docs-release-1.11/dev/table/config.html*table-exec-resource-default-parallelism__;Iw!!GqivPVa7Brio!ItHlGfYT1dLQeAolQoFNfXPN876842lnF4hOE7cxmmTJY4tJkXUmkz7J4wxLjc0$<https://urldefense.com/v3/__https://ci.apache.org/projects/flink/flink-docs-release-1.11/dev/table/config.html*table-exec-resource-default-parallelism__;Iw!!GqivPVa7Brio!ItHlGfYT1dLQeAolQoFNfXPN876842lnF4hOE7cxmmTJY4tJkXUmkz7J4wxLjc0$>


        Default parallelism should be the same like the stream
        environment. Why can't I update the watermarks for all 8
        parallelisms? What should I do to enable this function with
        Parallelism larger than 1? Thanks.

        First round: (Note the first column of each log row is the
        timelag strategy, it is getting updated correctly for all 8
        parallelism, but the other two strategies I mentioned above
        can't do that..)

        14:28:01,199 INFO
org.myorg.quickstart.operator.PeriodicTableOutputWatermarkGenerator

- Emit Watermark: watermark based on system time:1605047266198,

        periodicEmitWatermarkTime: 0, currentMaxTimestamp: 15000
        14:28:01,199 INFO
org.myorg.quickstart.operator.PeriodicTableOutputWatermarkGenerator

- Emit Watermark: watermark based on system time:1605047266199,

        periodicEmitWatermarkTime: 1605047172881, currentMaxTimestamp:
        1605047187881 (only one of the 8 parallelism for bound out of
        orderness is getting my new watermark)
        14:28:01,199 INFO
org.myorg.quickstart.operator.PeriodicTableOutputWatermarkGenerator

- Emit Watermark: watermark based on system time:1605047266199,

        periodicEmitWatermarkTime: 0, currentMaxTimestamp: 15000
        14:28:01,199 INFO
org.myorg.quickstart.operator.PeriodicTableOutputWatermarkGenerator

- Emit Watermark: watermark based on system time:1605047266198,

        periodicEmitWatermarkTime: 0, currentMaxTimestamp: 15000
        14:28:01,199 INFO
org.myorg.quickstart.operator.PeriodicTableOutputWatermarkGenerator

- Emit Watermark: watermark based on system time:1605047266198,

        periodicEmitWatermarkTime: 0, currentMaxTimestamp: 15000
        14:28:01,199 INFO
org.myorg.quickstart.operator.PeriodicTableOutputWatermarkGenerator

- Emit Watermark: watermark based on system time:1605047266198,

        periodicEmitWatermarkTime: 0, currentMaxTimestamp: 15000
        14:28:01,199 INFO
org.myorg.quickstart.operator.PeriodicTableOutputWatermarkGenerator

- Emit Watermark: watermark based on system time:1605047266198,

        periodicEmitWatermarkTime: 0, currentMaxTimestamp: 15000
        14:28:01,199 INFO
org.myorg.quickstart.operator.PeriodicTableOutputWatermarkGenerator

- Emit Watermark: watermark based on system time:1605047266198,

        periodicEmitWatermarkTime: 0, currentMaxTimestamp: 15000

Second round: (I set the autoWatermark interval to be 5seconds)

        14:28:06,200 INFO
org.myorg.quickstart.operator.PeriodicTableOutputWatermarkGenerator