It might be due to empty batch (activated when there're stateful
operator(s) and the previous batch advances watermark), which has no input
so no moving watermark.

Did you only turn on DEBUG for WatermarkTracker? If you turn on DEBUG for
MicroBatchExecution as well, it would log "Completed batch <batchId>" so if
I'm not missing, it should be logged between updating event-time watermark
and watermark didn't move. You can attach streaming query listener and get
more information about batches.

Thanks,
Jungtaek Lim (HeartSaVioR)

On Tue, Oct 8, 2019 at 6:12 PM Jacek Laskowski <ja...@japila.pl> wrote:

> Hi,
>
> I haven't spent much time on it, but the following DEBUG message
> from WatermarkTracker sparked my interest :)
>
> I ran a streaming aggregation in Append mode and got the messages:
>
> 19/10/08 10:48:56 DEBUG WatermarkTracker: Observed event time stats 0:
> EventTimeStats(15000,1000,8000.0,2)
> 19/10/08 10:48:56 INFO WatermarkTracker: Updating event-time watermark
> from 0 to 5000 ms
> 19/10/08 10:48:56 DEBUG WatermarkTracker: Event time watermark didn't
> move: 5000 < 5000
>
> I think the DEBUG message "Event time watermark didn't move" seems
> incorrect given that the query has just started and "Observed event time
> stats". It's true that the event-time watermark didn't move if it was 5000
> before, but it was not as it has just started from scratch (no checkpointed
> state).
>
> Can anyone shed some light on this? I'll be digging deeper in a bit, but
> am hoping to get some more info before. Thanks!
>
> Pozdrawiam,
> Jacek Laskowski
> ----
> https://about.me/JacekLaskowski
> The Internals of Spark SQL https://bit.ly/spark-sql-internals
> The Internals of Spark Structured Streaming
> https://bit.ly/spark-structured-streaming
> The Internals of Apache Kafka https://bit.ly/apache-kafka-internals
> Follow me at https://twitter.com/jaceklaskowski
>
>

Reply via email to