Haopu Wang created SPARK-25171: ---------------------------------- Summary: After restart, StreamingContext is replaying the last successful micro-batch right before the stop Key: SPARK-25171 URL: https://issues.apache.org/jira/browse/SPARK-25171 Project: Spark Issue Type: Bug Components: DStreams Affects Versions: 2.3.1 Reporter: Haopu Wang
Please look at this line: [https://github.com/apache/spark/blob/8bde4678166f5f01837919d4f8d742b89f5e76b8/streaming/src/main/scala/org/apache/spark/streaming/scheduler/JobGenerator.scala#L216] "checkpointTime" represents a successful micro-batch. Why do we still treat it as "pending"? I think this is a bug. It cause duplicate processing. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org