[jira] [Updated] (SPARK-25214) Kafka v2 source may return duplicated records when `failOnDataLoss` is `false`

2018-08-24 Thread Shixiong Zhu (JIRA)


 [ 
https://issues.apache.org/jira/browse/SPARK-25214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shixiong Zhu updated SPARK-25214:
-
Affects Version/s: (was: 2.3.1)
   (was: 2.3.0)
   2.4.0

> Kafka v2 source may return duplicated records when `failOnDataLoss` is `false`
> --
>
> Key: SPARK-25214
> URL: https://issues.apache.org/jira/browse/SPARK-25214
> Project: Spark
>  Issue Type: Bug
>  Components: Structured Streaming
>Affects Versions: 2.4.0
>Reporter: Shixiong Zhu
>Assignee: Shixiong Zhu
>Priority: Blocker
>  Labels: correctness
> Fix For: 2.4.0
>
>
> When there are missing offsets, Kafka v2 source may return duplicated records 
> when failOnDataLoss=false because it doesn't skip missing offsets.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-25214) Kafka v2 source may return duplicated records when `failOnDataLoss` is `false`

2018-08-23 Thread Shixiong Zhu (JIRA)


 [ 
https://issues.apache.org/jira/browse/SPARK-25214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shixiong Zhu updated SPARK-25214:
-
Description: 
When there are missing offsets, Kafka v2 source may return duplicated records 
when failOnDataLoss=false because it doesn't skip missing offsets.



> Kafka v2 source may return duplicated records when `failOnDataLoss` is `false`
> --
>
> Key: SPARK-25214
> URL: https://issues.apache.org/jira/browse/SPARK-25214
> Project: Spark
>  Issue Type: Bug
>  Components: Structured Streaming
>Affects Versions: 2.3.0, 2.3.1
>Reporter: Shixiong Zhu
>Assignee: Shixiong Zhu
>Priority: Blocker
>  Labels: correctness
>
> When there are missing offsets, Kafka v2 source may return duplicated records 
> when failOnDataLoss=false because it doesn't skip missing offsets.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org