[
https://issues.apache.org/jira/browse/SPARK-35880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368965#comment-17368965
]
Apache Spark commented on SPARK-35880:
--------------------------------------
User 'vkorukanti' has created a pull request for this issue:
https://github.com/apache/spark/pull/33065
> [SS] Track the number of duplicates dropped in streaming dedupe operator
> ------------------------------------------------------------------------
>
> Key: SPARK-35880
> URL: https://issues.apache.org/jira/browse/SPARK-35880
> Project: Spark
> Issue Type: Improvement
> Components: Structured Streaming
> Affects Versions: 3.1.2
> Reporter: Venki Korukanti
> Priority: Minor
>
> Currently there is no way to find how many duplicates in the input are
> dropped. Having this metric will help track down incorrect results issues.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]