Jason Tokayer created SPARK-20296:
-------------------------------------

             Summary: UnsupportedOperationChecker text on distinct aggregations 
differs from docs
                 Key: SPARK-20296
                 URL: https://issues.apache.org/jira/browse/SPARK-20296
             Project: Spark
          Issue Type: Documentation
          Components: Structured Streaming
    Affects Versions: 2.1.0
            Reporter: Jason Tokayer
            Priority: Minor


In the unsupported operations section in the docs 
https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html
 it states that "Distinct operations on streaming Datasets are not supported.". 
However, in 
```org.apache.spark.sql.catalyst.analysis.UnsupportedOperationChecker.scala```, 
the error message is ```Distinct aggregations are not supported on streaming 
DataFrames/Datasets, unless it is on aggregated DataFrame/Dataset in Complete 
output mode. Consider using approximate distinct aggregation```.

It seems that the error message is incorrect.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to