Jason Tokayer created SPARK-20296: ------------------------------------- Summary: UnsupportedOperationChecker text on distinct aggregations differs from docs Key: SPARK-20296 URL: https://issues.apache.org/jira/browse/SPARK-20296 Project: Spark Issue Type: Documentation Components: Structured Streaming Affects Versions: 2.1.0 Reporter: Jason Tokayer Priority: Minor
In the unsupported operations section in the docs https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html it states that "Distinct operations on streaming Datasets are not supported.". However, in ```org.apache.spark.sql.catalyst.analysis.UnsupportedOperationChecker.scala```, the error message is ```Distinct aggregations are not supported on streaming DataFrames/Datasets, unless it is on aggregated DataFrame/Dataset in Complete output mode. Consider using approximate distinct aggregation```. It seems that the error message is incorrect. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org