Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/21228#discussion_r185783283 --- Diff: python/pyspark/sql/functions.py --- @@ -151,13 +151,15 @@ def _(): _collect_list_doc = """ Aggregate function: returns a list of objects with duplicates. + The function is non-deterministic because its result depends on order of rows. --- End diff -- I'd use `... note:`
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org