Github user icexelloss commented on a diff in the pull request: https://github.com/apache/spark/pull/19872#discussion_r162980691 --- Diff: python/pyspark/sql/functions.py --- @@ -2221,6 +2223,35 @@ def pandas_udf(f=None, returnType=None, functionType=None): .. seealso:: :meth:`pyspark.sql.GroupedData.apply` + 3. GROUP_AGG + + A group aggregate UDF defines a transformation: One or more `pandas.Series` -> A scalar + The `returnType` should be a primitive data type, e.g, :class:`DoubleType`. --- End diff -- Fixed. Thanks!
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org