Github user icexelloss commented on a diff in the pull request: https://github.com/apache/spark/pull/18732#discussion_r141950131 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/RelationalGroupedDataset.scala --- @@ -47,8 +47,8 @@ import org.apache.spark.sql.types.StructType */ @InterfaceStability.Stable class RelationalGroupedDataset protected[sql]( - df: DataFrame, - groupingExprs: Seq[Expression], + val df: DataFrame, + val groupingExprs: Seq[Expression], --- End diff -- `val df` is used for accessing the jdf object from python: https://github.com/icexelloss/spark/blob/groupby-apply-SPARK-20396/python/pyspark/sql/group.py#L204 Alternatively, I can restore a reference to the python DataFrame in python GroupedData object. It doesn't seem to be much different though.
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org