Github user liancheng commented on a diff in the pull request:

    https://github.com/apache/spark/pull/12836#discussion_r66691737
  
    --- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/object.scala
 ---
    @@ -243,6 +243,55 @@ case class MapGroups(
         outputObjAttr: Attribute,
         child: LogicalPlan) extends UnaryNode with ObjectProducer
     
    +/** Factory for constructing new `FlatMapGroupsInR` nodes. */
    +object FlatMapGroupsInR {
    +  def apply(
    +      func: Array[Byte],
    +      packageNames: Array[Byte],
    +      broadcastVars: Array[Broadcast[Object]],
    +      schema: StructType,
    +      encoder: Expression,
    +      keyEncoder: Expression,
    +      rowEncoder: ExpressionEncoder[Row],
    --- End diff --
    
    Names of the above 3 parameters are quite confusing. The first two are 
actually deserializer expressions rather than encoders, so I guess 
`dataDeserializer` and `keyDeserializer` can be better names. The 3rd one is an 
encoder, but seems that what we actually care about is only its schema.
    
    Basically in most cases you can simply treat encoder as a pair of 
serializer and deserializer expressions.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to