Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/21193#discussion_r186627099
  
    --- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Cast.scala
 ---
    @@ -623,8 +624,14 @@ case class Cast(child: Expression, dataType: DataType, 
timeZoneId: Option[String
       override def doGenCode(ctx: CodegenContext, ev: ExprCode): ExprCode = {
         val eval = child.genCode(ctx)
         val nullSafeCast = nullSafeCastFunction(child.dataType, dataType, ctx)
    +
    +    // Below the code comment including `eval.value` and `eval.isNull` is 
a trick. It makes the two
    +    // expr values are referred by this code block.
         ev.copy(code = eval.code +
    -      castCode(ctx, eval.value, eval.isNull, ev.value, ev.isNull, 
dataType, nullSafeCast))
    +      code"""
    +        // Cast from ${eval.value}, ${eval.isNull}
    --- End diff --
    
    I feel it's a little fragile to depend on the `StringContext` to collect 
references. 2 proposal:
    1. ask the code builder to explicitly specify the references, like 
`JavaCode.block(code = xxx, ref1, ref2, ...)`
    2. ban `String` in the code block string context, and create a special 
class to insert `String` literal to code block, so that we won't mistakently 
pass code as string and lose references.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to