[ 
https://issues.apache.org/jira/browse/SPARK-31312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17073264#comment-17073264
 ] 

Jungtaek Lim commented on SPARK-31312:
--------------------------------------

No, it wasn't triggered by SPARK-26560 and should be long-lived bug same as 
SPARK-26560, although I haven't encountered the bug in practice so I can't say 
which version is the oldest.

Btw, while I always appreciate your details and diligence on leaving 
information, it's a different story if that is required to others. That would 
become a burden to do actual work, spending more time on "boilerplate" than the 
origin work. There should be a "balance", and the balance should be decided by 
consensus of community.

Let's discuss more in dev@ mailing list.

> Transforming Hive simple UDF (using JAR) expression may incur CNFE in later 
> evaluation
> --------------------------------------------------------------------------------------
>
>                 Key: SPARK-31312
>                 URL: https://issues.apache.org/jira/browse/SPARK-31312
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.4.5, 3.0.0
>            Reporter: Jungtaek Lim
>            Assignee: Jungtaek Lim
>            Priority: Major
>             Fix For: 3.0.0, 2.4.6
>
>
> In SPARK-26560, we ensured that Hive UDF using JAR is executed regardless of 
> current thread context classloader.
> [~cloud_fan] pointed out another potential issue in post-review of 
> SPARK-26560 - quoting the comment:
> {quote}
> Found a potential problem: here we call HiveSimpleUDF.dateType (which is a 
> lazy val), to force to load the class with the corrected class loader.
> However, if the expression gets transformed later, which copies 
> HiveSimpleUDF, then calling HiveSimpleUDF.dataType will re-trigger the class 
> loading, and at that time there is no guarantee that the corrected 
> classloader is used.
> I think we should materialize the loaded class in HiveSimpleUDF.
> {quote}
> This JIRA issue is to track the effort of verifying the potential issue and 
> fixing the issue.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to