Mridul Muralidharan created SPARK-9339:
------------------------------------------

             Summary: Use of Class.forName(String) should be replaced with 
version taking classloader
                 Key: SPARK-9339
                 URL: https://issues.apache.org/jira/browse/SPARK-9339
             Project: Spark
          Issue Type: Bug
            Reporter: Mridul Muralidharan



In spark, multiple places have ability to take external class as input - 
example: listener.
Other than in specific cases (like SparkEnv), the code typically calls 
Class.forName(clazzName)

This works when the class is from within spark - but when the referenced class 
is from external jar (user provided), it tends to fail.

For example, in 1.3 we get this when using custom listener:

ERROR ApplicationMaster: User class threw exception: Exception when registering 
SparkListener
org.apache.spark.SparkException: Exception when registering SparkListener
        at 
org.apache.spark.SparkContext.setupAndStartListenerBus(SparkContext.scala:1726)
        at org.apache.spark.SparkContext.<init>(SparkContext.scala:429)
        at org.apache.spark.SparkContext.<init>(SparkContext.scala:134)
        at 
com.yahoo.corp.yst.webmap.spark.PageRankDataGenerator$.main(PageRankDataGenerator.scala:170)
        at 
com.yahoo.corp.yst.webmap.spark.PageRankDataGenerator.main(PageRankDataGenerator.scala)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at 
org.apache.spark.deploy.yarn.ApplicationMaster$$anon$2.run(ApplicationMaster.scala:480)
Caused by: java.lang.ClassNotFoundException: <MySparkListener>
        at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
        at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
        at java.lang.Class.forName0(Native Method)
        at java.lang.Class.forName(Class.java:190)
        at 
org.apache.spark.SparkContext$$anonfun$setupAndStartListenerBus$1.apply(SparkContext.scala:1694)
        at 
org.apache.spark.SparkContext$$anonfun$setupAndStartListenerBus$1.apply(SparkContext.scala:1691)
        at 
scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
        at scala.collection.mutable.WrappedArray.foreach(WrappedArray.scala:34)
        at 
org.apache.spark.SparkContext.setupAndStartListenerBus(SparkContext.scala:1691)
        ... 9 more



Instead of
"val listenerClass = Class.forName(className)" in 
SparkContext.setupAndStartListenerBus, we should use 
"val listenerClass = Class.forName(className, true, 
Thread.currentThread().getContextClassLoader)"

Note - this is a common pattern in spark, and might be relevant elsewhere too.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to