Kevin W Monroe created ZEPPELIN-2458:
----------------------------------------

             Summary: tutorial requires sqlContext.implicits._?
                 Key: ZEPPELIN-2458
                 URL: https://issues.apache.org/jira/browse/ZEPPELIN-2458
             Project: Zeppelin
          Issue Type: Bug
    Affects Versions: 0.7.0
         Environment: Ubuntu 16.04 with zeppelin installed from Apache Bigtop 
repository.
            Reporter: Kevin W Monroe
            Priority: Minor


I'm unable to run the Basic Features (Spark) notebook included in Zeppelin 
0.7.0:

{noformat}
DEBUG [2017-04-26 16:07:34,967] ({Exec Stream Pumper} 
RemoteInterpreterManagedProcess.java[processLine]:189) - defined class Bank
DEBUG [2017-04-26 16:07:35,312] ({Exec Stream Pumper} 
RemoteInterpreterManagedProcess.java[processLine]:189) - <console>:32: error: 
value toDF is not a member of org.apache.spark.rdd.RDD[Bank]
DEBUG [2017-04-26 16:07:35,313] ({Exec Stream Pumper} 
RemoteInterpreterManagedProcess.java[processLine]:189) - possible cause: maybe 
a semicolon is missing before `value toDF'?
DEBUG [2017-04-26 16:07:35,313] ({Exec Stream Pumper} 
RemoteInterpreterManagedProcess.java[processLine]:189) -        ).toDF()
DEBUG [2017-04-26 16:07:35,313] ({Exec Stream Pumper} 
RemoteInterpreterManagedProcess.java[processLine]:189) -          ^
 WARN [2017-04-26 16:07:35,359] ({pool-2-thread-1} 
NotebookServer.java[afterStatusChange]:2026) - Job 20150210-015259_1403135953 
is finished, status: ERROR, exception: null, result:
{noformat}

This was mentioned way back in ZEPPELIN-90, but the resolution was to move to 
spark >= 1.3.  I'm using Spark 2.1.0 (verfied with {{sc.version}} in a zeppelin 
paragraph).  The spark interpreter master setting is {{local\[*]}} with 
spark.home set to {{/var/lib/spark}}, which is where spark debs are installed.  
I have the following package versions:

{noformat}
$ dpkg -l | grep -E 'spark|zeppelin'
ii  spark-core                       2.1.0-1                                    
all          Lightning-Fast Cluster Computing
ii  spark-datanucleus                2.1.0-1                                    
all          DataNucleus libraries for Apache Spark
ii  spark-python                     2.1.0-1                                    
all          Python client for Spark
ii  zeppelin                         0.7.0-1                                    
all          Web-based notebook for data analysts
{noformat}

The workaround for me is to import the sqlcontext.implicits in the affected 
paragraph:

{noformat}
import sqlContext.implicits._
{noformat} 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to