Divya Goel created ZEPPELIN-4973:
------------------------------------
Summary: Zeppelin spark jobs are getting hung and return with
different errors each time.
Key: ZEPPELIN-4973
URL: https://issues.apache.org/jira/browse/ZEPPELIN-4973
Project: Zeppelin
Issue Type: Bug
Components: Interpreters, spark
Affects Versions: 0.8.2
Environment: Hi,
I've been encountering this issue since 1 month and every time I've to restart
the spark interpreter which at first makes things more vulnerable then after
re-login again I run the job and again back to the job's hanging.
Could you please assist me as I've to use zeppelin for my data visualization
which is on stake due to this issue.
With much regards,
Divya
Reporter: Divya Goel
Fix For: 0.8.2
Attachments: zeppelin_error.PNG, zeppelin_sparkjob.PNG
Hi,Hi,
I've kerberized cluster and my kerberos ticket is renewed each day providing me
the valid key. When I run spark job from my zeppelin IDE, it first gets stuck
for 2.5-3 hours and after that I get an error mentioned below.
GSSException: No valid credentials provided (Mechanism level: Failed to find
any Kerberos tgt) at
sun.security.jgss.krb5.Krb5InitCredential.getInstance(Krb5InitCredential.java:147)
at
sun.security.jgss.krb5.Krb5MechFactory.getCredentialElement(Krb5MechFactory.java:122)
at
sun.security.jgss.krb5.Krb5MechFactory.getMechanismContext(Krb5MechFactory.java:187)
at
sun.security.jgss.GSSManagerImpl.getMechanismContext(GSSManagerImpl.java:224)
at sun.security.jgss.GSSContextImpl.initSecContext(GSSContextImpl.java:212) at
sun.security.jgss.GSSContextImpl.initSecContext(GSSContextImpl.java:179) at
com.sun.security.sasl.gsskerb.GssKrb5Client.evaluateChallenge(GssKrb5Client.java:192)
at
org.apache.hadoop.security.SaslRpcClient.saslConnect(SaslRpcClient.java:413) at
org.apache.hadoop.ipc.Client$Connection.setupSaslConnection(Client.java:594) at
org.apache.hadoop.ipc.Client$Connection.access$2000(Client.java:396) at
org.apache.hadoop.ipc.Client$Connection$2.run(Client.java:761) at
org.apache.hadoop.ipc.Client$Connection$2.run(Client.java:757) at
java.security.AccessController.doPrivileged(Native Method) at
javax.security.auth.Subject.doAs(Subject.java:422) at
I've enabled the user impersonation in zeppelin that's why zeppelin keytab and
principals are being submitted to spark interpreter by properties:
zeppelin.spark.keytab and zeppelin.spark.principal.
It's strange that this is the persistent error but sometimes out of the blue I
get error mentioned below after 2.5 to 3 hours:
java.lang.NullPointerException at
org.apache.thrift.transport.TSocket.open(TSocket.java:170) at
org.apache.zeppelin.interpreter.remote.ClientFactory.create(ClientFactory.java:51)
at
org.apache.zeppelin.interpreter.remote.ClientFactory.create(ClientFactory.java:37)
at
org.apache.commons.pool2.BasePooledObjectFactory.makeObject(BasePooledObjectFactory.java:60)
at
org.apache.commons.pool2.impl.GenericObjectPool.create(GenericObjectPool.java:861)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)