[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16073653#comment-16073653 ] Apache Spark commented on SPARK-21101: -- User 'wangyum' has created a pull request for this issue: https://github.com/apache/spark/pull/18527 > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16053647#comment-16053647 ] Liang-Chi Hsieh commented on SPARK-21101: - May I ask what Hive version your UDTF is based on? > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16053501#comment-16053501 ] Dayou Zhou commented on SPARK-21101: Hi [~q79969786], thank you kindly for your response. I was not aware of this 'other' version of initialize() method, and will try our suggestion tomorrow. > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16053145#comment-16053145 ] Yuming Wang commented on SPARK-21101: - [~dyzhou], Can you try to override https://github.com/apache/hive/blob/release-2.0.0/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDTF.java#L70 It works for me: {code:sql} add jar hdfs://nameservice1/tmp/wym/hive-exec-1.1.0-cdh5.4.3.jar; CREATE TEMPORARY FUNCTION spark_21101 AS 'org.apache.hadoop.hive.ql.udf.generic.GenericUDTFStack'; select spark_21101(2,'A',10,date '2015-01-01','B',20,date '2016-01-01'); {code} Ref: https://github.com/apache/hive/blob/release-2.0.0/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDTFStack.java > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16051017#comment-16051017 ] Dayou Zhou commented on SPARK-21101: Hi [~srowen], thanks for the helpful and constructive comment. So yes I have also tried starting STS using --jars option, i.e. ./start-thriftserver.sh --jars /path/to/udf.jar and have also verified that by doing this, I no longer need to specify USING JAR when creating my udf, i.e. CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' However, the bad news is that when I invoke the udf, I get exactly the same error as before, i.e. >> No handler for Hive UDF 'com.foo.MyUdtf': java.lang.NullPointerException; >> line 1 pos 7 So I have reported what I wanted to report and I will leave the authorities to decide whether this is a bug or a 'question' (even though I do have an opinion on which). Thanks for your help. > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16050774#comment-16050774 ] Sean Owen commented on SPARK-21101: --- [~dyzhou] I see your reply. The thrift server should just be another job that is spark-submit-ted. So I think you can in fact use {{--jars}} to add JARs to its classpath. That is what the guidance is getting at here. It is a bit more of a question therefore than JIRA issue, but I see why you're not convinced of that, but the way forward is to give the idea in that link a try next. Generally: I would only treat comments from committers or regular contributors as authoritative. > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16050762#comment-16050762 ] Dayou Zhou commented on SPARK-21101: Hi [~zhangzr1026], I'm still waiting for someone (anyone) to explain to me why this is not a bug, but whatever. If this is how you treat people who love Spark, use Spark, and are trying to help make it better, than fine. > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16050111#comment-16050111 ] zhangzr commented on SPARK-21101: - Hi Dayou Zhou , Jira is a place to post an almost certain bug , not a place to ask question . From now ,the question you ask cannot be see as a bug . . . > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16050048#comment-16050048 ] Dayou Zhou commented on SPARK-21101: Hi [~sowen], >> did you read the link he posted? Yes I did, but did you read my response? I'm not using spark shell, I'm using spark thrift server, with USING JAR syntax: CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR 'hdfs:///path/to/udf.jar'; > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16050041#comment-16050041 ] Sean Owen commented on SPARK-21101: --- [~dyzhou] did you read the link he posted? This does not seem like a bug if you're not even passing your jar to the app. I would also close it. > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16050031#comment-16050031 ] Dayou Zhou commented on SPARK-21101: Hi [~maropu], >> I'll close this because this seems to be a bug. This sounds bizarre, maybe you meant it wasn't a bug, but anyway, I did not start by asking a question, I started by reporting an error which is probably a bug. What is your justification that it is NOT a bug and what is your justification of closing it as 'not a problem' when you don't even seem to understand it? > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049977#comment-16049977 ] Takeshi Yamamuro commented on SPARK-21101: -- Since JIRA is not a place for questions, you better ask in spark-user. > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049915#comment-16049915 ] Dayou Zhou commented on SPARK-21101: Hi [~maropu], yes I saw this one, but in my case, I'm using JDBC Thrift server to invoke the UDTF, not using Spark-shell. So is there a way to pass my JAR to the Thrift server? > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049892#comment-16049892 ] Takeshi Yamamuro commented on SPARK-21101: -- See https://www.mail-archive.com/user@spark.apache.org/msg61009.html > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049891#comment-16049891 ] Dayou Zhou commented on SPARK-21101: Hi [~maropu] >>You just don't pass your uber-jar into spark? Sorry not sure what you meant -- could you clarify your question? >>Or, you mean the query above worked well on previous spark? I did not try it with earlier versions, but likely the same behavior I think. > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-21101) Error running Hive temporary UDTF on latest Spark 2.2
[ https://issues.apache.org/jira/browse/SPARK-21101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049883#comment-16049883 ] Takeshi Yamamuro commented on SPARK-21101: -- You just don't pass your uber-jar into spark? Or, you mean the query above worked well on previous spark? > Error running Hive temporary UDTF on latest Spark 2.2 > - > > Key: SPARK-21101 > URL: https://issues.apache.org/jira/browse/SPARK-21101 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.2.1 >Reporter: Dayou Zhou > > I'm using temporary UDTFs on Spark 2.2, e.g. > CREATE TEMPORARY FUNCTION myudtf AS 'com.foo.MyUdtf' USING JAR > 'hdfs:///path/to/udf.jar'; > But when I try to invoke it, I get the following error: > {noformat} > 17/06/14 19:43:50 ERROR SparkExecuteStatementOperation: Error running hive > query: > org.apache.hive.service.cli.HiveSQLException: > org.apache.spark.sql.AnalysisException: No handler for Hive UDF > 'com.foo.MyUdtf': java.lang.NullPointerException; line 1 pos 7 > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:266) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:174) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1$$anon$2.run(SparkExecuteStatementOperation.scala:171) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698) > at > org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$1.run(SparkExecuteStatementOperation.scala:184) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > Any help appreciated, thanks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org