Re: Query regarding Proleptic Gregorian Calendar Spark3

2022-09-20 Thread Sachit Murarka
Reposting once. Kind Regards, Sachit Murarka On Tue, Sep 20, 2022 at 6:56 PM Sachit Murarka wrote: > Hi All, > > I am getting below error , I read the document and understood that we need > to set 2 properties > spark.conf.set("spark.sql.parquet.int96RebaseModeInRead","CORRECTED") >

Query regarding Proleptic Gregorian Calendar Spark3

2022-09-20 Thread Sachit Murarka
Hi All, I am getting below error , I read the document and understood that we need to set 2 properties spark.conf.set("spark.sql.parquet.int96RebaseModeInRead","CORRECTED") spark.conf.set("spark.sql.parquet.int96RebaseModeInWrite","CORRECTED") is this the only way or is there any other way to

Error - Spark STREAMING

2022-09-20 Thread Akash Vellukai
Hello, py4j.protocol.Py4JJavaError: An error occurred while calling o80.load. : java.lang.NoClassDefFoundError: org/apache/spark/sql/internal/connector/SimpleTableProvider May anyone help Me to solve this issue. Thanks and regards Akash

Re: Issue with SparkContext

2022-09-20 Thread javacaoyu
Is you using the pyspark? If pyspark, you can try to set env about PYSPARK_PYTHON SPARK_HOME Example: import os os.environ['PYSPARK_PYTHON'] = “python path” os.environ[’SPARK_HOME’] = “SPARK path” you can try this code…may it can resolved this. 在 2022年9月20日 17:34,Bjørn Jørgensen 写道: Hi,

Re: Issue with SparkContext

2022-09-20 Thread Bjørn Jørgensen
Hi, we have a user group at user@spark.apache.org You must install a java JRE If you are on ubuntu you can type apt-get install openjdk-17-jre-headless tir. 20. sep. 2022 kl. 06:15 skrev yogita bhardwaj < yogita.bhard...@iktara.ai>: > > > I am getting the py4j.protocol.Py4JJavaError while

Re: Re: [how to]RDD using JDBC data source in PySpark

2022-09-20 Thread Bjørn Jørgensen
There is a PR for this now. [SPARK-40491][SQL] Expose a jdbcRDD function in SparkContext man. 19. sep. 2022 kl. 12:47 skrev javaca...@163.com : > Thank you Bjorn Jorgensen and also thank to Sean Owen. > > DataFrame and .format("jdbc") is good way to

Re: NoClassDefError and SparkSession should only be created and accessed on the driver.

2022-09-20 Thread Paul Rogalinski
Hi Rajat, I have been facing similar problem recently and could solve it by moving the UDF implementation into a dedicated class instead having it implemented in the driver class/object. Regards, Paul. On Tuesday 20 September 2022 10:11:31 (+02:00), rajat kumar wrote: Hi Alton, it's in

Re: NoClassDefError and SparkSession should only be created and accessed on the driver.

2022-09-20 Thread rajat kumar
Hi Alton, it's in same scala class only. Is there any change in spark3 to serialize separately? Regards Rajat On Tue, Sep 20, 2022, 13:35 Xiao, Alton wrote: > Can you show us your code? > > your udf wasn’t serialized by spark, In my opinion, were they out of the > spark running code? > > > >

答复: NoClassDefError and SparkSession should only be created and accessed on the driver.

2022-09-20 Thread Xiao, Alton
Can you show us your code? your udf wasn’t serialized by spark, In my opinion, were they out of the spark running code? 发件人: rajat kumar 日期: 星期二, 2022年9月20日 15:58 收件人: user @spark 主题: NoClassDefError and SparkSession should only be created and accessed on the driver. Hello , I am using

NoClassDefError and SparkSession should only be created and accessed on the driver.

2022-09-20 Thread rajat kumar
Hello , I am using Spark3 where there are some UDFs along . I am using Dataframe APIs to write parquet using spark. I am getting NoClassDefError along with below error. If I comment out all UDFs , it is working fine. Could someone suggest what could be wrong. It was working fine in Spark2.4