[ANNOUNCE] Apache Kyuubi released 1.7.0

2023-03-07 Thread Cheng Pan
Hi all, The Apache Kyuubi community is pleased to announce that Apache Kyuubi 1.7.0 has been released! Apache Kyuubi is a distributed multi-tenant Lakehouse gateway for large-scale data processing and analytics, built on top of Apache Spark, Apache Flink, Trino and also supports other computing

Re: Online classes for spark topics

2023-03-07 Thread Mich Talebzadeh
Hi, This might be a worthwhile exercise on the assumption that the contributors will find the time and bandwidth to chip in so to speak. I am sure there are many but on top of my head I can think of Holden Karau for k8s, and Sean Owen for data science stuff. They are both very experienced.

Online classes for spark topics

2023-03-07 Thread ashok34...@yahoo.com.INVALID
Hello gurus, Does Spark arranges online webinars for special topics like Spark on K8s, data science and Spark Structured Streaming? I would be most grateful if experts can share their experience with learners with intermediate knowledge like myself. Hopefully we will find the practical

Re: [Spark Structured Streaming] Could we apply new options of readStream/writeStream without stopping spark application (zero downtime)?

2023-03-07 Thread Mich Talebzadeh
hm interesting proposition. I guess you mean altering one of following parameters in flight streamingDataFrame = self.spark \ .readStream \ .format("kafka") \ .option("kafka.bootstrap.servers", config['MDVariables']['bootstrapServers'],)

Re: 回复:Re: Build SPARK from source with SBT failed

2023-03-07 Thread Tufan Rakshit
I use m1 apple silicon , use java11 from Zulu , and runs SBT based Build Jobs in Kubernetes Best Tufan On Tue, 7 Mar 2023 at 16:11, Sean Owen wrote: > No, it's that JAVA_HOME wasn't set to .../Home. It is simply not finding > javac, in the error. Zulu supports M1. > > On Tue, Mar 7, 2023 at

Re: 回复:Re: Build SPARK from source with SBT failed

2023-03-07 Thread Sean Owen
No, it's that JAVA_HOME wasn't set to .../Home. It is simply not finding javac, in the error. Zulu supports M1. On Tue, Mar 7, 2023 at 9:05 AM Artemis User wrote: > Looks like Maven build did find the javac, just can't run it. So it's not > a path problem but a compatibility problem. Are you

Re: 回复:Re: Build SPARK from source with SBT failed

2023-03-07 Thread Artemis User
Looks like Maven build did find the javac, just can't run it.  So it's not a path problem but a compatibility problem.  Are you doing this on a Mac with M1/M2?  I don't think that Zulu JDK supports Apple silicon.   Your best option would be to use homebrew to install the dev tools (including

回复:Re: Build SPARK from source with SBT failed

2023-03-07 Thread ckgppl_yan
No. I haven't installed Apple Developer Tools. I have installed Zulu OpenJDK 11.0.17 manually.So I need to install Apple Developer Tools?- 原始邮件 - 发件人:Sean Owen 收件人:ckgppl_...@sina.cn 抄送人:user 主题:Re: Build SPARK from source with SBT failed 日期:2023年03月07日 20点58分 This says you don't have

Re: Pandas UDFs vs Inbuilt pyspark functions

2023-03-07 Thread Sean Owen
It's hard to evaluate without knowing what you're doing. Generally, using a built-in function will be fastest. pandas UDFs can be faster than normal UDFs if you can take advantage of processing multiple rows at once. On Tue, Mar 7, 2023 at 6:47 AM neha garde wrote: > Hello All, > > I need help

Re: Build SPARK from source with SBT failed

2023-03-07 Thread Sean Owen
This says you don't have the java compiler installed. Did you install the Apple Developer Tools package? On Tue, Mar 7, 2023 at 1:42 AM wrote: > Hello, > > I have tried to build SPARK source codes with SBT in my local dev > environment (MacOS 13.2.1). But it reported following error: > [error]

Pandas UDFs vs Inbuilt pyspark functions

2023-03-07 Thread neha garde
Hello All, I need help deciding on what is better, pandas udfs or inbuilt functions I have to perform a transformation where I managed to compare the two for a few thousand records and pandas_udf infact performed better. Given the complexity of the transformation, I also found pandas_udf makes it