Re: Spark 2.4.4 with Hadoop 3.2.0

2019-11-25 Thread nihed mbarek
Hi, Spark 2.x is already part of Cloudera CDH6 who is based on Hadoop 3.x so they support officially Spark2+Hadoop3 So for sure, there is tests and development done from this side. In other part, I don't know the status for Hadoop 3.2. Regards, On Tue, Nov 26, 2019 at 1:46 AM Alfredo Marquez

Re: Spark 2.4.4 with Hadoop 3.2.0

2019-11-25 Thread Alfredo Marquez
Thank you Ismael! That's what I was looking for. I can take this to our platform team. Alfredo On Mon, Nov 25, 2019, 3:32 PM Ismaël Mejía wrote: > Not officially. Apache Spark only announced support for Hadoop 3.x > starting with the upcoming Spark 3. > There is a preview release of Spark 3

Re: Spark 2.4.4 with Hadoop 3.2.0

2019-11-25 Thread Ismaël Mejía
Not officially. Apache Spark only announced support for Hadoop 3.x starting with the upcoming Spark 3. There is a preview release of Spark 3 with support for Hadoop 3.2 that you can try now: https://archive.apache.org/dist/spark/spark-3.0.0-preview/spark-3.0.0-preview-bin-hadoop3.2.tgz Enjoy!

Re: GraphX performance feedback

2019-11-25 Thread mahzad kalantari
Thanks for your answer, my use case is friend recommandation for 200 million profils. Le lun. 25 nov. 2019 à 14:10, Jörn Franke a écrit : > I think it depends what you want do. Interactive big data graph analytics > are probably better of in Janusgraph or similar. > Batch processing (once-off)

Re: GraphX performance feedback

2019-11-25 Thread Jörn Franke
I think it depends what you want do. Interactive big data graph analytics are probably better of in Janusgraph or similar. Batch processing (once-off) can be still fine in graphx - you have though to carefully design the process. > Am 25.11.2019 um 20:04 schrieb mahzad kalantari : > >  > Hi

GraphX performance feedback

2019-11-25 Thread mahzad kalantari
Hi all My question is about GraphX, I 'm looking for user feedbacks on the performance. I read this paper written by Facebook team that says Graphx has very poor performance. https://engineering.fb.com/core-data/a-comparison-of-state-of-the-art-graph-processing-systems/ Has anyone already

Status of Spark testing on ARM64

2019-11-25 Thread Tianhua huang
Hi all, I will give you some informations about ARM CI of Spark: Our team and community are working on build/test Spark master on ARM64 server, after find and fix some issues[1], we have integrated two ARM testing jobs[2] to community CI(AMPLAB Jenkins), they run as daily job and have been

Re: how spark structrued stream write to kudu

2019-11-25 Thread lk_spark
I found _sqlContext is null , how to resolve it ? 2019-11-25 lk_spark 发件人:"lk_spark" 发送时间:2019-11-25 16:00 主题:how spark structrued stream write to kudu 收件人:"user.spark" 抄送: hi,all: I'm using spark 2.4.4 to readstream data from kafka and want to write to kudu 1.7.0 , my code like

how spark structrued stream write to kudu

2019-11-25 Thread lk_spark
hi,all: I'm using spark 2.4.4 to readstream data from kafka and want to write to kudu 1.7.0 , my code like below : val kuduContext = new KuduContext("master:7051", spark.sparkContext) val console = cnew.select("*").as[CstoreNew] .writeStream