'ExecutorTaskSummary' alternative in Spark 2.3 onwards

2019-12-30 Thread Ninja Coder
I have a spark streaming application (currently spark 2.2) which is using `org.apache.spark.ui.exec.ExecutorTaskSummary` to grab executor details like duration, tasks failed, tasks completed, GC time, etc after each batch is completed. These metrics are then loaded to prometheus and pagerduty

[SPARK ML] feature stopWordsRemover

2019-12-30 Thread franziska.kuehn
Would it be possible to add a parameter 'language' in 'StopWordsRemover' for using default languages provided by spark itself? This is not available in spark 3 yet. If one needs additional language they still could work with current stopWords-array-parameter. Would make it more generic and

unsubscribe

2019-12-30 Thread Jaebin Lee
unsubscribe

Issues with Apache Spark tgz file

2019-12-30 Thread rsinghania
Hi, I'm trying to open the file https://www.apache.org/dyn/closer.lua/spark/spark-2.4.4/spark-2.4.4-bin-hadoop2.7.tgz downloaded from https://spark.apache.org/downloads.html using wget, and getting the following messages: gzip: stdin: not in gzip format tar: Child returned status 1 tar: Error is

Re: Issues with Apache Spark tgz file

2019-12-30 Thread Marcelo Vanzin
That first URL is not the file. It's a web page with links to the file in different mirrors. I just looked at the actual file in one of the mirrors and it looks fine. On Mon, Dec 30, 2019 at 1:34 PM rsinghania wrote: > > Hi, > > I'm trying to open the file >

Re: Issues with Apache Spark tgz file

2019-12-30 Thread rsinghania
Ah ok - yes, that worked for me as well. Thank you! Rajat -- Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/ - To unsubscribe e-mail: user-unsubscr...@spark.apache.org