Re: why is spark + scala code so slow, compared to python?

2014-12-12 Thread rzykov
Try this https://github.com/RetailRocket/SparkMultiTool https://github.com/RetailRocket/SparkMultiTool This loader solved slow reading of a big data set of small files in hdfs. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/why-is-spark-scala-code-so

why is spark + scala code so slow, compared to python?

2014-12-11 Thread ll
performance today? or is spark + scala just not the right tool for small to medium datasets? when would you use spark + scala vs. python? thanks! -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/why-is-spark-scala-code-so-slow-compared-to-python-tp20636.html

Re: why is spark + scala code so slow, compared to python?

2014-12-11 Thread Natu Lauchande
? thanks! -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/why-is-spark-scala-code-so-slow-compared-to-python-tp20636.html Sent from the Apache Spark User List mailing list archive at Nabble.com

Re: why is spark + scala code so slow, compared to python?

2014-12-11 Thread Duy Huynh
this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/why-is-spark-scala-code-so-slow-compared-to-python-tp20636.html Sent from the Apache Spark User List mailing list archive at Nabble.com

Re: why is spark + scala code so slow, compared to python?

2014-12-11 Thread Duy Huynh
://apache-spark-user-list.1001560.n3.nabble.com/why-is-spark-scala-code-so-slow-compared-to-python-tp20636.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr

Re: why is spark + scala code so slow, compared to python?

2014-12-11 Thread Sean Owen
-code-so-slow-compared-to-python-tp20636.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h

Re: why is spark + scala code so slow, compared to python?

2014-12-11 Thread Andy Wagner
-user-list.1001560.n3.nabble.com/why-is-spark-scala-code-so-slow-compared-to-python-tp20636.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org