It is claimed that spark is 10x or 100x times faster than mapreduce and hive but since I started using it I haven't seen any faster performance. it is taking 2 minutes to run map and join tasks over just 2GB data. Instead hive was taking just a few seconds to join 2 tables over the same data. And, I haven't gotten any answers to my questions. I don't understand the purpose of this group and there is no enough documentations of spark and its usage.
-- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Spark-is-slow-tp4539.html Sent from the Apache Spark User List mailing list archive at Nabble.com.