Are join/groupBy operations with wide Java Beans using Dataset API much slower than using RDD API?

2016-08-04 Thread dueckm
://apache-spark-user-list.1001560.n3.nabble.com/file/n27473/Job_RDD_Details.png> <http://apache-spark-user-list.1001560.n3.nabble.com/file/n27473/Job_Dataset_Details.png> -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Are-join-groupBy-operations-with

Are join/groupBy operations with wide Java Beans using Dataset API much slower than using RDD API?

2016-08-02 Thread dueckm
park-user-list.1001560.n3.nabble.com/file/n27459/Job_RDD_Details.png> <http://apache-spark-user-list.1001560.n3.nabble.com/file/n27459/Job_Dataset_Details.png> -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Are-join-groupBy-operations-with-wide

Are join/groupBy operations with wide Java Beans using Dataset API much slower than using RDD API? [*]

2016-08-02 Thread dueckm
er-list.1001560.n3.nabble.com/attachment/27449/2/2D310440.gif> JoinGroupByTest.zip (5K) <http://apache-spark-user-list.1001560.n3.nabble.com/attachment/27449/3/JoinGroupByTest.zip> -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Are-join-groupBy-operati

Are join/groupBy operations with wide Java Beans using Dataset API much slower than using RDD API?

2016-08-02 Thread dueckm
xt: http://apache-spark-user-list.1001560.n3.nabble.com/Are-join-groupBy-operations-with-wide-Java-Beans-using-Dataset-API-much-slower-than-using-RDD-API-tp27448.html Sent from the Apache Spark User List mailing list archive at Nabble.com. ---