[jira] [Commented] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-05-03 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15269367#comment-15269367 ] Nick Pentreath commented on SPARK-15027: Will take a look at repartitioning. 2.1 seems ok, I

[jira] [Commented] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-05-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15269218#comment-15269218 ] Xiangrui Meng commented on SPARK-15027: --- Ah, I see the problems now. We do need the hash

[jira] [Commented] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-05-03 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15268662#comment-15268662 ] Nick Pentreath commented on SPARK-15027: I've managed to get it working for the following

[jira] [Commented] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-04-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15265238#comment-15265238 ] Xiangrui Meng commented on SPARK-15027: --- It might be tricky to use Dataset due to encoders and

[jira] [Commented] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-04-30 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15265232#comment-15265232 ] Nick Pentreath commented on SPARK-15027: Ok - it would make sense to have it in 2.0 if possible

[jira] [Commented] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-04-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15265229#comment-15265229 ] Xiangrui Meng commented on SPARK-15027: --- No, just API change. I guess there are still gaps to use

[jira] [Commented] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-04-30 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15265228#comment-15265228 ] Nick Pentreath commented on SPARK-15027: [~mengxr] are you intending this to be a more