[jira] [Commented] (SPARK-12662) Add document to randomSplit to explain the sampling depends on the ordering of the rows in a partition

2016-01-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15085007#comment-15085007 ] Reynold Xin commented on SPARK-12662: - Yea [~yhuai] and I talked offline and thought just adding a

[jira] [Commented] (SPARK-12662) Add document to randomSplit to explain the sampling depends on the ordering of the rows in a partition

2016-01-05 Thread Brian Pasley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15085005#comment-15085005 ] Brian Pasley commented on SPARK-12662: -- Users' expectation for randomSplit probably doesn't realize

[jira] [Commented] (SPARK-12662) Add document to randomSplit to explain the sampling depends on the ordering of the rows in a partition

2016-01-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15084109#comment-15084109 ] Reynold Xin commented on SPARK-12662: - Seems like that should be the user 's choice? We can improve

[jira] [Commented] (SPARK-12662) Add document to randomSplit to explain the sampling depends on the ordering of the rows in a partition

2016-01-05 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15084106#comment-15084106 ] Yin Huai commented on SPARK-12662: -- Another option is to always add local sort operator to make sure the

[jira] [Commented] (SPARK-12662) Add document to randomSplit to explain the sampling depends on the ordering of the rows in a partition

2016-01-05 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15084429#comment-15084429 ] Yin Huai commented on SPARK-12662: -- OK. Let's use this jira to track the work of adding document. > Add