[jira] [Updated] (SPARK-15947) Make pipeline components backward compatible with old vector columns in Scala/Java

2016-06-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15947: -- Description: After SPARK-15945, we should make ALL pipeline components accept old vector

[jira] [Updated] (SPARK-15947) Make pipeline components backward compatible with old vector columns in Scala/Java

2016-06-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15947: -- Description: After SPARK-15945, we should make ALL pipeline components accept old vector

[jira] [Commented] (SPARK-15944) Make spark.ml package backward compatible with spark.mllib vectors

2016-06-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15329675#comment-15329675 ] Xiangrui Meng commented on SPARK-15944: --- We won't deprecate those utils before we deprecate the

[jira] [Updated] (SPARK-15948) Make pipeline components backward compatible with old vector columns in Python

2016-06-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15948: -- Description: Same as SPARK-15947 but for Python. (was: Same as SPARK-15974 but for Python.)

[jira] [Created] (SPARK-15948) Make pipeline components backward compatible with old vector columns in Python

2016-06-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15948: - Summary: Make pipeline components backward compatible with old vector columns in Python Key: SPARK-15948 URL: https://issues.apache.org/jira/browse/SPARK-15948

[jira] [Updated] (SPARK-15948) Make pipeline components backward compatible with old vector columns in Python

2016-06-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15948: -- Description: Same as SPARK-15974 but for Python. > Make pipeline components backward

[jira] [Created] (SPARK-15947) Make pipeline components backward compatible with old vector columns in Scala/Java

2016-06-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15947: - Summary: Make pipeline components backward compatible with old vector columns in Scala/Java Key: SPARK-15947 URL: https://issues.apache.org/jira/browse/SPARK-15947

[jira] [Created] (SPARK-15946) Wrap the conversion utils in Python

2016-06-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15946: - Summary: Wrap the conversion utils in Python Key: SPARK-15946 URL: https://issues.apache.org/jira/browse/SPARK-15946 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-15946) Wrap the conversion utils in Python

2016-06-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15946: -- Description: This is to wrap SPARK-15945 in Python. So Python users can use it to convert

[jira] [Created] (SPARK-15945) Implement conversion utils in Scala/Java

2016-06-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15945: - Summary: Implement conversion utils in Scala/Java Key: SPARK-15945 URL: https://issues.apache.org/jira/browse/SPARK-15945 Project: Spark Issue Type:

[jira] [Created] (SPARK-15944) Make spark.ml package backward compatible with spark.mllib vectors

2016-06-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15944: - Summary: Make spark.ml package backward compatible with spark.mllib vectors Key: SPARK-15944 URL: https://issues.apache.org/jira/browse/SPARK-15944 Project: Spark

[jira] [Updated] (SPARK-15364) Implement Python picklers for ml.Vector and ml.Matrix under spark.ml.python

2016-06-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15364: -- Assignee: Liang-Chi Hsieh > Implement Python picklers for ml.Vector and ml.Matrix under

[jira] [Updated] (SPARK-15364) Implement Python picklers for ml.Vector and ml.Matrix under spark.ml.python

2016-06-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15364: -- Target Version/s: 2.0.0 (was: 2.1.0) > Implement Python picklers for ml.Vector and ml.Matrix

[jira] [Resolved] (SPARK-15364) Implement Python picklers for ml.Vector and ml.Matrix under spark.ml.python

2016-06-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-15364. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13219

[jira] [Updated] (SPARK-15799) Release SparkR on CRAN

2016-06-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15799: -- Target Version/s: 2.1.0 > Release SparkR on CRAN > -- > >

[jira] [Updated] (SPARK-15581) MLlib 2.1 Roadmap

2016-06-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15581: -- Description: This is a master list for MLlib improvements we are working on for the next

[jira] [Created] (SPARK-15799) Release SparkR on CRAN

2016-06-07 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15799: - Summary: Release SparkR on CRAN Key: SPARK-15799 URL: https://issues.apache.org/jira/browse/SPARK-15799 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-15581) MLlib 2.1 Roadmap

2016-06-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15581: -- Description: This is a master list for MLlib improvements we are working on for the next

[jira] [Commented] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314816#comment-15314816 ] Xiangrui Meng commented on SPARK-15740: --- The proposal looks good to me. Please also try to measure

[jira] [Commented] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313198#comment-15313198 ] Xiangrui Meng commented on SPARK-15740: --- [~tmnd91] Could you run the test and estimate how much ram

[jira] [Comment Edited] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313198#comment-15313198 ] Xiangrui Meng edited comment on SPARK-15740 at 6/2/16 10:24 PM: [~tmnd91]

[jira] [Updated] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15740: -- Description: [~andrewor14] noticed some OOM errors caused by "test big model load / save" in

[jira] [Created] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15740: - Summary: Word2VecSuite "big model load / save" caused OOM in maven jenkins builds Key: SPARK-15740 URL: https://issues.apache.org/jira/browse/SPARK-15740 Project:

[jira] [Resolved] (SPARK-13944) Separate out local linear algebra as a standalone module without Spark dependency

2016-06-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13944. --- Resolution: Fixed Fix Version/s: 2.0.0 > Separate out local linear algebra as a

[jira] [Closed] (SPARK-14529) Consolidate mllib and mllib-local into one mllib folder

2016-06-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-14529. - Resolution: Won't Fix Marked the issue as won't fix. The main reason is that mllib-local might

[jira] [Closed] (SPARK-15043) Fix and re-enable flaky test: mllib.stat.JavaStatisticsSuite.testCorr

2016-05-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-15043. - Resolution: Fixed Fixed as part of SPARK-15030. > Fix and re-enable flaky test:

[jira] [Updated] (SPARK-15043) Fix and re-enable flaky test: mllib.stat.JavaStatisticsSuite.testCorr

2016-05-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15043: -- Fix Version/s: 2.0.0 > Fix and re-enable flaky test: mllib.stat.JavaStatisticsSuite.testCorr >

[jira] [Commented] (SPARK-14529) Consolidate mllib and mllib-local into one mllib folder

2016-05-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15299511#comment-15299511 ] Xiangrui Meng commented on SPARK-14529: --- We should decide whether we want to make this change in

[jira] [Updated] (SPARK-15447) Performance test for ALS in Spark 2.0

2016-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15447: -- Labels: QA (was: ) > Performance test for ALS in Spark 2.0 >

[jira] [Created] (SPARK-15447) Performance test for ALS in Spark 2.0

2016-05-20 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15447: - Summary: Performance test for ALS in Spark 2.0 Key: SPARK-15447 URL: https://issues.apache.org/jira/browse/SPARK-15447 Project: Spark Issue Type: Task

[jira] [Resolved] (SPARK-15222) SparkR ML examples update in 2.0

2016-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-15222. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13000

[jira] [Updated] (SPARK-15222) SparkR ML examples update in 2.0

2016-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15222: -- Assignee: Yanbo Liang > SparkR ML examples update in 2.0 > >

[jira] [Updated] (SPARK-15153) SparkR spark.naiveBayes throws error when label is numeric type

2016-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15153: -- Shepherd: Xiangrui Meng > SparkR spark.naiveBayes throws error when label is numeric type >

[jira] [Updated] (SPARK-15153) SparkR spark.naiveBayes throws error when label is numeric type

2016-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15153: -- Assignee: Yanbo Liang > SparkR spark.naiveBayes throws error when label is numeric type >

[jira] [Updated] (SPARK-15339) ML 2.0 QA: Scala APIs and code audit for regression

2016-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15339: -- Assignee: Yanbo Liang > ML 2.0 QA: Scala APIs and code audit for regression >

[jira] [Resolved] (SPARK-15339) ML 2.0 QA: Scala APIs and code audit for regression

2016-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-15339. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13129

[jira] [Updated] (SPARK-15394) ML user guide typos and grammar audit

2016-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15394: -- Fix Version/s: 2.0.0 > ML user guide typos and grammar audit >

[jira] [Updated] (SPARK-15394) ML user guide typos and grammar audit

2016-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15394: -- Assignee: Seth Hendrickson > ML user guide typos and grammar audit >

[jira] [Resolved] (SPARK-15398) Update the warning message to recommend ML usage

2016-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-15398. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13190

[jira] [Updated] (SPARK-15398) Update the warning message to recommend ML usage

2016-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15398: -- Assignee: zhengruifeng > Update the warning message to recommend ML usage >

[jira] [Updated] (SPARK-15363) Example code shouldn't use VectorImplicits._, asML/fromML

2016-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15363: -- Assignee: Miao Wang > Example code shouldn't use VectorImplicits._, asML/fromML >

[jira] [Resolved] (SPARK-15363) Example code shouldn't use VectorImplicits._, asML/fromML

2016-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-15363. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13213

[jira] [Updated] (SPARK-15172) Warning message should explicitly tell user initial coefficients is ignored if its size doesn't match expected size in LogisticRegression

2016-05-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15172: -- Fix Version/s: (was: 2.1.0) 2.0.0 > Warning message should explicitly

[jira] [Resolved] (SPARK-15296) Refactor All Java Tests that use SparkSession

2016-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-15296. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13101

[jira] [Updated] (SPARK-15341) Add documentation for `model.write` to clarify `summary` was not saved

2016-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15341: -- Assignee: Yanbo Liang > Add documentation for `model.write` to clarify `summary` was not saved

[jira] [Updated] (SPARK-15341) Add documentation for `model.write` to clarify `summary` was not saved

2016-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15341: -- Fix Version/s: 2.0.0 > Add documentation for `model.write` to clarify `summary` was not saved

[jira] [Updated] (SPARK-15414) Make the mllib,ml linalg type conversion APIs public

2016-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15414: -- Assignee: Sandeep Singh > Make the mllib,ml linalg type conversion APIs public >

[jira] [Resolved] (SPARK-15414) Make the mllib,ml linalg type conversion APIs public

2016-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-15414. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13202

[jira] [Commented] (SPARK-15363) Example code shouldn't use VectorImplicits._, asML/fromML

2016-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15292403#comment-15292403 ] Xiangrui Meng commented on SPARK-15363: --- No. I think we need to make the converters between new and

[jira] [Resolved] (SPARK-14615) Use the new ML Vector and Matrix in the ML pipeline based algorithms

2016-05-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14615. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12627

[jira] [Updated] (SPARK-14615) Use the new ML Vector and Matrix in the ML pipeline based algorithms

2016-05-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14615: -- Priority: Blocker (was: Major) > Use the new ML Vector and Matrix in the ML pipeline based

[jira] [Created] (SPARK-15364) Implement Python picklers for ml.Vector and ml.Matrix under spark.ml.python

2016-05-17 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15364: - Summary: Implement Python picklers for ml.Vector and ml.Matrix under spark.ml.python Key: SPARK-15364 URL: https://issues.apache.org/jira/browse/SPARK-15364

[jira] [Updated] (SPARK-15363) Example code shouldn't use VectorImplicits._

2016-05-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15363: -- Description: In SPARK-14615, we use VectorImplicits._ and asML in example code to minimize

[jira] [Updated] (SPARK-15363) Example code shouldn't use VectorImplicits._, asML/fromML

2016-05-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15363: -- Summary: Example code shouldn't use VectorImplicits._, asML/fromML (was: Example code

[jira] [Updated] (SPARK-15363) Example code shouldn't use VectorImplicits._

2016-05-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15363: -- Description: In SPARK-14615, we use VectorImplicits._ in example code to minimize the changes

[jira] [Created] (SPARK-15363) Example code shouldn't use VectorImplicits._

2016-05-17 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15363: - Summary: Example code shouldn't use VectorImplicits._ Key: SPARK-15363 URL: https://issues.apache.org/jira/browse/SPARK-15363 Project: Spark Issue Type:

[jira] [Updated] (SPARK-14906) Copy pyspark.mllib.linalg to pyspark.ml.linalg

2016-05-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14906: -- Summary: Copy pyspark.mllib.linalg to pyspark.ml.linalg (was: Move VectorUDT and MatrixUDT in

[jira] [Updated] (SPARK-14906) Move VectorUDT and MatrixUDT in PySpark to new ML package

2016-05-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14906: -- Assignee: Liang-Chi Hsieh > Move VectorUDT and MatrixUDT in PySpark to new ML package >

[jira] [Resolved] (SPARK-14906) Move VectorUDT and MatrixUDT in PySpark to new ML package

2016-05-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14906. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13099

[jira] [Resolved] (SPARK-15268) Make JavaTypeInference work with UDTRegistration

2016-05-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-15268. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13046

[jira] [Updated] (SPARK-15268) Make JavaTypeInference work with UDTRegistration

2016-05-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15268: -- Assignee: Liang-Chi Hsieh > Make JavaTypeInference work with UDTRegistration >

[jira] [Updated] (SPARK-14050) Add multiple languages support for Stop Words Remover

2016-05-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14050: -- Assignee: Burak KÖSE > Add multiple languages support for Stop Words Remover >

[jira] [Resolved] (SPARK-14050) Add multiple languages support for Stop Words Remover

2016-05-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14050. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12843

[jira] [Commented] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-05-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15269218#comment-15269218 ] Xiangrui Meng commented on SPARK-15027: --- Ah, I see the problems now. We do need the hash

[jira] [Resolved] (SPARK-6717) Clear shuffle files after checkpointing in ALS

2016-05-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6717. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11919

[jira] [Created] (SPARK-15064) Locale support in StopWordsRemover

2016-05-02 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15064: - Summary: Locale support in StopWordsRemover Key: SPARK-15064 URL: https://issues.apache.org/jira/browse/SPARK-15064 Project: Spark Issue Type: New Feature

[jira] [Resolved] (SPARK-15030) Support formula in spark.kmeans in SparkR

2016-04-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-15030. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12813

[jira] [Resolved] (SPARK-14653) Remove NumericParser and jackson dependency from mllib-local

2016-04-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14653. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12802

[jira] [Updated] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-04-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15027: -- Target Version/s: (was: 2.0.0) > ALS.train should use DataFrame instead of RDD >

[jira] [Commented] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-04-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15265238#comment-15265238 ] Xiangrui Meng commented on SPARK-15027: --- It might be tricky to use Dataset due to encoders and

[jira] [Updated] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-04-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15027: -- Assignee: (was: Xiangrui Meng) > ALS.train should use DataFrame instead of RDD >

[jira] [Comment Edited] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-04-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15265229#comment-15265229 ] Xiangrui Meng edited comment on SPARK-15027 at 4/30/16 7:50 AM: Just API

[jira] [Commented] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-04-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15265229#comment-15265229 ] Xiangrui Meng commented on SPARK-15027: --- No, just API change. I guess there are still gaps to use

[jira] [Updated] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-04-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15027: -- Description: We should also update `ALS.train` to use `Dataset/DataFrame` instead of `RDD` to

[jira] [Updated] (SPARK-15027) ALS.train should use DataFrame instead of RDD

2016-04-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15027: -- Summary: ALS.train should use DataFrame instead of RDD (was: ml.ALS params and ALS.train

[jira] [Commented] (SPARK-14906) Move VectorUDT and MatrixUDT in PySpark to new ML package

2016-04-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15265216#comment-15265216 ] Xiangrui Meng commented on SPARK-14906: --- [~viirya] To confirm the scope of this JIRA, does it cover

[jira] [Created] (SPARK-15030) Support formula in spark.kmeans in SparkR

2016-04-30 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15030: - Summary: Support formula in spark.kmeans in SparkR Key: SPARK-15030 URL: https://issues.apache.org/jira/browse/SPARK-15030 Project: Spark Issue Type: New

[jira] [Resolved] (SPARK-14831) Make ML APIs in SparkR consistent

2016-04-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14831. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12789

[jira] [Resolved] (SPARK-14850) VectorUDT/MatrixUDT should take primitive arrays without boxing

2016-04-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14850. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12640

[jira] [Created] (SPARK-15027) ml.ALS params and ALS.train should not depend on RDD

2016-04-29 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15027: - Summary: ml.ALS params and ALS.train should not depend on RDD Key: SPARK-15027 URL: https://issues.apache.org/jira/browse/SPARK-15027 Project: Spark Issue

[jira] [Updated] (SPARK-14412) spark.ml ALS prefered storage level Params

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14412: -- Assignee: Nick Pentreath > spark.ml ALS prefered storage level Params >

[jira] [Resolved] (SPARK-14412) spark.ml ALS prefered storage level Params

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14412. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12660

[jira] [Assigned] (SPARK-14653) Remove NumericParser and jackson dependency from mllib-local

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-14653: - Assignee: Xiangrui Meng > Remove NumericParser and jackson dependency from mllib-local

[jira] [Updated] (SPARK-14311) Model persistence in SparkR 2.0

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14311: -- Target Version/s: 2.0.0 Fix Version/s: 2.0.0 > Model persistence in SparkR 2.0 >

[jira] [Updated] (SPARK-14311) Model persistence in SparkR 2.0

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14311: -- Summary: Model persistence in SparkR 2.0 (was: Model persistence in SparkR) > Model

[jira] [Resolved] (SPARK-14311) Model persistence in SparkR 2.0

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14311. --- Resolution: Fixed > Model persistence in SparkR 2.0 > --- > >

[jira] [Updated] (SPARK-13786) Pyspark ml.tuning support export/import

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13786: -- Fix Version/s: (was: 2.0.0) > Pyspark ml.tuning support export/import >

[jira] [Reopened] (SPARK-13786) Pyspark ml.tuning support export/import

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reopened SPARK-13786: --- Re-open the issue since we reverted the change. > Pyspark ml.tuning support export/import >

[jira] [Resolved] (SPARK-13786) Pyspark ml.tuning support export/import

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13786. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12782

[jira] [Updated] (SPARK-14059) Define R wrappers under org.apache.spark.ml.r

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14059: -- Assignee: Yanbo Liang > Define R wrappers under org.apache.spark.ml.r >

[jira] [Resolved] (SPARK-14059) Define R wrappers under org.apache.spark.ml.r

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14059. --- Resolution: Fixed > Define R wrappers under org.apache.spark.ml.r >

[jira] [Created] (SPARK-15010) Lots of error messages about accumulator in Spark shell when a task takes some time to run

2016-04-29 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15010: - Summary: Lots of error messages about accumulator in Spark shell when a task takes some time to run Key: SPARK-15010 URL: https://issues.apache.org/jira/browse/SPARK-15010

[jira] [Created] (SPARK-15006) Generated JavaDoc should hide package private objects

2016-04-29 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15006: - Summary: Generated JavaDoc should hide package private objects Key: SPARK-15006 URL: https://issues.apache.org/jira/browse/SPARK-15006 Project: Spark

[jira] [Commented] (SPARK-14831) Make ML APIs in SparkR consistent

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15264328#comment-15264328 ] Xiangrui Meng commented on SPARK-14831: --- Talked to [~timhunter] offline and he will submit a PR

[jira] [Resolved] (SPARK-14314) K-means model persistence in SparkR

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14314. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12778

[jira] [Resolved] (SPARK-14315) GLMs model persistence in SparkR

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14315. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12778

[jira] [Updated] (SPARK-14831) Make ML APIs in SparkR consistent

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14831: -- Assignee: Timothy Hunter (was: Xiangrui Meng) > Make ML APIs in SparkR consistent >

[jira] [Resolved] (SPARK-7264) SparkR API for parallel functions

2016-04-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7264. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12426

[jira] [Resolved] (SPARK-14487) User Defined Type registration without SQLUserDefinedType annotation

2016-04-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14487. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12259

[jira] [Updated] (SPARK-14850) VectorUDT/MatrixUDT should take primitive arrays without boxing

2016-04-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14850: -- Assignee: Wenchen Fan > VectorUDT/MatrixUDT should take primitive arrays without boxing >

<    4   5   6   7   8   9   10   11   12   13   >