[jira] [Updated] (SPARK-14831) Make ML APIs in SparkR consistent

2016-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14831: -- Assignee: Timothy Hunter (was: Xiangrui Meng) > Make ML APIs in SparkR consistent >

[jira] [Resolved] (SPARK-7264) SparkR API for parallel functions

2016-04-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7264. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12426

[jira] [Resolved] (SPARK-14487) User Defined Type registration without SQLUserDefinedType annotation

2016-04-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14487. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12259

[jira] [Updated] (SPARK-14850) VectorUDT/MatrixUDT should take primitive arrays without boxing

2016-04-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14850: -- Assignee: Wenchen Fan > VectorUDT/MatrixUDT should take primitive arrays without boxing >

[jira] [Commented] (SPARK-14831) Make ML APIs in SparkR consistent

2016-04-27 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15260981#comment-15260981 ] Xiangrui Meng commented on SPARK-14831: --- +1 on `read.ml` and `write.ml`, which are consistent with

[jira] [Updated] (SPARK-14315) GLMs model persistence in SparkR

2016-04-27 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14315: -- Assignee: Gayathri Murali > GLMs model persistence in SparkR >

[jira] [Updated] (SPARK-14314) K-means model persistence in SparkR

2016-04-27 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14314: -- Shepherd: Yanbo Liang Assignee: Gayathri Murali Target Version/s: 2.0.0

[jira] [Updated] (SPARK-14315) GLMs model persistence in SparkR

2016-04-27 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14315: -- Target Version/s: 2.0.0 > GLMs model persistence in SparkR >

[jira] [Updated] (SPARK-14315) GLMs model persistence in SparkR

2016-04-27 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14315: -- Shepherd: Yanbo Liang > GLMs model persistence in SparkR > >

[jira] [Resolved] (SPARK-14313) AFTSurvivalRegression model persistence in SparkR

2016-04-26 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14313. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12685

[jira] [Updated] (SPARK-14313) AFTSurvivalRegression model persistence in SparkR

2016-04-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14313: -- Assignee: Yanbo Liang > AFTSurvivalRegression model persistence in SparkR >

[jira] [Updated] (SPARK-14313) AFTSurvivalRegression model persistence in SparkR

2016-04-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14313: -- Target Version/s: 2.0.0 > AFTSurvivalRegression model persistence in SparkR >

[jira] [Resolved] (SPARK-14312) NaiveBayes model persistence in SparkR

2016-04-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14312. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12573

[jira] [Updated] (SPARK-14850) VectorUDT/MatrixUDT should take primitive arrays without boxing

2016-04-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14850: -- Priority: Blocker (was: Critical) > VectorUDT/MatrixUDT should take primitive arrays without

[jira] [Updated] (SPARK-14850) VectorUDT/MatrixUDT should take primitive arrays without boxing

2016-04-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14850: -- Affects Version/s: 1.5.2 > VectorUDT/MatrixUDT should take primitive arrays without boxing >

[jira] [Commented] (SPARK-14850) VectorUDT/MatrixUDT should take primitive arrays without boxing

2016-04-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254538#comment-15254538 ] Xiangrui Meng commented on SPARK-14850: --- Ran the following code with different Spark versions:

[jira] [Commented] (SPARK-14831) Make ML APIs in SparkR consistent

2016-04-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254516#comment-15254516 ] Xiangrui Meng commented on SPARK-14831: --- 1. Please see my reply to Felix above for the issue with

[jira] [Commented] (SPARK-14831) Make ML APIs in SparkR consistent

2016-04-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254510#comment-15254510 ] Xiangrui Meng commented on SPARK-14831: --- We have been trying to mimic existing R APIs in SparkR.

[jira] [Created] (SPARK-14850) VectorUDT/MatrixUDT should take primitive arrays without boxing

2016-04-22 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14850: - Summary: VectorUDT/MatrixUDT should take primitive arrays without boxing Key: SPARK-14850 URL: https://issues.apache.org/jira/browse/SPARK-14850 Project: Spark

[jira] [Updated] (SPARK-14850) VectorUDT/MatrixUDT should take primitive arrays without boxing

2016-04-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14850: -- Description: In SPARK-9390, we switched to use GenericArrayData to store indices and values

[jira] [Commented] (SPARK-14314) K-means model persistence in SparkR

2016-04-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253140#comment-15253140 ] Xiangrui Meng commented on SPARK-14314: --- Please hold until the naive Bayes one gets merged. On

[jira] [Updated] (SPARK-14831) Make ML APIs in SparkR consistent

2016-04-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14831: -- Description: In current master, we have 4 ML methods in SparkR: {code:none} glm(formula,

[jira] [Updated] (SPARK-14831) Make ML APIs in SparkR consistent

2016-04-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14831: -- Description: In current master, we have 4 ML methods in SparkR: {code:none} glm(formula,

[jira] [Created] (SPARK-14831) Make ML APIs in SparkR consistent

2016-04-21 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14831: - Summary: Make ML APIs in SparkR consistent Key: SPARK-14831 URL: https://issues.apache.org/jira/browse/SPARK-14831 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-14479) GLM supports output link prediction

2016-04-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14479. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12287

[jira] [Updated] (SPARK-14479) GLM supports output link prediction

2016-04-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14479: -- Assignee: Yanbo Liang > GLM supports output link prediction >

[jira] [Commented] (SPARK-7992) Hide private classes/objects in in generated Java API doc

2016-04-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253031#comment-15253031 ] Xiangrui Meng commented on SPARK-7992: -- Thanks for making this work in the official repo! I'm going

[jira] [Updated] (SPARK-7992) Hide private classes/objects in in generated Java API doc

2016-04-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7992: - Assignee: Jakob Odersky > Hide private classes/objects in in generated Java API doc >

[jira] [Resolved] (SPARK-7992) Hide private classes/objects in in generated Java API doc

2016-04-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7992. -- Resolution: Fixed Fix Version/s: 2.0.0 > Hide private classes/objects in in generated

[jira] [Updated] (SPARK-14312) NaiveBayes model persistence in SparkR

2016-04-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14312: -- Target Version/s: 2.0.0 > NaiveBayes model persistence in SparkR >

[jira] [Updated] (SPARK-14312) NaiveBayes model persistence in SparkR

2016-04-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14312: -- Assignee: Yanbo Liang > NaiveBayes model persistence in SparkR >

[jira] [Updated] (SPARK-14312) NaiveBayes model persistence in SparkR

2016-04-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14312: -- Shepherd: Xiangrui Meng > NaiveBayes model persistence in SparkR >

[jira] [Updated] (SPARK-7264) SparkR API for parallel functions

2016-04-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7264: - Target Version/s: 2.0.0 > SparkR API for parallel functions > - >

[jira] [Updated] (SPARK-7264) SparkR API for parallel functions

2016-04-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7264: - Assignee: Timothy Hunter > SparkR API for parallel functions > -

[jira] [Updated] (SPARK-14299) Scala ML examples code merge and clean up

2016-04-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14299: -- Assignee: Xusen Yin > Scala ML examples code merge and clean up >

[jira] [Resolved] (SPARK-14299) Scala ML examples code merge and clean up

2016-04-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14299. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12366

[jira] [Updated] (SPARK-14440) Remove PySpark ml.pipeline's specific Reader and Writer

2016-04-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14440: -- Assignee: Xusen Yin > Remove PySpark ml.pipeline's specific Reader and Writer >

[jira] [Resolved] (SPARK-14440) Remove PySpark ml.pipeline's specific Reader and Writer

2016-04-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14440. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12216

[jira] [Commented] (SPARK-13944) Separate out local linear algebra as a standalone module without Spark dependency

2016-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243793#comment-15243793 ] Xiangrui Meng commented on SPARK-13944: --- `mllib-local` by the name is not scoped just for local

[jira] [Updated] (SPARK-14657) RFormula output wrong features when formula w/o intercept

2016-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14657: -- Target Version/s: 2.0.0 > RFormula output wrong features when formula w/o intercept >

[jira] [Updated] (SPARK-14657) RFormula output wrong features when formula w/o intercept

2016-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14657: -- Shepherd: Xiangrui Meng > RFormula output wrong features when formula w/o intercept >

[jira] [Updated] (SPARK-14657) RFormula output wrong features when formula w/o intercept

2016-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14657: -- Assignee: Yanbo Liang > RFormula output wrong features when formula w/o intercept >

[jira] [Updated] (SPARK-13925) Expose R-like summary statistics in SparkR::glm for more family and link functions

2016-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13925: -- Assignee: Yanbo Liang > Expose R-like summary statistics in SparkR::glm for more family and

[jira] [Resolved] (SPARK-13925) Expose R-like summary statistics in SparkR::glm for more family and link functions

2016-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13925. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12393

[jira] [Resolved] (SPARK-14549) Copy the Vector and Matrix classes from mllib to ml in mllib-local

2016-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14549. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12317

[jira] [Created] (SPARK-14653) Remove NumericParser and jackson dependency from mllib-local

2016-04-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14653: - Summary: Remove NumericParser and jackson dependency from mllib-local Key: SPARK-14653 URL: https://issues.apache.org/jira/browse/SPARK-14653 Project: Spark

[jira] [Resolved] (SPARK-14374) PySpark ml GBTClassifier, Regressor support export/import

2016-04-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14374. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12383

[jira] [Created] (SPARK-14646) k-means save/load should put one cluster per row

2016-04-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14646: - Summary: k-means save/load should put one cluster per row Key: SPARK-14646 URL: https://issues.apache.org/jira/browse/SPARK-14646 Project: Spark Issue

[jira] [Resolved] (SPARK-12869) Optimize conversion from BlockMatrix to IndexedRowMatrix

2016-04-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-12869. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10839

[jira] [Resolved] (SPARK-14565) RandomForest should use parseInt and parseDouble for feature subset size instead of regexes

2016-04-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14565. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12360

[jira] [Commented] (SPARK-13944) Separate out local linear algebra as a standalone module without Spark dependency

2016-04-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15240322#comment-15240322 ] Xiangrui Meng commented on SPARK-13944: --- There are more production workflows using RDD-based APIs

[jira] [Commented] (SPARK-14154) Simplify the implementation for Kolmogorov–Smirnov test

2016-04-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15239531#comment-15239531 ] Xiangrui Meng commented on SPARK-14154: --- [~yuhaoyan] Thanks for the benchmark! I reverted the

[jira] [Closed] (SPARK-14154) Simplify the implementation for Kolmogorov–Smirnov test

2016-04-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-14154. - Resolution: Not A Problem Fix Version/s: (was: 2.0.0) > Simplify the implementation

[jira] [Commented] (SPARK-14154) Simplify the implementation for Kolmogorov–Smirnov test

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238172#comment-15238172 ] Xiangrui Meng commented on SPARK-14154: --- Changed the priority to critical since we should decide

[jira] [Updated] (SPARK-14154) Simplify the implementation for Kolmogorov–Smirnov test

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14154: -- Priority: Critical (was: Minor) > Simplify the implementation for Kolmogorov–Smirnov test >

[jira] [Updated] (SPARK-14568) Log instrumentation in logistic regression as a first task

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14568: -- Shepherd: Joseph K. Bradley > Log instrumentation in logistic regression as a first task >

[jira] [Updated] (SPARK-14568) Log instrumentation in logistic regression as a first task

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14568: -- Target Version/s: 2.0.0 > Log instrumentation in logistic regression as a first task >

[jira] [Updated] (SPARK-14568) Log instrumentation in logistic regression as a first task

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14568: -- Assignee: Timothy Hunter > Log instrumentation in logistic regression as a first task >

[jira] [Commented] (SPARK-14311) Model persistence in SparkR

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15237918#comment-15237918 ] Xiangrui Meng commented on SPARK-14311: --- I think we can implement a generic load in a Scalar

[jira] [Updated] (SPARK-14549) Copy the Vector and Matrix classes from mllib to ml in mllib-local

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14549: -- Shepherd: Xiangrui Meng > Copy the Vector and Matrix classes from mllib to ml in mllib-local >

[jira] [Resolved] (SPARK-14147) SparkR - ML predictors return features with vector datatype, however SparkR doesn't support it

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14147. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11958

[jira] [Updated] (SPARK-14147) SparkR - ML predictors return features with vector datatype, however SparkR doesn't support it

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14147: -- Assignee: Yanbo Liang > SparkR - ML predictors return features with vector datatype, however

[jira] [Resolved] (SPARK-14563) SQLTransformer.transformSchema is not implemented correctly

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14563. --- Resolution: Fixed Fix Version/s: 1.6.2 2.0.0 Issue resolved by

[jira] [Updated] (SPARK-13597) Python API for GeneralizedLinearRegression

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13597: -- Assignee: Kai Jiang > Python API for GeneralizedLinearRegression >

[jira] [Resolved] (SPARK-13597) Python API for GeneralizedLinearRegression

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13597. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11468

[jira] [Resolved] (SPARK-13322) AFTSurvivalRegression should support feature standardization

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13322. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11365

[jira] [Updated] (SPARK-13590) Document the behavior of spark.ml logistic regression and AFT survival regression when there are constant features

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13590: -- Summary: Document the behavior of spark.ml logistic regression and AFT survival regression

[jira] [Updated] (SPARK-13590) Document the behavior of spark.ml logistic regression when there are constant features

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13590: -- Assignee: Yanbo Liang > Document the behavior of spark.ml logistic regression when there are

[jira] [Reopened] (SPARK-14154) Simplify the implementation for Kolmogorov–Smirnov test

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reopened SPARK-14154: --- Re-open this issue to continue discussion. > Simplify the implementation for Kolmogorov–Smirnov

[jira] [Updated] (SPARK-14565) RandomForest should use parseInt and parseDouble for feature subset size instead of regexes

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14565: -- Description: Using regex is not robust and hard to maintain. > RandomForest should use

[jira] [Created] (SPARK-14565) RandomForest should use parseInt and parseDouble for feature subset size instead of regexes

2016-04-12 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14565: - Summary: RandomForest should use parseInt and parseDouble for feature subset size instead of regexes Key: SPARK-14565 URL: https://issues.apache.org/jira/browse/SPARK-14565

[jira] [Commented] (SPARK-14154) Simplify the implementation for Kolmogorov–Smirnov test

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15237657#comment-15237657 ] Xiangrui Meng commented on SPARK-14154: --- [~yuhaoyan] The main purpose of the initial implementation

[jira] [Resolved] (SPARK-14324) Refactor GLMs code in SparkRWrappers

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14324. --- Resolution: Fixed Fix Version/s: 2.0.0 > Refactor GLMs code in SparkRWrappers >

[jira] [Resolved] (SPARK-12566) GLM model family, link function support in SparkR:::glm

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-12566. --- Resolution: Fixed Fix Version/s: 2.0.0 > GLM model family, link function support in

[jira] [Updated] (SPARK-14563) SQLTransformer.transformSchema is not implemented correctly

2016-04-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14563: -- Description: `transformSchema` uses `__THIS__` as a temp table name, which would cause errors

[jira] [Created] (SPARK-14563) SQLTransformer.transformSchema is not implemented correctly

2016-04-12 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14563: - Summary: SQLTransformer.transformSchema is not implemented correctly Key: SPARK-14563 URL: https://issues.apache.org/jira/browse/SPARK-14563 Project: Spark

[jira] [Resolved] (SPARK-13600) Use approxQuantile from DataFrame stats in QuantileDiscretizer

2016-04-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13600. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11553

[jira] [Updated] (SPARK-13322) AFTSurvivalRegression should support feature standardization

2016-04-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13322: -- Shepherd: Xiangrui Meng (was: DB Tsai) > AFTSurvivalRegression should support feature

[jira] [Updated] (SPARK-14324) Refactor GLMs code in SparkRWrappers

2016-04-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14324: -- Assignee: Yanbo Liang > Refactor GLMs code in SparkRWrappers >

[jira] [Updated] (SPARK-12566) GLM model family, link function support in SparkR:::glm

2016-04-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12566: -- Shepherd: Xiangrui Meng (was: Yanbo Liang) > GLM model family, link function support in

[jira] [Resolved] (SPARK-14462) Add the mllib-local build to maven pom

2016-04-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14462. --- Resolution: Fixed Issue resolved by pull request 12298

[jira] [Updated] (SPARK-14510) Add args-checking for LDA and StreamingKMeans

2016-04-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14510: -- Assignee: zhengruifeng > Add args-checking for LDA and StreamingKMeans >

[jira] [Resolved] (SPARK-14510) Add args-checking for LDA and StreamingKMeans

2016-04-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14510. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12062

[jira] [Updated] (SPARK-14510) Add args-checking for LDA and StreamingKMeans

2016-04-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14510: -- Priority: Minor (was: Major) > Add args-checking for LDA and StreamingKMeans >

[jira] [Resolved] (SPARK-14500) Accept Dataset[_] instead of DataFrame in MLlib APIs

2016-04-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14500. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12274

[jira] [Updated] (SPARK-14497) Use top instead of sortBy() to get top N frequent words as dict in ConutVectorizer

2016-04-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14497: -- Assignee: Feng Wang > Use top instead of sortBy() to get top N frequent words as dict in >

[jira] [Resolved] (SPARK-14497) Use top instead of sortBy() to get top N frequent words as dict in ConutVectorizer

2016-04-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14497. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12265

[jira] [Resolved] (SPARK-14339) Add python examples for DCT,MinMaxScaler,MaxAbsScaler

2016-04-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14339. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12063

[jira] [Updated] (SPARK-14339) Add python examples for DCT,MinMaxScaler,MaxAbsScaler

2016-04-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14339: -- Assignee: zhengruifeng > Add python examples for DCT,MinMaxScaler,MaxAbsScaler >

[jira] [Resolved] (SPARK-14462) Add the mllib-local build to maven pom

2016-04-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14462. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12241

[jira] [Created] (SPARK-14500) Accept Dataset[_] instead of DataFrame in MLlib APIs

2016-04-08 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14500: - Summary: Accept Dataset[_] instead of DataFrame in MLlib APIs Key: SPARK-14500 URL: https://issues.apache.org/jira/browse/SPARK-14500 Project: Spark Issue

[jira] [Updated] (SPARK-14305) PySpark ml.clustering BisectingKMeans support export/import

2016-04-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14305: -- Assignee: Yanbo Liang > PySpark ml.clustering BisectingKMeans support export/import >

[jira] [Resolved] (SPARK-14305) PySpark ml.clustering BisectingKMeans support export/import

2016-04-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14305. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12112

[jira] [Updated] (SPARK-14324) Refactor GLMs code in SparkRWrappers

2016-04-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14324: -- Assignee: (was: Yanbo Liang) > Refactor GLMs code in SparkRWrappers >

[jira] [Resolved] (SPARK-14303) Refactor SparkRWrappers

2016-04-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14303. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12039

[jira] [Updated] (SPARK-14324) Refactor GLMs code in SparkRWrappers

2016-04-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14324: -- Summary: Refactor GLMs code in SparkRWrappers (was: Refactor SparkRWrappers) > Refactor GLMs

[jira] [Updated] (SPARK-14303) Refactor k-means code in SparkRWrappers

2016-04-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-14303: -- Summary: Refactor k-means code in SparkRWrappers (was: Refactor SparkRWrappers) > Refactor

[jira] [Created] (SPARK-14324) Refactor SparkRWrappers

2016-04-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-14324: - Summary: Refactor SparkRWrappers Key: SPARK-14324 URL: https://issues.apache.org/jira/browse/SPARK-14324 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-11262) Unit test for gradient, loss layers, memory management for multilayer perceptron

2016-04-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11262. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 9229

[jira] [Resolved] (SPARK-14295) buildReader implementation for LibSVM

2016-04-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-14295. --- Resolution: Fixed Issue resolved by pull request 12088

<    5   6   7   8   9   10   11   12   13   14   >