[jira] [Resolved] (SPARK-22060) CrossValidator/TrainValidationSplit parallelism param persist/load bug

2017-09-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-22060. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19278

[jira] [Comment Edited] (SPARK-19357) Parallel Model Evaluation for ML Tuning: Scala

2017-09-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175629#comment-16175629 ] Joseph K. Bradley edited comment on SPARK-19357 at 9/21/17 11:14 PM: -

[jira] [Commented] (SPARK-19357) Parallel Model Evaluation for ML Tuning: Scala

2017-09-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175629#comment-16175629 ] Joseph K. Bradley commented on SPARK-19357: --- [~bryanc], [~nick.pentre...@gmail.com],

[jira] [Updated] (SPARK-22060) CrossValidator/TrainValidationSplit parallelism param persist/load bug

2017-09-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-22060: -- Target Version/s: 2.3.0 > CrossValidator/TrainValidationSplit parallelism param

[jira] [Assigned] (SPARK-22060) CrossValidator/TrainValidationSplit parallelism param persist/load bug

2017-09-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-22060: - Assignee: Weichen Xu > CrossValidator/TrainValidationSplit parallelism param

[jira] [Updated] (SPARK-22060) CrossValidator/TrainValidationSplit parallelism param persist/load bug

2017-09-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-22060: -- Shepherd: Joseph K. Bradley > CrossValidator/TrainValidationSplit parallelism param

[jira] [Updated] (SPARK-14371) OnlineLDAOptimizer should not collect stats for each doc in mini-batch to driver

2017-09-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14371: -- Shepherd: Joseph K. Bradley > OnlineLDAOptimizer should not collect stats for each doc

[jira] [Commented] (SPARK-21770) ProbabilisticClassificationModel: Improve normalization of all-zero raw predictions

2017-09-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16170734#comment-16170734 ] Joseph K. Bradley commented on SPARK-21770: --- Update from PR discussion: The new plan is to

[jira] [Commented] (SPARK-21866) SPIP: Image support in Spark

2017-09-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16163976#comment-16163976 ] Joseph K. Bradley commented on SPARK-21866: --- 1. For the namespace, here are my thoughts: I

[jira] [Resolved] (SPARK-18608) Spark ML algorithms that check RDD cache level for internal caching double-cache data

2017-09-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-18608. --- Resolution: Fixed Fix Version/s: 2.3.0 2.2.1 Issue

[jira] [Updated] (SPARK-18608) Spark ML algorithms that check RDD cache level for internal caching double-cache data

2017-09-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18608: -- Target Version/s: 2.2.1, 2.3.0 > Spark ML algorithms that check RDD cache level for

[jira] [Assigned] (SPARK-18608) Spark ML algorithms that check RDD cache level for internal caching double-cache data

2017-09-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-18608: - Assignee: zhengruifeng > Spark ML algorithms that check RDD cache level for

[jira] [Assigned] (SPARK-21027) Parallel One vs. Rest Classifier

2017-09-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-21027: - Assignee: Ajay Saini > Parallel One vs. Rest Classifier >

[jira] [Resolved] (SPARK-21027) Parallel One vs. Rest Classifier

2017-09-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21027. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19110

[jira] [Commented] (SPARK-19422) Cache input data in algorithms

2017-09-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16162008#comment-16162008 ] Joseph K. Bradley commented on SPARK-19422: --- Linking [SPARK-21972], which may interact with

[jira] [Commented] (SPARK-21972) Allow users to control input data persistence in ML Estimators via a handlePersistence ml.Param

2017-09-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16162007#comment-16162007 ] Joseph K. Bradley commented on SPARK-21972: --- The issue (a) does not really conflict with or

[jira] [Commented] (SPARK-18608) Spark ML algorithms that check RDD cache level for internal caching double-cache data

2017-09-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16162004#comment-16162004 ] Joseph K. Bradley commented on SPARK-18608: --- Hi all, it looks like there has been confusion

[jira] [Closed] (SPARK-21799) KMeans performance regression (5-6x slowdown) in Spark 2.2

2017-09-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-21799. - Resolution: Duplicate > KMeans performance regression (5-6x slowdown) in Spark 2.2 >

[jira] [Commented] (SPARK-21799) KMeans performance regression (5-6x slowdown) in Spark 2.2

2017-09-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16161992#comment-16161992 ] Joseph K. Bradley commented on SPARK-21799: --- Now that I've caught up on these, this is just a

[jira] [Resolved] (SPARK-21729) Generic test for ProbabilisticClassifier to ensure consistent output columns

2017-09-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21729. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19065

[jira] [Assigned] (SPARK-21729) Generic test for ProbabilisticClassifier to ensure consistent output columns

2017-09-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-21729: - Assignee: Weichen Xu > Generic test for ProbabilisticClassifier to ensure

[jira] [Issue Comment Deleted] (SPARK-21729) Generic test for ProbabilisticClassifier to ensure consistent output columns

2017-09-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21729: -- Comment: was deleted (was: User 'WeichenXu123' has created a pull request for this

[jira] [Commented] (SPARK-21770) ProbabilisticClassificationModel: Improve normalization of all-zero raw predictions

2017-09-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16150846#comment-16150846 ] Joseph K. Bradley commented on SPARK-21770: --- Linear models are the most likely to hit this

[jira] [Resolved] (SPARK-21862) Add overflow check in PCA

2017-08-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21862. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19078

[jira] [Commented] (SPARK-21866) SPIP: Image support in Spark

2017-08-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148268#comment-16148268 ] Joseph K. Bradley commented on SPARK-21866: --- It's a valid question, but overall, I'd support

[jira] [Updated] (SPARK-21866) SPIP: Image support in Spark

2017-08-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21866: -- Target Version/s: (was: 2.3.0) > SPIP: Image support in Spark >

[jira] [Updated] (SPARK-21862) Add overflow check in PCA

2017-08-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21862: -- Shepherd: Joseph K. Bradley > Add overflow check in PCA > - >

[jira] [Assigned] (SPARK-21862) Add overflow check in PCA

2017-08-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-21862: - Assignee: Weichen Xu > Add overflow check in PCA > - >

[jira] [Resolved] (SPARK-17139) Add model summary for MultinomialLogisticRegression

2017-08-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-17139. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 15435

[jira] [Assigned] (SPARK-17139) Add model summary for MultinomialLogisticRegression

2017-08-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-17139: - Assignee: Weichen Xu > Add model summary for MultinomialLogisticRegression >

[jira] [Updated] (SPARK-21681) MLOR do not work correctly when featureStd contains zero

2017-08-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21681: -- Labels: correctness (was: ) > MLOR do not work correctly when featureStd contains

[jira] [Resolved] (SPARK-21681) MLOR do not work correctly when featureStd contains zero

2017-08-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21681. --- Resolution: Fixed Fix Version/s: 2.2.1 Issue resolved by pull request 19026

[jira] [Resolved] (SPARK-12664) Expose probability, rawPrediction in MultilayerPerceptronClassificationModel

2017-08-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-12664. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 17373

[jira] [Updated] (SPARK-12664) Expose probability, rawPrediction in MultilayerPerceptronClassificationModel

2017-08-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12664: -- Summary: Expose probability, rawPrediction in MultilayerPerceptronClassificationModel

[jira] [Commented] (SPARK-21535) Reduce memory requirement for CrossValidator and TrainValidationSplit

2017-08-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16137697#comment-16137697 ] Joseph K. Bradley commented on SPARK-21535: --- [~yuhaoyan] Parallel training of models can be

[jira] [Commented] (SPARK-21086) CrossValidator, TrainValidationSplit should preserve all models after fitting

2017-08-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16137690#comment-16137690 ] Joseph K. Bradley commented on SPARK-21086: --- My understanding is that they actually want these

[jira] [Updated] (SPARK-21681) MLOR do not work correctly when featureStd contains zero

2017-08-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21681: -- Fix Version/s: 2.3.0 > MLOR do not work correctly when featureStd contains zero >

[jira] [Commented] (SPARK-21681) MLOR do not work correctly when featureStd contains zero

2017-08-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16137638#comment-16137638 ] Joseph K. Bradley commented on SPARK-21681: --- I'll leave this open until it's been backported to

[jira] [Commented] (SPARK-21770) ProbabilisticClassificationModel: Improve normalization of all-zero raw predictions

2017-08-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16133483#comment-16133483 ] Joseph K. Bradley commented on SPARK-21770: --- I vaguely recall discussing this before but forget

[jira] [Commented] (SPARK-19747) Consolidate code in ML aggregators

2017-08-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16131340#comment-16131340 ] Joseph K. Bradley commented on SPARK-19747: --- Just saying: Thanks a lot for doing this reorg!

[jira] [Updated] (SPARK-21681) MLOR do not work correctly when featureStd contains zero

2017-08-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21681: -- Shepherd: Joseph K. Bradley > MLOR do not work correctly when featureStd contains zero

[jira] [Updated] (SPARK-21681) MLOR do not work correctly when featureStd contains zero

2017-08-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21681: -- Affects Version/s: 2.3.0 > MLOR do not work correctly when featureStd contains zero >

[jira] [Updated] (SPARK-21681) MLOR do not work correctly when featureStd contains zero

2017-08-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21681: -- Target Version/s: 2.2.1, 2.3.0 > MLOR do not work correctly when featureStd contains

[jira] [Assigned] (SPARK-21681) MLOR do not work correctly when featureStd contains zero

2017-08-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-21681: - Assignee: Weichen Xu > MLOR do not work correctly when featureStd contains zero

[jira] [Commented] (SPARK-12664) Expose raw prediction scores in MultilayerPerceptronClassificationModel

2017-08-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16126583#comment-16126583 ] Joseph K. Bradley commented on SPARK-12664: --- [~yanboliang] I can take over shepherding this

[jira] [Updated] (SPARK-12664) Expose raw prediction scores in MultilayerPerceptronClassificationModel

2017-08-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12664: -- Shepherd: Joseph K. Bradley (was: Yanbo Liang) > Expose raw prediction scores in

[jira] [Created] (SPARK-21729) Generic test for ProbabilisticClassifier to ensure consistent output columns

2017-08-14 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-21729: - Summary: Generic test for ProbabilisticClassifier to ensure consistent output columns Key: SPARK-21729 URL: https://issues.apache.org/jira/browse/SPARK-21729

[jira] [Commented] (SPARK-17025) Cannot persist PySpark ML Pipeline model that includes custom Transformer

2017-08-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16124493#comment-16124493 ] Joseph K. Bradley commented on SPARK-17025: --- [~nchammas] I just merged

[jira] [Commented] (SPARK-21685) Params isSet in scala Transformer triggered by _setDefault in pyspark

2017-08-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16122492#comment-16122492 ] Joseph K. Bradley commented on SPARK-21685: --- Could you please point to more info, such as the

[jira] [Resolved] (SPARK-21542) Helper functions for custom Python Persistence

2017-08-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21542. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18742

[jira] [Assigned] (SPARK-21542) Helper functions for custom Python Persistence

2017-08-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-21542: - Assignee: Ajay Saini > Helper functions for custom Python Persistence >

[jira] [Resolved] (SPARK-21633) Unary Transformer in Python

2017-08-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21633. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18746

[jira] [Assigned] (SPARK-21633) Unary Transformer in Python

2017-08-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-21633: - Assignee: Ajay Saini > Unary Transformer in Python >

[jira] [Updated] (SPARK-21633) Unary Transformer in Python

2017-08-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21633: -- Shepherd: Joseph K. Bradley > Unary Transformer in Python >

[jira] [Updated] (SPARK-21542) Helper functions for custom Python Persistence

2017-08-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21542: -- Shepherd: Joseph K. Bradley > Helper functions for custom Python Persistence >

[jira] [Updated] (SPARK-21542) Helper functions for custom Python Persistence

2017-07-26 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21542: -- Description: Currently, there is no way to easily persist Json-serializable parameters

[jira] [Updated] (SPARK-21542) Helper functions for custom Python Persistence

2017-07-26 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21542: -- Component/s: ML > Helper functions for custom Python Persistence >

[jira] [Resolved] (SPARK-13786) Pyspark ml.tuning support export/import

2017-07-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-13786. --- Resolution: Duplicate > Pyspark ml.tuning support export/import >

[jira] [Commented] (SPARK-13786) Pyspark ml.tuning support export/import

2017-07-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16101057#comment-16101057 ] Joseph K. Bradley commented on SPARK-13786: --- This has been resolved now via [SPARK-11893],

[jira] [Commented] (SPARK-21523) Fix bug of strong wolfe linesearch `init` parameter lose effectiveness

2017-07-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16099229#comment-16099229 ] Joseph K. Bradley commented on SPARK-21523: --- CC [~yanboliang] [~yuhaoyan] [~dbtsai] making a

[jira] [Updated] (SPARK-21523) Fix bug of strong wolfe linesearch `init` parameter lose effectiveness

2017-07-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21523: -- Description: We need merge this breeze bugfix into spark because it influence a series

[jira] [Commented] (SPARK-15574) Python meta-algorithms in Scala

2017-07-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16090459#comment-16090459 ] Joseph K. Bradley commented on SPARK-15574: --- (Just commented on the PR; I'm uncertain about the

[jira] [Resolved] (SPARK-21221) CrossValidator and TrainValidationSplit Persist Nested Estimators such as OneVsRest

2017-07-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21221. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18428

[jira] [Commented] (SPARK-20090) Add StructType.fieldNames to Python API

2017-07-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16087666#comment-16087666 ] Joseph K. Bradley commented on SPARK-20090: --- Sorry for not seeing this. You're right about

[jira] [Commented] (SPARK-20099) Add transformSchema to pyspark.ml

2017-07-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16086422#comment-16086422 ] Joseph K. Bradley commented on SPARK-20099: --- [~holdenk] [~yanboliang] [~yuhaoyan] [~mlnick]

[jira] [Assigned] (SPARK-21221) CrossValidator and TrainValidationSplit Persist Nested Estimators

2017-07-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-21221: - Assignee: Ajay Saini > CrossValidator and TrainValidationSplit Persist Nested

[jira] [Updated] (SPARK-21221) CrossValidator and TrainValidationSplit Persist Nested Estimators

2017-07-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21221: -- Affects Version/s: (was: 2.1.1) 2.2.0 > CrossValidator and

[jira] [Updated] (SPARK-20604) Allow Imputer to handle all numeric types

2017-07-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20604: -- Issue Type: Improvement (was: Bug) > Allow Imputer to handle all numeric types >

[jira] [Updated] (SPARK-21241) Add intercept to StreamingLinearRegressionWithSGD

2017-07-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21241: -- Issue Type: New Feature (was: Bug) > Add intercept to

[jira] [Commented] (SPARK-20133) User guide for spark.ml.stat.ChiSquareTest

2017-07-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16081503#comment-16081503 ] Joseph K. Bradley commented on SPARK-20133: --- Sorry for the slow response; please feel free to!

[jira] [Commented] (SPARK-21086) CrossValidator, TrainValidationSplit should preserve all models after fitting

2017-07-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16081500#comment-16081500 ] Joseph K. Bradley commented on SPARK-21086: --- I like the idea for that path, but it could become

[jira] [Commented] (SPARK-21341) Spark 2.1.1: I want to be able to serialize wordVectors on Word2VecModel

2017-07-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16081483#comment-16081483 ] Joseph K. Bradley commented on SPARK-21341: --- +1 for the built-in save/load. Saving as an

[jira] [Updated] (SPARK-21208) Ability to "setLocalProperty" from sc, in sparkR

2017-07-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21208: -- Issue Type: New Feature (was: Bug) > Ability to "setLocalProperty" from sc, in sparkR

[jira] [Resolved] (SPARK-21341) Spark 2.1.1: I want to be able to serialize wordVectors on Word2VecModel

2017-07-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21341. --- Resolution: Not A Problem > Spark 2.1.1: I want to be able to serialize wordVectors

[jira] [Updated] (SPARK-20929) LinearSVC should not use shared Param HasThresholds

2017-06-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20929: -- Target Version/s: 2.2.0 (was: 2.2.1, 2.3.0) > LinearSVC should not use shared Param

[jira] [Updated] (SPARK-20929) LinearSVC should not use shared Param HasThresholds

2017-06-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20929: -- Fix Version/s: (was: 2.2.1) (was: 2.3.0)

[jira] [Updated] (SPARK-21221) CrossValidator and TrainValidationSplit Persist Nested Estimators

2017-06-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21221: -- Shepherd: Joseph K. Bradley > CrossValidator and TrainValidationSplit Persist Nested

[jira] [Created] (SPARK-21166) Automated ML persistence

2017-06-21 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-21166: - Summary: Automated ML persistence Key: SPARK-21166 URL: https://issues.apache.org/jira/browse/SPARK-21166 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-20114) spark.ml parity for sequential pattern mining - PrefixSpan

2017-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20114: -- Issue Type: Sub-task (was: New Feature) Parent: SPARK-14501 > spark.ml parity

[jira] [Resolved] (SPARK-20929) LinearSVC should not use shared Param HasThresholds

2017-06-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-20929. --- Resolution: Fixed Fix Version/s: 2.3.0 2.2.1 Issue

[jira] [Updated] (SPARK-20929) LinearSVC should not use shared Param HasThresholds

2017-06-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20929: -- Target Version/s: 2.2.1, 2.3.0 > LinearSVC should not use shared Param HasThresholds >

[jira] [Created] (SPARK-21088) CrossValidator, TrainValidationSplit should preserve all models after fitting: Python

2017-06-13 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-21088: - Summary: CrossValidator, TrainValidationSplit should preserve all models after fitting: Python Key: SPARK-21088 URL: https://issues.apache.org/jira/browse/SPARK-21088

[jira] [Updated] (SPARK-21088) CrossValidator, TrainValidationSplit should preserve all models after fitting: Python

2017-06-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21088: -- Component/s: PySpark > CrossValidator, TrainValidationSplit should preserve all models

[jira] [Created] (SPARK-21087) CrossValidator, TrainValidationSplit should preserve all models after fitting: Scala

2017-06-13 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-21087: - Summary: CrossValidator, TrainValidationSplit should preserve all models after fitting: Scala Key: SPARK-21087 URL: https://issues.apache.org/jira/browse/SPARK-21087

[jira] [Created] (SPARK-21086) CrossValidator, TrainValidationSplit should preserve all models after fitting

2017-06-13 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-21086: - Summary: CrossValidator, TrainValidationSplit should preserve all models after fitting Key: SPARK-21086 URL: https://issues.apache.org/jira/browse/SPARK-21086

[jira] [Commented] (SPARK-13333) DataFrame filter + randn + unionAll has bad interaction

2017-06-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16048477#comment-16048477 ] Joseph K. Bradley commented on SPARK-1: --- Thanks for explaining! I just rediscovered this

[jira] [Updated] (SPARK-21027) Parallel One vs. Rest Classifier

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21027: -- Shepherd: Joseph K. Bradley > Parallel One vs. Rest Classifier >

[jira] [Closed] (SPARK-14450) Python OneVsRest should train multiple models at once

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-14450. - Resolution: Duplicate > Python OneVsRest should train multiple models at once >

[jira] [Commented] (SPARK-14450) Python OneVsRest should train multiple models at once

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047245#comment-16047245 ] Joseph K. Bradley commented on SPARK-14450: --- See linked JIRA for new issue. > Python OneVsRest

[jira] [Commented] (SPARK-14450) Python OneVsRest should train multiple models at once

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047244#comment-16047244 ] Joseph K. Bradley commented on SPARK-14450: --- Scala already has parallelization. I just

[jira] [Commented] (SPARK-21027) Parallel One vs. Rest Classifier

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047243#comment-16047243 ] Joseph K. Bradley commented on SPARK-21027: --- Copying from [ML-14450]: [SPARK-7861] adds a

[jira] [Comment Edited] (SPARK-21027) Parallel One vs. Rest Classifier

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047243#comment-16047243 ] Joseph K. Bradley edited comment on SPARK-21027 at 6/12/17 11:54 PM: -

[jira] [Commented] (SPARK-21027) Parallel One vs. Rest Classifier

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16047241#comment-16047241 ] Joseph K. Bradley commented on SPARK-21027: --- Whoops! I realized I'd reported this long

[jira] [Resolved] (SPARK-21050) ml word2vec write has overflow issue in calculating numPartitions

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21050. --- Resolution: Fixed Fix Version/s: 2.3.0 2.2.1 Issue

[jira] [Updated] (SPARK-20499) Spark MLlib, GraphX 2.2 QA umbrella

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20499: -- Fix Version/s: 2.2.0 > Spark MLlib, GraphX 2.2 QA umbrella >

[jira] [Updated] (SPARK-20507) Update MLlib, GraphX websites for 2.2

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20507: -- Fix Version/s: 2.2.0 > Update MLlib, GraphX websites for 2.2 >

[jira] [Assigned] (SPARK-20511) SparkR 2.2 QA: Check for new R APIs requiring example code

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-20511: - Assignee: Felix Cheung > SparkR 2.2 QA: Check for new R APIs requiring example

[jira] [Updated] (SPARK-18864) Changes of MLlib and SparkR behavior for 2.2

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18864: -- Fix Version/s: 2.2.0 > Changes of MLlib and SparkR behavior for 2.2 >

[jira] [Updated] (SPARK-20511) SparkR 2.2 QA: Check for new R APIs requiring example code

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20511: -- Fix Version/s: 2.2.0 > SparkR 2.2 QA: Check for new R APIs requiring example code >

[jira] [Assigned] (SPARK-20508) Spark R 2.2 QA umbrella

2017-06-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-20508: - Assignee: Felix Cheung (was: Joseph K. Bradley) > Spark R 2.2 QA umbrella >

<    1   2   3   4   5   6   7   8   9   10   >