[jira] [Issue Comment Deleted] (SPARK-24114) improve instrumentation for spark.ml.recommendation

2018-05-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-24114: -- Comment: was deleted (was: User 'MrBago' has created a pull request for this issue:

[jira] [Resolved] (SPARK-24310) Instrumentation for frequent pattern mining

2018-05-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-24310. --- Resolution: Fixed Fix Version/s: 2.4.0 > Instrumentation for frequent pattern

[jira] [Commented] (SPARK-24310) Instrumentation for frequent pattern mining

2018-05-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16479684#comment-16479684 ] Joseph K. Bradley commented on SPARK-24310: --- The PR for this was linked to the wrong JIRA, but

[jira] [Created] (SPARK-24310) Instrumentation for frequent pattern mining

2018-05-17 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-24310: - Summary: Instrumentation for frequent pattern mining Key: SPARK-24310 URL: https://issues.apache.org/jira/browse/SPARK-24310 Project: Spark Issue

[jira] [Assigned] (SPARK-24114) improve instrumentation for spark.ml.recommendation

2018-05-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-24114: - Assignee: (was: Bago Amirbekian) > improve instrumentation for

[jira] [Updated] (SPARK-24114) improve instrumentation for spark.ml.recommendation

2018-05-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-24114: -- Shepherd: (was: Joseph K. Bradley) > improve instrumentation for

[jira] [Updated] (SPARK-24114) improve instrumentation for spark.ml.recommendation

2018-05-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-24114: -- Shepherd: Joseph K. Bradley > improve instrumentation for spark.ml.recommendation >

[jira] [Assigned] (SPARK-24114) improve instrumentation for spark.ml.recommendation

2018-05-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-24114: - Assignee: Bago Amirbekian > improve instrumentation for spark.ml.recommendation

[jira] [Commented] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2018-05-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16478328#comment-16478328 ] Joseph K. Bradley commented on SPARK-15784: --- [~shahid] Thanks for offering! If [~wm624] wants

[jira] [Resolved] (SPARK-22210) Online LDA variationalTopicInference should use random seed to have stable behavior

2018-05-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-22210. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21183

[jira] [Resolved] (SPARK-24058) Default Params in ML should be saved separately: Python API

2018-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-24058. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21153

[jira] [Assigned] (SPARK-24058) Default Params in ML should be saved separately: Python API

2018-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-24058: - Assignee: Liang-Chi Hsieh > Default Params in ML should be saved separately:

[jira] [Commented] (SPARK-24213) Power Iteration Clustering in the SparkML throws exception, when the ID is IntType

2018-05-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16470705#comment-16470705 ] Joseph K. Bradley commented on SPARK-24213: --- On the topic of eating my words, please check out

[jira] [Commented] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16470704#comment-16470704 ] Joseph K. Bradley commented on SPARK-24217: --- On the topic of eating my words, please check out

[jira] [Comment Edited] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2018-05-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16470701#comment-16470701 ] Joseph K. Bradley edited comment on SPARK-15784 at 5/10/18 4:45 PM:

[jira] [Commented] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2018-05-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16470701#comment-16470701 ] Joseph K. Bradley commented on SPARK-15784: --- So... we originally agreed to make this a

[jira] [Comment Edited] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16469562#comment-16469562 ] Joseph K. Bradley edited comment on SPARK-24217 at 5/10/18 4:37 PM:

[jira] [Commented] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16469562#comment-16469562 ] Joseph K. Bradley commented on SPARK-24217: --- But the reason that the IDs are missing from the

[jira] [Commented] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some vertices.

2018-05-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16469230#comment-16469230 ] Joseph K. Bradley commented on SPARK-24217: --- I don't really think this is a bug. PIC's

[jira] [Resolved] (SPARK-14682) Provide evaluateEachIteration method or equivalent for spark.ml GBTs

2018-05-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-14682. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21097

[jira] [Assigned] (SPARK-14682) Provide evaluateEachIteration method or equivalent for spark.ml GBTs

2018-05-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-14682: - Assignee: Weichen Xu > Provide evaluateEachIteration method or equivalent for

[jira] [Updated] (SPARK-7132) Add fit with validation set to spark.ml GBT

2018-05-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7132: - Shepherd: Joseph K. Bradley > Add fit with validation set to spark.ml GBT >

[jira] [Assigned] (SPARK-7132) Add fit with validation set to spark.ml GBT

2018-05-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-7132: Assignee: Weichen Xu > Add fit with validation set to spark.ml GBT >

[jira] [Commented] (SPARK-24213) Power Iteration Clustering in the SparkML throws exception, when the ID is IntType

2018-05-08 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468018#comment-16468018 ] Joseph K. Bradley commented on SPARK-24213: --- Thanks for reporting this issue! There is

[jira] [Created] (SPARK-24212) PrefixSpan in spark.ml: user guide section

2018-05-08 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-24212: - Summary: PrefixSpan in spark.ml: user guide section Key: SPARK-24212 URL: https://issues.apache.org/jira/browse/SPARK-24212 Project: Spark Issue

[jira] [Closed] (SPARK-24145) spark.ml parity for sequential pattern mining - PrefixSpan: Python API

2018-05-08 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-24145. - > spark.ml parity for sequential pattern mining - PrefixSpan: Python API >

[jira] [Resolved] (SPARK-24145) spark.ml parity for sequential pattern mining - PrefixSpan: Python API

2018-05-08 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-24145. --- Resolution: Duplicate > spark.ml parity for sequential pattern mining - PrefixSpan:

[jira] [Resolved] (SPARK-20114) spark.ml parity for sequential pattern mining - PrefixSpan

2018-05-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-20114. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 20973

[jira] [Resolved] (SPARK-22885) ML test for StructuredStreaming: spark.ml.tuning

2018-05-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-22885. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 20261

[jira] [Resolved] (SPARK-15750) Constructing FPGrowth fails when no numPartitions specified in pyspark

2018-05-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-15750. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 13493

[jira] [Commented] (SPARK-24152) SparkR CRAN feasibility check server problem

2018-05-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466513#comment-16466513 ] Joseph K. Bradley commented on SPARK-24152: --- Thank you all! > SparkR CRAN feasibility check

[jira] [Updated] (SPARK-24097) Instruments improvements - RandomForest and GradientBoostedTree

2018-05-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-24097: -- Shepherd: Joseph K. Bradley > Instruments improvements - RandomForest and

[jira] [Assigned] (SPARK-24097) Instruments improvements - RandomForest and GradientBoostedTree

2018-05-02 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-24097: - Assignee: Weichen Xu > Instruments improvements - RandomForest and

[jira] [Comment Edited] (SPARK-23686) Make better usage of org.apache.spark.ml.util.Instrumentation

2018-05-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16460164#comment-16460164 ] Joseph K. Bradley edited comment on SPARK-23686 at 5/2/18 12:21 AM:

[jira] [Comment Edited] (SPARK-23686) Make better usage of org.apache.spark.ml.util.Instrumentation

2018-05-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16460164#comment-16460164 ] Joseph K. Bradley edited comment on SPARK-23686 at 5/1/18 9:52 PM: ---

[jira] [Assigned] (SPARK-15750) Constructing FPGrowth fails when no numPartitions specified in pyspark

2018-05-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-15750: - Assignee: Jeff Zhang > Constructing FPGrowth fails when no numPartitions

[jira] [Updated] (SPARK-15750) Constructing FPGrowth fails when no numPartitions specified in pyspark

2018-05-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15750: -- Shepherd: Joseph K. Bradley > Constructing FPGrowth fails when no numPartitions

[jira] [Commented] (SPARK-23686) Make better usage of org.apache.spark.ml.util.Instrumentation

2018-05-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16460164#comment-16460164 ] Joseph K. Bradley commented on SPARK-23686: --- [~yogeshgarg] made the good point that we should

[jira] [Updated] (SPARK-22885) ML test for StructuredStreaming: spark.ml.tuning

2018-05-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-22885: -- Shepherd: Joseph K. Bradley > ML test for StructuredStreaming: spark.ml.tuning >

[jira] [Assigned] (SPARK-22885) ML test for StructuredStreaming: spark.ml.tuning

2018-05-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-22885: - Assignee: Weichen Xu > ML test for StructuredStreaming: spark.ml.tuning >

[jira] [Commented] (SPARK-24115) improve instrumentation for spark.ml.tuning

2018-04-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16459033#comment-16459033 ] Joseph K. Bradley commented on SPARK-24115: --- Sounds good; go ahead. > improve instrumentation

[jira] [Assigned] (SPARK-22210) Online LDA variationalTopicInference should use random seed to have stable behavior

2018-04-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-22210: - Assignee: Lu Wang > Online LDA variationalTopicInference should use random

[jira] [Updated] (SPARK-22210) Online LDA variationalTopicInference should use random seed to have stable behavior

2018-04-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-22210: -- Shepherd: Joseph K. Bradley > Online LDA variationalTopicInference should use random

[jira] [Commented] (SPARK-22210) Online LDA variationalTopicInference should use random seed to have stable behavior

2018-04-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453228#comment-16453228 ] Joseph K. Bradley commented on SPARK-22210: --- [~lu.DB] Would you like to do this? It should be

[jira] [Resolved] (SPARK-23824) Make inpurityStats publicly accessible in ml.tree.Node

2018-04-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-23824. --- Resolution: Duplicate > Make inpurityStats publicly accessible in ml.tree.Node >

[jira] [Assigned] (SPARK-20114) spark.ml parity for sequential pattern mining - PrefixSpan

2018-04-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-20114: - Assignee: Weichen Xu > spark.ml parity for sequential pattern mining -

[jira] [Updated] (SPARK-20114) spark.ml parity for sequential pattern mining - PrefixSpan

2018-04-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20114: -- Shepherd: Joseph K. Bradley > spark.ml parity for sequential pattern mining -

[jira] [Updated] (SPARK-20114) spark.ml parity for sequential pattern mining - PrefixSpan

2018-04-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20114: -- Target Version/s: 2.4.0 > spark.ml parity for sequential pattern mining - PrefixSpan >

[jira] [Resolved] (SPARK-23990) Instruments logging improvements - ML regression package

2018-04-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-23990. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21078

[jira] [Resolved] (SPARK-23455) Default Params in ML should be saved separately

2018-04-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-23455. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 20633

[jira] [Commented] (SPARK-23975) Allow Clustering to take Arrays of Double as input features

2018-04-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16450161#comment-16450161 ] Joseph K. Bradley commented on SPARK-23975: --- I merged

[jira] [Assigned] (SPARK-23975) Allow Clustering to take Arrays of Double as input features

2018-04-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23975: - Assignee: Lu Wang > Allow Clustering to take Arrays of Double as input features

[jira] [Updated] (SPARK-23455) Default Params in ML should be saved separately

2018-04-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23455: -- Target Version/s: 2.4.0 > Default Params in ML should be saved separately >

[jira] [Assigned] (SPARK-23455) Default Params in ML should be saved separately

2018-04-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23455: - Assignee: Liang-Chi Hsieh > Default Params in ML should be saved separately >

[jira] [Commented] (SPARK-24058) Default Params in ML should be saved separately: Python API

2018-04-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16448994#comment-16448994 ] Joseph K. Bradley commented on SPARK-24058: --- CCing [~viirya] since you're the natural one to

[jira] [Created] (SPARK-24058) Default Params in ML should be saved separately: Python API

2018-04-23 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-24058: - Summary: Default Params in ML should be saved separately: Python API Key: SPARK-24058 URL: https://issues.apache.org/jira/browse/SPARK-24058 Project: Spark

[jira] [Updated] (SPARK-23990) Instruments logging improvements - ML regression package

2018-04-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23990: -- Shepherd: Joseph K. Bradley > Instruments logging improvements - ML regression package

[jira] [Assigned] (SPARK-23990) Instruments logging improvements - ML regression package

2018-04-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23990: - Assignee: Weichen Xu > Instruments logging improvements - ML regression package

[jira] [Resolved] (SPARK-24026) spark.ml Scala/Java API for PIC

2018-04-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-24026. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21090

[jira] [Created] (SPARK-24026) spark.ml Scala/Java API for PIC

2018-04-19 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-24026: - Summary: spark.ml Scala/Java API for PIC Key: SPARK-24026 URL: https://issues.apache.org/jira/browse/SPARK-24026 Project: Spark Issue Type:

[jira] [Commented] (SPARK-18693) BinaryClassificationEvaluator, RegressionEvaluator, and MulticlassClassificationEvaluator should use sample weight data

2018-04-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441713#comment-16441713 ] Joseph K. Bradley commented on SPARK-18693: --- [~imatiach] Would you mind creating JIRA subtasks

[jira] [Commented] (SPARK-23990) Instruments logging improvements - ML regression package

2018-04-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441701#comment-16441701 ] Joseph K. Bradley commented on SPARK-23990: --- A complication was brought up by this PR: Some

[jira] [Updated] (SPARK-22884) ML test for StructuredStreaming: spark.ml.clustering

2018-04-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-22884: -- Shepherd: Joseph K. Bradley > ML test for StructuredStreaming: spark.ml.clustering >

[jira] [Updated] (SPARK-8799) OneVsRestModel should extend ClassificationModel

2018-04-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8799: - Shepherd: Joseph K. Bradley > OneVsRestModel should extend ClassificationModel >

[jira] [Commented] (SPARK-8799) OneVsRestModel should extend ClassificationModel

2018-04-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441207#comment-16441207 ] Joseph K. Bradley commented on SPARK-8799: -- The missing functionality was added in [SPARK-9312],

[jira] [Updated] (SPARK-8799) OneVsRestModel should extend ClassificationModel

2018-04-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8799: - Target Version/s: 3.0.0 > OneVsRestModel should extend ClassificationModel >

[jira] [Assigned] (SPARK-21741) Python API for DataFrame-based multivariate summarizer

2018-04-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-21741: - Assignee: Weichen Xu > Python API for DataFrame-based multivariate summarizer >

[jira] [Resolved] (SPARK-21741) Python API for DataFrame-based multivariate summarizer

2018-04-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21741. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 20695

[jira] [Updated] (SPARK-23975) Allow Clustering to take Arrays of Double as input features

2018-04-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23975: -- Shepherd: Joseph K. Bradley > Allow Clustering to take Arrays of Double as input

[jira] [Resolved] (SPARK-21088) CrossValidator, TrainValidationSplit should collect all models when fitting: Python API

2018-04-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21088. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 19627

[jira] [Assigned] (SPARK-21088) CrossValidator, TrainValidationSplit should collect all models when fitting: Python API

2018-04-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-21088: - Assignee: Weichen Xu > CrossValidator, TrainValidationSplit should collect all

[jira] [Updated] (SPARK-9312) The OneVsRest model does not provide rawPrediction

2018-04-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9312: - Summary: The OneVsRest model does not provide rawPrediction (was: The OneVsRest model

[jira] [Assigned] (SPARK-9312) The OneVsRest model does not provide confidence factor(not probability) along with the prediction

2018-04-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-9312: Assignee: Lu Wang > The OneVsRest model does not provide confidence factor(not

[jira] [Resolved] (SPARK-9312) The OneVsRest model does not provide confidence factor(not probability) along with the prediction

2018-04-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-9312. -- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21044

[jira] [Resolved] (SPARK-22883) ML test for StructuredStreaming: spark.ml.feature, A-M

2018-04-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-22883. --- Resolution: Fixed Fix Version/s: 2.3.1 Issue resolved by pull request 21042

[jira] [Updated] (SPARK-22883) ML test for StructuredStreaming: spark.ml.feature, A-M

2018-04-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-22883: -- Fix Version/s: 2.4.0 > ML test for StructuredStreaming: spark.ml.feature, A-M >

[jira] [Updated] (SPARK-22883) ML test for StructuredStreaming: spark.ml.feature, A-M

2018-04-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-22883: -- Target Version/s: 2.3.1, 2.4.0 > ML test for StructuredStreaming: spark.ml.feature,

[jira] [Resolved] (SPARK-19947) RFormulaModel always throws Exception on transforming data with NULL or Unseen labels

2018-04-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-19947. --- Resolution: Fixed Fix Version/s: 2.4.0 I'll mark this as complete. Those

[jira] [Resolved] (SPARK-23562) RFormula handleInvalid should handle invalid values in non-string columns.

2018-04-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-23562. --- Resolution: Fixed Fix Version/s: 2.4.0 I think everything has been fixed, so

[jira] [Updated] (SPARK-23562) RFormula handleInvalid should handle invalid values in non-string columns.

2018-04-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23562: -- Shepherd: Joseph K. Bradley > RFormula handleInvalid should handle invalid values in

[jira] [Resolved] (SPARK-23944) Add Param set functions to LSHModel types

2018-04-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-23944. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21015

[jira] [Assigned] (SPARK-23944) Add Param set functions to LSHModel types

2018-04-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23944: - Assignee: Lu Wang > Add Param set functions to LSHModel types >

[jira] [Resolved] (SPARK-23871) add python api for VectorAssembler handleInvalid

2018-04-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-23871. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21003

[jira] [Updated] (SPARK-23871) add python api for VectorAssembler handleInvalid

2018-04-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23871: -- Shepherd: Joseph K. Bradley > add python api for VectorAssembler handleInvalid >

[jira] [Assigned] (SPARK-23871) add python api for VectorAssembler handleInvalid

2018-04-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23871: - Assignee: Huaxin Gao > add python api for VectorAssembler handleInvalid >

[jira] [Updated] (SPARK-21856) Update Python API for MultilayerPerceptronClassifierModel

2018-04-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21856: -- Fix Version/s: 2.3.0 > Update Python API for MultilayerPerceptronClassifierModel >

[jira] [Assigned] (SPARK-23751) Kolmogorov-Smirnoff test Python API in pyspark.ml

2018-04-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23751: - Assignee: Weichen Xu > Kolmogorov-Smirnoff test Python API in pyspark.ml >

[jira] [Resolved] (SPARK-23751) Kolmogorov-Smirnoff test Python API in pyspark.ml

2018-04-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-23751. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 20904

[jira] [Updated] (SPARK-23944) Add Param set functions to LSHModel types

2018-04-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23944: -- Fix Version/s: (was: 2.4.0) > Add Param set functions to LSHModel types >

[jira] [Resolved] (SPARK-14681) Provide label/impurity stats for spark.ml decision tree nodes

2018-04-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-14681. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 20786

[jira] [Assigned] (SPARK-14681) Provide label/impurity stats for spark.ml decision tree nodes

2018-04-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-14681: - Assignee: Weichen Xu > Provide label/impurity stats for spark.ml decision tree

[jira] [Commented] (SPARK-21005) VectorIndexerModel does not prepare output column field correctly

2018-04-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16431079#comment-16431079 ] Joseph K. Bradley commented on SPARK-21005: --- I don't actually see why this is a problem: If a

[jira] [Commented] (SPARK-18092) add type cast to avoid error "Column prediction must be of type DoubleType but was actually FloatType"

2018-04-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16429134#comment-16429134 ] Joseph K. Bradley commented on SPARK-18092: --- Can you please add a description and make the

[jira] [Updated] (SPARK-23751) Kolmogorov-Smirnoff test Python API in pyspark.ml

2018-04-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23751: -- Shepherd: Joseph K. Bradley > Kolmogorov-Smirnoff test Python API in pyspark.ml >

[jira] [Resolved] (SPARK-23859) Initial PR for Instrumentation improvements: UUID and logging levels

2018-04-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-23859. --- Resolution: Fixed Fix Version/s: 2.4.0 Resolved with

[jira] [Commented] (SPARK-23686) Make better usage of org.apache.spark.ml.util.Instrumentation

2018-04-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16428612#comment-16428612 ] Joseph K. Bradley commented on SPARK-23686: --- I wanted to ping some other active MLlib

[jira] [Resolved] (SPARK-23870) Forward RFormula handleInvalid Param to VectorAssembler

2018-04-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-23870. --- Resolution: Fixed Fix Version/s: 2.4.0 Resolved via

[jira] [Assigned] (SPARK-23870) Forward RFormula handleInvalid Param to VectorAssembler

2018-04-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23870: - Assignee: yogesh garg > Forward RFormula handleInvalid Param to

[jira] [Updated] (SPARK-23870) Forward RFormula handleInvalid Param to VectorAssembler

2018-04-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23870: -- Fix Version/s: (was: 2.4.0) > Forward RFormula handleInvalid Param to

[jira] [Resolved] (SPARK-22667) Fix model-specific optimization support for ML tuning: Python API

2018-04-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-22667. --- Resolution: Duplicate Fix Version/s: 2.3.0 > Fix model-specific optimization

<    1   2   3   4   5   6   7   8   9   10   >