[jira] [Commented] (SPARK-18569) Support R formula arithmetic

2017-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15827021#comment-15827021 ] Joseph K. Bradley commented on SPARK-18569: --- +1 for putting together a design d

[jira] [Commented] (SPARK-18618) SparkR GLM model predict should support type as a argument

2017-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15827019#comment-15827019 ] Joseph K. Bradley commented on SPARK-18618: --- [~yanboliang] Will you be willing

[jira] [Updated] (SPARK-3162) Train DecisionTree locally when possible

2017-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-3162: - Target Version/s: (was: 2.2.0) > Train DecisionTree locally when possible >

[jira] [Commented] (SPARK-12347) Write script to run all MLlib examples for testing

2017-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15826997#comment-15826997 ] Joseph K. Bradley commented on SPARK-12347: --- I really want this to get in but j

[jira] [Updated] (SPARK-12347) Write script to run all MLlib examples for testing

2017-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12347: -- Target Version/s: (was: 2.2.0) > Write script to run all MLlib examples for testing >

[jira] [Updated] (SPARK-18613) spark.ml LDA classes should not expose spark.mllib in APIs

2017-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18613: -- Shepherd: Joseph K. Bradley > spark.ml LDA classes should not expose spark.mllib in API

[jira] [Commented] (SPARK-18924) Improve collect/createDataFrame performance in SparkR

2017-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15826996#comment-15826996 ] Joseph K. Bradley commented on SPARK-18924: --- Per the 2.2 roadmap process, I'm g

[jira] [Updated] (SPARK-18924) Improve collect/createDataFrame performance in SparkR

2017-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18924: -- Shepherd: Xiangrui Meng > Improve collect/createDataFrame performance in SparkR > -

[jira] [Commented] (SPARK-19247) improve ml word2vec save/load

2017-01-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15826931#comment-15826931 ] Joseph K. Bradley commented on SPARK-19247: --- You're right; I forgot about that

[jira] [Commented] (SPARK-19247) improve ml word2vec save/load

2017-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15824807#comment-15824807 ] Joseph K. Bradley commented on SPARK-19247: --- Is this an actual problem? If thi

[jira] [Commented] (SPARK-11569) StringIndexer transform fails when column contains nulls

2017-01-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15813712#comment-15813712 ] Joseph K. Bradley commented on SPARK-11569: --- Hi all, I'm sorry for not followin

[jira] [Commented] (SPARK-13610) Create a Transformer to disassemble vectors in DataFrames

2017-01-08 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15810767#comment-15810767 ] Joseph K. Bradley commented on SPARK-13610: --- This sounds like a reasonable use

[jira] [Commented] (SPARK-11968) ALS recommend all methods spend most of time in GC

2017-01-08 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15809805#comment-15809805 ] Joseph K. Bradley commented on SPARK-11968: --- I disagree; this is a problem but

[jira] [Resolved] (SPARK-19110) DistributedLDAModel returns different logPrior for original and loaded model

2017-01-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-19110. --- Resolution: Fixed Fix Version/s: 2.2.0 2.0.3

[jira] [Commented] (SPARK-18948) Add Mean Percentile Rank metric for ranking algorithms

2017-01-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15808030#comment-15808030 ] Joseph K. Bradley commented on SPARK-18948: --- Oh I agree we'd need to add the in

[jira] [Updated] (SPARK-19110) DistributedLDAModel returns different logPrior for original and loaded model

2017-01-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19110: -- Target Version/s: 2.0.3, 2.1.1, 2.2.0 (was: 1.6.4, 2.0.3, 2.1.1, 2.2.0) > DistributedL

[jira] [Updated] (SPARK-19110) DistributedLDAModel returns different logPrior for original and loaded model

2017-01-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19110: -- Assignee: Miao Wang > DistributedLDAModel returns different logPrior for original and l

[jira] [Updated] (SPARK-19110) DistributedLDAModel returns different logPrior for original and loaded model

2017-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19110: -- Shepherd: Joseph K. Bradley Target Version/s: 1.6.4, 2.0.3, 2.1.1, 2.2.0 >

[jira] [Updated] (SPARK-19110) DistributedLDAModel returns different logPrior for original and loaded model

2017-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19110: -- Affects Version/s: 2.2.0 1.3.1 1.4.1

[jira] [Resolved] (SPARK-18194) Log instrumentation in OneVsRest, CrossValidator, TrainValidationSplit

2017-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-18194. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16480 [h

[jira] [Commented] (SPARK-17455) IsotonicRegression takes non-polynomial time for some inputs

2017-01-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15802805#comment-15802805 ] Joseph K. Bradley commented on SPARK-17455: --- I'm changing the target version si

[jira] [Updated] (SPARK-17455) IsotonicRegression takes non-polynomial time for some inputs

2017-01-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-17455: -- Target Version/s: 2.2.0 (was: 2.0.3, 2.1.1, 2.2.0) > IsotonicRegression takes non-poly

[jira] [Updated] (SPARK-18194) Log instrumentation in OneVsRest, CrossValidator, TrainValidationSplit

2017-01-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18194: -- Assignee: Sue Ann Hong > Log instrumentation in OneVsRest, CrossValidator, TrainValidat

[jira] [Updated] (SPARK-18194) Log instrumentation in OneVsRest, CrossValidator, TrainValidationSplit

2017-01-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18194: -- Shepherd: Joseph K. Bradley Target Version/s: 2.2.0 > Log instrumentation i

[jira] [Commented] (SPARK-5844) Optimize Pipeline.fit for ParamGrid

2017-01-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15802264#comment-15802264 ] Joseph K. Bradley commented on SPARK-5844: -- Thanks for the ideas! There is a key

[jira] [Commented] (SPARK-19071) Optimizations for ML Pipeline Tuning

2017-01-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15802260#comment-15802260 ] Joseph K. Bradley commented on SPARK-19071: --- Thanks @Bryan for the thoughtful d

[jira] [Commented] (SPARK-14804) Graph vertexRDD/EdgeRDD checkpoint results ClassCastException:

2017-01-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15799094#comment-15799094 ] Joseph K. Bradley commented on SPARK-14804: --- I think this is a separate issue f

[jira] [Commented] (SPARK-17265) EdgeRDD Difference throws an exception

2017-01-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15799091#comment-15799091 ] Joseph K. Bradley commented on SPARK-17265: --- [~shishir167] Could you please giv

[jira] [Updated] (SPARK-17747) WeightCol support non-double datatypes

2017-01-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-17747: -- Priority: Minor (was: Major) > WeightCol support non-double datatypes > --

[jira] [Updated] (SPARK-18206) Log instrumentation in MPC, NB, LDA, AFT, GLR, Isotonic, LinReg

2017-01-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18206: -- Shepherd: Joseph K. Bradley > Log instrumentation in MPC, NB, LDA, AFT, GLR, Isotonic,

[jira] [Commented] (SPARK-17169) To use scala macros to update code when SharedParamsCodeGen.scala changed

2017-01-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15798989#comment-15798989 ] Joseph K. Bradley commented on SPARK-17169: --- Is this worthwhile? A lot of deve

[jira] [Commented] (SPARK-13435) Add Weighted Cohen's kappa to MulticlassMetrics

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15796965#comment-15796965 ] Joseph K. Bradley commented on SPARK-13435: --- Thanks! We just have not been abl

[jira] [Commented] (SPARK-13677) Support Tree-Based Feature Transformation for ML

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15796748#comment-15796748 ] Joseph K. Bradley commented on SPARK-13677: --- [~podongfeng] Apologies for the in

[jira] [Commented] (SPARK-19039) UDF ClosureCleaner bug when UDF, col applied in paste mode in REPL

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15796740#comment-15796740 ] Joseph K. Bradley commented on SPARK-19039: --- Whoops, thanks! Posted stack trac

[jira] [Updated] (SPARK-19039) UDF ClosureCleaner bug when UDF, col applied in paste mode in REPL

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19039: -- Description: When I try this: * Define UDF * Apply UDF to get Column * Use Column in a

[jira] [Commented] (SPARK-18948) Add Mean Percentile Rank metric for ranking algorithms

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15796730#comment-15796730 ] Joseph K. Bradley commented on SPARK-18948: --- OK, but please say if you'd like t

[jira] [Closed] (SPARK-16786) LDA topic distributions for new documents in PySpark

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-16786. - Resolution: Won't Fix > LDA topic distributions for new documents in PySpark > --

[jira] [Commented] (SPARK-16786) LDA topic distributions for new documents in PySpark

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15796648#comment-15796648 ] Joseph K. Bradley commented on SPARK-16786: --- [~supremekai] Thanks for the PR.

[jira] [Resolved] (SPARK-15163) Mark experimental algorithms experimental in PySpark

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-15163. --- Resolution: Fixed Assignee: holdenk Fix Version/s: 2.0.0 I'm resolvin

[jira] [Updated] (SPARK-19057) Instance weights must be non-negative

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19057: -- Summary: Instance weights must be non-negative (was: Instances' weight must be non-neg

[jira] [Comment Edited] (SPARK-5535) Add parameter for storage levels

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15796294#comment-15796294 ] Joseph K. Bradley edited comment on SPARK-5535 at 1/3/17 10:02 PM: -

[jira] [Updated] (SPARK-5535) Add parameter for storage levels

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5535: - Description: Add a special parameter type for storage levels that takes the string repres

[jira] [Commented] (SPARK-19007) Speedup and optimize the GradientBoostedTrees in the "data>memory" scene

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15796313#comment-15796313 ] Joseph K. Bradley commented on SPARK-19007: --- >From discussion on the linked PR:

[jira] [Created] (SPARK-19063) Add parameter for storage levels to LDA

2017-01-03 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-19063: - Summary: Add parameter for storage levels to LDA Key: SPARK-19063 URL: https://issues.apache.org/jira/browse/SPARK-19063 Project: Spark Issue Type:

[jira] [Updated] (SPARK-5535) Add parameter for storage levels

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5535: - Summary: Add parameter for storage levels (was: Add parameter for storage levels.) > Add

[jira] [Commented] (SPARK-5535) Add parameter for storage levels.

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15796294#comment-15796294 ] Joseph K. Bradley commented on SPARK-5535: -- This issue came up in [SPARK-19007],

[jira] [Updated] (SPARK-5535) Add parameter for storage levels

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5535: - Description: Add a special parameter type for storage levels that takes both StorageLevels

[jira] [Updated] (SPARK-18454) Changes to improve Nearest Neighbor Search for LSH

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18454: -- Summary: Changes to improve Nearest Neighbor Search for LSH (was: Changes to fix Neare

[jira] [Commented] (SPARK-12757) Use reference counting to prevent blocks from being evicted during reads

2017-01-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15795770#comment-15795770 ] Joseph K. Bradley commented on SPARK-12757: --- [~joshrosen] Shall we downgrade th

[jira] [Created] (SPARK-19053) Supporting multiple evaluation metrics in DataFrame-based API: discussion

2017-01-02 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-19053: - Summary: Supporting multiple evaluation metrics in DataFrame-based API: discussion Key: SPARK-19053 URL: https://issues.apache.org/jira/browse/SPARK-19053 P

[jira] [Created] (SPARK-19039) UDF ClosureCleaner bug when UDF, col applied in paste mode in REPL

2016-12-30 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-19039: - Summary: UDF ClosureCleaner bug when UDF, col applied in paste mode in REPL Key: SPARK-19039 URL: https://issues.apache.org/jira/browse/SPARK-19039 Project:

[jira] [Commented] (SPARK-18813) MLlib 2.2 Roadmap

2016-12-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15786319#comment-15786319 ] Joseph K. Bradley commented on SPARK-18813: --- I just added links to the categori

[jira] [Updated] (SPARK-18813) MLlib 2.2 Roadmap

2016-12-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18813: -- Description: *PROPOSAL: This includes a proposal for the 2.2 roadmap process for MLlib.

[jira] [Updated] (SPARK-18698) public constructor with uid for IndexToString-class

2016-12-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18698: -- Assignee: Ilya Matiach > public constructor with uid for IndexToString-class >

[jira] [Resolved] (SPARK-18698) public constructor with uid for IndexToString-class

2016-12-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-18698. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16436 [h

[jira] [Updated] (SPARK-18698) public constructor with uid for IndexToString-class

2016-12-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18698: -- Shepherd: Joseph K. Bradley Affects Version/s: (was: 2.0.2) Target

[jira] [Updated] (SPARK-18698) public constructor with uid for IndexToString-class

2016-12-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18698: -- Issue Type: Improvement (was: Wish) > public constructor with uid for IndexToString-cl

[jira] [Updated] (SPARK-19007) Speedup and optimize the GradientBoostedTrees in the "data>memory" scene

2016-12-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19007: -- Component/s: (was: MLlib) > Speedup and optimize the GradientBoostedTrees in the "d

[jira] [Commented] (SPARK-18948) Add Mean Percentile Rank metric for ranking algorithms

2016-12-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15784216#comment-15784216 ] Joseph K. Bradley commented on SPARK-18948: --- Thanks [~danilo.ascione] for sugge

[jira] [Updated] (SPARK-18948) Add Mean Percentile Rank metric for ranking algorithms

2016-12-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18948: -- Shepherd: (was: Xiangrui Meng) > Add Mean Percentile Rank metric for ranking algorith

[jira] [Updated] (SPARK-18929) Add Tweedie distribution in GLM

2016-12-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18929: -- Affects Version/s: (was: 2.0.2) > Add Tweedie distribution in GLM > ---

[jira] [Commented] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15784190#comment-15784190 ] Joseph K. Bradley commented on SPARK-18862: --- I like the chosen organization too

[jira] [Updated] (SPARK-17847) Reduce shuffled data size of GaussianMixture & copy the implementation from mllib to ml

2016-12-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-17847: -- Target Version/s: 2.2.0 > Reduce shuffled data size of GaussianMixture & copy the imple

[jira] [Commented] (SPARK-18757) Models in Pyspark support column setters

2016-12-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15781096#comment-15781096 ] Joseph K. Bradley commented on SPARK-18757: --- I think it's useful to make the Py

[jira] [Updated] (SPARK-18757) Models in Pyspark support column setters

2016-12-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18757: -- Description: Recently, I found three places in which column setters are missing: KMean

[jira] [Commented] (SPARK-18618) SparkR GLM model predict should support type as a argument

2016-12-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15771417#comment-15771417 ] Joseph K. Bradley commented on SPARK-18618: --- Note that [~yanboliang]'s PR from

[jira] [Closed] (SPARK-18291) SparkR glm predict should output original label when family = "binomial"

2016-12-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-18291. - Resolution: Duplicate Target Version/s: (was: 2.2.0) I'm closing this since [

[jira] [Updated] (SPARK-18618) SparkR GLM model predict should support type as a argument

2016-12-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18618: -- Description: SparkR GLM model {{predict}} should support {{type}} as a argument. This w

[jira] [Updated] (SPARK-18618) SparkR GLM model predict should support type as a argument

2016-12-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18618: -- Summary: SparkR GLM model predict should support type as a argument (was: SparkR model

[jira] [Updated] (SPARK-10413) ML models should support prediction on single instances

2016-12-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10413: -- Summary: ML models should support prediction on single instances (was: Model should su

[jira] [Updated] (SPARK-15572) ML persistence in R format: compatibility with other languages

2016-12-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15572: -- Summary: ML persistence in R format: compatibility with other languages (was: MLlib in

[jira] [Commented] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15752864#comment-15752864 ] Joseph K. Bradley commented on SPARK-18844: --- Note: Please don't set the Target

[jira] [Updated] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18844: -- Target Version/s: (was: 2.0.3) > Add more binary classification metrics to BinaryClas

[jira] [Updated] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18844: -- Fix Version/s: (was: 2.0.2) > Add more binary classification metrics to BinaryClass

[jira] [Updated] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18844: -- Issue Type: New Feature (was: Improvement) > Add more binary classification metrics to

[jira] [Updated] (SPARK-18823) Assignation by column name variable not available or bug?

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18823: -- Fix Version/s: (was: 2.0.2) > Assignation by column name variable not available or

[jira] [Commented] (SPARK-18823) Assignation by column name variable not available or bug?

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15752456#comment-15752456 ] Joseph K. Bradley commented on SPARK-18823: --- Note: Please don't set the Target

[jira] [Updated] (SPARK-18823) Assignation by column name variable not available or bug?

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18823: -- Target Version/s: (was: 2.0.2) > Assignation by column name variable not available or

[jira] [Resolved] (SPARK-18329) Spark R 2.1 QA umbrella

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-18329. --- Resolution: Fixed Assignee: Joseph K. Bradley Fix Version/s: 2.1.0 Re

[jira] [Updated] (SPARK-18332) SparkR 2.1 QA: Programming guide, migration guide, vignettes updates

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18332: -- Assignee: Xiangrui Meng > SparkR 2.1 QA: Programming guide, migration guide, vignettes

[jira] [Resolved] (SPARK-18332) SparkR 2.1 QA: Programming guide, migration guide, vignettes updates

2016-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-18332. --- Resolution: Fixed Fix Version/s: 2.1.0 I'm going to declare victory on this is

[jira] [Closed] (SPARK-18783) ML StringIndexer does not work with nested fields

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-18783. - Resolution: Won't Fix > ML StringIndexer does not work with nested fields > -

[jira] [Commented] (SPARK-18783) ML StringIndexer does not work with nested fields

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15750057#comment-15750057 ] Joseph K. Bradley commented on SPARK-18783: --- I'd separate this into 2 issues: 1

[jira] [Commented] (SPARK-18795) SparkR vignette update: ksTest

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15750043#comment-15750043 ] Joseph K. Bradley commented on SPARK-18795: --- No problem, thanks for understandi

[jira] [Commented] (SPARK-18374) Incorrect words in StopWords/english.txt

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15750041#comment-15750041 ] Joseph K. Bradley commented on SPARK-18374: --- Oh nice, I didn't realize that was

[jira] [Updated] (SPARK-18849) Vignettes final checks for Spark 2.1

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18849: -- Target Version/s: 2.1.0 > Vignettes final checks for Spark 2.1 > --

[jira] [Updated] (SPARK-18865) SparkR vignettes MLP and LDA updates

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18865: -- Target Version/s: 2.1.0 (was: 2.1.1, 2.2.0) > SparkR vignettes MLP and LDA updates > -

[jira] [Updated] (SPARK-18865) SparkR vignettes MLP and LDA updates

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18865: -- Fix Version/s: 2.2.0 2.1.1 > SparkR vignettes MLP and LDA updates >

[jira] [Updated] (SPARK-18865) SparkR vignettes MLP and LDA updates

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18865: -- Issue Type: Documentation (was: Bug) > SparkR vignettes MLP and LDA updates >

[jira] [Resolved] (SPARK-18795) SparkR vignette update: ksTest

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-18795. --- Resolution: Fixed Fix Version/s: 2.2.0 2.1.1 Issue resolved

[jira] [Updated] (SPARK-18476) SparkR Logistic Regression should output original label

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18476: -- Affects Version/s: 2.1.0 > SparkR Logistic Regression should output original label > --

[jira] [Updated] (SPARK-18476) SparkR Logistic Regression should output original label

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18476: -- Summary: SparkR Logistic Regression should output original label (was: SparkR Logistic

[jira] [Updated] (SPARK-18612) Leaked broadcasted variable in LBFGS

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18612: -- Summary: Leaked broadcasted variable in LBFGS (was: Leaked broadcasted variable Mllib)

[jira] [Updated] (SPARK-18456) Use matrix abstraction for LogisticRegression coefficients during training

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18456: -- Summary: Use matrix abstraction for LogisticRegression coefficients during training (w

[jira] [Updated] (SPARK-18813) MLlib 2.2 Roadmap

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18813: -- Description: *PROPOSAL: This includes a proposal for the 2.2 roadmap process for MLlib.

[jira] [Commented] (SPARK-18813) MLlib 2.2 Roadmap

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749603#comment-15749603 ] Joseph K. Bradley commented on SPARK-18813: --- I added them to the description ab

[jira] [Commented] (SPARK-18795) SparkR vignette update: ksTest

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749450#comment-15749450 ] Joseph K. Bradley commented on SPARK-18795: --- But feel free to send an update la

[jira] [Assigned] (SPARK-18795) SparkR vignette update: ksTest

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-18795: - Assignee: Joseph K. Bradley (was: Miao Wang) > SparkR vignette update: ksTest >

[jira] [Commented] (SPARK-18795) SparkR vignette update: ksTest

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749447#comment-15749447 ] Joseph K. Bradley commented on SPARK-18795: --- [~wangmiao1981] I'm going to take

[jira] [Commented] (SPARK-18864) Changes of MLlib and SparkR behavior for 2.2

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749113#comment-15749113 ] Joseph K. Bradley commented on SPARK-18864: --- [SPARK-18374]: Change English stop

<    6   7   8   9   10   11   12   13   14   15   >