[jira] [Commented] (SPARK-17692) Document ML/MLlib behavior changes in Spark 2.1

2016-09-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15526510#comment-15526510 ] Yanbo Liang commented on SPARK-17692: - cc [~mengxr] [~josephkb] [~dbtsai] [~mlnick] [~srowen] >

[jira] [Updated] (SPARK-17692) Document ML/MLlib behavior changes in Spark 2.1

2016-09-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17692: Description: This JIRA records behavior changes of ML/MLlib between 2.0 and 2.1, so we can note

[jira] [Updated] (SPARK-17692) Document ML/MLlib behavior changes in Spark 2.1

2016-09-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17692: Description: This JIRA records behavior changes of ML/MLlib between 2.0 and 2.1, so we can note

[jira] [Updated] (SPARK-17692) Document ML/MLlib behavior changes in Spark 2.1

2016-09-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17692: Description: This JIRA records behavior changes of ML/MLlib between 2.0 and 2.1, so we can note

[jira] [Created] (SPARK-17692) Document ML/MLlib behavior changes in Spark 2.1

2016-09-27 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-17692: --- Summary: Document ML/MLlib behavior changes in Spark 2.1 Key: SPARK-17692 URL: https://issues.apache.org/jira/browse/SPARK-17692 Project: Spark Issue Type:

[jira] [Updated] (SPARK-17692) Document ML/MLlib behavior changes in Spark 2.1

2016-09-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17692: Description: This JIRA keeps a list of MLlib behavior changes in Spark 2.1. So we can remember to

[jira] [Resolved] (SPARK-17428) SparkR executors/workers support virtualenv

2016-09-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17428. - Resolution: Done Assignee: Yanbo Liang > SparkR executors/workers support virtualenv >

[jira] [Resolved] (SPARK-16356) Add testImplicits for ML unit tests and promote toDF()

2016-09-26 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-16356. - Resolution: Fixed Fix Version/s: 2.1.0 > Add testImplicits for ML unit tests and promote

[jira] [Resolved] (SPARK-17281) Add treeAggregateDepth parameter for AFTSurvivalRegression

2016-09-22 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17281. - Resolution: Fixed Fix Version/s: 2.1.0 > Add treeAggregateDepth parameter for

[jira] [Updated] (SPARK-17281) Add treeAggregateDepth parameter for AFTSurvivalRegression

2016-09-22 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17281: Assignee: Weichen Xu > Add treeAggregateDepth parameter for AFTSurvivalRegression >

[jira] [Updated] (SPARK-16356) Add testImplicits for ML unit tests and promote toDF()

2016-09-22 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-16356: Shepherd: Yanbo Liang > Add testImplicits for ML unit tests and promote toDF() >

[jira] [Updated] (SPARK-16356) Add testImplicits for ML unit tests and promote toDF()

2016-09-22 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-16356: Assignee: Hyukjin Kwon > Add testImplicits for ML unit tests and promote toDF() >

[jira] [Comment Edited] (SPARK-14709) spark.ml API for linear SVM

2016-09-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15512085#comment-15512085 ] Yanbo Liang edited comment on SPARK-14709 at 9/22/16 4:27 AM: -- [~yuhaoyan]

[jira] [Commented] (SPARK-14709) spark.ml API for linear SVM

2016-09-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15512085#comment-15512085 ] Yanbo Liang commented on SPARK-14709: - [~yuhaoyan] Any update about this? I think providing

[jira] [Resolved] (SPARK-17577) SparkR support add files to Spark job and get by executors

2016-09-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17577. - Resolution: Fixed Assignee: Yanbo Liang Fix Version/s: 2.1.0 > SparkR support

[jira] [Assigned] (SPARK-17585) PySpark SparkContext.addFile supports adding files recursively

2016-09-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-17585: --- Assignee: Yanbo Liang > PySpark SparkContext.addFile supports adding files recursively >

[jira] [Resolved] (SPARK-17585) PySpark SparkContext.addFile supports adding files recursively

2016-09-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17585. - Resolution: Fixed Fix Version/s: 2.1.0 > PySpark SparkContext.addFile supports adding

[jira] [Commented] (SPARK-17588) java.lang.AssertionError: assertion failed: lapack.dppsv returned 105. when running glm using gaussian link function.

2016-09-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15509001#comment-15509001 ] Yanbo Liang commented on SPARK-17588: - [~sowen] See my comments at SPARK-11918. Thanks. >

[jira] [Commented] (SPARK-11918) WLS can not resolve some kinds of equation

2016-09-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15508996#comment-15508996 ] Yanbo Liang commented on SPARK-11918: - Cholesky decomposition is unstable (for near-singular and rank

[jira] [Updated] (SPARK-17585) PySpark SparkContext.addFile supports adding files recursively

2016-09-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17585: Description: Users would like to add a directory as dependency in some cases, they can use

[jira] [Updated] (SPARK-17585) PySpark SparkContext.addFile supports adding files recursively

2016-09-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17585: Description: PySpark {{SparkContext.addFile}} should support adding files recursively under a

[jira] [Updated] (SPARK-17577) SparkR support add files to Spark job and get by executors

2016-09-18 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17577: Description: Scala/Python users can add files to Spark job by submit options {{--files}} or

[jira] [Updated] (SPARK-17577) SparkR support add files to Spark job and get by executors

2016-09-18 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17577: Description: Scala/Python users can add files to Spark job by submit options {{--files}} or

[jira] [Updated] (SPARK-17585) PySpark SparkContext.addFile supports adding files recursively

2016-09-18 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17585: Component/s: Spark Core > PySpark SparkContext.addFile supports adding files recursively >

[jira] [Created] (SPARK-17585) PySpark SparkContext.addFile supports adding files recursively

2016-09-18 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-17585: --- Summary: PySpark SparkContext.addFile supports adding files recursively Key: SPARK-17585 URL: https://issues.apache.org/jira/browse/SPARK-17585 Project: Spark

[jira] [Created] (SPARK-17577) SparkR support add files to Spark job and get by executors

2016-09-17 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-17577: --- Summary: SparkR support add files to Spark job and get by executors Key: SPARK-17577 URL: https://issues.apache.org/jira/browse/SPARK-17577 Project: Spark

[jira] [Commented] (SPARK-17471) Add compressed method for Matrix class

2016-09-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487035#comment-15487035 ] Yanbo Liang commented on SPARK-17471: - [~sethah] I'm sorry that I have some emergent affairs to deal

[jira] [Comment Edited] (SPARK-17471) Add compressed method for Matrix class

2016-09-09 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15477376#comment-15477376 ] Yanbo Liang edited comment on SPARK-17471 at 9/9/16 3:46 PM: - [~sethah] I

[jira] [Commented] (SPARK-17471) Add compressed method for Matrix class

2016-09-09 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15477376#comment-15477376 ] Yanbo Liang commented on SPARK-17471: - [~sethah] I think this task is duplicated with SPARK-17137

[jira] [Comment Edited] (SPARK-17428) SparkR executors/workers support virtualenv

2016-09-09 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15477150#comment-15477150 ] Yanbo Liang edited comment on SPARK-17428 at 9/9/16 2:14 PM: - Yeah, I agree

[jira] [Commented] (SPARK-17428) SparkR executors/workers support virtualenv

2016-09-09 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15477150#comment-15477150 ] Yanbo Liang commented on SPARK-17428: - Yeah, I agree to start with something simple and iterate

[jira] [Resolved] (SPARK-17464) SparkR spark.als arguments reg should be 0.1 by default

2016-09-09 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17464. - Resolution: Fixed Assignee: Yanbo Liang Fix Version/s: 2.1.0 > SparkR spark.als

[jira] [Resolved] (SPARK-17456) Utility for parsing Spark versions

2016-09-09 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17456. - Resolution: Fixed Fix Version/s: 2.1.0 > Utility for parsing Spark versions >

[jira] [Created] (SPARK-17464) SparkR spark.als arguments reg should be 0.1 by default

2016-09-08 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-17464: --- Summary: SparkR spark.als arguments reg should be 0.1 by default Key: SPARK-17464 URL: https://issues.apache.org/jira/browse/SPARK-17464 Project: Spark Issue

[jira] [Comment Edited] (SPARK-17428) SparkR executors/workers support virtualenv

2016-09-08 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15473643#comment-15473643 ] Yanbo Liang edited comment on SPARK-17428 at 9/8/16 11:46 AM: -- [~sunrui]

[jira] [Comment Edited] (SPARK-17428) SparkR executors/workers support virtualenv

2016-09-08 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15473643#comment-15473643 ] Yanbo Liang edited comment on SPARK-17428 at 9/8/16 11:45 AM: -- [~sunrui]

[jira] [Commented] (SPARK-17428) SparkR executors/workers support virtualenv

2016-09-08 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15473643#comment-15473643 ] Yanbo Liang commented on SPARK-17428: - [~sunrui] [~shivaram] [~felixcheung] Thanks for your reply.

[jira] [Comment Edited] (SPARK-17428) SparkR executors/workers support virtualenv

2016-09-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15469736#comment-15469736 ] Yanbo Liang edited comment on SPARK-17428 at 9/7/16 6:40 AM: - cc [~shivaram]

[jira] [Commented] (SPARK-17428) SparkR executors/workers support virtualenv

2016-09-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15469736#comment-15469736 ] Yanbo Liang commented on SPARK-17428: - cc [~shivaram] [~felixcheung] > SparkR executors/workers

[jira] [Updated] (SPARK-17428) SparkR executors/workers support virtualenv

2016-09-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17428: Description: Many users have requirements to use third party R packages in executors/workers, but

[jira] [Created] (SPARK-17428) SparkR executors/workers support virtualenv

2016-09-07 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-17428: --- Summary: SparkR executors/workers support virtualenv Key: SPARK-17428 URL: https://issues.apache.org/jira/browse/SPARK-17428 Project: Spark Issue Type: New

[jira] [Resolved] (SPARK-17197) PySpark LiR/LoR supports tree aggregation level configurable

2016-08-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17197. - Resolution: Fixed Assignee: Yanbo Liang Fix Version/s: 2.1.0 > PySpark LiR/LoR

[jira] [Resolved] (SPARK-14378) Review spark.ml parity for regression, except trees

2016-08-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-14378. - Resolution: Done > Review spark.ml parity for regression, except trees >

[jira] [Commented] (SPARK-14378) Review spark.ml parity for regression, except trees

2016-08-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15436529#comment-15436529 ] Yanbo Liang commented on SPARK-14378: - Yes, I think we can resolve this as DONE. Thanks! > Review

[jira] [Issue Comment Deleted] (SPARK-8519) Blockify distance computation in k-means

2016-08-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-8519: --- Comment: was deleted (was: User 'yanboliang' has created a pull request for this issue:

[jira] [Updated] (SPARK-14381) Review spark.ml parity for feature transformers

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-14381: Fix Version/s: (was: 2.1.0) > Review spark.ml parity for feature transformers >

[jira] [Commented] (SPARK-14381) Review spark.ml parity for feature transformers

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15436268#comment-15436268 ] Yanbo Liang commented on SPARK-14381: - Resolved this, thanks for working on it. > Review spark.ml

[jira] [Resolved] (SPARK-14381) Review spark.ml parity for feature transformers

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-14381. - Resolution: Done Assignee: Xusen Yin Fix Version/s: 2.1.0 > Review spark.ml

[jira] [Comment Edited] (SPARK-14378) Review spark.ml parity for regression, except trees

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15436264#comment-15436264 ] Yanbo Liang edited comment on SPARK-14378 at 8/25/16 4:30 AM: -- *

[jira] [Comment Edited] (SPARK-14378) Review spark.ml parity for regression, except trees

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15436264#comment-15436264 ] Yanbo Liang edited comment on SPARK-14378 at 8/25/16 4:29 AM: -- *

[jira] [Comment Edited] (SPARK-14378) Review spark.ml parity for regression, except trees

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15436264#comment-15436264 ] Yanbo Liang edited comment on SPARK-14378 at 8/25/16 4:26 AM: -- *

[jira] [Assigned] (SPARK-14378) Review spark.ml parity for regression, except trees

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-14378: --- Assignee: Yanbo Liang > Review spark.ml parity for regression, except trees >

[jira] [Comment Edited] (SPARK-14378) Review spark.ml parity for regression, except trees

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15436264#comment-15436264 ] Yanbo Liang edited comment on SPARK-14378 at 8/25/16 4:25 AM: -- *

[jira] [Commented] (SPARK-14378) Review spark.ml parity for regression, except trees

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15436264#comment-15436264 ] Yanbo Liang commented on SPARK-14378: - *

[jira] [Comment Edited] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15436191#comment-15436191 ] Yanbo Liang edited comment on SPARK-17163 at 8/25/16 3:22 AM: -- Exposing a

[jira] [Comment Edited] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15436191#comment-15436191 ] Yanbo Liang edited comment on SPARK-17163 at 8/25/16 3:22 AM: -- Exposing a

[jira] [Comment Edited] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15436191#comment-15436191 ] Yanbo Liang edited comment on SPARK-17163 at 8/25/16 3:14 AM: -- Exposing a

[jira] [Comment Edited] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15434412#comment-15434412 ] Yanbo Liang edited comment on SPARK-17163 at 8/25/16 3:12 AM: -- I think it's

[jira] [Commented] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15436191#comment-15436191 ] Yanbo Liang commented on SPARK-17163: - Exposing a {{family}} or similar parameter to control pivoting

[jira] [Comment Edited] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15434798#comment-15434798 ] Yanbo Liang edited comment on SPARK-17163 at 8/24/16 12:12 PM: --- Think more

[jira] [Comment Edited] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15434412#comment-15434412 ] Yanbo Liang edited comment on SPARK-17163 at 8/24/16 12:11 PM: --- I think

[jira] [Comment Edited] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15434412#comment-15434412 ] Yanbo Liang edited comment on SPARK-17163 at 8/24/16 12:10 PM: --- I think

[jira] [Commented] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15434798#comment-15434798 ] Yanbo Liang commented on SPARK-17163: - Think more about this problem, I change my mind to support

[jira] [Comment Edited] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15434412#comment-15434412 ] Yanbo Liang edited comment on SPARK-17163 at 8/24/16 7:54 AM: -- I think it's

[jira] [Comment Edited] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15434412#comment-15434412 ] Yanbo Liang edited comment on SPARK-17163 at 8/24/16 7:52 AM: -- I think it's

[jira] [Comment Edited] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15434412#comment-15434412 ] Yanbo Liang edited comment on SPARK-17163 at 8/24/16 7:50 AM: -- I think it's

[jira] [Comment Edited] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15434412#comment-15434412 ] Yanbo Liang edited comment on SPARK-17163 at 8/24/16 7:49 AM: -- I think it's

[jira] [Commented] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15434412#comment-15434412 ] Yanbo Liang commented on SPARK-17163: - I think it's hard to unify binary and multinomial logistic

[jira] [Updated] (SPARK-17197) PySpark LiR/LoR supports tree aggregation level configurable

2016-08-22 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17197: Priority: Minor (was: Major) > PySpark LiR/LoR supports tree aggregation level configurable >

[jira] [Created] (SPARK-17197) PySpark LiR/LoR supports tree aggregation level configurable

2016-08-22 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-17197: --- Summary: PySpark LiR/LoR supports tree aggregation level configurable Key: SPARK-17197 URL: https://issues.apache.org/jira/browse/SPARK-17197 Project: Spark

[jira] [Assigned] (SPARK-11215) Add multiple columns support to StringIndexer

2016-08-22 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-11215: --- Assignee: Yanbo Liang > Add multiple columns support to StringIndexer >

[jira] [Commented] (SPARK-17169) To use scala macros to update code when SharedParamsCodeGen.scala changed

2016-08-22 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15430282#comment-15430282 ] Yanbo Liang commented on SPARK-17169: - Meanwhile, it's better we can do compile time code-gen for

[jira] [Commented] (SPARK-17086) QuantileDiscretizer throws InvalidArgumentException (parameter splits given invalid value) on valid data

2016-08-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15430009#comment-15430009 ] Yanbo Liang commented on SPARK-17086: - We should not throw exception in this case. If the number of

[jira] [Resolved] (SPARK-15018) PySpark ML Pipeline raises unclear error when no stages set

2016-08-20 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-15018. - Resolution: Fixed Fix Version/s: 2.1.0 > PySpark ML Pipeline raises unclear error when no

[jira] [Commented] (SPARK-17138) Python API for multinomial logistic regression

2016-08-20 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429258#comment-15429258 ] Yanbo Liang commented on SPARK-17138: - [~WeichenXu123] Please hold on this task, since SPARK-17163

[jira] [Commented] (SPARK-17137) Add compressed support for multinomial logistic regression coefficients

2016-08-20 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429255#comment-15429255 ] Yanbo Liang commented on SPARK-17137: - Yes, I will do some performance test to weigh the trade-off.

[jira] [Commented] (SPARK-17136) Design optimizer interface for ML algorithms

2016-08-20 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429253#comment-15429253 ] Yanbo Liang commented on SPARK-17136: - Yes, only first order optimizer can scale well in number of

[jira] [Commented] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429196#comment-15429196 ] Yanbo Liang commented on SPARK-17134: - [~qhuang] Please feel free to take this task and do the

[jira] [Resolved] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17141. - Resolution: Fixed Fix Version/s: 2.1.0 > MinMaxScaler behaves weird when min and max have

[jira] [Assigned] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-17141: --- Assignee: Yanbo Liang > MinMaxScaler behaves weird when min and max have the same value and

[jira] [Commented] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427860#comment-15427860 ] Yanbo Liang commented on SPARK-17141: - In the existing code, {{MinMaxScaler}} handle NaN value

[jira] [Comment Edited] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427860#comment-15427860 ] Yanbo Liang edited comment on SPARK-17141 at 8/19/16 9:01 AM: -- In the

[jira] [Updated] (SPARK-17141) MinMaxScaler behaves weird when min and max have the same value and some values are NaN

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17141: Priority: Minor (was: Trivial) > MinMaxScaler behaves weird when min and max have the same value

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline fails when no stages set

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15018: Shepherd: Yanbo Liang Assignee: Bryan Cutler > PySpark ML Pipeline fails when no stages set >

[jira] [Commented] (SPARK-17137) Add compressed support for multinomial logistic regression coefficients

2016-08-18 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427563#comment-15427563 ] Yanbo Liang commented on SPARK-17137: - I think we should provide transparent interface to users

[jira] [Commented] (SPARK-17136) Design optimizer interface for ML algorithms

2016-08-18 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427539#comment-15427539 ] Yanbo Liang commented on SPARK-17136: - I would like to know that users' own optimizers have some

[jira] [Comment Edited] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-08-18 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427529#comment-15427529 ] Yanbo Liang edited comment on SPARK-17134 at 8/19/16 3:04 AM: -- This is

[jira] [Commented] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-08-18 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15427529#comment-15427529 ] Yanbo Liang commented on SPARK-17134: - This is interesting. We also trying to use BLAS to accelerate

[jira] [Comment Edited] (SPARK-17086) QuantileDiscretizer throws InvalidArgumentException (parameter splits given invalid value) on valid data

2016-08-18 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15426254#comment-15426254 ] Yanbo Liang edited comment on SPARK-17086 at 8/18/16 10:49 AM: --- [~sowen]

[jira] [Commented] (SPARK-17086) QuantileDiscretizer throws InvalidArgumentException (parameter splits given invalid value) on valid data

2016-08-18 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15426254#comment-15426254 ] Yanbo Liang commented on SPARK-17086: - [~sowen] The bucket defined by [1.0, 1.0) will only receive

[jira] [Commented] (SPARK-17086) QuantileDiscretizer throws InvalidArgumentException (parameter splits given invalid value) on valid data

2016-08-18 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15425949#comment-15425949 ] Yanbo Liang commented on SPARK-17086: - If the number of distinct input data is less than

[jira] [Commented] (SPARK-17090) Make tree aggregation level in linear/logistic regression configurable

2016-08-17 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15425913#comment-15425913 ] Yanbo Liang commented on SPARK-17090: - Making aggregation depth configurable is necessary when

[jira] [Commented] (SPARK-16993) model.transform without label column in random forest regression

2016-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15422384#comment-15422384 ] Yanbo Liang commented on SPARK-16993: - [~dulajrajitha] I can not reproduce your reported issue, the

[jira] [Commented] (SPARK-17048) ML model read for custom transformers in a pipeline does not work

2016-08-16 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15422365#comment-15422365 ] Yanbo Liang commented on SPARK-17048: - [~taras.matyashov...@gmail.com] Would you mind to share your

[jira] [Resolved] (SPARK-17033) GaussianMixture should use treeAggregate to improve performance

2016-08-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17033. - Resolution: Fixed Assignee: Yanbo Liang Fix Version/s: 2.1.0 > GaussianMixture

[jira] [Resolved] (SPARK-16934) Update LogisticCostAggregator serialization code to make it consistent with LinearRegression

2016-08-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-16934. - Resolution: Fixed Assignee: Weichen Xu Fix Version/s: 2.1.0 Target

[jira] [Updated] (SPARK-17033) GaussianMixture should use treeAggregate to improve performance

2016-08-12 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17033: Description: {{GaussianMixture}} should use {{treeAggregate}} rather than {{aggregate}} to improve

[jira] [Updated] (SPARK-17033) GaussianMixture should use treeAggregate to improve performance

2016-08-12 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17033: Description: {{GaussianMixture}} should use {{treeAggregate}} rather than {{aggregate}} to improve

[jira] [Updated] (SPARK-17033) GaussianMixture should use treeAggregate to improve performance

2016-08-12 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17033: Description: {{GaussianMixture}} should use {{treeAggregate}} rather than {{aggregate}} to improve

[jira] [Created] (SPARK-17033) GaussianMixture should use treeAggregate to improve performance

2016-08-12 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-17033: --- Summary: GaussianMixture should use treeAggregate to improve performance Key: SPARK-17033 URL: https://issues.apache.org/jira/browse/SPARK-17033 Project: Spark

<    1   2   3   4   5   6   7   8   9   10   >