[jira] [Resolved] (SPARK-10231) Update @Since annotation for mllib.classification

2015-08-25 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-10231. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 8421

[jira] [Resolved] (SPARK-10244) Update @Since annotation for mllib.util

2015-08-25 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-10244. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 8430

[jira] [Resolved] (SPARK-10239) Update @Since annotation for mllib.pmml

2015-08-25 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-10239. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 8430

[jira] [Updated] (SPARK-7780) The intercept in LogisticRegressionWithLBFGS should not be regularized

2015-08-18 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7780: --- Target Version/s: 1.6.0 (was: 1.5.0) The intercept in LogisticRegressionWithLBFGS should not be regularized

[jira] [Resolved] (SPARK-8916) Add @since tags to mllib.regression

2015-08-17 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-8916. Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7518

[jira] [Updated] (SPARK-9642) LinearRegression should supported weighted data

2015-08-06 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-9642: --- Labels: 1.6 (was: ) LinearRegression should supported weighted data

[jira] [Assigned] (SPARK-9612) Add instance weight support for GBTs

2015-08-04 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai reassigned SPARK-9612: -- Assignee: DB Tsai Add instance weight support for GBTs

[jira] [Resolved] (SPARK-8522) Disable feature scaling in Linear and Logistic Regression

2015-08-04 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-8522. Resolution: Fixed Disable feature scaling in Linear and Logistic Regression

[jira] [Commented] (SPARK-6683) Handling feature scaling properly for GLMs

2015-08-02 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14651269#comment-14651269 ] DB Tsai commented on SPARK-6683: I think we can close this one. They are addressed in

[jira] [Resolved] (SPARK-7518) Tests for comparing probability and prediction in binary logistic regression against R

2015-08-02 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-7518. Resolution: Won't Fix Tests for comparing probability and prediction in binary logistic regression

[jira] [Created] (SPARK-9442) java.lang.ArithmeticException: / by zero when reading Parquet

2015-07-29 Thread DB Tsai (JIRA)
DB Tsai created SPARK-9442: -- Summary: java.lang.ArithmeticException: / by zero when reading Parquet Key: SPARK-9442 URL: https://issues.apache.org/jira/browse/SPARK-9442 Project: Spark Issue Type:

[jira] [Commented] (SPARK-9442) java.lang.ArithmeticException: / by zero when reading Parquet

2015-07-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14646708#comment-14646708 ] DB Tsai commented on SPARK-9442: I will try to turn on the logging at the info level.

[jira] [Comment Edited] (SPARK-9442) java.lang.ArithmeticException: / by zero when reading Parquet

2015-07-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14646705#comment-14646705 ] DB Tsai edited comment on SPARK-9442 at 7/29/15 8:28 PM: - Another

[jira] [Updated] (SPARK-9442) java.lang.ArithmeticException: / by zero when reading Parquet

2015-07-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-9442: --- Description: I am counting how many records in my nested parquet file with this schema, {code} scala

[jira] [Updated] (SPARK-9442) java.lang.ArithmeticException: / by zero when reading Parquet

2015-07-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-9442: --- Description: I am counting how many records in my nested parquet file with this schema, {code} scala

[jira] [Commented] (SPARK-9442) java.lang.ArithmeticException: / by zero when reading Parquet

2015-07-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14646705#comment-14646705 ] DB Tsai commented on SPARK-9442: Another note: By explicitly looping through the data to

[jira] [Updated] (SPARK-9442) java.lang.ArithmeticException: / by zero when reading Parquet

2015-07-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-9442: --- Description: I am counting how many records in my nested parquet file with this schema, {code} scala

[jira] [Updated] (SPARK-9442) java.lang.ArithmeticException: / by zero when reading Parquet

2015-07-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-9442: --- Description: I am counting how many records in my nested parquet file with this schema, {code} scala

[jira] [Assigned] (SPARK-7685) Handle high imbalanced data and apply weights to different samples in Logistic Regression

2015-07-28 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai reassigned SPARK-7685: -- Assignee: DB Tsai (was: Shuo Xiang) Handle high imbalanced data and apply weights to different

[jira] [Resolved] (SPARK-9204) Add default params test to linear regression

2015-07-20 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-9204. Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7553

[jira] [Resolved] (SPARK-8913) Follow-up on SPARK-8700. Cleanup the test

2015-07-09 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-8913. Resolution: Fixed Follow-up on SPARK-8700. Cleanup the test -

[jira] [Commented] (SPARK-8913) Follow-up on SPARK-8700. Cleanup the test

2015-07-09 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14621605#comment-14621605 ] DB Tsai commented on SPARK-8913: merged https://github.com/apache/spark/pull/7335

[jira] [Updated] (SPARK-8913) Follow-up on SPARK-8700. Cleanup the test

2015-07-09 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-8913: --- Fix Version/s: 1.5.0 Follow-up on SPARK-8700. Cleanup the test -

[jira] [Resolved] (SPARK-8963) Improve Linear Regression tests to use Vectors

2015-07-09 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-8963. Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7327

[jira] [Updated] (SPARK-8963) Improve Linear Regression tests to use Vectors

2015-07-09 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-8963: --- Assignee: holdenk Improve Linear Regression tests to use Vectors

[jira] [Resolved] (SPARK-8700) Disable feature scaling in Logistic Regression

2015-07-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-8700. Resolution: Fixed Issue resolved by pull request 7080 [https://github.com/apache/spark/pull/7080] Disable

[jira] [Assigned] (SPARK-7159) Support multiclass logistic regression in spark.ml

2015-07-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai reassigned SPARK-7159: -- Assignee: DB Tsai Support multiclass logistic regression in spark.ml

[jira] [Created] (SPARK-8912) Follow-up on SPARK-8700. Documentation and cleanup the test

2015-07-08 Thread DB Tsai (JIRA)
DB Tsai created SPARK-8912: -- Summary: Follow-up on SPARK-8700. Documentation and cleanup the test Key: SPARK-8912 URL: https://issues.apache.org/jira/browse/SPARK-8912 Project: Spark Issue Type:

[jira] [Created] (SPARK-8913) Follow-up on SPARK-8700. Cleanup the test

2015-07-08 Thread DB Tsai (JIRA)
DB Tsai created SPARK-8913: -- Summary: Follow-up on SPARK-8700. Cleanup the test Key: SPARK-8913 URL: https://issues.apache.org/jira/browse/SPARK-8913 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-8912) Follow-up on SPARK-8700. Documentation

2015-07-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-8912: --- Description: We need to clearly document the effectively objective function when `standardization` is

[jira] [Updated] (SPARK-8912) Follow-up on SPARK-8700. Documentation

2015-07-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-8912: --- Summary: Follow-up on SPARK-8700. Documentation (was: Follow-up on SPARK-8700. Documentation and cleanup the

[jira] [Updated] (SPARK-8913) Follow-up on SPARK-8700. Cleanup the test

2015-07-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-8913: --- Assignee: holdenk Follow-up on SPARK-8700. Cleanup the test -

[jira] [Updated] (SPARK-8912) Documentation on the effective objective function in LoR when `standardization` is true/false

2015-07-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-8912: --- Summary: Documentation on the effective objective function in LoR when `standardization` is true/false (was:

[jira] [Updated] (SPARK-8912) Follow-up on SPARK-8700. Documentation

2015-07-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-8912: --- Description: Follow-up on SPARK-8700. Documentation We need to clearly document the effectively objective

[jira] [Updated] (SPARK-8912) Documentation on the effective objective function in LoR when `standardization` is true/false

2015-07-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-8912: --- Description: This is a follow-up PR to https://issues.apache.org/jira/browse/SPARK-8700 which will document

[jira] [Commented] (SPARK-8849) expose multiclass scores in LogisticRegression

2015-07-06 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14615988#comment-14615988 ] DB Tsai commented on SPARK-8849: I hope that we can port MLOR to ML pipeline framework in

[jira] [Closed] (SPARK-2505) Weighted Regularizer

2015-06-30 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai closed SPARK-2505. -- Resolution: Won't Fix Weighted Regularizer Key: SPARK-2505

[jira] [Resolved] (SPARK-8551) Python example code for elastic net

2015-06-30 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-8551. Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6946

[jira] [Updated] (SPARK-8551) Python example code for elastic net

2015-06-30 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-8551: --- Assignee: Shuo Xiang Python example code for elastic net ---

[jira] [Created] (SPARK-8700) Disable feature scaling in Logistic Regression

2015-06-29 Thread DB Tsai (JIRA)
DB Tsai created SPARK-8700: -- Summary: Disable feature scaling in Logistic Regression Key: SPARK-8700 URL: https://issues.apache.org/jira/browse/SPARK-8700 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-8700) Disable feature scaling in Logistic Regression

2015-06-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai reassigned SPARK-8700: -- Assignee: DB Tsai Disable feature scaling in Logistic Regression

[jira] [Assigned] (SPARK-8522) Disable feature scaling in Linear and Logistic Regression

2015-06-26 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai reassigned SPARK-8522: -- Assignee: DB Tsai (was: holdenk) Disable feature scaling in Linear and Logistic Regression

[jira] [Resolved] (SPARK-8613) Add a param for disabling of feature scaling, default to true

2015-06-26 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-8613. Resolution: Fixed Assignee: holdenk Fix Version/s: 1.5.0 Target Version/s:

[jira] [Issue Comment Deleted] (SPARK-8522) Disable feature scaling in Linear and Logistic Regression

2015-06-26 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-8522: --- Comment: was deleted (was: Issue resolved by pull request 7024 [https://github.com/apache/spark/pull/7024])

[jira] [Updated] (SPARK-8601) Disable feature scaling in Linear Regression

2015-06-26 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-8601: --- Assignee: holdenk Disable feature scaling in Linear Regression

[jira] [Resolved] (SPARK-8522) Disable feature scaling in Linear and Logistic Regression

2015-06-26 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-8522. Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7024

[jira] [Reopened] (SPARK-8522) Disable feature scaling in Linear and Logistic Regression

2015-06-26 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai reopened SPARK-8522: Disable feature scaling in Linear and Logistic Regression

[jira] [Closed] (SPARK-7888) Be able to disable intercept in Linear Regression in ML package

2015-06-23 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai closed SPARK-7888. -- Resolution: Done Fix Version/s: 1.5.0 Target Version/s: 1.5.0 Be able to disable intercept in

[jira] [Commented] (SPARK-7888) Be able to disable intercept in Linear Regression in ML package

2015-06-23 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14598282#comment-14598282 ] DB Tsai commented on SPARK-7888: merged into master.

[jira] [Commented] (SPARK-7674) R-like stats for ML models

2015-06-22 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14596751#comment-14596751 ] DB Tsai commented on SPARK-7674: I mean should we have `trait` for shared properties like

[jira] [Commented] (SPARK-7674) R-like stats for ML models

2015-06-22 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14596920#comment-14596920 ] DB Tsai commented on SPARK-7674: sounds fair. How about the `estimate std error`, `t

[jira] [Commented] (SPARK-7674) R-like stats for ML models

2015-06-22 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14596923#comment-14596923 ] DB Tsai commented on SPARK-7674: Also, will be nice to have like training confusion matrix

[jira] [Updated] (SPARK-8522) Disable feature scaling in Linear and Logistic Regression

2015-06-22 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-8522: --- Assignee: holdenk Disable feature scaling in Linear and Logistic Regression

[jira] [Created] (SPARK-8522) Disable feature scaling in Linear and Logistic Regression

2015-06-22 Thread DB Tsai (JIRA)
DB Tsai created SPARK-8522: -- Summary: Disable feature scaling in Linear and Logistic Regression Key: SPARK-8522 URL: https://issues.apache.org/jira/browse/SPARK-8522 Project: Spark Issue Type: New

[jira] [Commented] (SPARK-7456) Perf test for linear regression and logistic regression with elastic-net

2015-06-18 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14592645#comment-14592645 ] DB Tsai commented on SPARK-7456: PR was already submitted.

[jira] [Commented] (SPARK-7456) Perf test for linear regression and logistic regression with elastic-net

2015-06-18 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14592649#comment-14592649 ] DB Tsai commented on SPARK-7456: BTW, [~holdenk] did a benchmark comparing the

[jira] [Commented] (SPARK-7888) Be able to disable intercept in Linear Regression in ML package

2015-06-17 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14590380#comment-14590380 ] DB Tsai commented on SPARK-7888: Last night, I figured out how to do this. If you look at

[jira] [Commented] (SPARK-7888) Be able to disable intercept in Linear Regression in ML package

2015-06-17 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14590713#comment-14590713 ] DB Tsai commented on SPARK-7888: Yeah, we don't re-center but still scaling to unit

[jira] [Commented] (SPARK-7674) R-like stats for ML models

2015-06-16 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14588925#comment-14588925 ] DB Tsai commented on SPARK-7674: How do we store those statistical metadata in model

[jira] [Commented] (SPARK-7674) R-like stats for ML models

2015-06-16 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14589261#comment-14589261 ] DB Tsai commented on SPARK-7674: but lots of them share the same information. For example,

[jira] [Updated] (SPARK-7888) Be able to disable intercept in Linear Regression in ML package

2015-06-15 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7888: --- Assignee: holdenk Be able to disable intercept in Linear Regression in ML package

[jira] [Updated] (SPARK-7685) Handle high imbalanced data and apply weights to different samples in Logistic Regression

2015-06-15 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7685: --- Assignee: Shuo Xiang Handle high imbalanced data and apply weights to different samples in Logistic

[jira] [Updated] (SPARK-7555) User guide update for ElasticNet

2015-06-15 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7555: --- Assignee: Shuo Xiang (was: DB Tsai) User guide update for ElasticNet

[jira] [Resolved] (SPARK-8314) improvement in performance of MLUtils.appendBias

2015-06-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-8314. Resolution: Fixed Merged into master

[jira] [Closed] (SPARK-8168) Add Python friendly constructor to PipelineModel

2015-06-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai closed SPARK-8168. -- Resolution: Fixed Issue resolved by pull request https://github.com/apache/spark/pull/6709 Add Python

[jira] [Commented] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-06-04 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14573820#comment-14573820 ] DB Tsai commented on SPARK-7008: Do you see better convergence rate when LBFGS is used?

[jira] [Commented] (SPARK-7547) Example code for ElasticNet

2015-06-01 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14568498#comment-14568498 ] DB Tsai commented on SPARK-7547: Pong! Sorry for the delay. Was a busy weekend. Will send

[jira] [Created] (SPARK-7888) Be able to disable intercept in Linear Regression in ML package

2015-05-27 Thread DB Tsai (JIRA)
DB Tsai created SPARK-7888: -- Summary: Be able to disable intercept in Linear Regression in ML package Key: SPARK-7888 URL: https://issues.apache.org/jira/browse/SPARK-7888 Project: Spark Issue

[jira] [Updated] (SPARK-7852) Set the initial weights based on the previous when GLMs are run with multiple regParams

2015-05-27 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7852: --- Issue Type: New Feature (was: Bug) Set the initial weights based on the previous when GLMs are run with

[jira] [Commented] (SPARK-7888) Be able to disable intercept in Linear Regression in ML package

2015-05-27 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14562148#comment-14562148 ] DB Tsai commented on SPARK-7888: Sounds great. This requires some math to understand how R

[jira] [Commented] (SPARK-7547) Example code for ElasticNet

2015-05-26 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14560061#comment-14560061 ] DB Tsai commented on SPARK-7547: Sorry for delay. I can write Scala/Java example code by

[jira] [Commented] (SPARK-7780) The intercept in LogisticRegressionWithLBFGS should not be regularized

2015-05-22 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14556828#comment-14556828 ] DB Tsai commented on SPARK-7780: ++1 Although I don't think there are many users

[jira] [Commented] (SPARK-7674) R-like stats for ML models

2015-05-22 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14556913#comment-14556913 ] DB Tsai commented on SPARK-7674: I implemented the stats for ML models when I was Alpine,

[jira] [Commented] (SPARK-7780) The intercept in LogisticRegressionWithLBFGS should not be regularized

2015-05-21 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14555156#comment-14555156 ] DB Tsai commented on SPARK-7780: @holdenk Sure. This one will be fun to work on. If it's

[jira] [Commented] (SPARK-7555) User guide update for ElasticNet

2015-05-20 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553542#comment-14553542 ] DB Tsai commented on SPARK-7555: [~coderxiang] told me he will chime in and help out the

[jira] [Created] (SPARK-7780) The intercept in LogisticRegressionWithLBFGS should not be regularized

2015-05-20 Thread DB Tsai (JIRA)
DB Tsai created SPARK-7780: -- Summary: The intercept in LogisticRegressionWithLBFGS should not be regularized Key: SPARK-7780 URL: https://issues.apache.org/jira/browse/SPARK-7780 Project: Spark

[jira] [Created] (SPARK-7685) Handle high imbalanced data or apply weights to different samples in Logistic Regression

2015-05-16 Thread DB Tsai (JIRA)
DB Tsai created SPARK-7685: -- Summary: Handle high imbalanced data or apply weights to different samples in Logistic Regression Key: SPARK-7685 URL: https://issues.apache.org/jira/browse/SPARK-7685 Project:

[jira] [Updated] (SPARK-7685) Handle high imbalanced data and apply weights to different samples in Logistic Regression

2015-05-16 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7685: --- Summary: Handle high imbalanced data and apply weights to different samples in Logistic Regression (was:

[jira] [Created] (SPARK-7620) Removed calling size, length in while condition to avoid extra JVM call

2015-05-13 Thread DB Tsai (JIRA)
DB Tsai created SPARK-7620: -- Summary: Removed calling size, length in while condition to avoid extra JVM call Key: SPARK-7620 URL: https://issues.apache.org/jira/browse/SPARK-7620 Project: Spark

[jira] [Comment Edited] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-13 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541416#comment-14541416 ] DB Tsai edited comment on SPARK-7568 at 5/13/15 6:06 AM: -

[jira] [Comment Edited] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-13 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541416#comment-14541416 ] DB Tsai edited comment on SPARK-7568 at 5/13/15 6:05 AM: -

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-13 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541416#comment-14541416 ] DB Tsai commented on SPARK-7568: `fitIntercept = false`, or in Spark 1.3, the training

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541332#comment-14541332 ] DB Tsai commented on SPARK-7568: Well, the third example is 0.0 in the old code. ``` (4,

[jira] [Issue Comment Deleted] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7568: --- Comment: was deleted (was: Oh... my bad. I guess you are referring the third example in the training set.

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541374#comment-14541374 ] DB Tsai commented on SPARK-7568: In 1.3,

[jira] [Comment Edited] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541374#comment-14541374 ] DB Tsai edited comment on SPARK-7568 at 5/13/15 5:37 AM: - In 1.3,

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541385#comment-14541385 ] DB Tsai commented on SPARK-7568: Default for R is true. ml.LogisticRegression doesn't

[jira] [Comment Edited] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541385#comment-14541385 ] DB Tsai edited comment on SPARK-7568 at 5/13/15 5:43 AM: - Default

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541352#comment-14541352 ] DB Tsai commented on SPARK-7568: Actually, with lambda = 0.001, the training accuracy is

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541301#comment-14541301 ] DB Tsai commented on SPARK-7568: This is because we regularize the intercept before which

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541334#comment-14541334 ] DB Tsai commented on SPARK-7568: Oh... my bad. I guess you are referring the third example

[jira] [Commented] (SPARK-7547) ElasticNet example code

2015-05-11 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14539216#comment-14539216 ] DB Tsai commented on SPARK-7547: Sure! I'll add the example code and documentation in this

[jira] [Created] (SPARK-7518) Tests for comparing probability and prediction in binary logistic regression against R

2015-05-10 Thread DB Tsai (JIRA)
DB Tsai created SPARK-7518: -- Summary: Tests for comparing probability and prediction in binary logistic regression against R Key: SPARK-7518 URL: https://issues.apache.org/jira/browse/SPARK-7518 Project:

[jira] [Updated] (SPARK-7262) Binary LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML package

2015-05-06 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7262: --- Summary: Binary LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML package (was:

[jira] [Updated] (SPARK-7262) Binary LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML package

2015-05-06 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7262: --- Description: 1) Handle scaling and addBias internally. 2) L1/L2 elasticnet using OWLQN optimizer. was: 1)

[jira] [Created] (SPARK-7279) Removed diffSum which is theoretical zero in LinearRegression and coding formating

2015-04-30 Thread DB Tsai (JIRA)
DB Tsai created SPARK-7279: -- Summary: Removed diffSum which is theoretical zero in LinearRegression and coding formating Key: SPARK-7279 URL: https://issues.apache.org/jira/browse/SPARK-7279 Project: Spark

[jira] [Updated] (SPARK-7222) Added mathematical derivation in comment and compressed the model to LinearRegression with ElasticNet

2015-04-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7222: --- Description: Added detailed mathematical derivation of how scaling and LeastSquaresAggregator work. Also

[jira] [Created] (SPARK-7222) Added mathematical derivation in comment to LinearRegression with ElasticNet

2015-04-29 Thread DB Tsai (JIRA)
DB Tsai created SPARK-7222: -- Summary: Added mathematical derivation in comment to LinearRegression with ElasticNet Key: SPARK-7222 URL: https://issues.apache.org/jira/browse/SPARK-7222 Project: Spark

[jira] [Updated] (SPARK-7222) Added mathematical derivation in comment and compressed the model to LinearRegression with ElasticNet

2015-04-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7222: --- Summary: Added mathematical derivation in comment and compressed the model to LinearRegression with

[jira] [Updated] (SPARK-7222) Added mathematical derivation in comment and compressed the model to LinearRegression with ElasticNet

2015-04-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7222: --- Issue Type: Improvement (was: Documentation) Added mathematical derivation in comment and compressed the

<    1   2   3   4   5   6   7   >