[jira] [Updated] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2016-11-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15784: Shepherd: Yanbo Liang Assignee: Miao Wang > Add Power Iteration Clustering to spark.ml >

[jira] [Resolved] (SPARK-18291) SparkR glm predict should output original label when family = "binomial"

2016-11-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-18291. - Resolution: Fixed Assignee: Yanbo Liang Fix Version/s: 2.1.0 Target

[jira] [Resolved] (SPARK-18210) Pipeline.copy does not create an instance with the same UID

2016-11-06 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-18210. - Resolution: Fixed Assignee: Wojciech Szymanski Fix Version/s: 2.1.0 >

[jira] [Updated] (SPARK-18291) SparkR glm predict should output original label when family = "binomial"

2016-11-06 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18291: Description: SparkR spark.glm predict should output original label when family = "binomial". For

[jira] [Updated] (SPARK-18291) SparkR glm predict should output original label when family = "binomial"

2016-11-06 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18291: Description: SparkR spark.glm predict should output original label when family = "binomial". For

[jira] [Updated] (SPARK-18291) SparkR glm predict should output original label when family = "binomial"

2016-11-06 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18291: Description: SparkR glm predict should output original label when family = "binomial". For

[jira] [Updated] (SPARK-18291) SparkR glm predict should output original label when family = "binomial"

2016-11-06 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18291: Description: SparkR glm predict should output original label when family = "binomial". For

[jira] [Updated] (SPARK-18291) SparkR glm predict should output original label when family = "binomial"

2016-11-06 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18291: Description: SparkR glm predict should output original label when family = "binomial".

[jira] [Created] (SPARK-18291) SparkR glm predict should output original label when family = "binomial"

2016-11-05 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-18291: --- Summary: SparkR glm predict should output original label when family = "binomial" Key: SPARK-18291 URL: https://issues.apache.org/jira/browse/SPARK-18291 Project:

[jira] [Resolved] (SPARK-18276) Some ML training summaries are not copied when {{copy()}} is called.

2016-11-05 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-18276. - Resolution: Fixed Assignee: Seth Hendrickson Fix Version/s: 2.1.0 > Some ML

[jira] [Created] (SPARK-18286) Add Scala/Java/Python examples for MinHash and RandomProjection

2016-11-05 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-18286: --- Summary: Add Scala/Java/Python examples for MinHash and RandomProjection Key: SPARK-18286 URL: https://issues.apache.org/jira/browse/SPARK-18286 Project: Spark

[jira] [Assigned] (SPARK-18080) Locality Sensitive Hashing (LSH) Python API

2016-11-04 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-18080: --- Assignee: Yanbo Liang > Locality Sensitive Hashing (LSH) Python API >

[jira] [Updated] (SPARK-18080) Locality Sensitive Hashing (LSH) Python API

2016-11-04 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18080: Shepherd: Joseph K. Bradley > Locality Sensitive Hashing (LSH) Python API >

[jira] [Updated] (SPARK-18218) Optimize BlockMatrix multiplication, which may cause OOM and low parallelism usage problem in several cases

2016-11-03 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18218: Shepherd: Yanbo Liang > Optimize BlockMatrix multiplication, which may cause OOM and low

[jira] [Updated] (SPARK-18210) Pipeline.copy does not create an instance with the same UID

2016-11-03 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18210: Shepherd: Yanbo Liang > Pipeline.copy does not create an instance with the same UID >

[jira] [Commented] (SPARK-18210) Pipeline.copy does not create an instance with the same UID

2016-11-03 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15632990#comment-15632990 ] Yanbo Liang commented on SPARK-18210: - This make sense, please feel free to send a PR. Thanks. >

[jira] [Resolved] (SPARK-18177) Add missing 'subsamplingRate' of pyspark GBTClassifier

2016-11-03 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-18177. - Resolution: Fixed Fix Version/s: 2.1.0 > Add missing 'subsamplingRate' of pyspark

[jira] [Comment Edited] (SPARK-18080) Locality Sensitive Hashing (LSH) Python API

2016-11-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15628449#comment-15628449 ] Yanbo Liang edited comment on SPARK-18080 at 11/2/16 10:06 AM: --- Since

[jira] [Commented] (SPARK-18080) Locality Sensitive Hashing (LSH) Python API

2016-11-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15628449#comment-15628449 ] Yanbo Liang commented on SPARK-18080: - Since SPARK-5992 has been merged, it's better we can make the

[jira] [Updated] (SPARK-17692) Document ML/MLlib behavior changes in Spark 2.1

2016-11-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17692: Description: This JIRA records behavior changes of ML/MLlib between 2.0 and 2.1, so we can note

[jira] [Updated] (SPARK-3181) Add Robust Regression Algorithm with Huber Estimator

2016-11-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-3181: --- Target Version/s: 2.2.0 > Add Robust Regression Algorithm with Huber Estimator >

[jira] [Commented] (SPARK-3181) Add Robust Regression Algorithm with Huber Estimator

2016-11-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15628380#comment-15628380 ] Yanbo Liang commented on SPARK-3181: [~josephkb] Thanks for retargeting. This task was blocked by a

[jira] [Comment Edited] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2016-11-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15628347#comment-15628347 ] Yanbo Liang edited comment on SPARK-15784 at 11/2/16 9:32 AM: -- I'm prefer to

[jira] [Commented] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2016-11-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15628347#comment-15628347 ] Yanbo Liang commented on SPARK-15784: - I'm prefer to #1 and #3, but it looks like we can achieve both

[jira] [Commented] (SPARK-16000) Make model loading backward compatible with saved models using old vector columns

2016-11-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15628001#comment-15628001 ] Yanbo Liang commented on SPARK-16000: - [~josephkb] Yep, all the sub tasks complete. We can close this

[jira] [Resolved] (SPARK-18133) Python ML Pipeline Example has syntax errors

2016-10-28 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-18133. - Resolution: Fixed Assignee: Jagadeesan A S Fix Version/s: 2.1.0 > Python ML

[jira] [Resolved] (SPARK-18109) Log instrumentation in GMM

2016-10-28 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-18109. - Resolution: Fixed Fix Version/s: 2.1.0 > Log instrumentation in GMM >

[jira] [Commented] (SPARK-18088) ChiSqSelector FPR PR cleanups

2016-10-26 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15608643#comment-15608643 ] Yanbo Liang commented on SPARK-18088: - +1 [~peng.m...@intel.com] I vote to change param

[jira] [Updated] (SPARK-15819) Add KMeanSummary in KMeans of PySpark

2016-10-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15819: Shepherd: Yanbo Liang Assignee: Jeff Zhang > Add KMeanSummary in KMeans of PySpark >

[jira] [Resolved] (SPARK-17748) One-pass algorithm for linear regression with L1 and elastic-net penalties

2016-10-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17748. - Resolution: Fixed Fix Version/s: 2.1.0 Target Version/s: 2.1.0 > One-pass

[jira] [Assigned] (SPARK-17847) Reduce shuffled data size of GaussianMixture & copy the implementation from mllib to ml

2016-10-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-17847: --- Assignee: Yanbo Liang > Reduce shuffled data size of GaussianMixture & copy the

[jira] [Updated] (SPARK-17847) Reduce shuffled data size of GaussianMixture & copy the implementation from mllib to ml

2016-10-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17847: Description: Copy {{GaussianMixture}} implementation from mllib to ml, then we can add new

[jira] [Updated] (SPARK-17847) Copy GaussianMixture implementation from mllib to ml

2016-10-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17847: Description: Copy {{GaussianMixture}} implementation from mllib to ml, then we can add new

[jira] [Updated] (SPARK-17847) Reduce shuffled data size of GaussianMixture & copy the implementation from mllib to ml

2016-10-24 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17847: Summary: Reduce shuffled data size of GaussianMixture & copy the implementation from mllib to ml

[jira] [Updated] (SPARK-17986) SQLTransformer leaks temporary tables

2016-10-22 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17986: Affects Version/s: (was: 2.0.1) > SQLTransformer leaks temporary tables >

[jira] [Resolved] (SPARK-17986) SQLTransformer leaks temporary tables

2016-10-22 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17986. - Resolution: Fixed Assignee: Drew Robb Fix Version/s: 2.1.0

[jira] [Updated] (SPARK-17645) Add feature selector methods based on: False Discovery Rate (FDR) and Family Wise Error rate (FWE)

2016-10-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17645: Shepherd: Yanbo Liang Assignee: Peng Meng > Add feature selector methods based on: False

[jira] [Comment Edited] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15575553#comment-15575553 ] Yanbo Liang edited comment on SPARK-17904 at 10/14/16 2:49 PM: ---

[jira] [Commented] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15575553#comment-15575553 ] Yanbo Liang commented on SPARK-17904: - [~felixcheung] I think the proposal I made in this JIRA is

[jira] [Resolved] (SPARK-14634) Add BisectingKMeansSummary

2016-10-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-14634. - Resolution: Fixed Assignee: zhengruifeng Fix Version/s: 2.1.0 > Add

[jira] [Resolved] (SPARK-15402) PySpark ml.evaluation should support save/load

2016-10-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-15402. - Resolution: Fixed Assignee: Yanbo Liang Fix Version/s: 2.1.0 > PySpark

[jira] [Assigned] (SPARK-12664) Expose raw prediction scores in MultilayerPerceptronClassificationModel

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-12664: --- Assignee: Yanbo Liang > Expose raw prediction scores in

[jira] [Updated] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17904: Description: SparkR provides {{spark.lappy}} to run local R functions in distributed environment,

[jira] [Updated] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17904: Description: SparkR provides {{spark.lappy}} to run local R functions in distributed environment,

[jira] [Updated] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17904: Description: SparkR provides {{spark.lappy}} to run local R functions in distributed environment,

[jira] [Updated] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17904: Description: SparkR provides {{spark.lappy}} to run local R functions in distributed environment,

[jira] [Commented] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572108#comment-15572108 ] Yanbo Liang commented on SPARK-17904: - Thanks your comments. Yeah, I agree it's tricky in my example

[jira] [Commented] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15571414#comment-15571414 ] Yanbo Liang commented on SPARK-17904: - [~srowen] Yeah, this proposal is different from Python

[jira] [Commented] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15571387#comment-15571387 ] Yanbo Liang commented on SPARK-17904: - [~zjffdu] Thanks for your reply. For R users, they may install

[jira] [Comment Edited] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15571337#comment-15571337 ] Yanbo Liang edited comment on SPARK-17904 at 10/13/16 9:07 AM: --- cc

[jira] [Commented] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15571337#comment-15571337 ] Yanbo Liang commented on SPARK-17904: - cc [~shivaram] [~felixcheung] [~sunrui] > Add a wrapper

[jira] [Updated] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17904: Description: SparkR provides {{spark.lappy}} to run local R functions in distributed environment,

[jira] [Updated] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17904: Description: SparkR provides {{spark.lappy}} to run local R functions in distributed environment,

[jira] [Updated] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17904: Description: SparkR provides {{spark.lappy}} to run local R functions in distributed environment,

[jira] [Updated] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17904: Summary: Add a wrapper function to install R packages on each executors. (was: Add a wrapper

[jira] [Created] (SPARK-17904) Add a wrapper function to download and install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-17904: --- Summary: Add a wrapper function to download and install R packages on each executors. Key: SPARK-17904 URL: https://issues.apache.org/jira/browse/SPARK-17904 Project:

[jira] [Updated] (SPARK-12664) Expose raw prediction scores in MultilayerPerceptronClassificationModel

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-12664: Shepherd: Yanbo Liang Target Version/s: 2.1.0 > Expose raw prediction scores in

[jira] [Commented] (SPARK-12664) Expose raw prediction scores in MultilayerPerceptronClassificationModel

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15571105#comment-15571105 ] Yanbo Liang commented on SPARK-12664: - [~GayathriMurali] Are you still working on this, if not, I can

[jira] [Updated] (SPARK-17692) Document ML/MLlib behavior changes in Spark 2.1

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17692: Description: This JIRA records behavior changes of ML/MLlib between 2.0 and 2.1, so we can note

[jira] [Updated] (SPARK-17748) One-pass algorithm for linear regression with L1 and elastic-net penalties

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17748: Shepherd: Yanbo Liang > One-pass algorithm for linear regression with L1 and elastic-net penalties

[jira] [Resolved] (SPARK-17835) Optimize NaiveBayes mllib wrapper to eliminate extra pass on data

2016-10-12 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17835. - Resolution: Fixed Assignee: Yanbo Liang Fix Version/s: 2.1.0 > Optimize

[jira] [Resolved] (SPARK-17745) Update Python API for NB to support weighted instances

2016-10-12 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17745. - Resolution: Fixed Assignee: Weichen Xu Fix Version/s: 2.1.0 > Update Python API

[jira] [Resolved] (SPARK-15957) RFormula supports forcing to index label

2016-10-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-15957. - Resolution: Fixed Fix Version/s: 2.1.0 > RFormula supports forcing to index label >

[jira] [Created] (SPARK-17847) Copy GaussianMixture implementation from mllib to ml

2016-10-09 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-17847: --- Summary: Copy GaussianMixture implementation from mllib to ml Key: SPARK-17847 URL: https://issues.apache.org/jira/browse/SPARK-17847 Project: Spark Issue

[jira] [Closed] (SPARK-8780) Move Python doctest code example from models to algorithms

2016-10-09 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang closed SPARK-8780. -- Resolution: Won't Fix > Move Python doctest code example from models to algorithms >

[jira] [Commented] (SPARK-8780) Move Python doctest code example from models to algorithms

2016-10-09 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15560103#comment-15560103 ] Yanbo Liang commented on SPARK-8780: Since spark.mllib package is in maintenance mode, only bug fix

[jira] [Updated] (SPARK-17835) Optimize NaiveBayes mllib wrapper to eliminate extra pass on data

2016-10-08 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17835: Description: SPARK-14077 copied the {{NaiveBayes}} implementation from mllib to ml and left mllib

[jira] [Updated] (SPARK-17835) Optimize NaiveBayes mllib wrapper to eliminate extra pass on data

2016-10-08 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17835: Description: SPARK-14077 copied the {{NaiveBayes}} implementation from mllib to ml and left mllib

[jira] [Created] (SPARK-17835) Optimize NaiveBayes mllib wrapper to eliminate extra pass on data

2016-10-08 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-17835: --- Summary: Optimize NaiveBayes mllib wrapper to eliminate extra pass on data Key: SPARK-17835 URL: https://issues.apache.org/jira/browse/SPARK-17835 Project: Spark

[jira] [Updated] (SPARK-17835) Optimize NaiveBayes mllib wrapper to eliminate extra pass on data

2016-10-08 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17835: Issue Type: Improvement (was: Bug) > Optimize NaiveBayes mllib wrapper to eliminate extra pass on

[jira] [Updated] (SPARK-17835) Optimize NaiveBayes mllib wrapper to eliminate extra pass on data

2016-10-08 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17835: Description: SPARK-14077 copied the {{NaiveBayes}} implementation from mllib to ml and left ml as

[jira] [Commented] (SPARK-17825) Expose log likelihood of EM algorithm in mllib

2016-10-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15557281#comment-15557281 ] Yanbo Liang commented on SPARK-17825: - Sure. You can definitely contribute on this issue after my PR.

[jira] [Updated] (SPARK-17825) Expose log likelihood of EM algorithm in mllib

2016-10-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17825: Component/s: (was: MLlib) ML > Expose log likelihood of EM algorithm in mllib

[jira] [Commented] (SPARK-17825) Expose log likelihood of EM algorithm in mllib

2016-10-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15557137#comment-15557137 ] Yanbo Liang commented on SPARK-17825: - [~is03wlei] This task depends on copying the GaussianMixture

[jira] [Commented] (SPARK-17824) QR solver for WeightedLeastSquares

2016-10-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1460#comment-1460 ] Yanbo Liang commented on SPARK-17824: - [~sethah] That's cool. Let's work together and I will wait

[jira] [Comment Edited] (SPARK-17824) QR solver for WeightedLeastSquares

2016-10-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1448#comment-1448 ] Yanbo Liang edited comment on SPARK-17824 at 10/7/16 3:53 PM: -- [~sethah] I

[jira] [Commented] (SPARK-17824) QR solver for WeightedLeastSquares

2016-10-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1448#comment-1448 ] Yanbo Liang commented on SPARK-17824: - [~sethah] I saw your proposal at SPARK-17748: {code} class

[jira] [Commented] (SPARK-17824) QR solver for WeightedLeastSquares

2016-10-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15554886#comment-15554886 ] Yanbo Liang commented on SPARK-17824: - Sure, I updated the description to make the statement more

[jira] [Updated] (SPARK-17824) QR solver for WeightedLeastSquares

2016-10-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17824: Description: Cholesky decomposition is unstable (for near-singular and rank deficient matrices)

[jira] [Updated] (SPARK-17824) QR solver for WeightedLeastSquares

2016-10-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17824: Description: Cholesky decomposition is unstable (for near-singular and rank deficient matrices)

[jira] [Updated] (SPARK-17824) QR solver for WeightedLeastSquares

2016-10-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17824: Description: Cholesky decomposition is unstable (for near-singular and rank deficient matrices),

[jira] [Updated] (SPARK-17824) QR solver for WeightedLeastSquares

2016-10-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17824: Description: Cholesky decomposition is unstable (for near-singular and rank deficient matrices),

[jira] [Updated] (SPARK-17824) QR solver for WeightedLeastSquares

2016-10-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17824: Description: Cholesky decomposition is unstable (for near-singular and rank deficient matrices),

[jira] [Updated] (SPARK-17824) QR solver for WeightedLeastSquares

2016-10-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17824: Description: Cholesky decomposition is unstable (for near-singular and rank deficient matrices),

[jira] [Updated] (SPARK-17824) QR solver for WeightedLeastSquares

2016-10-07 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17824: Summary: QR solver for WeightedLeastSquares (was: QR solver for WeightedLeastSquare) > QR solver

[jira] [Created] (SPARK-17824) QR solver for WeightedLeastSquare

2016-10-07 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-17824: --- Summary: QR solver for WeightedLeastSquare Key: SPARK-17824 URL: https://issues.apache.org/jira/browse/SPARK-17824 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-17792) L-BFGS solver for linear regression does not accept general numeric label column types

2016-10-06 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17792: Fix Version/s: 2.0.2 > L-BFGS solver for linear regression does not accept general numeric label

[jira] [Resolved] (SPARK-17792) L-BFGS solver for linear regression does not accept general numeric label column types

2016-10-06 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17792. - Resolution: Fixed Assignee: Seth Hendrickson Fix Version/s: 2.1.0 > L-BFGS

[jira] [Resolved] (SPARK-17744) Parity check between the ml and mllib test suites for NB

2016-10-04 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17744. - Resolution: Fixed Assignee: zhengruifeng Fix Version/s: 2.1.0 > Parity check

[jira] [Commented] (SPARK-17744) Parity check between the ml and mllib test suites for NB

2016-10-04 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15545428#comment-15545428 ] Yanbo Liang commented on SPARK-17744: - [~josephkb] Yes, we can copy the implementation to ml and

[jira] [Updated] (SPARK-17744) Parity check between the ml and mllib test suites for NB

2016-09-30 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17744: Description: We have moving {{NaiveBayes}} implementation from mllib to ml package in SPARK-14077.

[jira] [Updated] (SPARK-17744) Parity check between the ml and mllib test suites for NB

2016-09-30 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17744: Description: We have moving {{NaiveBayes}} implementation from mllib to ml package in SPARK-14077.

[jira] [Updated] (SPARK-17744) Parity check between the ml and mllib test suites for NB

2016-09-30 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17744: Description: We have moving {{NaiveBayes}} implementation from mllib to ml package in SPARK-14077.

[jira] [Updated] (SPARK-17744) Parity check between the ml and mllib test suites for NB

2016-09-30 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17744: Priority: Minor (was: Major) > Parity check between the ml and mllib test suites for NB >

[jira] [Commented] (SPARK-17744) Parity check between the ml and mllib test suites for NB

2016-09-30 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15535775#comment-15535775 ] Yanbo Liang commented on SPARK-17744: - [~srowen] We are working to copy algorithm implementations

[jira] [Updated] (SPARK-17745) Update Python API for NB to support weighted instances

2016-09-30 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17745: Priority: Minor (was: Major) > Update Python API for NB to support weighted instances >

[jira] [Updated] (SPARK-16872) Include Gaussian Naive Bayes Classifier

2016-09-30 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-16872: Shepherd: Yanbo Liang > Include Gaussian Naive Bayes Classifier >

[jira] [Resolved] (SPARK-14077) Support weighted instances in naive Bayes

2016-09-30 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-14077. - Resolution: Fixed Fix Version/s: 2.1.0 > Support weighted instances in naive Bayes >

[jira] [Created] (SPARK-17704) ChiSqSelector performance improvement.

2016-09-28 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-17704: --- Summary: ChiSqSelector performance improvement. Key: SPARK-17704 URL: https://issues.apache.org/jira/browse/SPARK-17704 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-17692) Document ML/MLlib behavior changes in Spark 2.1

2016-09-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17692: Labels: 2.1.0 (was: ) > Document ML/MLlib behavior changes in Spark 2.1 >

<    1   2   3   4   5   6   7   8   9   10   >