[jira] [Created] (SPARK-17146) Add RandomizedSearch to the CrossValidator API

2016-08-18 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-17146: --- Summary: Add RandomizedSearch to the CrossValidator API Key: SPARK-17146 URL: https://issues.apache.org/jira/browse/SPARK-17146 Project: Spark Issue Type:

[jira] [Commented] (SPARK-17116) Allow params to be a {string, value} dict at fit time

2016-08-17 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15425522#comment-15425522 ] Manoj Kumar commented on SPARK-17116: - Haha, not really. I just found it odd that setParams accepts

[jira] [Created] (SPARK-17118) Make examples Python3 compatible

2016-08-17 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-17118: --- Summary: Make examples Python3 compatible Key: SPARK-17118 URL: https://issues.apache.org/jira/browse/SPARK-17118 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-17116) Allow params to be a {string, value} dict at fit time

2016-08-17 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-17116: Description: Currently, it is possible to override the default params set at constructor time by

[jira] [Comment Edited] (SPARK-17116) Allow params to be a {string, value} dict at fit time

2016-08-17 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15425494#comment-15425494 ] Manoj Kumar edited comment on SPARK-17116 at 8/17/16 10:17 PM: --- [~josephkb]

[jira] [Commented] (SPARK-17116) Allow params to be a {string, value} dict at fit time

2016-08-17 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15425494#comment-15425494 ] Manoj Kumar commented on SPARK-17116: - [~josephkb] [~mlnick] This is not super important, but I do

[jira] [Updated] (SPARK-17116) Allow params to be a {string, value} dict at fit time

2016-08-17 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-17116: Summary: Allow params to be a {string, value} dict at fit time (was: Allow params to be a

[jira] [Created] (SPARK-17116) Allow params to be a {string, value} dict

2016-08-17 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-17116: --- Summary: Allow params to be a {string, value} dict Key: SPARK-17116 URL: https://issues.apache.org/jira/browse/SPARK-17116 Project: Spark Issue Type:

[jira] [Commented] (SPARK-16365) Ideas for moving "mllib-local" forward

2016-07-08 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15368602#comment-15368602 ] Manoj Kumar commented on SPARK-16365: - Could you be a bit more clearer about the first point? Is it

[jira] [Commented] (SPARK-3728) RandomForest: Learn models too large to store in memory

2016-07-08 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15368216#comment-15368216 ] Manoj Kumar commented on SPARK-3728: Hi [~xusen]. Are you still working on this? > RandomForest:

[jira] [Commented] (SPARK-16365) Ideas for moving "mllib-local" forward

2016-07-07 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15366978#comment-15366978 ] Manoj Kumar commented on SPARK-16365: - Is the ultimate aim to make mllib-local, the scikit-learn of

[jira] [Commented] (SPARK-16399) Set PYSPARK_PYTHON to point to "python" instead of "python2.7"

2016-07-07 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15366392#comment-15366392 ] Manoj Kumar commented on SPARK-16399: - It would just run with the default python, that is in this

[jira] [Created] (SPARK-16399) Set PYSPARK_PYTHON to point to "python" instead of "python2.7"

2016-07-06 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-16399: --- Summary: Set PYSPARK_PYTHON to point to "python" instead of "python2.7" Key: SPARK-16399 URL: https://issues.apache.org/jira/browse/SPARK-16399 Project: Spark

[jira] [Created] (SPARK-16307) Improve testing for DecisionTree variances

2016-06-29 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-16307: --- Summary: Improve testing for DecisionTree variances Key: SPARK-16307 URL: https://issues.apache.org/jira/browse/SPARK-16307 Project: Spark Issue Type: Test

[jira] [Created] (SPARK-16306) Improve testing for DecisionTree variances

2016-06-29 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-16306: --- Summary: Improve testing for DecisionTree variances Key: SPARK-16306 URL: https://issues.apache.org/jira/browse/SPARK-16306 Project: Spark Issue Type: Test

[jira] [Comment Edited] (SPARK-14351) Optimize ImpurityAggregator for decision trees

2016-06-23 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15335165#comment-15335165 ] Manoj Kumar edited comment on SPARK-14351 at 6/23/16 11:43 PM: --- OK, so here

[jira] [Comment Edited] (SPARK-14351) Optimize ImpurityAggregator for decision trees

2016-06-23 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15335165#comment-15335165 ] Manoj Kumar edited comment on SPARK-14351 at 6/23/16 11:43 PM: --- OK, so here

[jira] [Issue Comment Deleted] (SPARK-14351) Optimize ImpurityAggregator for decision trees

2016-06-22 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-14351: Comment: was deleted (was: Here are my thoughts: Also ccing [~sethah] since he has seen this part

[jira] [Commented] (SPARK-14351) Optimize ImpurityAggregator for decision trees

2016-06-22 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15345182#comment-15345182 ] Manoj Kumar commented on SPARK-14351: - Here are my thoughts: Also ccing [~sethah] since he has seen

[jira] [Commented] (SPARK-14351) Optimize ImpurityAggregator for decision trees

2016-06-16 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15335165#comment-15335165 ] Manoj Kumar commented on SPARK-14351: - OK, so here are some benchmarks that validate your claims

[jira] [Commented] (SPARK-14351) Optimize ImpurityAggregator for decision trees

2016-06-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328941#comment-15328941 ] Manoj Kumar commented on SPARK-14351: - I can try working on this. > Optimize ImpurityAggregator for

[jira] [Comment Edited] (SPARK-3155) Support DecisionTree pruning

2016-06-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328939#comment-15328939 ] Manoj Kumar edited comment on SPARK-3155 at 6/14/16 5:01 AM: - 1. I agree that

[jira] [Commented] (SPARK-3155) Support DecisionTree pruning

2016-06-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328939#comment-15328939 ] Manoj Kumar commented on SPARK-3155: 1. I agree that the use cases are limited to single trees. You

[jira] [Commented] (SPARK-3155) Support DecisionTree pruning

2016-06-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328592#comment-15328592 ] Manoj Kumar commented on SPARK-3155: I would like to add support for pruning DecisionTrees as part of

[jira] [Issue Comment Deleted] (SPARK-3155) Support DecisionTree pruning

2016-06-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-3155: --- Comment: was deleted (was: I would like to add support for pruning DecisionTrees as part of my

[jira] [Commented] (SPARK-3155) Support DecisionTree pruning

2016-06-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15328487#comment-15328487 ] Manoj Kumar commented on SPARK-3155: I would like to add support for pruning DecisionTrees as part of

[jira] [Commented] (SPARK-9623) RandomForestRegressor: provide variance of predictions

2016-06-07 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15319450#comment-15319450 ] Manoj Kumar commented on SPARK-9623: [~yanboliang] Are you still working on this? Would you mind if I

[jira] [Created] (SPARK-15761) pyspark shell should load if PYSPARK_DRIVER_PYTHON is ipython an Python3

2016-06-03 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-15761: --- Summary: pyspark shell should load if PYSPARK_DRIVER_PYTHON is ipython an Python3 Key: SPARK-15761 URL: https://issues.apache.org/jira/browse/SPARK-15761 Project:

[jira] [Commented] (SPARK-6192) Enhance MLlib's Python API (GSoC 2015)

2015-08-21 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14706722#comment-14706722 ] Manoj Kumar commented on SPARK-6192: [~rxin] It gets over in a few hours from now. I

[jira] [Commented] (SPARK-9848) Add @Since annotation to new public APIs in 1.5

2015-08-19 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14703133#comment-14703133 ] Manoj Kumar commented on SPARK-9848: Do we want to tag spark.ml in this release as

[jira] [Commented] (SPARK-9848) Add @since tag to new public APIs in 1.5

2015-08-19 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14702808#comment-14702808 ] Manoj Kumar commented on SPARK-9848: Well, actually in the JIRA linked to I could find

[jira] [Commented] (SPARK-7751) Add @since to stable and experimental methods in MLlib

2015-08-19 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14702816#comment-14702816 ] Manoj Kumar commented on SPARK-7751: Did you forget to add mllib.feature? Add @since

[jira] [Created] (SPARK-10108) Add @since tags to mllib.feature

2015-08-19 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-10108: --- Summary: Add @since tags to mllib.feature Key: SPARK-10108 URL: https://issues.apache.org/jira/browse/SPARK-10108 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-10082) Validate i, j in apply (Dense and Sparse Matrices)

2015-08-18 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-10082: Component/s: MLlib Validate i, j in apply (Dense and Sparse Matrices)

[jira] [Commented] (SPARK-9911) User guide for MulticlassClassificationEvaluator

2015-08-18 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700802#comment-14700802 ] Manoj Kumar commented on SPARK-9911: Umm. What additional advantage does the

[jira] [Created] (SPARK-10082) Validate i, j in apply (Dense and Sparse Matrices)

2015-08-18 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-10082: --- Summary: Validate i, j in apply (Dense and Sparse Matrices) Key: SPARK-10082 URL: https://issues.apache.org/jira/browse/SPARK-10082 Project: Spark Issue Type:

[jira] [Commented] (SPARK-9911) User guide for MulticlassClassificationEvaluator

2015-08-18 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701573#comment-14701573 ] Manoj Kumar commented on SPARK-9911: Ah I see. Thanks for the clarification. Where

[jira] [Commented] (SPARK-9911) User guide for MulticlassClassificationEvaluator

2015-08-18 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701572#comment-14701572 ] Manoj Kumar commented on SPARK-9911: Ah I see. Thanks for the clarification. Where

[jira] [Issue Comment Deleted] (SPARK-9911) User guide for MulticlassClassificationEvaluator

2015-08-18 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-9911: --- Comment: was deleted (was: Ah I see. Thanks for the clarification. Where should the user guide for

[jira] [Commented] (SPARK-9911) User guide for MulticlassClassificationEvaluator

2015-08-16 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698841#comment-14698841 ] Manoj Kumar commented on SPARK-9911: Can I work on this? User guide for

[jira] [Commented] (SPARK-9906) User guide for LogisticRegressionSummary

2015-08-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695156#comment-14695156 ] Manoj Kumar commented on SPARK-9906: Sure ! User guide for LogisticRegressionSummary

[jira] [Commented] (SPARK-6364) hashCode and equals for Matrices

2015-08-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695468#comment-14695468 ] Manoj Kumar commented on SPARK-6364: It is all right, there is enough work for

[jira] [Commented] (SPARK-9919) Matrices should respect Java's equals and hashCode contract

2015-08-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695491#comment-14695491 ] Manoj Kumar commented on SPARK-9919: OK, but I need to make some changes. I'll

[jira] [Commented] (SPARK-9919) Matrices should respect Java's equals and hashCode contract

2015-08-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695492#comment-14695492 ] Manoj Kumar commented on SPARK-9919: OK, but I need to make some changes. I'll

[jira] [Resolved] (SPARK-8633) List missing model methods in Python Pipeline API

2015-08-06 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar resolved SPARK-8633. Resolution: Fixed List missing model methods in Python Pipeline API

[jira] [Commented] (SPARK-8633) List missing model methods in Python Pipeline API

2015-08-06 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14660372#comment-14660372 ] Manoj Kumar commented on SPARK-8633: Should I mark this as resolved? List missing

[jira] [Commented] (SPARK-6488) Support addition/multiplication in PySpark's BlockMatrix

2015-08-05 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14658600#comment-14658600 ] Manoj Kumar commented on SPARK-6488: I'll create a JIRA in a while. I am just adding

[jira] [Created] (SPARK-9655) Add missing methods to linalg.distributed

2015-08-05 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-9655: -- Summary: Add missing methods to linalg.distributed Key: SPARK-9655 URL: https://issues.apache.org/jira/browse/SPARK-9655 Project: Spark Issue Type: Improvement

[jira] [Closed] (SPARK-9655) Add missing methods to linalg.distributed

2015-08-05 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar closed SPARK-9655. -- Resolution: Duplicate Add missing methods to linalg.distributed

[jira] [Commented] (SPARK-9656) Add missing methods to linalg.distributed

2015-08-05 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14658937#comment-14658937 ] Manoj Kumar commented on SPARK-9656: cc: [~mwdus...@us.ibm.com] Add missing methods

[jira] [Created] (SPARK-9656) Add missing methods to linalg.distributed

2015-08-05 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-9656: -- Summary: Add missing methods to linalg.distributed Key: SPARK-9656 URL: https://issues.apache.org/jira/browse/SPARK-9656 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-9533) Add missing methods in Word2Vec ML (Python API)

2015-08-04 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-9533: --- Component/s: PySpark Add missing methods in Word2Vec ML (Python API)

[jira] [Commented] (SPARK-9484) Word2Vec import/export for original binary format

2015-08-03 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14651899#comment-14651899 ] Manoj Kumar commented on SPARK-9484: I just went through the C code that does the .bin

[jira] [Commented] (SPARK-8874) Add missing methods in Word2Vec ML

2015-08-02 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14650619#comment-14650619 ] Manoj Kumar commented on SPARK-8874: Done. Thanks. Add missing methods in Word2Vec

[jira] [Created] (SPARK-9533) Add missing methods in Word2Vec ML (Python API)

2015-08-02 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-9533: -- Summary: Add missing methods in Word2Vec ML (Python API) Key: SPARK-9533 URL: https://issues.apache.org/jira/browse/SPARK-9533 Project: Spark Issue Type:

[jira] [Commented] (SPARK-6227) PCA and SVD for PySpark

2015-08-01 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14650314#comment-14650314 ] Manoj Kumar commented on SPARK-6227: [~mengxr] Can this be assigned to me? Since the

[jira] [Updated] (SPARK-9525) Optimize SparseVector initializations in linalg

2015-08-01 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-9525: --- Priority: Major (was: Minor) Optimize SparseVector initializations in linalg

[jira] [Created] (SPARK-9525) Optimize SparseVector initializations in linalg

2015-08-01 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-9525: -- Summary: Optimize SparseVector initializations in linalg Key: SPARK-9525 URL: https://issues.apache.org/jira/browse/SPARK-9525 Project: Spark Issue Type:

[jira] [Commented] (SPARK-9277) SparseVector constructor must throw an error when declared number of elements less than array length

2015-07-30 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14647380#comment-14647380 ] Manoj Kumar commented on SPARK-9277: I will not have access to a development

[jira] [Created] (SPARK-9408) Refactor mllib/linalg.py to mllib/linalg

2015-07-28 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-9408: -- Summary: Refactor mllib/linalg.py to mllib/linalg Key: SPARK-9408 URL: https://issues.apache.org/jira/browse/SPARK-9408 Project: Spark Issue Type: Task

[jira] [Commented] (SPARK-9277) SparseVector constructor must throw an error when declared number of elements less than array lenght

2015-07-24 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14640055#comment-14640055 ] Manoj Kumar commented on SPARK-9277: I have labelled this as started. Will fix this in

[jira] [Updated] (SPARK-9277) SparseVector constructor must throw an error when declared number of elements less than array lenght

2015-07-24 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-9277: --- Labels: starter (was: ) SparseVector constructor must throw an error when declared number of

[jira] [Commented] (SPARK-7105) Support model save/load in Python's GaussianMixture

2015-07-21 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14635424#comment-14635424 ] Manoj Kumar commented on SPARK-7105: Hi, Are you still working on this? Support

[jira] [Created] (SPARK-9223) Support model save/load in Python's LDA

2015-07-21 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-9223: -- Summary: Support model save/load in Python's LDA Key: SPARK-9223 URL: https://issues.apache.org/jira/browse/SPARK-9223 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-9222) Make class instantiation variables in DistributedLDAModel [private] clustering

2015-07-21 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-9222: -- Summary: Make class instantiation variables in DistributedLDAModel [private] clustering Key: SPARK-9222 URL: https://issues.apache.org/jira/browse/SPARK-9222 Project:

[jira] [Commented] (SPARK-6486) Add BlockMatrix in PySpark

2015-07-17 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14631182#comment-14631182 ] Manoj Kumar commented on SPARK-6486: Great, I will start on this after the weekend.

[jira] [Commented] (SPARK-9112) Implement LogisticRegressionSummary similar to LinearRegressionSummary

2015-07-16 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14630053#comment-14630053 ] Manoj Kumar commented on SPARK-9112: Yes, that is the idea. Also we need not port it

[jira] [Commented] (SPARK-6001) K-Means clusterer should return the assignments of input points to clusters

2015-07-16 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629808#comment-14629808 ] Manoj Kumar commented on SPARK-6001: I just started to work on this. K-Means

[jira] [Commented] (SPARK-9112) Implement LogisticRegressionSummary similar to LinearRegressionSummary

2015-07-16 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14630014#comment-14630014 ] Manoj Kumar commented on SPARK-9112: Indeed, it seems so. But the merged

[jira] [Commented] (SPARK-6001) K-Means clusterer should return the assignments of input points to clusters

2015-07-16 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629981#comment-14629981 ] Manoj Kumar commented on SPARK-6001: Oops. I just figured out we do not have a KMeans

[jira] [Created] (SPARK-9112) Implement LogisticRegressionSummary similar to LinearRegressionSummary

2015-07-16 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-9112: -- Summary: Implement LogisticRegressionSummary similar to LinearRegressionSummary Key: SPARK-9112 URL: https://issues.apache.org/jira/browse/SPARK-9112 Project: Spark

[jira] [Commented] (SPARK-9112) Implement LogisticRegressionSummary similar to LinearRegressionSummary

2015-07-16 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14630132#comment-14630132 ] Manoj Kumar commented on SPARK-9112: I see that these are fields in the transformed

[jira] [Commented] (SPARK-8996) Add Python API for Kolmogorov-Smirnov Test

2015-07-14 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14626409#comment-14626409 ] Manoj Kumar commented on SPARK-8996: Hi, Can I work on this? Add Python API for

[jira] [Commented] (SPARK-3703) Ensemble learning methods

2015-07-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625859#comment-14625859 ] Manoj Kumar commented on SPARK-3703: Hi, I am interested in working on ensemble

[jira] [Comment Edited] (SPARK-7126) For spark.ml Classifiers, automatically index labels if they are not yet indexed

2015-07-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625289#comment-14625289 ] Manoj Kumar edited comment on SPARK-7126 at 7/13/15 8:36 PM: -

[jira] [Commented] (SPARK-6261) Python MLlib API missing items: Feature

2015-07-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625241#comment-14625241 ] Manoj Kumar commented on SPARK-6261: We can mark this as resolved. I think? Python

[jira] [Commented] (SPARK-7126) For spark.ml Classifiers, automatically index labels if they are not yet indexed

2015-07-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625289#comment-14625289 ] Manoj Kumar commented on SPARK-7126: [~josephkb] 1. In scikit-learn predict outputs

[jira] [Updated] (SPARK-8704) Add missing methods in StandardScaler (ML and PySpark)

2015-07-07 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-8704: --- Summary: Add missing methods in StandardScaler (ML and PySpark) (was: Add missing methods in

[jira] [Updated] (SPARK-8704) Add missing methods in StandardScaler

2015-07-07 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-8704: --- Summary: Add missing methods in StandardScaler (was: Add additional methods to wrappers in

[jira] [Updated] (SPARK-8874) Add missing methods in Word2Vec ML

2015-07-07 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-8874: --- Component/s: PySpark ML Add missing methods in Word2Vec ML

[jira] [Created] (SPARK-8874) Add missing methods in Word2Vec ML

2015-07-07 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-8874: -- Summary: Add missing methods in Word2Vec ML Key: SPARK-8874 URL: https://issues.apache.org/jira/browse/SPARK-8874 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-8823) Optimizations for sparse vector products in pyspark.mllib.linalg

2015-07-04 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-8823: -- Summary: Optimizations for sparse vector products in pyspark.mllib.linalg Key: SPARK-8823 URL: https://issues.apache.org/jira/browse/SPARK-8823 Project: Spark

[jira] [Commented] (SPARK-8706) Implement Pylint / Prospector checks for PySpark

2015-07-02 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14611611#comment-14611611 ] Manoj Kumar commented on SPARK-8706: Sorry for sounding dumb, but the present code

[jira] [Updated] (SPARK-7401) Dot product and squared_distances should be vectorized in Vectors

2015-07-02 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-7401: --- Priority: Major (was: Minor) Dot product and squared_distances should be vectorized in Vectors

[jira] [Closed] (SPARK-8291) Add parse functionality to LabeledPoint in PySpark

2015-07-01 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar closed SPARK-8291. -- Resolution: Won't Fix Add parse functionality to LabeledPoint in PySpark

[jira] [Resolved] (SPARK-8265) Add LinearDataGenerator to pyspark.mllib.utils

2015-07-01 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar resolved SPARK-8265. Resolution: Fixed Fix Version/s: 1.5.0 Add LinearDataGenerator to pyspark.mllib.utils

[jira] [Commented] (SPARK-3258) Python API for streaming MLlib algorithms

2015-06-30 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14608757#comment-14608757 ] Manoj Kumar commented on SPARK-3258: [~mengxr] We can mark this as resolved. Python

[jira] [Resolved] (SPARK-3258) Python API for streaming MLlib algorithms

2015-06-30 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar resolved SPARK-3258. Resolution: Fixed Fix Version/s: 1.5.0 Python API for streaming MLlib algorithms

[jira] [Created] (SPARK-8704) Add additional methods to wrappers in ml.pyspark.feature

2015-06-29 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-8704: -- Summary: Add additional methods to wrappers in ml.pyspark.feature Key: SPARK-8704 URL: https://issues.apache.org/jira/browse/SPARK-8704 Project: Spark Issue

[jira] [Commented] (SPARK-8706) Implement Pylint / Prospector checks for PySpark

2015-06-29 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606311#comment-14606311 ] Manoj Kumar commented on SPARK-8706: Mind if I hack on this? Implement Pylint /

[jira] [Commented] (SPARK-8633) List missing model methods in Python Pipeline API

2015-06-29 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606266#comment-14606266 ] Manoj Kumar commented on SPARK-8633: I think that should be it. List missing model

[jira] [Created] (SPARK-8711) Add additional methods to JavaModel wrappers in trees

2015-06-29 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-8711: -- Summary: Add additional methods to JavaModel wrappers in trees Key: SPARK-8711 URL: https://issues.apache.org/jira/browse/SPARK-8711 Project: Spark Issue Type:

[jira] [Created] (SPARK-8678) Default values in Pipeline API should be immutable

2015-06-27 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-8678: -- Summary: Default values in Pipeline API should be immutable Key: SPARK-8678 URL: https://issues.apache.org/jira/browse/SPARK-8678 Project: Spark Issue Type: Bug

[jira] [Comment Edited] (SPARK-6724) Model import/export for FPGrowth

2015-06-24 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14599753#comment-14599753 ] Manoj Kumar edited comment on SPARK-6724 at 6/24/15 5:13 PM: -

[jira] [Commented] (SPARK-6791) Model export/import for spark.ml: meta-algorithms

2015-06-24 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14599777#comment-14599777 ] Manoj Kumar commented on SPARK-6791: Oh sorry, I read block on as block. Do you have

[jira] [Comment Edited] (SPARK-6724) Model import/export for FPGrowth

2015-06-24 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14599753#comment-14599753 ] Manoj Kumar edited comment on SPARK-6724 at 6/24/15 5:12 PM: -

[jira] [Commented] (SPARK-6724) Model import/export for FPGrowth

2015-06-24 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14599753#comment-14599753 ] Manoj Kumar commented on SPARK-6724: [~hrishikesh] Are you actively working on this?

[jira] [Resolved] (SPARK-5694) Python API for evaluation metrics

2015-06-23 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar resolved SPARK-5694. Resolution: Fixed Python API for evaluation metrics -

[jira] [Commented] (SPARK-6791) Model export/import for spark.ml: meta-algorithms

2015-06-20 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14594802#comment-14594802 ] Manoj Kumar commented on SPARK-6791: [~josephkb] I would like to work on this. Which

[jira] [Created] (SPARK-8479) Add numNonzeros and numActives to linalg.Matrices

2015-06-19 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-8479: -- Summary: Add numNonzeros and numActives to linalg.Matrices Key: SPARK-8479 URL: https://issues.apache.org/jira/browse/SPARK-8479 Project: Spark Issue Type:

  1   2   3   >