[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-14 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14278381#comment-14278381 ] Pedro Rodriguez commented on SPARK-1405: Worked on some preliminary testing result

[jira] [Updated] (SPARK-5186) Vector.equals and Vector.hashCode are very inefficient and fail on SparseVectors with large size

2015-01-14 Thread Derrick Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Derrick Burns updated SPARK-5186: - Summary: Vector.equals and Vector.hashCode are very inefficient and fail on SparseVectors with la

[jira] [Commented] (SPARK-5186) Vector.equals and Vector.hashCode are very inefficient

2015-01-14 Thread Derrick Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14278374#comment-14278374 ] Derrick Burns commented on SPARK-5186: -- The aforementioned pull request does fix part

[jira] [Commented] (SPARK-5226) Add DBSCAN Clustering Algorithm to MLlib

2015-01-14 Thread Muhammad-Ali A'rabi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14278305#comment-14278305 ] Muhammad-Ali A'rabi commented on SPARK-5226: Yeah, of course. It will take me

[jira] [Created] (SPARK-5261) In some cases ,The value of word's vector representation is too big

2015-01-14 Thread Guoqiang Li (JIRA)
Guoqiang Li created SPARK-5261: -- Summary: In some cases ,The value of word's vector representation is too big Key: SPARK-5261 URL: https://issues.apache.org/jira/browse/SPARK-5261 Project: Spark

[jira] [Commented] (SPARK-5193) Make Spark SQL API usable in Java and remove the Java-specific API

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14278249#comment-14278249 ] Apache Spark commented on SPARK-5193: - User 'rxin' has created a pull request for this

[jira] [Updated] (SPARK-5247) Enable javadoc/scaladoc for public classes in catalyst project

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5247: --- Assignee: Michael Armbrust > Enable javadoc/scaladoc for public classes in catalyst project >

[jira] [Updated] (SPARK-5260) Expose JsonRDD.allKeysWithValueTypes() in a utility class

2015-01-14 Thread Corey J. Nolet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Corey J. Nolet updated SPARK-5260: -- Description: I have found this method extremely useful when implementing my own strategy for inf

[jira] [Created] (SPARK-5260) Expose JsonRDD.allKeysWithValueTypes() in a utility class

2015-01-14 Thread Corey J. Nolet (JIRA)
Corey J. Nolet created SPARK-5260: - Summary: Expose JsonRDD.allKeysWithValueTypes() in a utility class Key: SPARK-5260 URL: https://issues.apache.org/jira/browse/SPARK-5260 Project: Spark Is

[jira] [Comment Edited] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14278228#comment-14278228 ] RJ Nowling edited comment on SPARK-4894 at 1/15/15 4:21 AM: [~

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14278228#comment-14278228 ] RJ Nowling commented on SPARK-4894: --- [~josephkb], after some thought, I've come around a

[jira] [Updated] (SPARK-5259) Add task equal() and hashcode() to avoid stage.pendingTasks not accurate while stage was retry

2015-01-14 Thread SuYan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SuYan updated SPARK-5259: - Description: 1. while shuffle stage was retry, there may have 2 taskSet running. we call the 2 taskSet:taskSet0.

[jira] [Updated] (SPARK-5259) Add task equal() and hashcode() to avoid stage.pendingTasks not accurate while stage was retry

2015-01-14 Thread SuYan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SuYan updated SPARK-5259: - Description: 1. while shuffle stage was retry, there may have 2 taskSet running. we call the 2 taskSet:taskSet0.

[jira] [Updated] (SPARK-5259) Add task equal() and hashcode() to avoid stage.pendingTasks not accurate while stage was retry

2015-01-14 Thread SuYan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SuYan updated SPARK-5259: - Description: 1. while shuffle stage was retry, there may have 2 taskSet running. we call the 2 taskSet:taskSet0.

[jira] [Commented] (SPARK-5259) Add task equal() and hashcode() to avoid stage.pendingTasks not accurate while stage was retry

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14278222#comment-14278222 ] Apache Spark commented on SPARK-5259: - User 'suyanNone' has created a pull request for

[jira] [Created] (SPARK-5259) Add task equal() and hashcode() to avoid stage.pendingTasks not accurate while stage was retry

2015-01-14 Thread SuYan (JIRA)
SuYan created SPARK-5259: Summary: Add task equal() and hashcode() to avoid stage.pendingTasks not accurate while stage was retry Key: SPARK-5259 URL: https://issues.apache.org/jira/browse/SPARK-5259 Project

[jira] [Commented] (SPARK-5193) Make Spark SQL API usable in Java and remove the Java-specific API

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14278163#comment-14278163 ] Apache Spark commented on SPARK-5193: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-5254) Update the user guide to make clear that spark.mllib is not being deprecated

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14278160#comment-14278160 ] Apache Spark commented on SPARK-5254: - User 'mengxr' has created a pull request for th

[jira] [Commented] (SPARK-5147) write ahead logs from streaming receiver are not purged because cleanupOldBlocks in WriteAheadLogBasedBlockHandler is never called

2015-01-14 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14278161#comment-14278161 ] Saisai Shao commented on SPARK-5147: 1. Currently detecting whether to delete the WAL

[jira] [Created] (SPARK-5258) Clean up exposed classes in sql.hive package

2015-01-14 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5258: -- Summary: Clean up exposed classes in sql.hive package Key: SPARK-5258 URL: https://issues.apache.org/jira/browse/SPARK-5258 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-5257) SparseVector indices must be non-negative

2015-01-14 Thread Derrick Burns (JIRA)
Derrick Burns created SPARK-5257: Summary: SparseVector indices must be non-negative Key: SPARK-5257 URL: https://issues.apache.org/jira/browse/SPARK-5257 Project: Spark Issue Type: Documenta

[jira] [Resolved] (SPARK-5254) Update the user guide to make clear that spark.mllib is not being deprecated

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5254. -- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Issue resolved by pull re

[jira] [Commented] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277988#comment-14277988 ] Alexander Ulanov commented on SPARK-5256: - Also, asynchronous gradient update migh

[jira] [Commented] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277986#comment-14277986 ] Alexander Ulanov commented on SPARK-5256: - I would like to improve Gradient interf

[jira] [Commented] (SPARK-5226) Add DBSCAN Clustering Algorithm to MLlib

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277932#comment-14277932 ] Xiangrui Meng commented on SPARK-5226: -- [~angellandros] Before you start coding, coul

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277933#comment-14277933 ] Joseph K. Bradley commented on SPARK-4894: -- +1 for small changes, but occasionall

[jira] [Updated] (SPARK-5226) Add DBSCAN Clustering Algorithm to MLlib

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5226: - Affects Version/s: (was: 1.2.0) > Add DBSCAN Clustering Algorithm to MLlib > -

[jira] [Updated] (SPARK-5226) Add DBSCAN Clustering Algorithm to MLlib

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5226: - Target Version/s: (was: 1.2.0) > Add DBSCAN Clustering Algorithm to MLlib >

[jira] [Updated] (SPARK-3655) Support sorting of values in addition to keys (i.e. secondary sort)

2015-01-14 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-3655: -- Priority: Major (was: Minor) > Support sorting of values in addition to keys (i.e. secondary sort) > --

[jira] [Issue Comment Deleted] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5256: - Comment: was deleted (was: Generalization: grouped optimization) > Improving MLlib optimi

[jira] [Issue Comment Deleted] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5256: - Comment: was deleted (was: Generalization: grouped optimization) > Improving MLlib optimi

[jira] [Issue Comment Deleted] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5256: - Comment: was deleted (was: Improving "Updater" concept) > Improving MLlib optimization AP

[jira] [Commented] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277861#comment-14277861 ] Joseph K. Bradley commented on SPARK-5256: -- Generalization: grouped optimization

[jira] [Commented] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277858#comment-14277858 ] Joseph K. Bradley commented on SPARK-5256: -- Generalization: grouped optimization

[jira] [Commented] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277857#comment-14277857 ] Joseph K. Bradley commented on SPARK-5256: -- Improving "Updater" concept > Improv

[jira] [Created] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5256: Summary: Improving MLlib optimization APIs Key: SPARK-5256 URL: https://issues.apache.org/jira/browse/SPARK-5256 Project: Spark Issue Type: Umbrella

[jira] [Commented] (SPARK-4906) Spark master OOMs with exception stack trace stored in JobProgressListener

2015-01-14 Thread Mingyu Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277849#comment-14277849 ] Mingyu Kim commented on SPARK-4906: --- "typically once a few tasks have failed the stage w

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1420#comment-1420 ] RJ Nowling commented on SPARK-4894: --- Hi [~josephkb], lots to think about! In general, I

[jira] [Commented] (SPARK-5254) Update the user guide to make clear that spark.mllib is not being deprecated

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277768#comment-14277768 ] Apache Spark commented on SPARK-5254: - User 'mengxr' has created a pull request for th

[jira] [Updated] (SPARK-5255) Use python doc "note" for experimental tags in tree.py

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5255: - Description: spark/python/pyspark/mllib/tree.py currently has several "EXPERIMENTAL" tags

[jira] [Created] (SPARK-5255) Use python doc "note" for experimental tags in tree.py

2015-01-14 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5255: Summary: Use python doc "note" for experimental tags in tree.py Key: SPARK-5255 URL: https://issues.apache.org/jira/browse/SPARK-5255 Project: Spark

[jira] [Commented] (SPARK-4585) Spark dynamic executor allocation shouldn't use maxExecutors as initial number

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277716#comment-14277716 ] Apache Spark commented on SPARK-4585: - User 'sryza' has created a pull request for thi

[jira] [Created] (SPARK-5254) Update the user guide to make clear that spark.mllib is not being deprecated

2015-01-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5254: Summary: Update the user guide to make clear that spark.mllib is not being deprecated Key: SPARK-5254 URL: https://issues.apache.org/jira/browse/SPARK-5254 Project: S

[jira] [Updated] (SPARK-3726) RandomForest: Support for bootstrap options

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-3726: - Assignee: Manoj Kumar > RandomForest: Support for bootstrap options >

[jira] [Commented] (SPARK-5199) Input metrics should show up for InputFormats that return CombineFileSplits

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277704#comment-14277704 ] Apache Spark commented on SPARK-5199: - User 'sryza' has created a pull request for thi

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277702#comment-14277702 ] Joseph K. Bradley commented on SPARK-4894: -- [~rnowling] Thanks for looking into t

[jira] [Created] (SPARK-5253) LinearRegression with L1/L2 (elastic net) using OWLQN in new ML pacakge

2015-01-14 Thread DB Tsai (JIRA)
DB Tsai created SPARK-5253: -- Summary: LinearRegression with L1/L2 (elastic net) using OWLQN in new ML pacakge Key: SPARK-5253 URL: https://issues.apache.org/jira/browse/SPARK-5253 Project: Spark Is

[jira] [Commented] (SPARK-5193) Make Spark SQL API usable in Java and remove the Java-specific API

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277656#comment-14277656 ] Apache Spark commented on SPARK-5193: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-4746) integration tests should be separated from faster unit tests

2015-01-14 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277651#comment-14277651 ] Imran Rashid commented on SPARK-4746: - btw if anybody else wants to futz around with t

[jira] [Commented] (SPARK-4746) integration tests should be separated from faster unit tests

2015-01-14 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277646#comment-14277646 ] Imran Rashid commented on SPARK-4746: - submitted a PR: https://github.com/apache/spark

[jira] [Comment Edited] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277631#comment-14277631 ] RJ Nowling edited comment on SPARK-4894 at 1/14/15 8:50 PM: Th

[jira] [Commented] (SPARK-3726) RandomForest: Support for bootstrap options

2015-01-14 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277643#comment-14277643 ] Manoj Kumar commented on SPARK-3726: [~josephkb] You seem to report issues that I alwa

[jira] [Commented] (SPARK-4746) integration tests should be separated from faster unit tests

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277633#comment-14277633 ] Apache Spark commented on SPARK-4746: - User 'squito' has created a pull request for th

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277631#comment-14277631 ] RJ Nowling commented on SPARK-4894: --- Thanks [~lmcguire]! I'll wait until next week in c

[jira] [Commented] (SPARK-5235) Determine serializability of SQLContext

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277611#comment-14277611 ] Alex Baretta commented on SPARK-5235: - [~rxin] I see your point. Well, listen, I appre

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread Leah McGuire (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277590#comment-14277590 ] Leah McGuire commented on SPARK-4894: - Thanks [~rnowling]! I can take a look at it th

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277578#comment-14277578 ] Xiangrui Meng commented on SPARK-4894: -- [~rnowling] I've assigned this to you. Let's

[jira] [Updated] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4894: - Assignee: RJ Nowling > Add Bernoulli-variant of Naive Bayes >

[jira] [Updated] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4894: - Affects Version/s: (was: 1.1.1) 1.2.0 > Add Bernoulli-variant of Naive

[jira] [Updated] (SPARK-5234) examples for ml don't have sparkContext.stop

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5234: - Assignee: yuhao yang > examples for ml don't have sparkContext.stop >

[jira] [Updated] (SPARK-5234) examples for ml don't have sparkContext.stop

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5234: - Target Version/s: 1.3.0, 1.2.1 (was: 1.3.0) > examples for ml don't have sparkContext.stop >

[jira] [Updated] (SPARK-5234) examples for ml don't have sparkContext.stop

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5234: - Fix Version/s: 1.2.1 > examples for ml don't have sparkContext.stop >

[jira] [Resolved] (SPARK-5234) examples for ml don't have sparkContext.stop

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5234. -- Resolution: Fixed Issue resolved by pull request 4044 [https://github.com/apache/spark/pull/4044

[jira] [Commented] (SPARK-5235) Determine serializability of SQLContext

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277561#comment-14277561 ] Reynold Xin commented on SPARK-5235: I will merge your PR and we can continue having t

[jira] [Commented] (SPARK-5206) Accumulators are not re-registered during recovering from checkpoint

2015-01-14 Thread vincent ye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277554#comment-14277554 ] vincent ye commented on SPARK-5206: --- Hi Tathagata, Accumulator object is created after t

[jira] [Commented] (SPARK-5206) Accumulators are not re-registered during recovering from checkpoint

2015-01-14 Thread vincent ye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277552#comment-14277552 ] vincent ye commented on SPARK-5206: --- Hi Tathagata, Accumulator object is created after t

[jira] [Commented] (SPARK-5235) Determine serializability of SQLContext

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277555#comment-14277555 ] Alex Baretta commented on SPARK-5235: - [~rxin] Could be. All I'm saying is that your c

[jira] [Commented] (SPARK-5235) Determine serializability of SQLContext

2015-01-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277553#comment-14277553 ] Sean Owen commented on SPARK-5235: -- [~alexbaretta] I suppose my point is that no code can

[jira] [Issue Comment Deleted] (SPARK-5206) Accumulators are not re-registered during recovering from checkpoint

2015-01-14 Thread vincent ye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vincent ye updated SPARK-5206: -- Comment: was deleted (was: Hi Tathagata, Accumulator object is created after the StreamingContext (ssc)

[jira] [Updated] (SPARK-4014) TaskContext.attemptId returns taskId

2015-01-14 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4014: -- Target Version/s: 1.0.3, 1.1.2, 1.2.1 Assignee: Josh Rosen Labels: backport-nee

[jira] [Resolved] (SPARK-4014) TaskContext.attemptId returns taskId

2015-01-14 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4014. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3849 [https://github.com/

[jira] [Updated] (SPARK-5235) Determine serializability of SQLContext

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5235: --- Summary: Determine serializability of SQLContext (was: java.io.NotSerializableException: org.apache.s

[jira] [Updated] (SPARK-5235) Determine serializability of SQLContext

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5235: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-5166 > Determine serializability of SQLContext

[jira] [Commented] (SPARK-5019) Update GMM API to use MultivariateGaussian

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277538#comment-14277538 ] Joseph K. Bradley commented on SPARK-5019: -- [~lewuathe] Will you be able to upda

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277539#comment-14277539 ] Reynold Xin commented on SPARK-5235: How would we deprecate that though? Are you sugge

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277533#comment-14277533 ] Alex Baretta commented on SPARK-5235: - [~sowen] I would much rather the decision of ma

[jira] [Commented] (SPARK-3717) DecisionTree, RandomForest: Partition by feature

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277485#comment-14277485 ] Joseph K. Bradley commented on SPARK-3717: -- [~bbnsumanth] Please do not misunder

[jira] [Updated] (SPARK-5228) Hide tables for "Active Jobs/Completed Jobs/Failed Jobs" when they are empty

2015-01-14 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5228: -- Assignee: Kousuke Saruta > Hide tables for "Active Jobs/Completed Jobs/Failed Jobs" when they are empty

[jira] [Resolved] (SPARK-5228) Hide tables for "Active Jobs/Completed Jobs/Failed Jobs" when they are empty

2015-01-14 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-5228. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4028 [https://github.com/

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277475#comment-14277475 ] Sean Owen commented on SPARK-5235: -- [~alexbaretta] Well at least that explains why tests

[jira] [Updated] (SPARK-2909) Indexing for SparseVector in pyspark

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2909: - Assignee: Manoj Kumar > Indexing for SparseVector in pyspark > ---

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277460#comment-14277460 ] Alex Baretta commented on SPARK-5235: - [~rxin] [~sowen] My bad! Indeed the SQLContext

[jira] [Resolved] (SPARK-2909) Indexing for SparseVector in pyspark

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2909. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4025 [https://githu

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277449#comment-14277449 ] Apache Spark commented on SPARK-1405: - User 'jkbradley' has created a pull request for

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277414#comment-14277414 ] Alex Baretta commented on SPARK-5235: - [~rxin] I'm sorry to say it's not that easy, es

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-01-14 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277334#comment-14277334 ] Shivaram Venkataraman commented on SPARK-3821: -- Regarding the pre-built distr

[jira] [Commented] (SPARK-5211) Restore HiveMetastoreTypes.toDataType

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277290#comment-14277290 ] Reynold Xin commented on SPARK-5211: BTW who are the developers using it? > Restore H

[jira] [Commented] (SPARK-5211) Restore HiveMetastoreTypes.toDataType

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277292#comment-14277292 ] Reynold Xin commented on SPARK-5211: I'm under the impression that everything in the H

[jira] [Updated] (SPARK-5245) Move Decimal from types.decimal to types package

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5245: --- Summary: Move Decimal from types.decimal to types package (was: Move Decimal and Date into o.a.s.sql.

[jira] [Resolved] (SPARK-5245) Move Decimal from types.decimal to types package

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5245. Resolution: Fixed Fix Version/s: 1.3.0 Fixed in https://github.com/apache/spark/pull/4041 >

[jira] [Updated] (SPARK-5245) Move Decimal from types.decimal to types package

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5245: --- Assignee: Adrian Wang > Move Decimal from types.decimal to types package > ---

[jira] [Resolved] (SPARK-5248) moving Decimal from types.decimal to types package

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5248. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Adrian Wang Fixed in https://github

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277270#comment-14277270 ] Reynold Xin commented on SPARK-5235: I can merge your PR soon, but [~alexbaretta] can

[jira] [Updated] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5235: --- Description: The SQLConf field in SQLContext is neither Serializable nor transient. Here's the stack

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277258#comment-14277258 ] Alex Baretta commented on SPARK-5235: - Yes, there is a need for a hotfix. [~rxin] comm

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277251#comment-14277251 ] Sean Owen commented on SPARK-5235: -- @Alex Baretta what version worked? If you're saying a

[jira] [Updated] (SPARK-5252) Streaming StatefulNetworkWordCount example hangs

2015-01-14 Thread Lutz Buech (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lutz Buech updated SPARK-5252: -- Attachment: debug.txt log at DEBUG level > Streaming StatefulNetworkWordCount example hangs > -

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277240#comment-14277240 ] Alex Baretta commented on SPARK-5235: - [~sowen] I agree with you that contexts have no

[jira] [Created] (SPARK-5252) Streaming StatefulNetworkWordCount example hangs

2015-01-14 Thread Lutz Buech (JIRA)
Lutz Buech created SPARK-5252: - Summary: Streaming StatefulNetworkWordCount example hangs Key: SPARK-5252 URL: https://issues.apache.org/jira/browse/SPARK-5252 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14277217#comment-14277217 ] Sean Owen commented on SPARK-5235: -- [~alexbaretta] It certainly may not be your code of c

  1   2   >