[jira] [Created] (SPARK-7752) NaiveBayes.modelType should use lowercase letters

2015-05-19 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7752: Summary: NaiveBayes.modelType should use lowercase letters Key: SPARK-7752 URL: https://issues.apache.org/jira/browse/SPARK-7752 Project: Spark Issue Type: S

[jira] [Updated] (SPARK-7537) Audit new public Scala APIs for MLlib 1.4

2015-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7537: - Issue Type: Umbrella (was: Sub-task) Parent: (was: SPARK-7443) > Audit new public Sca

[jira] [Commented] (SPARK-4352) Incorporate locality preferences in dynamic allocation requests

2015-05-19 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551898#comment-14551898 ] Sandy Ryza commented on SPARK-4352: --- I don't think we should kill executors in order to

[jira] [Comment Edited] (SPARK-7537) Audit new public Scala APIs for MLlib 1.4

2015-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551863#comment-14551863 ] Xiangrui Meng edited comment on SPARK-7537 at 5/20/15 6:05 AM: -

[jira] [Assigned] (SPARK-6094) Add MultilabelMetrics in PySpark/MLlib

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6094: --- Assignee: (was: Apache Spark) > Add MultilabelMetrics in PySpark/MLlib >

[jira] [Commented] (SPARK-6094) Add MultilabelMetrics in PySpark/MLlib

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551870#comment-14551870 ] Apache Spark commented on SPARK-6094: - User 'yanboliang' has created a pull request fo

[jira] [Assigned] (SPARK-6094) Add MultilabelMetrics in PySpark/MLlib

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6094: --- Assignee: Apache Spark > Add MultilabelMetrics in PySpark/MLlib > ---

[jira] [Commented] (SPARK-7537) Audit new public Scala APIs for MLlib 1.4

2015-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551863#comment-14551863 ] Xiangrui Meng commented on SPARK-7537: -- 1. toPMML should be experimental. Shall we ke

[jira] [Updated] (SPARK-7165) Sort Merge Join for outer joins

2015-05-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7165: --- Target Version/s: 1.5.0 > Sort Merge Join for outer joins > --- > >

[jira] [Commented] (SPARK-4352) Incorporate locality preferences in dynamic allocation requests

2015-05-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551836#comment-14551836 ] Saisai Shao commented on SPARK-4352: Hi Sandy, I retrieved back the old code which sup

[jira] [Commented] (SPARK-7640) Private VPC with default Spark AMI breaks yum

2015-05-19 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551795#comment-14551795 ] Nicholas Chammas commented on SPARK-7640: - [~brdwrd] - According to [this doc on A

[jira] [Commented] (SPARK-7750) Rename "json" endpoints to "api" endpoints

2015-05-19 Thread Mark Hamstra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551718#comment-14551718 ] Mark Hamstra commented on SPARK-7750: - Including `@Produces` annotations is also proba

[jira] [Commented] (SPARK-4352) Incorporate locality preferences in dynamic allocation requests

2015-05-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551713#comment-14551713 ] Saisai Shao commented on SPARK-4352: Hi Sandy, thanks a lot for your comments, I will

[jira] [Commented] (SPARK-4352) Incorporate locality preferences in dynamic allocation requests

2015-05-19 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551677#comment-14551677 ] Sandy Ryza commented on SPARK-4352: --- Thanks for posting this Saisai. Can you export and

[jira] [Commented] (SPARK-4352) Incorporate locality preferences in dynamic allocation requests

2015-05-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551674#comment-14551674 ] Saisai Shao commented on SPARK-4352: Hi [~kevincox], thanks a lot for your comments. I

[jira] [Commented] (SPARK-4352) Incorporate locality preferences in dynamic allocation requests

2015-05-19 Thread Kevin Cox (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551667#comment-14551667 ] Kevin Cox commented on SPARK-4352: -- I believe the idea of this issue was to determine the

[jira] [Updated] (SPARK-7741) ContextCleaner not used by many DStream operations

2015-05-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7741: - Description: In the 1.4 branch, this results in java.io.NotSerializableExceptions when trying to use opera

[jira] [Updated] (SPARK-7741) ContextCleaner not used by many DStream operations

2015-05-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7741: - Description: In the 1.4.0 branch, this results in java.io.NotSerializableExceptions when trying to use ope

[jira] [Commented] (SPARK-4352) Incorporate locality preferences in dynamic allocation requests

2015-05-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551649#comment-14551649 ] Saisai Shao commented on SPARK-4352: Hi all, I'd like to take a crack at this, Here is

[jira] [Commented] (SPARK-6880) Spark Shutdowns with NoSuchElementException when running parallel collect on cachedRDD

2015-05-19 Thread Mark Hamstra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551647#comment-14551647 ] Mark Hamstra commented on SPARK-6880: - This fix should also be applied as far back as

[jira] [Updated] (SPARK-7741) ContextCleaner not used by many DStream operations

2015-05-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7741: - Description: This results in java.io.NotSerializableExceptions when trying to use operators wrapped in `wi

[jira] [Commented] (SPARK-7455) Perf test for LDA (EM/online)

2015-05-19 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551624#comment-14551624 ] yuhao yang commented on SPARK-7455: --- work in progress https://github.com/databricks/spar

[jira] [Created] (SPARK-7751) Add @since to stable methods in MLlib

2015-05-19 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7751: Summary: Add @since to stable methods in MLlib Key: SPARK-7751 URL: https://issues.apache.org/jira/browse/SPARK-7751 Project: Spark Issue Type: New Feature

[jira] [Assigned] (SPARK-7750) Rename "json" endpoints to "api" endpoints

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7750: --- Assignee: (was: Apache Spark) > Rename "json" endpoints to "api" endpoints >

[jira] [Assigned] (SPARK-7750) Rename "json" endpoints to "api" endpoints

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7750: --- Assignee: Apache Spark > Rename "json" endpoints to "api" endpoints > ---

[jira] [Commented] (SPARK-7750) Rename "json" endpoints to "api" endpoints

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551584#comment-14551584 ] Apache Spark commented on SPARK-7750: - User 'harishreedharan' has created a pull reque

[jira] [Commented] (SPARK-7743) Upgrade parquet dependency

2015-05-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551579#comment-14551579 ] Cheng Lian commented on SPARK-7743: --- We probably want to upgrade to the newly release 1.

[jira] [Resolved] (SPARK-7656) use CatalystConf in FunctionRegistry

2015-05-19 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7656. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6164 [https:/

[jira] [Created] (SPARK-7750) Rename "json" endpoints to "api" endpoints

2015-05-19 Thread Hari Shreedharan (JIRA)
Hari Shreedharan created SPARK-7750: --- Summary: Rename "json" endpoints to "api" endpoints Key: SPARK-7750 URL: https://issues.apache.org/jira/browse/SPARK-7750 Project: Spark Issue Type: Bu

[jira] [Updated] (SPARK-7744) "Distributed matrix" section in MLlib "Data Types" documentation should be reordered.

2015-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7744: - Assignee: Mike Dusenberry > "Distributed matrix" section in MLlib "Data Types" documentation shoul

[jira] [Resolved] (SPARK-7744) "Distributed matrix" section in MLlib "Data Types" documentation should be reordered.

2015-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7744. -- Resolution: Fixed Fix Version/s: 1.4.0 1.3.2 Issue resolved by pull re

[jira] [Updated] (SPARK-7737) parquet schema discovery should not fail because of empty _temporary dir

2015-05-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-7737: Assignee: Cheng Lian (was: Yin Huai) > parquet schema discovery should not fail because of empty _temporary

[jira] [Created] (SPARK-7749) Parquet metastore conversion does not use metastore cache

2015-05-19 Thread Yin Huai (JIRA)
Yin Huai created SPARK-7749: --- Summary: Parquet metastore conversion does not use metastore cache Key: SPARK-7749 URL: https://issues.apache.org/jira/browse/SPARK-7749 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-7747) Document spark.sql.planner.externalSort option

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7747: --- Assignee: (was: Apache Spark) > Document spark.sql.planner.externalSort option >

[jira] [Assigned] (SPARK-7747) Document spark.sql.planner.externalSort option

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7747: --- Assignee: Apache Spark > Document spark.sql.planner.externalSort option > ---

[jira] [Commented] (SPARK-7747) Document spark.sql.planner.externalSort option

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551524#comment-14551524 ] Apache Spark commented on SPARK-7747: - User 'lucamartinetti' has created a pull reques

[jira] [Created] (SPARK-7748) Graduate spark.ml from alpha

2015-05-19 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7748: Summary: Graduate spark.ml from alpha Key: SPARK-7748 URL: https://issues.apache.org/jira/browse/SPARK-7748 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-7443) MLlib 1.4 QA plan

2015-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7443: - Description: TODO: create JIRAs for each task and assign them accordingly. h2. API * Check API c

[jira] [Created] (SPARK-7747) Document spark.sql.planner.externalSort option

2015-05-19 Thread Luca Martinetti (JIRA)
Luca Martinetti created SPARK-7747: -- Summary: Document spark.sql.planner.externalSort option Key: SPARK-7747 URL: https://issues.apache.org/jira/browse/SPARK-7747 Project: Spark Issue Type:

[jira] [Commented] (SPARK-7711) startTime() is missing

2015-05-19 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551498#comment-14551498 ] holdenk commented on SPARK-7711: I can add this. > startTime() is missing > -

[jira] [Commented] (SPARK-6246) spark-ec2 can't handle clusters with > 100 nodes

2015-05-19 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551486#comment-14551486 ] Shivaram Venkataraman commented on SPARK-6246: -- [~srowen] Could you add [~aly

[jira] [Created] (SPARK-7746) SetFetchSize for JDBCRDD's prepareStatement

2015-05-19 Thread Paul Wu (JIRA)
Paul Wu created SPARK-7746: -- Summary: SetFetchSize for JDBCRDD's prepareStatement Key: SPARK-7746 URL: https://issues.apache.org/jira/browse/SPARK-7746 Project: Spark Issue Type: New Feature

[jira] [Resolved] (SPARK-6246) spark-ec2 can't handle clusters with > 100 nodes

2015-05-19 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-6246. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 626

[jira] [Updated] (SPARK-5480) GraphX pageRank: java.lang.ArrayIndexOutOfBoundsException:

2015-05-19 Thread Niklas Wilcke (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Niklas Wilcke updated SPARK-5480: - Affects Version/s: 1.3.1 > GraphX pageRank: java.lang.ArrayIndexOutOfBoundsException: > -

[jira] [Comment Edited] (SPARK-5480) GraphX pageRank: java.lang.ArrayIndexOutOfBoundsException:

2015-05-19 Thread Niklas Wilcke (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551366#comment-14551366 ] Niklas Wilcke edited comment on SPARK-5480 at 5/19/15 11:36 PM:

[jira] [Assigned] (SPARK-7745) Replace assertions with requires (IllegalArgumentException) and modify other state checks

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7745: --- Assignee: (was: Apache Spark) > Replace assertions with requires (IllegalArgumentExceptio

[jira] [Assigned] (SPARK-7745) Replace assertions with requires (IllegalArgumentException) and modify other state checks

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7745: --- Assignee: Apache Spark > Replace assertions with requires (IllegalArgumentException) and modi

[jira] [Commented] (SPARK-7745) Replace assertions with requires (IllegalArgumentException) and modify other state checks

2015-05-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551412#comment-14551412 ] Sean Owen commented on SPARK-7745: -- I tend to agree. There aren't many uses of the assert

[jira] [Commented] (SPARK-7745) Replace assertions with requires (IllegalArgumentException) and modify other state checks

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551409#comment-14551409 ] Apache Spark commented on SPARK-7745: - User 'brkyvz' has created a pull request for th

[jira] [Created] (SPARK-7745) Replace assertions with requires (IllegalArgumentException) and modify other state checks

2015-05-19 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-7745: -- Summary: Replace assertions with requires (IllegalArgumentException) and modify other state checks Key: SPARK-7745 URL: https://issues.apache.org/jira/browse/SPARK-7745 P

[jira] [Commented] (SPARK-5480) GraphX pageRank: java.lang.ArrayIndexOutOfBoundsException:

2015-05-19 Thread Niklas Wilcke (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551366#comment-14551366 ] Niklas Wilcke commented on SPARK-5480: -- I'm running Spark 1.3.1 and I'm facing the sa

[jira] [Resolved] (SPARK-7662) Exception of multi-attribute generator anlysis in projection

2015-05-19 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7662. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6178 [https:/

[jira] [Assigned] (SPARK-7744) "Distributed matrix" section in MLlib "Data Types" documentation should be reordered.

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7744: --- Assignee: (was: Apache Spark) > "Distributed matrix" section in MLlib "Data Types" docume

[jira] [Assigned] (SPARK-7744) "Distributed matrix" section in MLlib "Data Types" documentation should be reordered.

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7744: --- Assignee: Apache Spark > "Distributed matrix" section in MLlib "Data Types" documentation sho

[jira] [Commented] (SPARK-7744) "Distributed matrix" section in MLlib "Data Types" documentation should be reordered.

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551309#comment-14551309 ] Apache Spark commented on SPARK-7744: - User 'dusenberrymw' has created a pull request

[jira] [Created] (SPARK-7744) "Distributed matrix" section in MLlib "Data Types" documentation should be reordered.

2015-05-19 Thread Mike Dusenberry (JIRA)
Mike Dusenberry created SPARK-7744: -- Summary: "Distributed matrix" section in MLlib "Data Types" documentation should be reordered. Key: SPARK-7744 URL: https://issues.apache.org/jira/browse/SPARK-7744

[jira] [Updated] (SPARK-7402) JSON serialization of params

2015-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7402: - Target Version/s: 1.4.1 (was: 1.4.0) > JSON serialization of params > ---

[jira] [Updated] (SPARK-7402) JSON serialization of params

2015-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7402: - Target Version/s: 1.4.1, 1.5.0 (was: 1.4.1) > JSON serialization of params >

[jira] [Closed] (SPARK-7688) PySpark + ipython throws port out of range exception

2015-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-7688. Resolution: Not A Problem It only happens with ipython 3.0.0. Upgrading to ipython 3.1.0 resolved th

[jira] [Updated] (SPARK-7743) Upgrade parquet dependency

2015-05-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7743: --- Component/s: SQL > Upgrade parquet dependency > -- > >

[jira] [Commented] (SPARK-6323) Large rank matrix factorization with Nonlinear loss and constraints

2015-05-19 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551294#comment-14551294 ] Debasish Das commented on SPARK-6323: - Petuum paper that got released today mentioned

[jira] [Created] (SPARK-7743) Upgrade parquet dependency

2015-05-19 Thread Thomas Omans (JIRA)
Thomas Omans created SPARK-7743: --- Summary: Upgrade parquet dependency Key: SPARK-7743 URL: https://issues.apache.org/jira/browse/SPARK-7743 Project: Spark Issue Type: Bug Reporter:

[jira] [Commented] (SPARK-7741) ContextCleaner not used by many DStream operations

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551285#comment-14551285 ] Apache Spark commented on SPARK-7741: - User 'andrewor14' has created a pull request fo

[jira] [Commented] (SPARK-7237) Many user provided closures are not actually cleaned

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551284#comment-14551284 ] Apache Spark commented on SPARK-7237: - User 'andrewor14' has created a pull request fo

[jira] [Created] (SPARK-7742) Figure out what to do with insertInto w.r.t. DataFrameWriter API

2015-05-19 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-7742: -- Summary: Figure out what to do with insertInto w.r.t. DataFrameWriter API Key: SPARK-7742 URL: https://issues.apache.org/jira/browse/SPARK-7742 Project: Spark I

[jira] [Resolved] (SPARK-7654) DataFrameReader and DataFrameWriter for input/output API

2015-05-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7654. Resolution: Fixed Fix Version/s: 1.4.0 > DataFrameReader and DataFrameWriter for input/output

[jira] [Updated] (SPARK-7738) DataFrame reader/writer API in Python

2015-05-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7738: --- Issue Type: Sub-task (was: New Feature) Parent: SPARK-6116 > DataFrame reader/writer API in P

[jira] [Resolved] (SPARK-7738) DataFrame reader/writer API in Python

2015-05-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7738. Resolution: Fixed Fix Version/s: 1.4.0 > DataFrame reader/writer API in Python >

[jira] [Commented] (SPARK-7721) Generate test coverage report from Python

2015-05-19 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551255#comment-14551255 ] Josh Rosen commented on SPARK-7721: --- Codacy doesn't require repo hook access in order to

[jira] [Commented] (SPARK-7721) Generate test coverage report from Python

2015-05-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551252#comment-14551252 ] Reynold Xin commented on SPARK-7721: Would we have permission to use this? > Generate

[jira] [Commented] (SPARK-7721) Generate test coverage report from Python

2015-05-19 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551249#comment-14551249 ] Josh Rosen commented on SPARK-7721: --- Actually, we should check out Codacy, since they su

[jira] [Commented] (SPARK-7721) Generate test coverage report from Python

2015-05-19 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551240#comment-14551240 ] Josh Rosen commented on SPARK-7721: --- If we just want to be able to view the coverage rep

[jira] [Resolved] (SPARK-7652) Performance regression in naive Bayes prediction

2015-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7652. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6189 [https://githu

[jira] [Updated] (SPARK-7652) Performance regression in naive Bayes prediction

2015-05-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7652: - Assignee: Liang-Chi Hsieh > Performance regression in naive Bayes prediction > ---

[jira] [Commented] (SPARK-7721) Generate test coverage report from Python

2015-05-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551198#comment-14551198 ] Davies Liu commented on SPARK-7721: --- There are some tools to generate test coverage for

[jira] [Comment Edited] (SPARK-7688) PySpark + ipython throws port out of range exception

2015-05-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551187#comment-14551187 ] Davies Liu edited comment on SPARK-7688 at 5/19/15 8:45 PM: [~

[jira] [Commented] (SPARK-7688) PySpark + ipython throws port out of range exception

2015-05-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551187#comment-14551187 ] Davies Liu commented on SPARK-7688: --- @mengxr, could you work around it, or should be fix

[jira] [Resolved] (SPARK-7586) User guide update for spark.ml Word2Vec

2015-05-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7586. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6181 [https

[jira] [Issue Comment Deleted] (SPARK-7338) Survival Modelling - Cox proportional hazards

2015-05-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7338: - Comment: was deleted (was: User 'davies' has created a pull request for this issue: https:

[jira] [Commented] (SPARK-7741) ContextCleaner not used by many DStream operations

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551162#comment-14551162 ] Apache Spark commented on SPARK-7741: - User 'tdas' has created a pull request for this

[jira] [Assigned] (SPARK-7741) ContextCleaner not used by many DStream operations

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7741: --- Assignee: Tathagata Das (was: Apache Spark) > ContextCleaner not used by many DStream operat

[jira] [Assigned] (SPARK-7741) ContextCleaner not used by many DStream operations

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7741: --- Assignee: Apache Spark (was: Tathagata Das) > ContextCleaner not used by many DStream operat

[jira] [Commented] (SPARK-6246) spark-ec2 can't handle clusters with > 100 nodes

2015-05-19 Thread Alex (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551157#comment-14551157 ] Alex commented on SPARK-6246: - [~shivaram] Done. This is my first PR. Do I have to do anything

[jira] [Commented] (SPARK-6246) spark-ec2 can't handle clusters with > 100 nodes

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551150#comment-14551150 ] Apache Spark commented on SPARK-6246: - User 'alyaxey' has created a pull request for t

[jira] [Assigned] (SPARK-6246) spark-ec2 can't handle clusters with > 100 nodes

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6246: --- Assignee: (was: Apache Spark) > spark-ec2 can't handle clusters with > 100 nodes > --

[jira] [Created] (SPARK-7741) ContextCleaner not used by many DStream operations

2015-05-19 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-7741: Summary: ContextCleaner not used by many DStream operations Key: SPARK-7741 URL: https://issues.apache.org/jira/browse/SPARK-7741 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-6246) spark-ec2 can't handle clusters with > 100 nodes

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6246: --- Assignee: Apache Spark > spark-ec2 can't handle clusters with > 100 nodes > -

[jira] [Created] (SPARK-7740) Improve Evaluator doc to state that higher metric values are better.

2015-05-19 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7740: Summary: Improve Evaluator doc to state that higher metric values are better. Key: SPARK-7740 URL: https://issues.apache.org/jira/browse/SPARK-7740 Project: Spark

[jira] [Created] (SPARK-7739) Improve ChiSqSelector example code in the user guide

2015-05-19 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7739: Summary: Improve ChiSqSelector example code in the user guide Key: SPARK-7739 URL: https://issues.apache.org/jira/browse/SPARK-7739 Project: Spark Issue Type

[jira] [Resolved] (SPARK-7092) Update spark scala version to 2.11.6

2015-05-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7092. Resolution: Fixed Okay this was "re merged" in the SPARK-7726 fix: https://github.com/apach

[jira] [Resolved] (SPARK-7726) Maven Install Breaks When Upgrading Scala 2.11.2-->[2.11.3 or higher]

2015-05-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7726. Resolution: Fixed Fix Version/s: 1.4.0 > Maven Install Breaks When Upgrading Scala 2.

[jira] [Resolved] (SPARK-7701) UDT not working

2015-05-19 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7701. - Resolution: Not A Problem > UDT not working > --- > > Key: SPA

[jira] [Updated] (SPARK-7738) DataFrame reader/writer API in Python

2015-05-19 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7738: Summary: DataFrame reader/writer API in Python (was: DataFramer reader/writer API in Python

[jira] [Assigned] (SPARK-7738) DataFramer reader/writer API in Python

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7738: --- Assignee: Apache Spark (was: Davies Liu) > DataFramer reader/writer API in Python >

[jira] [Assigned] (SPARK-7738) DataFramer reader/writer API in Python

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7738: --- Assignee: Davies Liu (was: Apache Spark) > DataFramer reader/writer API in Python >

[jira] [Commented] (SPARK-7738) DataFramer reader/writer API in Python

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551031#comment-14551031 ] Apache Spark commented on SPARK-7738: - User 'davies' has created a pull request for th

[jira] [Comment Edited] (SPARK-7701) UDT not working

2015-05-19 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551015#comment-14551015 ] Liang-Chi Hsieh edited comment on SPARK-7701 at 5/19/15 6:57 PM: ---

[jira] [Commented] (SPARK-7701) UDT not working

2015-05-19 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551015#comment-14551015 ] Liang-Chi Hsieh commented on SPARK-7701: I think this ticket can be closes now. >

[jira] [Assigned] (SPARK-7338) Survival Modelling - Cox proportional hazards

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7338: --- Assignee: Apache Spark > Survival Modelling - Cox proportional hazards >

[jira] [Commented] (SPARK-7338) Survival Modelling - Cox proportional hazards

2015-05-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551004#comment-14551004 ] Apache Spark commented on SPARK-7338: - User 'davies' has created a pull request for th

  1   2   3   >