[jira] [Updated] (SPARK-7591) FSBasedRelation interface tweaks

2015-05-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-7591: -- Description: # Renaming {{FSBasedRelation}} to {{HadoopFsRelation}} Since itss all coupled with

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Commented] (SPARK-7540) PMML correctness check

2015-05-13 Thread Villu Ruusmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541766#comment-14541766 ] Villu Ruusmann commented on SPARK-7540: --- There are two kinds of tests that should be

[jira] [Resolved] (SPARK-6568) spark-shell.cmd --jars option does not accept the jar that has space in its path

2015-05-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6568. -- Resolution: Fixed Issue resolved by pull request 5447 [https://github.com/apache/spark/pull/5447]

[jira] [Updated] (SPARK-7598) Add aliveWorkers metrics in Master

2015-05-13 Thread Rex Xiong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rex Xiong updated SPARK-7598: - Summary: Add aliveWorkers metrics in Master (was: Add aliveWorker metrics in Master) Add aliveWorkers

[jira] [Updated] (SPARK-6568) spark-shell.cmd --jars option does not accept the jar that has space in its path

2015-05-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6568: - Assignee: Masayoshi TSUZUKI spark-shell.cmd --jars option does not accept the jar that has space in its

[jira] [Commented] (SPARK-7556) User guide update for feature transformer: Binarizer

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541640#comment-14541640 ] Apache Spark commented on SPARK-7556: - User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-7556) User guide update for feature transformer: Binarizer

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7556: --- Assignee: Liang-Chi Hsieh (was: Apache Spark) User guide update for feature transformer:

[jira] [Assigned] (SPARK-7556) User guide update for feature transformer: Binarizer

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7556: --- Assignee: Apache Spark (was: Liang-Chi Hsieh) User guide update for feature transformer:

[jira] [Updated] (SPARK-7591) FSBasedRelation interface tweaks

2015-05-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-7591: -- Description: # Renaming {{FSBasedRelation}} to {{HadoopFsRelation}} Since itss all coupled with

[jira] [Updated] (SPARK-7591) FSBasedRelation interface tweaks

2015-05-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-7591: -- Description: - Renaming {{FSBasedRelation}} to {{HadoopFsRelation}} Since itss all coupled with

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Updated] (SPARK-7591) FSBasedRelation interface tweaks

2015-05-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-7591: -- Description: # Renaming {{FSBasedRelation}} to {{HadoopFsRelation}} Since itss all coupled with

[jira] [Commented] (SPARK-1537) Add integration with Yarn's Application Timeline Server

2015-05-13 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541589#comment-14541589 ] Steve Loughran commented on SPARK-1537: --- + YARN-3539 is resolved; the [v1 timeline

[jira] [Created] (SPARK-7598) Add aliveWorker metrics in Master

2015-05-13 Thread Rex Xiong (JIRA)
Rex Xiong created SPARK-7598: Summary: Add aliveWorker metrics in Master Key: SPARK-7598 URL: https://issues.apache.org/jira/browse/SPARK-7598 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Resolved] (SPARK-7522) ML Examples option for dataFormat should not be enclosed in angle brackets

2015-05-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7522. -- Resolution: Fixed Fix Version/s: 1.3.2 1.2.3 ML Examples option for

[jira] [Comment Edited] (SPARK-5185) pyspark --jars does not add classes to driver class path

2015-05-13 Thread Leonardo Trabuco (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541652#comment-14541652 ] Leonardo Trabuco edited comment on SPARK-5185 at 5/13/15 9:29 AM:

[jira] [Commented] (SPARK-5185) pyspark --jars does not add classes to driver class path

2015-05-13 Thread Leonardo Trabuco (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541652#comment-14541652 ] Leonardo Trabuco commented on SPARK-5185: - We've been setting

[jira] [Commented] (SPARK-7598) Add aliveWorkers metrics in Master

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541659#comment-14541659 ] Apache Spark commented on SPARK-7598: - User 'twilightgod' has created a pull request

[jira] [Assigned] (SPARK-7598) Add aliveWorkers metrics in Master

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7598: --- Assignee: (was: Apache Spark) Add aliveWorkers metrics in Master

[jira] [Commented] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541725#comment-14541725 ] Yanbo Liang commented on SPARK-7536: [MLLIB] Python support for Power Iteration

[jira] [Updated] (SPARK-7591) FSBasedRelation interface tweaks

2015-05-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-7591: -- Description: # Renaming {{FSBasedRelation}} to {{HadoopFsRelation}} Since itss all coupled with

[jira] [Commented] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541726#comment-14541726 ] Yanbo Liang commented on SPARK-7536: Python API for LDA Audit MLlib Python API for

[jira] [Assigned] (SPARK-7598) Add aliveWorkers metrics in Master

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7598: --- Assignee: Apache Spark Add aliveWorkers metrics in Master

[jira] [Created] (SPARK-7599) Don't restrict customized FSBasedRelation OutputCommitter to be subclass of FileOutputFormat

2015-05-13 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-7599: - Summary: Don't restrict customized FSBasedRelation OutputCommitter to be subclass of FileOutputFormat Key: SPARK-7599 URL: https://issues.apache.org/jira/browse/SPARK-7599

[jira] [Updated] (SPARK-7591) FSBasedRelation interface tweaks

2015-05-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-7591: -- Summary: FSBasedRelation interface tweaks (was: Rename FsBasedRelation - HadoopFsRelation)

[jira] [Updated] (SPARK-7591) FSBasedRelation interface tweaks

2015-05-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-7591: -- Description: # Renaming {{FSBasedRelation}} to {{HadoopFsRelation}} Since itss all coupled with

[jira] [Comment Edited] (SPARK-3702) Standardize MLlib classes for learners, models

2015-05-13 Thread Jao Rabary (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541864#comment-14541864 ] Jao Rabary edited comment on SPARK-3702 at 5/13/15 12:51 PM: -

[jira] [Commented] (SPARK-4758) Make metastore_db in-memory for HiveContext

2015-05-13 Thread Santiago M. Mola (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541865#comment-14541865 ] Santiago M. Mola commented on SPARK-4758: - This could also make testing more

[jira] [Resolved] (SPARK-7561) Install Junit Attachment Plugin on Jenkins

2015-05-13 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shane knapp resolved SPARK-7561. Resolution: Pending Closed ok, this is done Install Junit Attachment Plugin on Jenkins

[jira] [Commented] (SPARK-3702) Standardize MLlib classes for learners, models

2015-05-13 Thread Jao Rabary (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541864#comment-14541864 ] Jao Rabary commented on SPARK-3702: --- Are unsupervised learning algorithm also concerned

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Resolved] (SPARK-7141) saveAsTextFile() on S3 first creates empty prefix

2015-05-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7141. -- Resolution: Not A Problem saveAsTextFile() on S3 first creates empty prefix

[jira] [Created] (SPARK-7600) Stopping Streaming Context (sometimes) crashes master

2015-05-13 Thread Marius Soutier (JIRA)
Marius Soutier created SPARK-7600: - Summary: Stopping Streaming Context (sometimes) crashes master Key: SPARK-7600 URL: https://issues.apache.org/jira/browse/SPARK-7600 Project: Spark Issue

[jira] [Assigned] (SPARK-7599) Don't restrict customized FSBasedRelation OutputCommitter to be subclass of FileOutputFormat

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7599: --- Assignee: Cheng Lian (was: Apache Spark) Don't restrict customized FSBasedRelation

[jira] [Assigned] (SPARK-7599) Don't restrict customized FSBasedRelation OutputCommitter to be subclass of FileOutputFormat

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7599: --- Assignee: Apache Spark (was: Cheng Lian) Don't restrict customized FSBasedRelation

[jira] [Assigned] (SPARK-7601) Support Insert into JDBC Datasource

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7601: --- Assignee: (was: Apache Spark) Support Insert into JDBC Datasource

[jira] [Commented] (SPARK-7599) Don't restrict customized FSBasedRelation OutputCommitter to be subclass of FileOutputFormat

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542006#comment-14542006 ] Apache Spark commented on SPARK-7599: - User 'liancheng' has created a pull request for

[jira] [Commented] (SPARK-7601) Support Insert into JDBC Datasource

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542007#comment-14542007 ] Apache Spark commented on SPARK-7601: - User 'gvramana' has created a pull request for

[jira] [Assigned] (SPARK-7601) Support Insert into JDBC Datasource

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7601: --- Assignee: Apache Spark Support Insert into JDBC Datasource

[jira] [Resolved] (SPARK-7599) Don't restrict customized FSBasedRelation OutputCommitter to be subclass of FileOutputFormat

2015-05-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-7599. - Resolution: Fixed Issue resolved by pull request 6118 [https://github.com/apache/spark/pull/6118] Don't

[jira] [Updated] (SPARK-7599) Don't restrict customized FSBasedRelation OutputCommitter to be subclass of FileOutputFormat

2015-05-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-7599: Fix Version/s: 1.4.0 Don't restrict customized FSBasedRelation OutputCommitter to be subclass of

[jira] [Commented] (SPARK-7576) User guide update for spark.ml ElementwiseProduct

2015-05-13 Thread Octavian Geagla (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542073#comment-14542073 ] Octavian Geagla commented on SPARK-7576: [~josephkb] Yup, I think this is an easy

[jira] [Created] (SPARK-7601) Support Insert into JDBC Datasource

2015-05-13 Thread Venkata Ramana G (JIRA)
Venkata Ramana G created SPARK-7601: --- Summary: Support Insert into JDBC Datasource Key: SPARK-7601 URL: https://issues.apache.org/jira/browse/SPARK-7601 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-6981) [SQL] SparkPlanner and QueryExecution should be factored out from SQLContext

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542038#comment-14542038 ] Apache Spark commented on SPARK-6981: - User 'evacchi' has created a pull request for

[jira] [Commented] (SPARK-6613) Starting stream from checkpoint causes Streaming tab to throw error

2015-05-13 Thread Marius Soutier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14541951#comment-14541951 ] Marius Soutier commented on SPARK-6613: --- It's still happening with 1.3.1. Starting

[jira] [Updated] (SPARK-6613) Starting stream from checkpoint causes Streaming tab to throw error

2015-05-13 Thread Marius Soutier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marius Soutier updated SPARK-6613: -- Affects Version/s: 1.3.1 Starting stream from checkpoint causes Streaming tab to throw error

[jira] [Updated] (SPARK-7601) Support Insert into JDBC Datasource

2015-05-13 Thread Venkata Ramana G (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Venkata Ramana G updated SPARK-7601: Description: Support Insert into JDBCDataSource. Following are usage examples {code}

[jira] [Issue Comment Deleted] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Comment: was deleted (was: Python API for LDA) Audit MLlib Python API for 1.4

[jira] [Issue Comment Deleted] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Comment: was deleted (was: [MLLIB] Python support for Power Iteration Clustering) Audit MLlib

[jira] [Reopened] (SPARK-5081) Shuffle write increases

2015-05-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reopened SPARK-5081: --- Re-opening this for further investigation. [~cbbetz], I think that your issue could be caused by a

[jira] [Created] (SPARK-7611) Support HashJoin if the join condition uses eqNullSafe/=

2015-05-13 Thread David Tolnay (JIRA)
David Tolnay created SPARK-7611: --- Summary: Support HashJoin if the join condition uses eqNullSafe/= Key: SPARK-7611 URL: https://issues.apache.org/jira/browse/SPARK-7611 Project: Spark Issue

[jira] [Comment Edited] (SPARK-7606) Document all PySpark SQL/DataFrame public methods with @since tag

2015-05-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542701#comment-14542701 ] Nicholas Chammas edited comment on SPARK-7606 at 5/13/15 8:57 PM:

[jira] [Assigned] (SPARK-7608) Memory leak in RDDOperationGraphListener

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7608: --- Assignee: (was: Apache Spark) Memory leak in RDDOperationGraphListener

[jira] [Assigned] (SPARK-7608) Memory leak in RDDOperationGraphListener

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7608: --- Assignee: Apache Spark Memory leak in RDDOperationGraphListener

[jira] [Commented] (SPARK-7608) Memory leak in RDDOperationGraphListener

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542382#comment-14542382 ] Apache Spark commented on SPARK-7608: - User 'andrewor14' has created a pull request

[jira] [Created] (SPARK-7609) Add standardized checks for (Model, Estimator) unit tests

2015-05-13 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7609: Summary: Add standardized checks for (Model, Estimator) unit tests Key: SPARK-7609 URL: https://issues.apache.org/jira/browse/SPARK-7609 Project: Spark

[jira] [Commented] (SPARK-3702) Standardize MLlib classes for learners, models

2015-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542416#comment-14542416 ] Joseph K. Bradley commented on SPARK-3702: -- I would call it a sub-task, but we

[jira] [Commented] (SPARK-7579) User guide update for OneHotEncoder

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542412#comment-14542412 ] Apache Spark commented on SPARK-7579: - User 'sryza' has created a pull request for

[jira] [Assigned] (SPARK-7579) User guide update for OneHotEncoder

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7579: --- Assignee: Apache Spark (was: Sandy Ryza) User guide update for OneHotEncoder

[jira] [Assigned] (SPARK-7579) User guide update for OneHotEncoder

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7579: --- Assignee: Sandy Ryza (was: Apache Spark) User guide update for OneHotEncoder

[jira] [Created] (SPARK-7610) Design clustering abstractions for Pipelines API

2015-05-13 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7610: Summary: Design clustering abstractions for Pipelines API Key: SPARK-7610 URL: https://issues.apache.org/jira/browse/SPARK-7610 Project: Spark Issue

[jira] [Updated] (SPARK-7576) User guide update for spark.ml ElementwiseProduct

2015-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7576: - Component/s: Documentation User guide update for spark.ml ElementwiseProduct

[jira] [Commented] (SPARK-7576) User guide update for spark.ml ElementwiseProduct

2015-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542431#comment-14542431 ] Joseph K. Bradley commented on SPARK-7576: -- Great, thanks! User guide update

[jira] [Created] (SPARK-7628) DAG visualization: position graphs with semantic awareness

2015-05-13 Thread Andrew Or (JIRA)
Andrew Or created SPARK-7628: Summary: DAG visualization: position graphs with semantic awareness Key: SPARK-7628 URL: https://issues.apache.org/jira/browse/SPARK-7628 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-7614) CLONE - Master fails on 2.11 with compilation error

2015-05-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7614. -- Resolution: Duplicate Fix Version/s: (was: 1.4.0) There's no need to open a duplicate. If

[jira] [Commented] (SPARK-7614) CLONE - Master fails on 2.11 with compilation error

2015-05-13 Thread Jianshi Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542596#comment-14542596 ] Jianshi Huang commented on SPARK-7614: -- Yeah, it seems I cannot reopen 7399. That's

[jira] [Commented] (SPARK-7399) Master fails on 2.11 with compilation error

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542637#comment-14542637 ] Apache Spark commented on SPARK-7399: - User 'andrewor14' has created a pull request

[jira] [Updated] (SPARK-7615) Word2Vec wordVectors divided by Euclidean Norm equals to zero

2015-05-13 Thread Eric Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Li updated SPARK-7615: --- Summary: Word2Vec wordVectors divided by Euclidean Norm equals to zero (was: WordVector divided by

[jira] [Updated] (SPARK-7615) WordVector divided by Euclidean Norm equals to zero

2015-05-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7615: - Priority: Minor (was: Major) WordVector divided by Euclidean Norm equals to zero

[jira] [Commented] (SPARK-6785) DateUtils can not handle date before 1970/01/01 correctly

2015-05-13 Thread Christian Kadner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542697#comment-14542697 ] Christian Kadner commented on SPARK-6785: - Hi Patrick, I would like to work on

[jira] [Updated] (SPARK-7545) Bernoulli NaiveBayes should validate data

2015-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7545: - Fix Version/s: 1.4.0 Bernoulli NaiveBayes should validate data

[jira] [Commented] (SPARK-7606) Document all PySpark SQL/DataFrame public methods with @since tag

2015-05-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542615#comment-14542615 ] Reynold Xin commented on SPARK-7606: Doesn't look like there is a standard way to

[jira] [Created] (SPARK-7618) Word2VecModel cache normalized wordVectors to speed up findSynonyms

2015-05-13 Thread Eric Li (JIRA)
Eric Li created SPARK-7618: -- Summary: Word2VecModel cache normalized wordVectors to speed up findSynonyms Key: SPARK-7618 URL: https://issues.apache.org/jira/browse/SPARK-7618 Project: Spark Issue

[jira] [Resolved] (SPARK-7551) Don't split by dot if within backticks for DataFrame attribute resolution

2015-05-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7551. Resolution: Fixed Fix Version/s: 1.4.0 Don't split by dot if within backticks for DataFrame

[jira] [Commented] (SPARK-7606) Document all PySpark SQL/DataFrame public methods with @since tag

2015-05-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542701#comment-14542701 ] Nicholas Chammas commented on SPARK-7606: - Just looked into this. If we are using

[jira] [Created] (SPARK-7616) Overwriting a partitioned parquet table corrupt data

2015-05-13 Thread Yin Huai (JIRA)
Yin Huai created SPARK-7616: --- Summary: Overwriting a partitioned parquet table corrupt data Key: SPARK-7616 URL: https://issues.apache.org/jira/browse/SPARK-7616 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-7593) Python API for Bucketizer

2015-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7593: - Fix Version/s: 1.4.0 Python API for Bucketizer -

[jira] [Closed] (SPARK-7510) DAG visualization: Arrows should not cover text

2015-05-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-7510. Resolution: Won't Fix Closing this as a won't fix because I tried fixing it and the effects aren't really

[jira] [Updated] (SPARK-7399) Master fails on 2.11 with compilation error

2015-05-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7399: - Priority: Blocker (was: Major) Master fails on 2.11 with compilation error

[jira] [Updated] (SPARK-7615) MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero

2015-05-13 Thread Eric Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Li updated SPARK-7615: --- Summary: MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero (was: Word2Vec wordVectors

[jira] [Assigned] (SPARK-7578) User guide update for spark.ml IDF, Normalizer, StandardScaler

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7578: --- Assignee: Joseph K. Bradley (was: Apache Spark) User guide update for spark.ml IDF,

[jira] [Commented] (SPARK-7578) User guide update for spark.ml IDF, Normalizer, StandardScaler

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542459#comment-14542459 ] Apache Spark commented on SPARK-7578: - User 'jkbradley' has created a pull request for

[jira] [Created] (SPARK-7612) Use BLAS in naive Bayes training

2015-05-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7612: Summary: Use BLAS in naive Bayes training Key: SPARK-7612 URL: https://issues.apache.org/jira/browse/SPARK-7612 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-4356) Test Scala 2.11 on Jenkins

2015-05-13 Thread Jianshi Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542603#comment-14542603 ] Jianshi Huang commented on SPARK-4356: -- When can we have 2.11 build tests in Jenkins?

[jira] [Updated] (SPARK-7606) Document all PySpark SQL/DataFrame public methods with @since tag

2015-05-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7606: --- Assignee: (was: Reynold Xin) Document all PySpark SQL/DataFrame public methods with @since tag

[jira] [Reopened] (SPARK-7399) Master fails on 2.11 with compilation error

2015-05-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-7399: -- Reopening due to https://github.com/apache/spark/pull/5966#issuecomment-101712549 ; [~andrewor14] is on

[jira] [Created] (SPARK-7617) Word2VecModel fVector not normalized

2015-05-13 Thread Eric Li (JIRA)
Eric Li created SPARK-7617: -- Summary: Word2VecModel fVector not normalized Key: SPARK-7617 URL: https://issues.apache.org/jira/browse/SPARK-7617 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-6837) SparkR failure in processClosure

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6837: --- Assignee: Apache Spark SparkR failure in processClosure

[jira] [Created] (SPARK-7613) Serialization fails in pyspark for lambdas referencing class data members

2015-05-13 Thread Nate Crosswhite (JIRA)
Nate Crosswhite created SPARK-7613: -- Summary: Serialization fails in pyspark for lambdas referencing class data members Key: SPARK-7613 URL: https://issues.apache.org/jira/browse/SPARK-7613 Project:

[jira] [Resolved] (SPARK-7593) Python API for Bucketizer

2015-05-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7593. -- Resolution: Fixed Issue resolved by pull request 6124

[jira] [Comment Edited] (SPARK-1865) Improve behavior of cleanup of disk state

2015-05-13 Thread Nick Poorman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542641#comment-14542641 ] Nick Poorman edited comment on SPARK-1865 at 5/13/15 8:29 PM: --

[jira] [Commented] (SPARK-7567) Migrating Parquet data source to FSBasedRelation

2015-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14542668#comment-14542668 ] Apache Spark commented on SPARK-7567: - User 'yhuai' has created a pull request for

  1   2   3   >