[jira] [Updated] (SPARK-15736) Gracefully handle loss of DiskStore files

2016-06-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-15736: --- Summary: Gracefully handle loss of DiskStore files (was: Gracefully handle loss of cached RDDs'

[jira] [Resolved] (SPARK-15515) Error Handling in Running SQL Directly On Files

2016-06-02 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-15515. -- Resolution: Fixed Fix Version/s: 2.0.0 > Error Handling in Running SQL Directly On

[jira] [Created] (SPARK-15737) Fix Jetty server start warning

2016-06-02 Thread Bo Meng (JIRA)
Bo Meng created SPARK-15737: --- Summary: Fix Jetty server start warning Key: SPARK-15737 URL: https://issues.apache.org/jira/browse/SPARK-15737 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-15547) Encoder validation is too strict for inner nested structs

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15547: Assignee: Wenchen Fan (was: Apache Spark) > Encoder validation is too strict for inner

[jira] [Assigned] (SPARK-15547) Encoder validation is too strict for inner nested structs

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15547: Assignee: Apache Spark (was: Wenchen Fan) > Encoder validation is too strict for inner

[jira] [Created] (SPARK-15738) PySpark ml.feature RFormula missing string representation displaying formula

2016-06-02 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-15738: Summary: PySpark ml.feature RFormula missing string representation displaying formula Key: SPARK-15738 URL: https://issues.apache.org/jira/browse/SPARK-15738

[jira] [Assigned] (SPARK-15735) Allow specifying min time to run in microbenchmarks

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15735: Assignee: (was: Apache Spark) > Allow specifying min time to run in microbenchmarks >

[jira] [Assigned] (SPARK-15735) Allow specifying min time to run in microbenchmarks

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15735: Assignee: Apache Spark > Allow specifying min time to run in microbenchmarks >

[jira] [Commented] (SPARK-15735) Allow specifying min time to run in microbenchmarks

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15312908#comment-15312908 ] Apache Spark commented on SPARK-15735: -- User 'ericl' has created a pull request for this issue:

[jira] [Issue Comment Deleted] (SPARK-15722) Wrong data when CTAS specifies schema

2016-06-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-15722: -- Comment: was deleted (was: User 'andrewor14' has created a pull request for this issue:

[jira] [Updated] (SPARK-15716) Memory usage keep growing up in Spark Streaming

2016-06-02 Thread Yan Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Chen updated SPARK-15716: - Description: Code: {code:java} import org.apache.hadoop.io.LongWritable; import

[jira] [Commented] (SPARK-15722) Wrong data when CTAS specifies schema

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313131#comment-15313131 ] Apache Spark commented on SPARK-15722: -- User 'andrewor14' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-15716) Memory usage keep growing up in Spark Streaming

2016-06-02 Thread Yan Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15312450#comment-15312450 ] Yan Chen edited comment on SPARK-15716 at 6/2/16 9:58 PM: -- We actually ran jmap

[jira] [Updated] (SPARK-15716) Memory usage of driver keep growing up in Spark Streaming

2016-06-02 Thread Yan Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Chen updated SPARK-15716: - Summary: Memory usage of driver keep growing up in Spark Streaming (was: Memory usage keep growing up

[jira] [Comment Edited] (SPARK-15716) Memory usage of driver keeps growing up in Spark Streaming

2016-06-02 Thread Yan Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313116#comment-15313116 ] Yan Chen edited comment on SPARK-15716 at 6/2/16 9:59 PM: -- I tried to run it

[jira] [Created] (SPARK-15739) Expose aggregateMessagesWithActiveSet to users.

2016-06-02 Thread Anderson de Andrade (JIRA)
Anderson de Andrade created SPARK-15739: --- Summary: Expose aggregateMessagesWithActiveSet to users. Key: SPARK-15739 URL: https://issues.apache.org/jira/browse/SPARK-15739 Project: Spark

[jira] [Updated] (SPARK-15684) Not mask startsWith and endsWith in R

2016-06-02 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Miao Wang updated SPARK-15684: -- Priority: Major (was: Minor) > Not mask startsWith and endsWith in R >

[jira] [Updated] (SPARK-15737) Fix Jetty server start warning

2016-06-02 Thread Bo Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Meng updated SPARK-15737: Component/s: (was: SQL) Spark Core > Fix Jetty server start warning >

[jira] [Updated] (SPARK-15716) Memory usage of driver keeps growing up in Spark Streaming

2016-06-02 Thread Yan Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Chen updated SPARK-15716: - Summary: Memory usage of driver keeps growing up in Spark Streaming (was: Memory usage of driver keep

[jira] [Updated] (SPARK-15736) Gracefully handle loss of DiskStore files

2016-06-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-15736: --- Affects Version/s: 1.6.0 > Gracefully handle loss of DiskStore files >

[jira] [Commented] (SPARK-15736) Gracefully handle loss of DiskStore files

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313196#comment-15313196 ] Apache Spark commented on SPARK-15736: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Updated] (SPARK-15736) Gracefully handle loss of DiskStore files

2016-06-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-15736: --- Target Version/s: 1.6.2, 2.0.0 (was: 2.0.0) > Gracefully handle loss of DiskStore files >

[jira] [Updated] (SPARK-15716) Memory usage keep growing up in Spark Streaming

2016-06-02 Thread Yan Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Chen updated SPARK-15716: - Description: Code: {code:java} import org.apache.hadoop.io.LongWritable; import

[jira] [Commented] (SPARK-15736) Gracefully handle loss of DiskStore files

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313088#comment-15313088 ] Apache Spark commented on SPARK-15736: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Commented] (SPARK-15737) Fix Jetty server start warning

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313108#comment-15313108 ] Apache Spark commented on SPARK-15737: -- User 'bomeng' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15737) Fix Jetty server start warning

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15737: Assignee: (was: Apache Spark) > Fix Jetty server start warning >

[jira] [Updated] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-15740: -- Description: [~andrewor14] noticed some OOM errors caused by "test big model load / save" in

[jira] [Created] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-15740: - Summary: Word2VecSuite "big model load / save" caused OOM in maven jenkins builds Key: SPARK-15740 URL: https://issues.apache.org/jira/browse/SPARK-15740 Project:

[jira] [Assigned] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15740: Assignee: Apache Spark > Word2VecSuite "big model load / save" caused OOM in maven

[jira] [Commented] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313195#comment-15313195 ] Apache Spark commented on SPARK-15740: -- User 'mengxr' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15740: Assignee: (was: Apache Spark) > Word2VecSuite "big model load / save" caused OOM in

[jira] [Created] (SPARK-15735) Allow specifying min time to run in microbenchmarks

2016-06-02 Thread Eric Liang (JIRA)
Eric Liang created SPARK-15735: -- Summary: Allow specifying min time to run in microbenchmarks Key: SPARK-15735 URL: https://issues.apache.org/jira/browse/SPARK-15735 Project: Spark Issue Type:

[jira] [Updated] (SPARK-15734) Avoids printing internal row in explain output

2016-06-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15734: --- Assignee: Sean Zhong > Avoids printing internal row in explain output >

[jira] [Commented] (SPARK-15547) Encoder validation is too strict for inner nested structs

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313100#comment-15313100 ] Apache Spark commented on SPARK-15547: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15684) Not mask startsWith and endsWith in R

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15684: Assignee: Apache Spark > Not mask startsWith and endsWith in R >

[jira] [Comment Edited] (SPARK-15716) Memory usage keep growing up in Spark Streaming

2016-06-02 Thread Yan Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313116#comment-15313116 ] Yan Chen edited comment on SPARK-15716 at 6/2/16 9:43 PM: -- I tried to run it

[jira] [Assigned] (SPARK-15684) Not mask startsWith and endsWith in R

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15684: Assignee: (was: Apache Spark) > Not mask startsWith and endsWith in R >

[jira] [Commented] (SPARK-15684) Not mask startsWith and endsWith in R

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313130#comment-15313130 ] Apache Spark commented on SPARK-15684: -- User 'wangmiao1981' has created a pull request for this

[jira] [Commented] (SPARK-15710) Exception with WHERE clause in SQL for non-default Hive database

2016-06-02 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313193#comment-15313193 ] Xin Wu commented on SPARK-15710: hmm.. after another rebase of the master. it seems that the problem is

[jira] [Resolved] (SPARK-15711) Ban CREATE TEMP TABLE USING AS SELECT for now

2016-06-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15711. --- Resolution: Fixed Fix Version/s: 2.0.0 > Ban CREATE TEMP TABLE USING AS SELECT for now >

[jira] [Created] (SPARK-15736) Gracefully handle loss of cached RDDs' on-disk files

2016-06-02 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-15736: -- Summary: Gracefully handle loss of cached RDDs' on-disk files Key: SPARK-15736 URL: https://issues.apache.org/jira/browse/SPARK-15736 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-15736) Gracefully handle loss of DiskStore files

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15736: Assignee: Josh Rosen (was: Apache Spark) > Gracefully handle loss of DiskStore files >

[jira] [Assigned] (SPARK-15547) Encoder validation is too strict for inner nested structs

2016-06-02 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-15547: --- Assignee: Wenchen Fan (was: Cheng Lian) > Encoder validation is too strict for inner

[jira] [Assigned] (SPARK-15736) Gracefully handle loss of DiskStore files

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15736: Assignee: Apache Spark (was: Josh Rosen) > Gracefully handle loss of DiskStore files >

[jira] [Commented] (SPARK-15716) Memory usage keep growing up in Spark Streaming

2016-06-02 Thread Yan Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313116#comment-15313116 ] Yan Chen commented on SPARK-15716: -- I tried to run it again, with only 500M of memory on both driver and

[jira] [Comment Edited] (SPARK-15716) Memory usage keep growing up in Spark Streaming

2016-06-02 Thread Yan Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313116#comment-15313116 ] Yan Chen edited comment on SPARK-15716 at 6/2/16 9:57 PM: -- I tried to run it

[jira] [Assigned] (SPARK-15739) Expose aggregateMessagesWithActiveSet to users.

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15739: Assignee: Apache Spark > Expose aggregateMessagesWithActiveSet to users. >

[jira] [Commented] (SPARK-15739) Expose aggregateMessagesWithActiveSet to users.

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313188#comment-15313188 ] Apache Spark commented on SPARK-15739: -- User 'adeandrade' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15739) Expose aggregateMessagesWithActiveSet to users.

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15739: Assignee: (was: Apache Spark) > Expose aggregateMessagesWithActiveSet to users. >

[jira] [Commented] (SPARK-15734) Avoids printing internal row in explain output

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15312889#comment-15312889 ] Apache Spark commented on SPARK-15734: -- User 'clockfly' has created a pull request for this issue:

[jira] [Commented] (SPARK-15009) PySpark CountVectorizerModel should be able to construct from vocabulary list

2016-06-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313030#comment-15313030 ] Bryan Cutler commented on SPARK-15009: -- note - also similar constructor for StringIndexerModel >

[jira] [Updated] (SPARK-15736) Gracefully handle loss of DiskStore files

2016-06-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-15736: --- Description: If an RDD partition is cached on disk and the DiskStore file is lost, then reads of

[jira] [Comment Edited] (SPARK-15716) Memory usage keep growing up in Spark Streaming

2016-06-02 Thread Yan Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313116#comment-15313116 ] Yan Chen edited comment on SPARK-15716 at 6/2/16 9:51 PM: -- I tried to run it

[jira] [Resolved] (SPARK-15728) Rename aggregate operators: HashAggregate and SortAggregate

2016-06-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15728. - Resolution: Fixed Fix Version/s: 2.0.0 > Rename aggregate operators: HashAggregate and

[jira] [Assigned] (SPARK-15737) Fix Jetty server start warning

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15737: Assignee: Apache Spark > Fix Jetty server start warning > --

[jira] [Comment Edited] (SPARK-15716) Memory usage keep growing up in Spark Streaming

2016-06-02 Thread Yan Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313116#comment-15313116 ] Yan Chen edited comment on SPARK-15716 at 6/2/16 9:45 PM: -- I tried to run it

[jira] [Updated] (SPARK-15739) Expose aggregateMessagesWithActiveSet to users.

2016-06-02 Thread Anderson de Andrade (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anderson de Andrade updated SPARK-15739: Description: The current version of Pregel has some flaws: * Each iteration

[jira] [Resolved] (SPARK-15719) Disable writing Parquet summary files by default

2016-06-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-15719. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13455

[jira] [Comment Edited] (SPARK-15705) Spark won't read ORC schema from metastore for partitioned tables

2016-06-02 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313257#comment-15313257 ] Xin Wu edited comment on SPARK-15705 at 6/2/16 11:15 PM: - I can recreate it now.

[jira] [Assigned] (SPARK-15688) RelationalGroupedDataset.toDF should not add group by expressions that are already added in the aggregate expressions.

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15688: Assignee: (was: Apache Spark) > RelationalGroupedDataset.toDF should not add group by

[jira] [Commented] (SPARK-15688) RelationalGroupedDataset.toDF should not add group by expressions that are already added in the aggregate expressions.

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313306#comment-15313306 ] Apache Spark commented on SPARK-15688: -- User 'dilipbiswal' has created a pull request for this

[jira] [Assigned] (SPARK-15688) RelationalGroupedDataset.toDF should not add group by expressions that are already added in the aggregate expressions.

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15688: Assignee: Apache Spark > RelationalGroupedDataset.toDF should not add group by

[jira] [Resolved] (SPARK-15668) ml.feature: update check schema to avoid confusion when user use MLlib.vector as input type

2016-06-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-15668. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13411

[jira] [Created] (SPARK-15742) Reduce collections allocations in Catalyst tree transformation methods

2016-06-02 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-15742: -- Summary: Reduce collections allocations in Catalyst tree transformation methods Key: SPARK-15742 URL: https://issues.apache.org/jira/browse/SPARK-15742 Project: Spark

[jira] [Commented] (SPARK-15741) PySpark Cleanup of _setDefault with seed=None

2016-06-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313376#comment-15313376 ] Bryan Cutler commented on SPARK-15741: -- >From what I gathered, explicitly setting a seed to {{None}}

[jira] [Resolved] (SPARK-15734) Avoids printing internal row in explain output

2016-06-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-15734. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13471

[jira] [Commented] (SPARK-15725) Dynamic allocation hangs YARN app when executors time out

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313285#comment-15313285 ] Apache Spark commented on SPARK-15725: -- User 'rdblue' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15725) Dynamic allocation hangs YARN app when executors time out

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15725: Assignee: (was: Apache Spark) > Dynamic allocation hangs YARN app when executors time

[jira] [Assigned] (SPARK-15725) Dynamic allocation hangs YARN app when executors time out

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15725: Assignee: Apache Spark > Dynamic allocation hangs YARN app when executors time out >

[jira] [Commented] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313321#comment-15313321 ] Wenchen Fan commented on SPARK-15732: - It's really hard to support it, and I don't think this corner

[jira] [Created] (SPARK-15743) Prevent saving with all-column partitioning

2016-06-02 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-15743: - Summary: Prevent saving with all-column partitioning Key: SPARK-15743 URL: https://issues.apache.org/jira/browse/SPARK-15743 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-15718) better error message for writing bucketing data

2016-06-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15718. --- Resolution: Fixed Fix Version/s: 2.0.0 > better error message for writing bucketing data >

[jira] [Comment Edited] (SPARK-13868) Random forest accuracy exploration

2016-06-02 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313400#comment-15313400 ] Xusen Yin edited comment on SPARK-13868 at 6/3/16 12:40 AM: [~josephkb]

[jira] [Updated] (SPARK-15744) Rename two TungstenAggregation*Suites and update error messages/comments

2016-06-02 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-15744: -- Description: For consistency, this issue updates some remaining

[jira] [Updated] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15732: --- Assignee: Wenchen Fan > Dataset generated code "generated.java" Fails with Certain Case Classes >

[jira] [Resolved] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-15732. Resolution: Fixed Fix Version/s: 2.0.0 Resolved by

[jira] [Comment Edited] (SPARK-14811) ML, Graph 2.0 QA: API: New Scala APIs, docs

2016-06-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313208#comment-15313208 ] Nick Pentreath edited comment on SPARK-14811 at 6/2/16 10:31 PM: -

[jira] [Comment Edited] (SPARK-14811) ML, Graph 2.0 QA: API: New Scala APIs, docs

2016-06-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313208#comment-15313208 ] Nick Pentreath edited comment on SPARK-14811 at 6/2/16 10:31 PM: -

[jira] [Commented] (SPARK-14811) ML, Graph 2.0 QA: API: New Scala APIs, docs

2016-06-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313208#comment-15313208 ] Nick Pentreath commented on SPARK-14811: Question on this - we seem to be inconsistent with the

[jira] [Commented] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Bo Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313230#comment-15313230 ] Bo Meng commented on SPARK-15732: - There is no easy way to work around this issue since "abstract" is a

[jira] [Resolved] (SPARK-15139) PySpark TreeEnsemble missing methods

2016-06-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-15139. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12919

[jira] [Updated] (SPARK-15139) PySpark TreeEnsemble missing methods

2016-06-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-15139: --- Assignee: holdenk > PySpark TreeEnsemble missing methods >

[jira] [Created] (SPARK-15744) Rename two TungstenAggregation*Suites and update error messages/comments

2016-06-02 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-15744: - Summary: Rename two TungstenAggregation*Suites and update error messages/comments Key: SPARK-15744 URL: https://issues.apache.org/jira/browse/SPARK-15744 Project:

[jira] [Resolved] (SPARK-15736) Gracefully handle loss of DiskStore files

2016-06-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15736. --- Resolution: Fixed Fix Version/s: 2.0.0 1.6.2 > Gracefully handle loss of

[jira] [Commented] (SPARK-15722) Wrong data when CTAS specifies schema

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313438#comment-15313438 ] Apache Spark commented on SPARK-15722: -- User 'andrewor14' has created a pull request for this issue:

[jira] [Commented] (SPARK-15738) PySpark ml.feature RFormula missing string representation displaying formula

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313222#comment-15313222 ] Apache Spark commented on SPARK-15738: -- User 'BryanCutler' has created a pull request for this

[jira] [Assigned] (SPARK-15738) PySpark ml.feature RFormula missing string representation displaying formula

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15738: Assignee: (was: Apache Spark) > PySpark ml.feature RFormula missing string

[jira] [Assigned] (SPARK-15738) PySpark ml.feature RFormula missing string representation displaying formula

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15738: Assignee: Apache Spark > PySpark ml.feature RFormula missing string representation

[jira] [Created] (SPARK-15741) PySpark Cleanup of _setDefault with seed=None

2016-06-02 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-15741: Summary: PySpark Cleanup of _setDefault with seed=None Key: SPARK-15741 URL: https://issues.apache.org/jira/browse/SPARK-15741 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-15743) Prevent saving with all-column partitioning

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15743: Assignee: Apache Spark > Prevent saving with all-column partitioning >

[jira] [Commented] (SPARK-15743) Prevent saving with all-column partitioning

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313381#comment-15313381 ] Apache Spark commented on SPARK-15743: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-15743) Prevent saving with all-column partitioning

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15743: Assignee: (was: Apache Spark) > Prevent saving with all-column partitioning >

[jira] [Created] (SPARK-15745) Use classloader's getResource() for reading resource files in HiveTests

2016-06-02 Thread Sameer Agarwal (JIRA)
Sameer Agarwal created SPARK-15745: -- Summary: Use classloader's getResource() for reading resource files in HiveTests Key: SPARK-15745 URL: https://issues.apache.org/jira/browse/SPARK-15745 Project:

[jira] [Commented] (SPARK-15623) 2.0 python coverage ml.feature

2016-06-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313226#comment-15313226 ] Bryan Cutler commented on SPARK-15623: -- I took another spin through this and linked a couple of

[jira] [Commented] (SPARK-15742) Reduce collections allocations in Catalyst tree transformation methods

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313355#comment-15313355 ] Apache Spark commented on SPARK-15742: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15732: Assignee: Apache Spark > Dataset generated code "generated.java" Fails with Certain Case

[jira] [Assigned] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15732: Assignee: (was: Apache Spark) > Dataset generated code "generated.java" Fails with

[jira] [Assigned] (SPARK-15742) Reduce collections allocations in Catalyst tree transformation methods

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15742: Assignee: Josh Rosen (was: Apache Spark) > Reduce collections allocations in Catalyst

[jira] [Assigned] (SPARK-15742) Reduce collections allocations in Catalyst tree transformation methods

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15742: Assignee: Apache Spark (was: Josh Rosen) > Reduce collections allocations in Catalyst

[jira] [Commented] (SPARK-15732) Dataset generated code "generated.java" Fails with Certain Case Classes

2016-06-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313354#comment-15313354 ] Apache Spark commented on SPARK-15732: -- User 'cloud-fan' has created a pull request for this issue:

  1   2   3   >