[jira] [Commented] (SPARK-16664) Spark 1.6.2 - Persist call on Data frames with more than 200 columns is wiping out the data.

2016-07-22 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15390562#comment-15390562 ] Dongjoon Hyun commented on SPARK-16664: --- Oh, I missed the 201 cases. Sorry. > Spark 1.6.2 -

[jira] [Updated] (SPARK-16692) multilabel classification to DataFrame, ML

2016-07-22 Thread Weizhi Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weizhi Li updated SPARK-16692: -- Summary: multilabel classification to DataFrame, ML (was: multi labels evaluations to Dataframe. )

[jira] [Created] (SPARK-16692) multi labels evaluations to Dataframe.

2016-07-22 Thread Weizhi Li (JIRA)
Weizhi Li created SPARK-16692: - Summary: multi labels evaluations to Dataframe. Key: SPARK-16692 URL: https://issues.apache.org/jira/browse/SPARK-16692 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-16664) Spark 1.6.2 - Persist call on Data frames with more than 200 columns is wiping out the data.

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16664: Assignee: (was: Apache Spark) > Spark 1.6.2 - Persist call on Data frames with more

[jira] [Assigned] (SPARK-16664) Spark 1.6.2 - Persist call on Data frames with more than 200 columns is wiping out the data.

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16664: Assignee: Apache Spark > Spark 1.6.2 - Persist call on Data frames with more than 200

[jira] [Commented] (SPARK-16664) Spark 1.6.2 - Persist call on Data frames with more than 200 columns is wiping out the data.

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15390534#comment-15390534 ] Apache Spark commented on SPARK-16664: -- User 'breakdawn' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-16664) Spark 1.6.2 - Persist call on Data frames with more than 200 columns is wiping out the data.

2016-07-22 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15390525#comment-15390525 ] Liwei Lin edited comment on SPARK-16664 at 7/23/16 4:24 AM: I think I've

[jira] [Commented] (SPARK-16664) Spark 1.6.2 - Persist call on Data frames with more than 200 columns is wiping out the data.

2016-07-22 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15390525#comment-15390525 ] Liwei Lin commented on SPARK-16664: --- I've found the root cause; will submit a patch shortly. > Spark

[jira] [Created] (SPARK-16691) move BucketSpec to catalyst module and use it in CatalogTable

2016-07-22 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-16691: --- Summary: move BucketSpec to catalyst module and use it in CatalogTable Key: SPARK-16691 URL: https://issues.apache.org/jira/browse/SPARK-16691 Project: Spark

[jira] [Assigned] (SPARK-16675) Avoid per-record type dispatch in JDBC when writing

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16675: Assignee: Apache Spark > Avoid per-record type dispatch in JDBC when writing >

[jira] [Assigned] (SPARK-16675) Avoid per-record type dispatch in JDBC when writing

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16675: Assignee: (was: Apache Spark) > Avoid per-record type dispatch in JDBC when writing >

[jira] [Commented] (SPARK-16675) Avoid per-record type dispatch in JDBC when writing

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15390489#comment-15390489 ] Apache Spark commented on SPARK-16675: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Updated] (SPARK-16622) Fix NullPointerException when the returned value of the called method in Invoke is null

2016-07-22 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-16622: Assignee: Liang-Chi Hsieh > Fix NullPointerException when the returned value of the called method

[jira] [Resolved] (SPARK-16622) Fix NullPointerException when the returned value of the called method in Invoke is null

2016-07-22 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-16622. - Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 14259

[jira] [Assigned] (SPARK-16690) rename SQLTestUtils.withTempTable to withTempView

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16690: Assignee: Apache Spark (was: Wenchen Fan) > rename SQLTestUtils.withTempTable to

[jira] [Commented] (SPARK-16690) rename SQLTestUtils.withTempTable to withTempView

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15390474#comment-15390474 ] Apache Spark commented on SPARK-16690: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-16690) rename SQLTestUtils.withTempTable to withTempView

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16690: Assignee: Wenchen Fan (was: Apache Spark) > rename SQLTestUtils.withTempTable to

[jira] [Created] (SPARK-16690) rename SQLTestUtils.withTempTable to withTempView

2016-07-22 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-16690: --- Summary: rename SQLTestUtils.withTempTable to withTempView Key: SPARK-16690 URL: https://issues.apache.org/jira/browse/SPARK-16690 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-16646) LEAST doesn't accept numeric arguments with different data types

2016-07-22 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389273#comment-15389273 ] Hyukjin Kwon edited comment on SPARK-16646 at 7/23/16 1:53 AM: --- It seems

[jira] [Commented] (SPARK-16689) FileSourceStrategy: Pruning Partition Columns When No Partition Column Exist in Project

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15390449#comment-15390449 ] Apache Spark commented on SPARK-16689: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-16689) FileSourceStrategy: Pruning Partition Columns When No Partition Column Exist in Project

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16689: Assignee: (was: Apache Spark) > FileSourceStrategy: Pruning Partition Columns When No

[jira] [Assigned] (SPARK-16689) FileSourceStrategy: Pruning Partition Columns When No Partition Column Exist in Project

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16689: Assignee: Apache Spark > FileSourceStrategy: Pruning Partition Columns When No Partition

[jira] [Created] (SPARK-16689) FileSourceStrategy: Pruning Partition Columns When No Partition Column Exist in Project

2016-07-22 Thread Xiao Li (JIRA)
Xiao Li created SPARK-16689: --- Summary: FileSourceStrategy: Pruning Partition Columns When No Partition Column Exist in Project Key: SPARK-16689 URL: https://issues.apache.org/jira/browse/SPARK-16689

[jira] [Commented] (SPARK-16589) Chained cartesian produces incorrect number of records

2016-07-22 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15390299#comment-15390299 ] holdenk commented on SPARK-16589: - Yah I think we should explore whats going on a bit more detail here -

[jira] [Created] (SPARK-16688) OpenHashSet.MAX_CAPACITY is always based on Int even when using Long

2016-07-22 Thread Ben McCann (JIRA)
Ben McCann created SPARK-16688: -- Summary: OpenHashSet.MAX_CAPACITY is always based on Int even when using Long Key: SPARK-16688 URL: https://issues.apache.org/jira/browse/SPARK-16688 Project: Spark

[jira] [Commented] (SPARK-8971) Support balanced class labels when splitting train/cross validation sets

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15390242#comment-15390242 ] Apache Spark commented on SPARK-8971: - User 'sethah' has created a pull request for this issue:

[jira] [Commented] (SPARK-16595) Spark History server Rest Api gives Application not found error for yarn-cluster mode

2016-07-22 Thread Weiqing Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15390162#comment-15390162 ] Weiqing Yang commented on SPARK-16595: -- This issue is not reproduced. > Spark History server Rest

[jira] [Commented] (SPARK-11702) Guava ClassLoading Issue When Using Different Hive Metastore Version

2016-07-22 Thread Sabyasachi Nayak (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15390138#comment-15390138 ] Sabyasachi Nayak commented on SPARK-11702: -- Hi Joey, I am facing the similar issue when I am

[jira] [Resolved] (SPARK-16687) build/mvn fails when fetching mvn

2016-07-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16687. --- Resolution: Not A Problem

[jira] [Commented] (SPARK-16687) build/mvn fails when fetching mvn

2016-07-22 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15390014#comment-15390014 ] Michael Gummelt commented on SPARK-16687: - thanks! > build/mvn fails when fetching mvn >

[jira] [Updated] (SPARK-16687) build/mvn fails when fetching mvn

2016-07-22 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Gummelt updated SPARK-16687: Description: mvn 3.3.3 no longer exists in the apache.org mirror used by `build/mvn`

[jira] [Created] (SPARK-16687) build/mvn fails when fetching mvn

2016-07-22 Thread Michael Gummelt (JIRA)
Michael Gummelt created SPARK-16687: --- Summary: build/mvn fails when fetching mvn Key: SPARK-16687 URL: https://issues.apache.org/jira/browse/SPARK-16687 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-16685) audit release docs are ambiguous

2016-07-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16685: -- Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) > audit release docs are

[jira] [Commented] (SPARK-16685) audit release docs are ambiguous

2016-07-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389956#comment-15389956 ] Sean Owen commented on SPARK-16685: --- I'm not actually sure where this is used now. I don't think even

[jira] [Closed] (SPARK-16431) Add a unified method that accepts single instances to feature transformers and predictors

2016-07-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-16431. - Resolution: Duplicate > Add a unified method that accepts single instances to feature

[jira] [Commented] (SPARK-16431) Add a unified method that accepts single instances to feature transformers and predictors

2016-07-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389954#comment-15389954 ] Joseph K. Bradley commented on SPARK-16431: --- Single-row prediction is something we are working

[jira] [Updated] (SPARK-16686) Dataset.sample with seed: result seems to depend on downstream usage

2016-07-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-16686: -- Attachment: DataFrame.sample bug - 2.0.html > Dataset.sample with seed: result seems

[jira] [Updated] (SPARK-16686) Dataset.sample with seed: result seems to depend on downstream usage

2016-07-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-16686: -- Affects Version/s: 1.6.2 Environment: Spark 1.6.2 and Spark 2.0 - RC4

[jira] [Created] (SPARK-16686) Dataset.sample with seed: result seems to depend on downstream usage

2016-07-22 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-16686: - Summary: Dataset.sample with seed: result seems to depend on downstream usage Key: SPARK-16686 URL: https://issues.apache.org/jira/browse/SPARK-16686

[jira] [Created] (SPARK-16685) audit release docs are ambiguous

2016-07-22 Thread jay vyas (JIRA)
jay vyas created SPARK-16685: Summary: audit release docs are ambiguous Key: SPARK-16685 URL: https://issues.apache.org/jira/browse/SPARK-16685 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-16197) Cleanup PySpark status api and example

2016-07-22 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-16197: - Description: Cleanup of Status API example to use SparkSession and be more consistent with

[jira] [Assigned] (SPARK-16421) Improve output from ML examples

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16421: Assignee: (was: Apache Spark) > Improve output from ML examples >

[jira] [Assigned] (SPARK-16421) Improve output from ML examples

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16421: Assignee: Apache Spark > Improve output from ML examples >

[jira] [Commented] (SPARK-16421) Improve output from ML examples

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389825#comment-15389825 ] Apache Spark commented on SPARK-16421: -- User 'BryanCutler' has created a pull request for this

[jira] [Comment Edited] (SPARK-4240) Refine Tree Predictions in Gradient Boosting to Improve Prediction Accuracy.

2016-07-22 Thread Vladimir Feinberg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15366502#comment-15366502 ] Vladimir Feinberg edited comment on SPARK-4240 at 7/22/16 4:47 PM: ---

[jira] [Commented] (SPARK-16416) Logging in shutdown hook does not work properly with Log4j 2.x

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389800#comment-15389800 ] Apache Spark commented on SPARK-16416: -- User 'mikaelstaldal' has created a pull request for this

[jira] [Assigned] (SPARK-16416) Logging in shutdown hook does not work properly with Log4j 2.x

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16416: Assignee: (was: Apache Spark) > Logging in shutdown hook does not work properly with

[jira] [Assigned] (SPARK-16416) Logging in shutdown hook does not work properly with Log4j 2.x

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16416: Assignee: Apache Spark > Logging in shutdown hook does not work properly with Log4j 2.x >

[jira] [Commented] (SPARK-16416) Logging in shutdown hook does not work properly with Log4j 2.x

2016-07-22 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389802#comment-15389802 ] Mikael Ståldal commented on SPARK-16416: https://github.com/apache/spark/pull/14320 > Logging in

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.10 Consumer API

2016-07-22 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389797#comment-15389797 ] Cody Koeninger commented on SPARK-12177: This has already been merged for the upcoming Spark 2.0

[jira] [Commented] (SPARK-2183) Avoid loading/shuffling data twice in self-join query

2016-07-22 Thread Khaled Hammouda (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389751#comment-15389751 ] Khaled Hammouda commented on SPARK-2183: [~etang] For us caching the dataframe before performing

[jira] [Created] (SPARK-16684) Standalone mode local dirs not properly cleaned if job is killed

2016-07-22 Thread Dean Wampler (JIRA)
Dean Wampler created SPARK-16684: Summary: Standalone mode local dirs not properly cleaned if job is killed Key: SPARK-16684 URL: https://issues.apache.org/jira/browse/SPARK-16684 Project: Spark

[jira] [Commented] (SPARK-7481) Add spark-cloud module to pull in aws+azure object store FS accessors; test integration

2016-07-22 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389730#comment-15389730 ] Steve Loughran commented on SPARK-7481: --- ps, latest s3a state # [Object stores in

[jira] [Commented] (SPARK-16676) Spark jobs stay in pending

2016-07-22 Thread Joe Chong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389726#comment-15389726 ] Joe Chong commented on SPARK-16676: --- It didn't. How do I troubleshoot. From the attached picture, the

[jira] [Commented] (SPARK-16676) Spark jobs stay in pending

2016-07-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389704#comment-15389704 ] Sean Owen commented on SPARK-16676: --- Meaning, did the executors actually start? if they didn't, and are

[jira] [Commented] (SPARK-7481) Add spark-cloud module to pull in aws+azure object store FS accessors; test integration

2016-07-22 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389707#comment-15389707 ] Steve Loughran commented on SPARK-7481: --- Sad but true. * The PR I've put up adds the hadoop-aws and

[jira] [Commented] (SPARK-16676) Spark jobs stay in pending

2016-07-22 Thread Joe Chong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389677#comment-15389677 ] Joe Chong commented on SPARK-16676: --- Don't understand what you meant by "Did your executors schedule?".

[jira] [Commented] (SPARK-16635) Provide Session support in the Spark UI

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389673#comment-15389673 ] Apache Spark commented on SPARK-16635: -- User 'nblintao' has created a pull request for this issue:

[jira] [Assigned] (SPARK-16635) Provide Session support in the Spark UI

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16635: Assignee: (was: Apache Spark) > Provide Session support in the Spark UI >

[jira] [Assigned] (SPARK-16635) Provide Session support in the Spark UI

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16635: Assignee: Apache Spark > Provide Session support in the Spark UI >

[jira] [Comment Edited] (SPARK-12177) Update KafkaDStreams to new Kafka 0.10 Consumer API

2016-07-22 Thread David Sabater (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389668#comment-15389668 ] David Sabater edited comment on SPARK-12177 at 7/22/16 3:21 PM: I am with

[jira] [Comment Edited] (SPARK-12177) Update KafkaDStreams to new Kafka 0.10 Consumer API

2016-07-22 Thread David Sabater (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389668#comment-15389668 ] David Sabater edited comment on SPARK-12177 at 7/22/16 3:19 PM: I am with

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.10 Consumer API

2016-07-22 Thread David Sabater (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389668#comment-15389668 ] David Sabater commented on SPARK-12177: --- I am with you guys, this is an important feature required

[jira] [Commented] (SPARK-16683) Group by does not work after multiple joins of the same dataframe

2016-07-22 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389628#comment-15389628 ] Jacek Laskowski commented on SPARK-16683: - Hey Witek, can you include the sample code and the

[jira] [Updated] (SPARK-16683) Group by does not work after multiple joins of the same dataframe

2016-07-22 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Witold Jędrzejewski updated SPARK-16683: Attachment: Duplicates Problem Presentation.json > Group by does not work after

[jira] [Created] (SPARK-16683) Group by does not work after multiple joins of the same dataframe

2016-07-22 Thread JIRA
Witold Jędrzejewski created SPARK-16683: --- Summary: Group by does not work after multiple joins of the same dataframe Key: SPARK-16683 URL: https://issues.apache.org/jira/browse/SPARK-16683

[jira] [Commented] (SPARK-7481) Add spark-cloud module to pull in aws+azure object store FS accessors; test integration

2016-07-22 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389589#comment-15389589 ] Nicholas Chammas commented on SPARK-7481: - [~ste...@apache.org] - Some relevant reading for you

[jira] [Created] (SPARK-16682) pyspark 1.6.0 not handling multiple level import when the necessary files are zipped

2016-07-22 Thread Santosh Balasubramanya (JIRA)
Santosh Balasubramanya created SPARK-16682: -- Summary: pyspark 1.6.0 not handling multiple level import when the necessary files are zipped Key: SPARK-16682 URL:

[jira] [Created] (SPARK-16681) Optimizer changes order of filter predicates involving UDFs, which changes semantics

2016-07-22 Thread Stefan Fehrenbach (JIRA)
Stefan Fehrenbach created SPARK-16681: - Summary: Optimizer changes order of filter predicates involving UDFs, which changes semantics Key: SPARK-16681 URL: https://issues.apache.org/jira/browse/SPARK-16681

[jira] [Commented] (SPARK-16595) Spark History server Rest Api gives Application not found error for yarn-cluster mode

2016-07-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389412#comment-15389412 ] Sean Owen commented on SPARK-16595: --- [~jerryshao] [~WeiqingYang] do you have any thoughts on this in

[jira] [Commented] (SPARK-16664) Spark 1.6.2 - Persist call on Data frames with more than 200 columns is wiping out the data.

2016-07-22 Thread Satish Kolli (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389397#comment-15389397 ] Satish Kolli commented on SPARK-16664: -- [~dongjoon] Problem exists in master also. I tried a nightly

[jira] [Resolved] (SPARK-16651) Document no exception using DataFrame.withColumnRenamed when existing column doesn't exist

2016-07-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16651. --- Resolution: Fixed Fix Version/s: 2.0.1 Issue resolved by pull request 14288

[jira] [Updated] (SPARK-16651) Document no exception using DataFrame.withColumnRenamed when existing column doesn't exist

2016-07-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16651: -- Assignee: Dongjoon Hyun Priority: Minor (was: Major) Component/s: Documentation

[jira] [Commented] (SPARK-16380) Update SQL examples and programming guide for Python language binding

2016-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389373#comment-15389373 ] Apache Spark commented on SPARK-16380: -- User 'liancheng' has created a pull request for this issue:

[jira] [Commented] (SPARK-16664) Spark 1.6.2 - Persist call on Data frames with more than 200 columns is wiping out the data.

2016-07-22 Thread Wesley Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389367#comment-15389367 ] Wesley Tang commented on SPARK-16664: - According to the original post, the size should be 201 to

[jira] [Updated] (SPARK-16650) Improve documentation of spark.task.maxFailures

2016-07-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16650: -- Assignee: Thomas Graves Priority: Minor (was: Major) > Improve documentation of

[jira] [Resolved] (SPARK-16650) Improve documentation of spark.task.maxFailures

2016-07-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16650. --- Resolution: Fixed Fix Version/s: 2.0.1 Issue resolved by pull request 14287

[jira] [Updated] (SPARK-16487) Some batches might not get marked as fully processed in JobGenerator

2016-07-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16487: -- Assignee: Ahmed Mahran Priority: Major (was: Trivial) > Some batches might not get marked as

[jira] [Commented] (SPARK-16635) Provide Session support in the Spark UI

2016-07-22 Thread Tao Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389360#comment-15389360 ] Tao Lin commented on SPARK-16635: - Notice that the configuration of SparkSession could be changed during

[jira] [Resolved] (SPARK-16487) Some batches might not get marked as fully processed in JobGenerator

2016-07-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16487. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 14145

[jira] [Commented] (SPARK-16659) use Maven project to submit spark application via yarn-client

2016-07-22 Thread Jack Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389331#comment-15389331 ] Jack Jiang commented on SPARK-16659: i have fixed it > use Maven project to submit spark application

[jira] [Comment Edited] (SPARK-5992) Locality Sensitive Hashing (LSH) for MLlib

2016-07-22 Thread tony (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389321#comment-15389321 ] tony edited comment on SPARK-5992 at 7/22/16 11:08 AM: --- Hi, I am new to spark. How

[jira] [Comment Edited] (SPARK-5992) Locality Sensitive Hashing (LSH) for MLlib

2016-07-22 Thread tony (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389321#comment-15389321 ] tony edited comment on SPARK-5992 at 7/22/16 11:03 AM: --- Hi, I am new to spark. How

[jira] [Commented] (SPARK-5992) Locality Sensitive Hashing (LSH) for MLlib

2016-07-22 Thread tony (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389321#comment-15389321 ] tony commented on SPARK-5992: - Hi, I am new to spark. How can I make a contribution to Spark LSH

[jira] [Commented] (SPARK-8871) Add maximal frequent itemsets filter in Spark MLib FPGrowth

2016-07-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389312#comment-15389312 ] Sean Owen commented on SPARK-8871: -- Is this the same as https://issues.apache.org/jira/browse/SPARK-6143

[jira] [Commented] (SPARK-16646) LEAST doesn't accept numeric arguments with different data types

2016-07-22 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389276#comment-15389276 ] Hyukjin Kwon commented on SPARK-16646: -- [~lian cheng] Should we also follow this? I will follow your

[jira] [Commented] (SPARK-16646) LEAST doesn't accept numeric arguments with different data types

2016-07-22 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389273#comment-15389273 ] Hyukjin Kwon commented on SPARK-16646: -- It seems basically comparison between numbers and decimal,

[jira] [Comment Edited] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2016-07-22 Thread Avik Sil (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389189#comment-15389189 ] Avik Sil edited comment on SPARK-15544 at 7/22/16 9:11 AM: --- I am also seeing

[jira] [Commented] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2016-07-22 Thread Avik Sil (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389189#comment-15389189 ] Avik Sil commented on SPARK-15544: -- I am also seeing the same issue with spark 1.3.0, ubuntu 14.04,

[jira] [Updated] (SPARK-8871) Add maximal frequent itemsets filter in Spark MLib FPGrowth

2016-07-22 Thread Jonathan Svirsky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Svirsky updated SPARK-8871: Issue Type: Improvement (was: New Feature) > Add maximal frequent itemsets filter in Spark

[jira] [Commented] (SPARK-16628) OrcConversions should not convert an ORC table represented by MetastoreRelation to HadoopFsRelation if metastore schema does not match schema stored in ORC files

2016-07-22 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389173#comment-15389173 ] Liang-Chi Hsieh commented on SPARK-16628: - I think it depends whether Hive also writes wrong

[jira] [Commented] (SPARK-16680) Set spark.driver.userClassPathFirst=true, and run spark-sql failed

2016-07-22 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389168#comment-15389168 ] Dongjoon Hyun commented on SPARK-16680: --- Hi, [~KaiXinXIaoLei]. According to the log, it looks like

[jira] [Created] (SPARK-16680) Set spark.driver.userClassPathFirst=true, and run spark-sql failed

2016-07-22 Thread KaiXinXIaoLei (JIRA)
KaiXinXIaoLei created SPARK-16680: - Summary: Set spark.driver.userClassPathFirst=true, and run spark-sql failed Key: SPARK-16680 URL: https://issues.apache.org/jira/browse/SPARK-16680 Project: Spark

[jira] [Commented] (SPARK-16646) LEAST doesn't accept numeric arguments with different data types

2016-07-22 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389145#comment-15389145 ] Hyukjin Kwon commented on SPARK-16646: -- My pleasure! Let me look into this and will bring some

[jira] [Commented] (SPARK-16646) LEAST doesn't accept numeric arguments with different data types

2016-07-22 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389141#comment-15389141 ] Cheng Lian commented on SPARK-16646: Could you please help check Hive's behavior here? Especially

[jira] [Commented] (SPARK-16665) python import pyspark fails in context.py

2016-07-22 Thread Andrew Jefferson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389135#comment-15389135 ] Andrew Jefferson commented on SPARK-16665: -- This was the result of a previous failed import in

[jira] [Resolved] (SPARK-16665) python import pyspark fails in context.py

2016-07-22 Thread Andrew Jefferson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Jefferson resolved SPARK-16665. -- Resolution: Cannot Reproduce > python import pyspark fails in context.py >

[jira] [Commented] (SPARK-16676) Spark jobs stay in pending

2016-07-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389124#comment-15389124 ] Sean Owen commented on SPARK-16676: --- Did your executors schedule? your other operations sound like

[jira] [Commented] (SPARK-16677) Strange Error when Issuing Load Table Against A View

2016-07-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15389113#comment-15389113 ] Sean Owen commented on SPARK-16677: --- Is there more to this error? InvocationTargetException is just a

[jira] [Commented] (SPARK-16670) Partition pruning doesn't work when queries contain disjunctions involving partition columns

2016-07-22 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15388989#comment-15388989 ] Xiao Li commented on SPARK-16670: - https://github.com/apache/spark/pull/13585 A related PR was already

  1   2   >