[jira] [Commented] (SPARK-17142) Complex query triggers binding error in HashAggregateExec

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15489503#comment-15489503 ] Apache Spark commented on SPARK-17142: -- User 'jiangxb1987' has created a pull request for this

[jira] [Updated] (SPARK-17460) Dataset.joinWith broadcasts gigabyte sized table, causes OOM Exception

2016-09-13 Thread Chris Perluss (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Perluss updated SPARK-17460: -- Summary: Dataset.joinWith broadcasts gigabyte sized table, causes OOM Exception (was:

[jira] [Updated] (SPARK-17317) Add package vignette to SparkR

2016-09-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-17317: -- Assignee: Junyang Qian > Add package vignette to SparkR >

[jira] [Resolved] (SPARK-17317) Add package vignette to SparkR

2016-09-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-17317. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request

[jira] [Commented] (SPARK-17073) generate basic stats for column

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15489326#comment-15489326 ] Apache Spark commented on SPARK-17073: -- User 'wzhfy' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17073) generate basic stats for column

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17073: Assignee: (was: Apache Spark) > generate basic stats for column >

[jira] [Assigned] (SPARK-17073) generate basic stats for column

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17073: Assignee: Apache Spark > generate basic stats for column >

[jira] [Commented] (SPARK-2365) Add IndexedRDD, an efficient updatable key-value store

2016-09-13 Thread Ganesh Krishnan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15489250#comment-15489250 ] Ganesh Krishnan commented on SPARK-2365: Is this only for RDD or can we use it with DataFrames?

[jira] [Commented] (SPARK-16460) Spark 2.0 CSV ignores NULL value in Date format

2016-09-13 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15489193#comment-15489193 ] Liwei Lin commented on SPARK-16460: --- [~marcelboldt] Oh cool! Thanks for the feedback! > Spark 2.0 CSV

[jira] [Comment Edited] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15489169#comment-15489169 ] Reynold Xin edited comment on SPARK-15406 at 9/14/16 2:50 AM: -- Finally back

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15489169#comment-15489169 ] Reynold Xin commented on SPARK-15406: - Finally back from vacation. FWIW, I want to cut 2.0.1 rc in

[jira] [Commented] (SPARK-17510) Set Streaming MaxRate Independently For Multiple Streams

2016-09-13 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15489102#comment-15489102 ] Cody Koeninger commented on SPARK-17510: This would require a constructor change and another

[jira] [Commented] (SPARK-6593) Provide option for HadoopRDD to skip corrupted files

2016-09-13 Thread Charles Pritchard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1543#comment-1543 ] Charles Pritchard commented on SPARK-6593: -- Something appears to have changed between 2.0 and

[jira] [Assigned] (SPARK-15621) BatchEvalPythonExec fails with OOM

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15621: Assignee: Apache Spark (was: Davies Liu) > BatchEvalPythonExec fails with OOM >

[jira] [Assigned] (SPARK-15621) BatchEvalPythonExec fails with OOM

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15621: Assignee: Davies Liu (was: Apache Spark) > BatchEvalPythonExec fails with OOM >

[jira] [Commented] (SPARK-15621) BatchEvalPythonExec fails with OOM

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488869#comment-15488869 ] Apache Spark commented on SPARK-15621: -- User 'davies' has created a pull request for this issue:

[jira] [Commented] (SPARK-14482) Change default compression codec for Parquet from gzip to snappy

2016-09-13 Thread Charles Pritchard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488848#comment-15488848 ] Charles Pritchard commented on SPARK-14482: --- I don't think this fully made it into the manual;

[jira] [Updated] (SPARK-17531) Don't initialize Hive Listeners for the Execution Client

2016-09-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-17531: - Fix Version/s: 1.6.3 > Don't initialize Hive Listeners for the Execution Client >

[jira] [Resolved] (SPARK-17530) Add Statistics into DESCRIBE FORMATTED

2016-09-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-17530. --- Resolution: Fixed Assignee: Xiao Li Fix Version/s: 2.1.0 > Add

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488642#comment-15488642 ] Cody Koeninger commented on SPARK-15406: 1. How can we avoid duplicate work like this? There was

[jira] [Updated] (SPARK-17445) Reference an ASF page as the main place to find third-party packages

2016-09-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-17445: - Target Version/s: 2.0.1, 2.1.0 > Reference an ASF page as the main place to find

[jira] [Commented] (SPARK-17097) Pregel does not keep vertex state properly; fails to terminate

2016-09-13 Thread Kevin Rossi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488612#comment-15488612 ] Kevin Rossi commented on SPARK-17097: - I am away and I will return on 16 September 2016. Thank you,

[jira] [Commented] (SPARK-17097) Pregel does not keep vertex state properly; fails to terminate

2016-09-13 Thread ding (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488609#comment-15488609 ] ding commented on SPARK-17097: -- I am afraid the attached sample code fail to terminate with case class is

[jira] [Resolved] (SPARK-17531) Don't initialize Hive Listeners for the Execution Client

2016-09-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-17531. -- Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 Issue resolved by pull request

[jira] [Updated] (SPARK-17531) Don't initialize Hive Listeners for the Execution Client

2016-09-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-17531: - Assignee: Burak Yavuz > Don't initialize Hive Listeners for the Execution Client >

[jira] [Assigned] (SPARK-17532) Add thread lock information from JMX to thread dump UI

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17532: Assignee: (was: Apache Spark) > Add thread lock information from JMX to thread dump

[jira] [Commented] (SPARK-17532) Add thread lock information from JMX to thread dump UI

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488601#comment-15488601 ] Apache Spark commented on SPARK-17532: -- User 'rdblue' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17532) Add thread lock information from JMX to thread dump UI

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17532: Assignee: Apache Spark > Add thread lock information from JMX to thread dump UI >

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488595#comment-15488595 ] Tathagata Das commented on SPARK-15406: --- 1. We will have a PR very soon so that you can start

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488558#comment-15488558 ] Cody Koeninger commented on SPARK-15406: So you're saying the type of K and V will always be

[jira] [Created] (SPARK-17532) Add thread lock information from JMX to thread dump UI

2016-09-13 Thread Ryan Blue (JIRA)
Ryan Blue created SPARK-17532: - Summary: Add thread lock information from JMX to thread dump UI Key: SPARK-17532 URL: https://issues.apache.org/jira/browse/SPARK-17532 Project: Spark Issue Type:

[jira] [Commented] (SPARK-16441) Spark application hang when dynamic allocation is enabled

2016-09-13 Thread Dhruve Ashar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488528#comment-15488528 ] Dhruve Ashar commented on SPARK-16441: -- So all of these are related in one way or the other. I will

[jira] [Commented] (SPARK-17531) Don't initialize Hive Listeners for the Execution Client

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488498#comment-15488498 ] Apache Spark commented on SPARK-17531: -- User 'brkyvz' has created a pull request for this issue:

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488496#comment-15488496 ] Michael Armbrust commented on SPARK-15406: -- For the types that are coming out, the SQL way would

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488477#comment-15488477 ] Michael Armbrust commented on SPARK-15406: -- Streaming is labeled experimental, we can continue

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Ofir Manor (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488487#comment-15488487 ] Ofir Manor commented on SPARK-15406: Cody, I think you are right. Now is the right time to spend a

[jira] [Commented] (SPARK-16441) Spark application hang when dynamic allocation is enabled

2016-09-13 Thread Ashwin Shankar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488465#comment-15488465 ] Ashwin Shankar commented on SPARK-16441: hey, Thanks! I don't think either of the patches would

[jira] [Comment Edited] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488452#comment-15488452 ] Cody Koeninger edited comment on SPARK-15406 at 9/13/16 9:18 PM: - So I

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488452#comment-15488452 ] Cody Koeninger commented on SPARK-15406: So if I asked this twice with no answer, so I'll ask it

[jira] [Comment Edited] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488420#comment-15488420 ] Tathagata Das edited comment on SPARK-15406 at 9/13/16 9:04 PM:

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488420#comment-15488420 ] Tathagata Das commented on SPARK-15406: --- [Combining the comments in the doc and on the JIRA] Thank

[jira] [Assigned] (SPARK-17531) Don't initialize Hive Listeners for the Execution Client

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17531: Assignee: Apache Spark > Don't initialize Hive Listeners for the Execution Client >

[jira] [Commented] (SPARK-17531) Don't initialize Hive Listeners for the Execution Client

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488337#comment-15488337 ] Apache Spark commented on SPARK-17531: -- User 'brkyvz' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17531) Don't initialize Hive Listeners for the Execution Client

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17531: Assignee: (was: Apache Spark) > Don't initialize Hive Listeners for the Execution

[jira] [Comment Edited] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488312#comment-15488312 ] Cody Koeninger edited comment on SPARK-15406 at 9/13/16 8:26 PM: - Unless

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488312#comment-15488312 ] Cody Koeninger commented on SPARK-15406: Unless I'm misunderstanding, you answered regarding a

[jira] [Assigned] (SPARK-17484) Race condition when cancelling a job during a cache write can lead to block fetch failures

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17484: Assignee: Josh Rosen (was: Apache Spark) > Race condition when cancelling a job during a

[jira] [Assigned] (SPARK-17484) Race condition when cancelling a job during a cache write can lead to block fetch failures

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17484: Assignee: Apache Spark (was: Josh Rosen) > Race condition when cancelling a job during a

[jira] [Commented] (SPARK-17484) Race condition when cancelling a job during a cache write can lead to block fetch failures

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488273#comment-15488273 ] Apache Spark commented on SPARK-17484: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Updated] (SPARK-17531) Don't initialize Hive Listeners for the Execution Client

2016-09-13 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-17531: Description: If a user provides listeners inside the Hive Conf, the configuration for these

[jira] [Created] (SPARK-17531) Don't initialize Hive Listeners for the Execution Client

2016-09-13 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-17531: --- Summary: Don't initialize Hive Listeners for the Execution Client Key: SPARK-17531 URL: https://issues.apache.org/jira/browse/SPARK-17531 Project: Spark Issue

[jira] [Commented] (SPARK-17529) On highly skewed data, outer join merges are slow

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488106#comment-15488106 ] Apache Spark commented on SPARK-17529: -- User 'davidnavas' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17529) On highly skewed data, outer join merges are slow

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17529: Assignee: Apache Spark > On highly skewed data, outer join merges are slow >

[jira] [Assigned] (SPARK-17529) On highly skewed data, outer join merges are slow

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17529: Assignee: (was: Apache Spark) > On highly skewed data, outer join merges are slow >

[jira] [Updated] (SPARK-17529) On highly skewed data, outer join merges are slow

2016-09-13 Thread Mark Hamstra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Hamstra updated SPARK-17529: - Priority: Major (was: Trivial) > On highly skewed data, outer join merges are slow >

[jira] [Assigned] (SPARK-17530) Add Statistics into DESCRIBE FORMATTED

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17530: Assignee: (was: Apache Spark) > Add Statistics into DESCRIBE FORMATTED >

[jira] [Assigned] (SPARK-17530) Add Statistics into DESCRIBE FORMATTED

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17530: Assignee: Apache Spark > Add Statistics into DESCRIBE FORMATTED >

[jira] [Commented] (SPARK-17530) Add Statistics into DESCRIBE FORMATTED

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488090#comment-15488090 ] Apache Spark commented on SPARK-17530: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Created] (SPARK-17530) Add Statistics into DESCRIBE FORMATTED

2016-09-13 Thread Xiao Li (JIRA)
Xiao Li created SPARK-17530: --- Summary: Add Statistics into DESCRIBE FORMATTED Key: SPARK-17530 URL: https://issues.apache.org/jira/browse/SPARK-17530 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488079#comment-15488079 ] Tathagata Das commented on SPARK-15406: --- Here are some thoughts. - The key-value types should be

[jira] [Created] (SPARK-17529) On highly skewed data, outer join merges are slow

2016-09-13 Thread David C Navas (JIRA)
David C Navas created SPARK-17529: - Summary: On highly skewed data, outer join merges are slow Key: SPARK-17529 URL: https://issues.apache.org/jira/browse/SPARK-17529 Project: Spark Issue

[jira] [Commented] (SPARK-10816) API design: window and session specification

2016-09-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488035#comment-15488035 ] Maciej BryƄski commented on SPARK-10816: Hi, Any updates on Session Window ? > API design:

[jira] [Commented] (SPARK-17450) spark sql rownumber OOM

2016-09-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15488005#comment-15488005 ] Herman van Hovell commented on SPARK-17450: --- https://github.com/apache/spark/pull/10605 >

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487930#comment-15487930 ] Cody Koeninger commented on SPARK-15406: Specific examples: Kafka has a type for a key, and a

[jira] [Updated] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-15406: -- Description: This is the parent JIRA to track all the work for the building a Kafka source

[jira] [Commented] (SPARK-17528) MutableProjection should not cache content from the input row

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487907#comment-15487907 ] Apache Spark commented on SPARK-17528: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17528) MutableProjection should not cache content from the input row

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17528: Assignee: Apache Spark (was: Wenchen Fan) > MutableProjection should not cache content

[jira] [Assigned] (SPARK-17528) MutableProjection should not cache content from the input row

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17528: Assignee: Wenchen Fan (was: Apache Spark) > MutableProjection should not cache content

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487875#comment-15487875 ] Michael Armbrust commented on SPARK-15406: -- Hey Cody, thanks for the input and for sharing your

[jira] [Created] (SPARK-17528) MutableProjection should not cache content from the input row

2016-09-13 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-17528: --- Summary: MutableProjection should not cache content from the input row Key: SPARK-17528 URL: https://issues.apache.org/jira/browse/SPARK-17528 Project: Spark

[jira] [Created] (SPARK-17527) mergeSchema with `_OPTIONAL_` metadata fails

2016-09-13 Thread Gaurav Shah (JIRA)
Gaurav Shah created SPARK-17527: --- Summary: mergeSchema with `_OPTIONAL_` metadata fails Key: SPARK-17527 URL: https://issues.apache.org/jira/browse/SPARK-17527 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-17525) SparkContext.clearFiles() still present in the PySpark bindings though the underlying Scala method was removed in Spark 2.0

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487789#comment-15487789 ] Apache Spark commented on SPARK-17525: -- User 'sjakthol' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17525) SparkContext.clearFiles() still present in the PySpark bindings though the underlying Scala method was removed in Spark 2.0

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17525: Assignee: (was: Apache Spark) > SparkContext.clearFiles() still present in the

[jira] [Assigned] (SPARK-17525) SparkContext.clearFiles() still present in the PySpark bindings though the underlying Scala method was removed in Spark 2.0

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17525: Assignee: Apache Spark > SparkContext.clearFiles() still present in the PySpark bindings

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Frederick Reiss (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487780#comment-15487780 ] Frederick Reiss commented on SPARK-15406: - +1 for taking the simple route in the short term. I'm

[jira] [Commented] (SPARK-17252) Performing arithmetic in VALUES can lead to ClassCastException / MatchErrors during query parsing

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487749#comment-15487749 ] Apache Spark commented on SPARK-17252: -- User 'sjakthol' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17526) Display the executor log links with the job failure message on Spark UI and Console

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17526: Assignee: (was: Apache Spark) > Display the executor log links with the job failure

[jira] [Assigned] (SPARK-17526) Display the executor log links with the job failure message on Spark UI and Console

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17526: Assignee: Apache Spark > Display the executor log links with the job failure message on

[jira] [Commented] (SPARK-17526) Display the executor log links with the job failure message on Spark UI and Console

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487705#comment-15487705 ] Apache Spark commented on SPARK-17526: -- User 'zhzhan' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-17397) Show example of what to do when awaitTermination() throws an Exception

2016-09-13 Thread Spiro Michaylov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487691#comment-15487691 ] Spiro Michaylov edited comment on SPARK-17397 at 9/13/16 4:37 PM: -- In

[jira] [Assigned] (SPARK-10408) Autoencoder

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10408: Assignee: Apache Spark (was: Alexander Ulanov) > Autoencoder > --- > >

[jira] [Commented] (SPARK-10408) Autoencoder

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487696#comment-15487696 ] Apache Spark commented on SPARK-10408: -- User 'avulanov' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10408) Autoencoder

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10408: Assignee: Alexander Ulanov (was: Apache Spark) > Autoencoder > --- > >

[jira] [Commented] (SPARK-17397) Show example of what to do when awaitTermination() throws an Exception

2016-09-13 Thread Spiro Michaylov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487691#comment-15487691 ] Spiro Michaylov commented on SPARK-17397: - In the form of a PR? Sure, I can do that. If I don't

[jira] [Created] (SPARK-17526) Display the executor log links with the job failure message on Spark UI and Console

2016-09-13 Thread Zhan Zhang (JIRA)
Zhan Zhang created SPARK-17526: -- Summary: Display the executor log links with the job failure message on Spark UI and Console Key: SPARK-17526 URL: https://issues.apache.org/jira/browse/SPARK-17526

[jira] [Resolved] (SPARK-17142) Complex query triggers binding error in HashAggregateExec

2016-09-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-17142. --- Resolution: Fixed Assignee: Jiang Xingbo Fix Version/s: 2.1.0 >

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

2016-09-13 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487310#comment-15487310 ] Cody Koeninger commented on SPARK-15406: So we can do the easiest thing possible for 2.0.1, which

[jira] [Commented] (SPARK-16441) Spark application hang when dynamic allocation is enabled

2016-09-13 Thread Dhruve Ashar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487224#comment-15487224 ] Dhruve Ashar commented on SPARK-16441: -- There was a patch which was recently contributed which

[jira] [Commented] (SPARK-17321) YARN shuffle service should use good disk from yarn.nodemanager.local-dirs

2016-09-13 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487203#comment-15487203 ] Thomas Graves commented on SPARK-17321: --- yes that makes sense and as I stated I think the fix for

[jira] [Comment Edited] (SPARK-16938) Cannot resolve column name after a join

2016-09-13 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487185#comment-15487185 ] Dongjoon Hyun edited comment on SPARK-16938 at 9/13/16 1:23 PM: Hi,

[jira] [Commented] (SPARK-16938) Cannot resolve column name after a join

2016-09-13 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487185#comment-15487185 ] Dongjoon Hyun commented on SPARK-16938: --- Hi, @cloud-fan . Could you review this issue and PR? >

[jira] [Commented] (SPARK-17321) YARN shuffle service should use good disk from yarn.nodemanager.local-dirs

2016-09-13 Thread Alexander Kasper (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487166#comment-15487166 ] Alexander Kasper commented on SPARK-17321: -- No, we're not using NM recovery. What we observed is

[jira] [Commented] (SPARK-17525) SparkContext.clearFiles() still present in the PySpark bindings though the underlying Scala method was removed in Spark 2.0

2016-09-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487117#comment-15487117 ] Sean Owen commented on SPARK-17525: --- Oops, yeah this was missed in

[jira] [Assigned] (SPARK-17524) RowBasedKeyValueBatchSuite always uses 64 mb page size

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17524: Assignee: Apache Spark > RowBasedKeyValueBatchSuite always uses 64 mb page size >

[jira] [Commented] (SPARK-17524) RowBasedKeyValueBatchSuite always uses 64 mb page size

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487093#comment-15487093 ] Apache Spark commented on SPARK-17524: -- User 'a-roberts' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17524) RowBasedKeyValueBatchSuite always uses 64 mb page size

2016-09-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17524: Assignee: (was: Apache Spark) > RowBasedKeyValueBatchSuite always uses 64 mb page

[jira] [Created] (SPARK-17525) SparkContext.clearFiles() still present in the PySpark bindings though the underlying Scala method was removed in Spark 2.0

2016-09-13 Thread Sami Jaktholm (JIRA)
Sami Jaktholm created SPARK-17525: - Summary: SparkContext.clearFiles() still present in the PySpark bindings though the underlying Scala method was removed in Spark 2.0 Key: SPARK-17525 URL:

[jira] [Commented] (SPARK-17521) Error when I use sparkContext.makeRDD(Seq())

2016-09-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487089#comment-15487089 ] Sean Owen commented on SPARK-17521: --- Reaching way back to page [~pwendell] -- do you have any view on

[jira] [Created] (SPARK-17524) RowBasedKeyValueBatchSuite always uses 64 mb page size

2016-09-13 Thread Adam Roberts (JIRA)
Adam Roberts created SPARK-17524: Summary: RowBasedKeyValueBatchSuite always uses 64 mb page size Key: SPARK-17524 URL: https://issues.apache.org/jira/browse/SPARK-17524 Project: Spark Issue

[jira] [Commented] (SPARK-17471) Add compressed method for Matrix class

2016-09-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15487035#comment-15487035 ] Yanbo Liang commented on SPARK-17471: - [~sethah] I'm sorry that I have some emergent affairs to deal

  1   2   >