[jira] [Comment Edited] (SPARK-24924) Add mapping for built-in Avro data source

2018-08-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16569071#comment-16569071 ] Hyukjin Kwon edited comment on SPARK-24924 at 8/4/18 5:44 AM: -- If it

[jira] [Commented] (SPARK-24924) Add mapping for built-in Avro data source

2018-08-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16569072#comment-16569072 ] Hyukjin Kwon commented on SPARK-24924: -- Also, for clarification, we already issue warnings: {code}

[jira] [Comment Edited] (SPARK-24924) Add mapping for built-in Avro data source

2018-08-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16569071#comment-16569071 ] Hyukjin Kwon edited comment on SPARK-24924 at 8/4/18 5:42 AM: -- If it

[jira] [Commented] (SPARK-24924) Add mapping for built-in Avro data source

2018-08-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16569071#comment-16569071 ] Hyukjin Kwon commented on SPARK-24924: -- If it already throws an error for CSV case too, I would

[jira] [Assigned] (SPARK-24888) spark-submit --master spark://host:port --status driver-id does not work

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24888: Assignee: (was: Apache Spark) > spark-submit --master spark://host:port --status

[jira] [Commented] (SPARK-24888) spark-submit --master spark://host:port --status driver-id does not work

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568969#comment-16568969 ] Apache Spark commented on SPARK-24888: -- User 'devaraj-kavali' has created a pull request for this

[jira] [Assigned] (SPARK-24888) spark-submit --master spark://host:port --status driver-id does not work

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24888: Assignee: Apache Spark > spark-submit --master spark://host:port --status driver-id does

[jira] [Commented] (SPARK-24928) spark sql cross join running time too long

2018-08-03 Thread Matthew Normyle (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568893#comment-16568893 ] Matthew Normyle commented on SPARK-24928: - In CartesianRDD.compute, changing:

[jira] [Commented] (SPARK-18057) Update structured streaming kafka from 0.10.0.1 to 2.0.0

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568863#comment-16568863 ] Apache Spark commented on SPARK-18057: -- User 'srowen' has created a pull request for this issue:

[jira] [Commented] (SPARK-25024) Update mesos documentation to be clear about security supported

2018-08-03 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568810#comment-16568810 ] Imran Rashid commented on SPARK-25024: -- [~rvesse] [~arand] maybe you could take a stab at this

[jira] [Assigned] (SPARK-24529) Add spotbugs into maven build process

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24529: Assignee: Kazuaki Ishizaki (was: Apache Spark) > Add spotbugs into maven build process

[jira] [Commented] (SPARK-24529) Add spotbugs into maven build process

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568806#comment-16568806 ] Apache Spark commented on SPARK-24529: -- User 'kiszk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-24529) Add spotbugs into maven build process

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24529: Assignee: Apache Spark (was: Kazuaki Ishizaki) > Add spotbugs into maven build process

[jira] [Commented] (SPARK-25023) Clarify Spark security documentation

2018-08-03 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568805#comment-16568805 ] Thomas Graves commented on SPARK-25023: --- note some of this was already updated with

[jira] [Issue Comment Deleted] (SPARK-25024) Update mesos documentation to be clear about security supported

2018-08-03 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-25024: -- Comment: was deleted (was: I'm going to work on this.) > Update mesos documentation to be

[jira] [Commented] (SPARK-25023) Clarify Spark security documentation

2018-08-03 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568767#comment-16568767 ] Thomas Graves commented on SPARK-25023: --- I'm going to work on this > Clarify Spark security

[jira] [Commented] (SPARK-25024) Update mesos documentation to be clear about security supported

2018-08-03 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568766#comment-16568766 ] Thomas Graves commented on SPARK-25024: --- I'm going to work on this. > Update mesos documentation

[jira] [Created] (SPARK-25024) Update mesos documentation to be clear about security supported

2018-08-03 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-25024: - Summary: Update mesos documentation to be clear about security supported Key: SPARK-25024 URL: https://issues.apache.org/jira/browse/SPARK-25024 Project: Spark

[jira] [Created] (SPARK-25023) Clarify Spark security documentation

2018-08-03 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-25023: - Summary: Clarify Spark security documentation Key: SPARK-25023 URL: https://issues.apache.org/jira/browse/SPARK-25023 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-24983) Collapsing multiple project statements with dependent When-Otherwise statements on the same column can OOM the driver

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568730#comment-16568730 ] Apache Spark commented on SPARK-24983: -- User 'dvogelbacher' has created a pull request for this

[jira] [Assigned] (SPARK-24983) Collapsing multiple project statements with dependent When-Otherwise statements on the same column can OOM the driver

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24983: Assignee: Apache Spark > Collapsing multiple project statements with dependent

[jira] [Assigned] (SPARK-24983) Collapsing multiple project statements with dependent When-Otherwise statements on the same column can OOM the driver

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24983: Assignee: (was: Apache Spark) > Collapsing multiple project statements with

[jira] [Commented] (SPARK-21986) QuantileDiscretizer picks wrong split point for data with lots of 0's

2018-08-03 Thread Barry Becker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568718#comment-16568718 ] Barry Becker commented on SPARK-21986: -- Here are a couple more test cases that show the problem:

[jira] [Commented] (SPARK-24375) Design sketch: support barrier scheduling in Apache Spark

2018-08-03 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568664#comment-16568664 ] Mridul Muralidharan commented on SPARK-24375: - {quote} It's not desired behavior to catch

[jira] [Updated] (SPARK-25020) Unable to Perform Graceful Shutdown in Spark Streaming with Hadoop 2.8

2018-08-03 Thread Ricky Saltzer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ricky Saltzer updated SPARK-25020: -- Description: Opening this up to give you guys some insight in an issue that will occur when

[jira] [Created] (SPARK-25022) Add spark.executor.pyspark.memory support to Mesos

2018-08-03 Thread Ryan Blue (JIRA)
Ryan Blue created SPARK-25022: - Summary: Add spark.executor.pyspark.memory support to Mesos Key: SPARK-25022 URL: https://issues.apache.org/jira/browse/SPARK-25022 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-25021) Add spark.executor.pyspark.memory support to Kubernetes

2018-08-03 Thread Ryan Blue (JIRA)
Ryan Blue created SPARK-25021: - Summary: Add spark.executor.pyspark.memory support to Kubernetes Key: SPARK-25021 URL: https://issues.apache.org/jira/browse/SPARK-25021 Project: Spark Issue

[jira] [Created] (SPARK-25020) Unable to Perform Graceful Shutdown in Spark Streaming with Hadoop 2.8

2018-08-03 Thread Ricky Saltzer (JIRA)
Ricky Saltzer created SPARK-25020: - Summary: Unable to Perform Graceful Shutdown in Spark Streaming with Hadoop 2.8 Key: SPARK-25020 URL: https://issues.apache.org/jira/browse/SPARK-25020 Project:

[jira] [Commented] (SPARK-25019) The published spark sql pom does not exclude the normal version of orc-core

2018-08-03 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568554#comment-16568554 ] Yin Huai commented on SPARK-25019: -- [~dongjoon] can you help us fix this issue? Or there is a reason

[jira] [Created] (SPARK-25019) The published spark sql pom does not exclude the normal version of orc-core

2018-08-03 Thread Yin Huai (JIRA)
Yin Huai created SPARK-25019: Summary: The published spark sql pom does not exclude the normal version of orc-core Key: SPARK-25019 URL: https://issues.apache.org/jira/browse/SPARK-25019 Project: Spark

[jira] [Assigned] (SPARK-25018) Use `Co-Authored-By` git trailer in `merge_spark_pr.py`

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25018: Assignee: Apache Spark > Use `Co-Authored-By` git trailer in `merge_spark_pr.py` >

[jira] [Assigned] (SPARK-25018) Use `Co-Authored-By` git trailer in `merge_spark_pr.py`

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25018: Assignee: (was: Apache Spark) > Use `Co-Authored-By` git trailer in

[jira] [Commented] (SPARK-25018) Use `Co-Authored-By` git trailer in `merge_spark_pr.py`

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568545#comment-16568545 ] Apache Spark commented on SPARK-25018: -- User 'dbtsai' has created a pull request for this issue:

[jira] [Created] (SPARK-25018) Use `Co-Authored-By` git trailer in `merge_spark_pr.py`

2018-08-03 Thread DB Tsai (JIRA)
DB Tsai created SPARK-25018: --- Summary: Use `Co-Authored-By` git trailer in `merge_spark_pr.py` Key: SPARK-25018 URL: https://issues.apache.org/jira/browse/SPARK-25018 Project: Spark Issue Type:

[jira] [Created] (SPARK-25017) Add test suite for ContextBarrierState

2018-08-03 Thread Jiang Xingbo (JIRA)
Jiang Xingbo created SPARK-25017: Summary: Add test suite for ContextBarrierState Key: SPARK-25017 URL: https://issues.apache.org/jira/browse/SPARK-25017 Project: Spark Issue Type: Test

[jira] [Created] (SPARK-25016) remove Support for hadoop 2.6

2018-08-03 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-25016: - Summary: remove Support for hadoop 2.6 Key: SPARK-25016 URL: https://issues.apache.org/jira/browse/SPARK-25016 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-25016) remove Support for hadoop 2.6

2018-08-03 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-25016: -- Target Version/s: 3.0.0 > remove Support for hadoop 2.6 > - > >

[jira] [Updated] (SPARK-24918) Executor Plugin API

2018-08-03 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-24918: - Labels: SPIP memory-analysis (was: memory-analysis) > Executor Plugin API >

[jira] [Commented] (SPARK-24918) Executor Plugin API

2018-08-03 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568471#comment-16568471 ] Imran Rashid commented on SPARK-24918: -- attached an [spip

[jira] [Assigned] (SPARK-24954) Fail fast on job submit if run a barrier stage with dynamic resource allocation enabled

2018-08-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-24954: - Assignee: Jiang Xingbo > Fail fast on job submit if run a barrier stage with dynamic

[jira] [Resolved] (SPARK-24954) Fail fast on job submit if run a barrier stage with dynamic resource allocation enabled

2018-08-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-24954. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21915

[jira] [Commented] (SPARK-24924) Add mapping for built-in Avro data source

2018-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568454#comment-16568454 ] Reynold Xin commented on SPARK-24924: - I like the improved error message (I didn't read the earlier

[jira] [Commented] (SPARK-20696) tf-idf document clustering with K-means in Apache Spark putting points into one cluster

2018-08-03 Thread Aditya Kamath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568439#comment-16568439 ] Aditya Kamath commented on SPARK-20696: --- [~rajanimaski] Please let me know which implementation of

[jira] [Updated] (SPARK-25011) Add PrefixSpan to __all__ in fpm.py

2018-08-03 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-25011: --- Summary: Add PrefixSpan to __all__ in fpm.py (was: Add PrefixSpan to __all__) > Add PrefixSpan to

[jira] [Commented] (SPARK-25003) Pyspark Does not use Spark Sql Extensions

2018-08-03 Thread Russell Spitzer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568415#comment-16568415 ] Russell Spitzer commented on SPARK-25003: - [~holden.karau] , Wrote up a PR for each branch

[jira] [Commented] (SPARK-25003) Pyspark Does not use Spark Sql Extensions

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568414#comment-16568414 ] Apache Spark commented on SPARK-25003: -- User 'RussellSpitzer' has created a pull request for this

[jira] [Commented] (SPARK-25003) Pyspark Does not use Spark Sql Extensions

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568405#comment-16568405 ] Apache Spark commented on SPARK-25003: -- User 'RussellSpitzer' has created a pull request for this

[jira] [Assigned] (SPARK-25003) Pyspark Does not use Spark Sql Extensions

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25003: Assignee: (was: Apache Spark) > Pyspark Does not use Spark Sql Extensions >

[jira] [Assigned] (SPARK-25003) Pyspark Does not use Spark Sql Extensions

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25003: Assignee: Apache Spark > Pyspark Does not use Spark Sql Extensions >

[jira] [Commented] (SPARK-25003) Pyspark Does not use Spark Sql Extensions

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568396#comment-16568396 ] Apache Spark commented on SPARK-25003: -- User 'RussellSpitzer' has created a pull request for this

[jira] [Assigned] (SPARK-25015) Update Hadoop 2.7 to 2.7.7

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25015: Assignee: Apache Spark (was: Sean Owen) > Update Hadoop 2.7 to 2.7.7 >

[jira] [Commented] (SPARK-25015) Update Hadoop 2.7 to 2.7.7

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568392#comment-16568392 ] Apache Spark commented on SPARK-25015: -- User 'srowen' has created a pull request for this issue:

[jira] [Commented] (SPARK-24924) Add mapping for built-in Avro data source

2018-08-03 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568393#comment-16568393 ] Thomas Graves commented on SPARK-24924: --- | It wouldn't be very different for 2.4.0. It could be

[jira] [Assigned] (SPARK-25015) Update Hadoop 2.7 to 2.7.7

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25015: Assignee: Sean Owen (was: Apache Spark) > Update Hadoop 2.7 to 2.7.7 >

[jira] [Created] (SPARK-25015) Update Hadoop 2.7 to 2.7.7

2018-08-03 Thread Sean Owen (JIRA)
Sean Owen created SPARK-25015: - Summary: Update Hadoop 2.7 to 2.7.7 Key: SPARK-25015 URL: https://issues.apache.org/jira/browse/SPARK-25015 Project: Spark Issue Type: Task Components:

[jira] [Updated] (SPARK-18057) Update structured streaming kafka from 0.10.0.1 to 2.0.0

2018-08-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18057: -- Priority: Major (was: Blocker) > Update structured streaming kafka from 0.10.0.1 to 2.0.0 >

[jira] [Resolved] (SPARK-18057) Update structured streaming kafka from 0.10.0.1 to 2.0.0

2018-08-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18057. --- Resolution: Fixed Fix Version/s: 2.4.0 > Update structured streaming kafka from 0.10.0.1 to

[jira] [Updated] (SPARK-25014) When we tried to read kafka topic through spark streaming spark submit is getting failed with Python worker exited unexpectedly (crashed) error

2018-08-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-25014: -- Priority: Major (was: Blocker) Fix Version/s: (was: 2.3.2) Please read

[jira] [Resolved] (SPARK-25014) When we tried to read kafka topic through spark streaming spark submit is getting failed with Python worker exited unexpectedly (crashed) error

2018-08-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-25014. --- Resolution: Invalid > When we tried to read kafka topic through spark streaming spark submit is >

[jira] [Commented] (SPARK-24924) Add mapping for built-in Avro data source

2018-08-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568352#comment-16568352 ] Hyukjin Kwon commented on SPARK-24924: -- cc [~cloud_fan] since we talked about this for CSV, and

[jira] [Comment Edited] (SPARK-24924) Add mapping for built-in Avro data source

2018-08-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568348#comment-16568348 ] Hyukjin Kwon edited comment on SPARK-24924 at 8/3/18 3:29 PM: -- {quote} but

[jira] [Commented] (SPARK-24924) Add mapping for built-in Avro data source

2018-08-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568348#comment-16568348 ] Hyukjin Kwon commented on SPARK-24924: -- {quote} but at the same time we aren't adding the

[jira] [Assigned] (SPARK-23937) High-order function: map_filter(map, function) → MAP

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23937: Assignee: (was: Apache Spark) > High-order function: map_filter(map, function) → MAP

[jira] [Commented] (SPARK-23937) High-order function: map_filter(map, function) → MAP

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568337#comment-16568337 ] Apache Spark commented on SPARK-23937: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23937) High-order function: map_filter(map, function) → MAP

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23937: Assignee: Apache Spark > High-order function: map_filter(map, function) → MAP >

[jira] [Created] (SPARK-25014) When we tried to read kafka topic through spark streaming spark submit is getting failed with Python worker exited unexpectedly (crashed) error

2018-08-03 Thread KARTHIKEYAN RASIPALAYAM DURAIRAJ (JIRA)
KARTHIKEYAN RASIPALAYAM DURAIRAJ created SPARK-25014: Summary: When we tried to read kafka topic through spark streaming spark submit is getting failed with Python worker exited unexpectedly (crashed) error

[jira] [Commented] (SPARK-14220) Build and test Spark against Scala 2.12

2018-08-03 Thread Nick Poorman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568238#comment-16568238 ] Nick Poorman commented on SPARK-14220: -- Awesome job! (y) > Build and test Spark against Scala 2.12

[jira] [Assigned] (SPARK-24884) Implement regexp_extract_all

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24884: Assignee: (was: Apache Spark) > Implement regexp_extract_all >

[jira] [Commented] (SPARK-24884) Implement regexp_extract_all

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568215#comment-16568215 ] Apache Spark commented on SPARK-24884: -- User 'xueyumusic' has created a pull request for this

[jira] [Assigned] (SPARK-24884) Implement regexp_extract_all

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24884: Assignee: Apache Spark > Implement regexp_extract_all > > >

[jira] [Commented] (SPARK-24924) Add mapping for built-in Avro data source

2018-08-03 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568204#comment-16568204 ] Thomas Graves commented on SPARK-24924: --- [~felixcheung] did your discussion on the same thing with

[jira] [Commented] (SPARK-24924) Add mapping for built-in Avro data source

2018-08-03 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568199#comment-16568199 ] Thomas Graves commented on SPARK-24924: --- Hmm, so we are adding this for ease of upgrading I guess

[jira] [Commented] (SPARK-23937) High-order function: map_filter(map, function) → MAP

2018-08-03 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568152#comment-16568152 ] Marco Gaido commented on SPARK-23937: - I am working on this, thanks. > High-order function:

[jira] [Commented] (SPARK-24772) support reading AVRO logical types - Decimal

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568139#comment-16568139 ] Apache Spark commented on SPARK-24772: -- User 'gengliangwang' has created a pull request for this

[jira] [Assigned] (SPARK-24772) support reading AVRO logical types - Decimal

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24772: Assignee: (was: Apache Spark) > support reading AVRO logical types - Decimal >

[jira] [Assigned] (SPARK-24772) support reading AVRO logical types - Decimal

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24772: Assignee: Apache Spark > support reading AVRO logical types - Decimal >

[jira] [Updated] (SPARK-24998) spark-sql will scan the same table repeatedly when doing multi-insert

2018-08-03 Thread ice bai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ice bai updated SPARK-24998: Summary: spark-sql will scan the same table repeatedly when doing multi-insert (was: spark-sql will scan

[jira] [Updated] (SPARK-24998) spark-sql will scan the same table repeatedly when doing multi-insert"

2018-08-03 Thread ice bai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ice bai updated SPARK-24998: Description: Such as the query likes "From xx SELECT yy INSERT INTO a INSERT INTO b INSERT INTO c ..." .

[jira] [Updated] (SPARK-24998) spark-sql will scan the same table repeatedly when doing multi-insert"

2018-08-03 Thread ice bai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ice bai updated SPARK-24998: Summary: spark-sql will scan the same table repeatedly when doing multi-insert" (was: spark-sql will

[jira] [Created] (SPARK-25013) JDBC urls with jdbc:mariadb don't work as expected

2018-08-03 Thread Dieter Vekeman (JIRA)
Dieter Vekeman created SPARK-25013: -- Summary: JDBC urls with jdbc:mariadb don't work as expected Key: SPARK-25013 URL: https://issues.apache.org/jira/browse/SPARK-25013 Project: Spark Issue

[jira] [Commented] (SPARK-23932) High-order function: zip_with(array, array, function) → array

2018-08-03 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568059#comment-16568059 ] Takuya Ueshin commented on SPARK-23932: --- Hi [~crafty-coder], Are you still working on this?

[jira] [Assigned] (SPARK-24987) Kafka Cached Consumer Leaking File Descriptors

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24987: Assignee: (was: Apache Spark) > Kafka Cached Consumer Leaking File Descriptors >

[jira] [Assigned] (SPARK-24987) Kafka Cached Consumer Leaking File Descriptors

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24987: Assignee: Apache Spark > Kafka Cached Consumer Leaking File Descriptors >

[jira] [Commented] (SPARK-24987) Kafka Cached Consumer Leaking File Descriptors

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568022#comment-16568022 ] Apache Spark commented on SPARK-24987: -- User 'YuvalItzchakov' has created a pull request for this

[jira] [Commented] (SPARK-24598) SPARK SQL:Datatype overflow conditions gives incorrect result

2018-08-03 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568005#comment-16568005 ] Marco Gaido commented on SPARK-24598: - [~smilegator] as we just enhanced the doc, but we have not

[jira] [Created] (SPARK-25012) dataframe creation results in matcherror

2018-08-03 Thread uwe (JIRA)
uwe created SPARK-25012: --- Summary: dataframe creation results in matcherror Key: SPARK-25012 URL: https://issues.apache.org/jira/browse/SPARK-25012 Project: Spark Issue Type: Bug Components:

[jira] [Commented] (SPARK-24928) spark sql cross join running time too long

2018-08-03 Thread LIFULONG (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16567986#comment-16567986 ] LIFULONG commented on SPARK-24928: -- for (x <- rdd1.iterator(currSplit.s1, context); y <-

[jira] [Commented] (SPARK-23911) High-order function: reduce(array, initialState S, inputFunction, outputFunction) → R

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16567963#comment-16567963 ] Apache Spark commented on SPARK-23911: -- User 'ueshin' has created a pull request for this issue:

[jira] [Commented] (SPARK-23909) High-order function: filter(array, function) → array

2018-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16567941#comment-16567941 ] Apache Spark commented on SPARK-23909: -- User 'ueshin' has created a pull request for this issue:

[jira] [Resolved] (SPARK-24993) Make Avro fast again

2018-08-03 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-24993. - Resolution: Fixed Fix Version/s: 2.4.0 > Make Avro fast again > > >

[jira] [Updated] (SPARK-24993) Make Avro fast again

2018-08-03 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-24993: Affects Version/s: (was: 2.3.0) 2.4.0 > Make Avro fast again >

[jira] [Resolved] (SPARK-24989) BlockFetcher should retry while getting OutOfDirectMemoryError

2018-08-03 Thread Li Yuanjian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Yuanjian resolved SPARK-24989. - Resolution: Not A Problem The param `spark.reducer.maxBlocksInFlightPerAddress` added in

[jira] [Resolved] (SPARK-25009) Standalone Cluster mode application submit is not working

2018-08-03 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-25009. - Resolution: Fixed Assignee: Devaraj K Fix Version/s: 2.4.0 > Standalone Cluster mode

[jira] [Resolved] (SPARK-25011) Add PrefixSpan to __all__

2018-08-03 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25011. -- Resolution: Fixed Assignee: yuhao yang Fix Version/s: 2.4.0 Fixed in