[jira] [Created] (SPARK-27532) Correct the default value in the Documentation for "spark.redaction.regex"

2019-04-20 Thread Shivu Sondur (JIRA)
Shivu Sondur created SPARK-27532: Summary: Correct the default value in the Documentation for "spark.redaction.regex" Key: SPARK-27532 URL: https://issues.apache.org/jira/browse/SPARK-27532 Project:

[jira] [Commented] (SPARK-27396) SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread binwei yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822616#comment-16822616 ] binwei yang commented on SPARK-27396: - This is the same proposal we are working on. I have a topic

[jira] [Resolved] (SPARK-27524) Remove the parquet-provided support

2019-04-20 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-27524. - Resolution: Not A Problem > Remove the parquet-provided support >

[jira] [Comment Edited] (SPARK-27421) RuntimeException when querying a view on a partitioned parquet table

2019-04-20 Thread Shivu Sondur (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16816851#comment-16816851 ] Shivu Sondur edited comment on SPARK-27421 at 4/21/19 1:30 AM: ---

[jira] [Created] (SPARK-27531) Improve explain output of describe table command to show the inputs to the command.

2019-04-20 Thread Dilip Biswal (JIRA)
Dilip Biswal created SPARK-27531: Summary: Improve explain output of describe table command to show the inputs to the command. Key: SPARK-27531 URL: https://issues.apache.org/jira/browse/SPARK-27531

[jira] [Commented] (SPARK-26970) Can't load PipelineModel that was created in Scala with Python due to missing Interaction transformer

2019-04-20 Thread Andrew Crosby (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822548#comment-16822548 ] Andrew Crosby commented on SPARK-26970: --- The code changes required for this looked relatively

[jira] [Commented] (SPARK-27512) Decimal parsing leads to unexpected type inference

2019-04-20 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822525#comment-16822525 ] koert kuipers commented on SPARK-27512: --- default locale is US, which now has this logic for

[jira] [Updated] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames or Arrow batches for the entire partition

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-26412: -- Summary: Allow Pandas UDF to take an iterator of pd.DataFrames or Arrow batches for the

[jira] [Updated] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames or Arrow batches

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-26412: -- Summary: Allow Pandas UDF to take an iterator of pd.DataFrames or Arrow batches (was: Allow

[jira] [Comment Edited] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames for the entire partition

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822521#comment-16822521 ] Xiangrui Meng edited comment on SPARK-26412 at 4/20/19 5:13 PM:

[jira] [Comment Edited] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames for the entire partition

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822521#comment-16822521 ] Xiangrui Meng edited comment on SPARK-26412 at 4/20/19 5:13 PM:

[jira] [Updated] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames for the entire partition

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-26412: -- Description: Pandas UDF is the ideal connection between PySpark and DL model inference

[jira] [Commented] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames for the entire partition

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822521#comment-16822521 ] Xiangrui Meng commented on SPARK-26412: --- [~bryanc] It handles the data exchange for DL model

[jira] [Comment Edited] (SPARK-27396) SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822517#comment-16822517 ] Xiangrui Meng edited comment on SPARK-27396 at 4/20/19 5:03 PM:

[jira] [Comment Edited] (SPARK-27396) SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822517#comment-16822517 ] Xiangrui Meng edited comment on SPARK-27396 at 4/20/19 5:02 PM:

[jira] [Comment Edited] (SPARK-27396) SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822517#comment-16822517 ] Xiangrui Meng edited comment on SPARK-27396 at 4/20/19 5:01 PM:

[jira] [Commented] (SPARK-27396) SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822517#comment-16822517 ] Xiangrui Meng commented on SPARK-27396: --- [~revans2] Thanks for clarifying the proposal! If your

[jira] [Updated] (SPARK-27529) Spark Streaming consumer dies with kafka.common.OffsetOutOfRangeException

2019-04-20 Thread Dmitry Goldenberg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Goldenberg updated SPARK-27529: -- Description: We have a Spark Streaming consumer which at a certain point started

[jira] [Updated] (SPARK-27530) FetchFailedException: Received a zero-size buffer for block shuffle

2019-04-20 Thread Adrian Muraru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adrian Muraru updated SPARK-27530: -- Description: I'm getting this in a large shuffle:

[jira] [Created] (SPARK-27530) FetchFailedException: Received a zero-size buffer for block shuffle

2019-04-20 Thread Adrian Muraru (JIRA)
Adrian Muraru created SPARK-27530: - Summary: FetchFailedException: Received a zero-size buffer for block shuffle Key: SPARK-27530 URL: https://issues.apache.org/jira/browse/SPARK-27530 Project: Spark

[jira] [Updated] (SPARK-27529) Spark Streaming consumer dies with kafka.common.OffsetOutOfRangeException

2019-04-20 Thread Dmitry Goldenberg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Goldenberg updated SPARK-27529: -- Description: We have a Spark Streaming consumer which at a certain point started

[jira] [Created] (SPARK-27529) Spark Streaming consumer dies with kafka.common.OffsetOutOfRangeException

2019-04-20 Thread Dmitry Goldenberg (JIRA)
Dmitry Goldenberg created SPARK-27529: - Summary: Spark Streaming consumer dies with kafka.common.OffsetOutOfRangeException Key: SPARK-27529 URL: https://issues.apache.org/jira/browse/SPARK-27529

[jira] [Created] (SPARK-27528) Use Parquet logical type TIMESTAMP_MICROS by default

2019-04-20 Thread Maxim Gekk (JIRA)
Maxim Gekk created SPARK-27528: -- Summary: Use Parquet logical type TIMESTAMP_MICROS by default Key: SPARK-27528 URL: https://issues.apache.org/jira/browse/SPARK-27528 Project: Spark Issue Type:

[jira] [Created] (SPARK-27527) Improve description of Timestamp and Date types

2019-04-20 Thread Maxim Gekk (JIRA)
Maxim Gekk created SPARK-27527: -- Summary: Improve description of Timestamp and Date types Key: SPARK-27527 URL: https://issues.apache.org/jira/browse/SPARK-27527 Project: Spark Issue Type:

[jira] [Commented] (SPARK-27396) SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread Robert Joseph Evans (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822411#comment-16822411 ] Robert Joseph Evans commented on SPARK-27396: - [~mengxr], My goal is to provide a framework

[jira] [Updated] (SPARK-27526) Driver OOM error occurs while writing parquet file with Append mode

2019-04-20 Thread senyoung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] senyoung updated SPARK-27526: - Description: As this user code below {code:java} someDataFrame.write .mode(SaveMode.Append)

[jira] [Created] (SPARK-27526) Driver OOM error occurs while writing parquet file with Append mode

2019-04-20 Thread senyoung (JIRA)
senyoung created SPARK-27526: Summary: Driver OOM error occurs while writing parquet file with Append mode Key: SPARK-27526 URL: https://issues.apache.org/jira/browse/SPARK-27526 Project: Spark

[jira] [Created] (SPARK-27525) Exclude commons-httpclient when interacting with different versions of the HiveMetastoreClient

2019-04-20 Thread Yuming Wang (JIRA)
Yuming Wang created SPARK-27525: --- Summary: Exclude commons-httpclient when interacting with different versions of the HiveMetastoreClient Key: SPARK-27525 URL: https://issues.apache.org/jira/browse/SPARK-27525