[jira] [Commented] (SPARK-26312) Converting converters in RDDConversions into arrays to improve their access performance

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713576#comment-16713576 ] Apache Spark commented on SPARK-26312: -- User 'eatoncys' has created a pull request for this issue:

[jira] [Commented] (SPARK-26312) Converting converters in RDDConversions into arrays to improve their access performance

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713577#comment-16713577 ] Apache Spark commented on SPARK-26312: -- User 'eatoncys' has created a pull request for this issue:

[jira] [Assigned] (SPARK-26312) Converting converters in RDDConversions into arrays to improve their access performance

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26312: Assignee: (was: Apache Spark) > Converting converters in RDDConversions into arrays

[jira] [Assigned] (SPARK-26312) Converting converters in RDDConversions into arrays to improve their access performance

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26312: Assignee: Apache Spark > Converting converters in RDDConversions into arrays to improve

[jira] [Created] (SPARK-26312) Converting converters in RDDConversions into arrays to improve their access performance

2018-12-07 Thread eaton (JIRA)
eaton created SPARK-26312: - Summary: Converting converters in RDDConversions into arrays to improve their access performance Key: SPARK-26312 URL: https://issues.apache.org/jira/browse/SPARK-26312 Project:

[jira] [Commented] (SPARK-26311) [YARN] New feature: custom log URL for stdout/stderr

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713565#comment-16713565 ] Apache Spark commented on SPARK-26311: -- User 'HeartSaVioR' has created a pull request for this

[jira] [Commented] (SPARK-23674) Add Spark ML Listener for Tracking ML Pipeline Status

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713568#comment-16713568 ] Apache Spark commented on SPARK-23674: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-23674) Add Spark ML Listener for Tracking ML Pipeline Status

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713567#comment-16713567 ] Apache Spark commented on SPARK-23674: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-26311) [YARN] New feature: custom log URL for stdout/stderr

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26311: Assignee: (was: Apache Spark) > [YARN] New feature: custom log URL for stdout/stderr

[jira] [Assigned] (SPARK-26311) [YARN] New feature: custom log URL for stdout/stderr

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26311: Assignee: Apache Spark > [YARN] New feature: custom log URL for stdout/stderr >

[jira] [Commented] (SPARK-26311) [YARN] New feature: custom log URL for stdout/stderr

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713564#comment-16713564 ] Apache Spark commented on SPARK-26311: -- User 'HeartSaVioR' has created a pull request for this

[jira] [Created] (SPARK-26311) [YARN] New feature: custom log URL for stdout/stderr

2018-12-07 Thread Jungtaek Lim (JIRA)
Jungtaek Lim created SPARK-26311: Summary: [YARN] New feature: custom log URL for stdout/stderr Key: SPARK-26311 URL: https://issues.apache.org/jira/browse/SPARK-26311 Project: Spark Issue

[jira] [Updated] (SPARK-26224) Results in stackOverFlowError when trying to add 3000 new columns using withColumn function of dataframe.

2018-12-07 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-26224: - Component/s: (was: Spark Core) SQL > Results in stackOverFlowError

[jira] [Commented] (SPARK-26215) define reserved keywords after SQL standard

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713540#comment-16713540 ] Apache Spark commented on SPARK-26215: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-26215) define reserved keywords after SQL standard

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26215: Assignee: Apache Spark > define reserved keywords after SQL standard >

[jira] [Commented] (SPARK-26215) define reserved keywords after SQL standard

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713538#comment-16713538 ] Apache Spark commented on SPARK-26215: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-26215) define reserved keywords after SQL standard

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26215: Assignee: (was: Apache Spark) > define reserved keywords after SQL standard >

[jira] [Commented] (SPARK-23375) Optimizer should remove unneeded Sort

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713516#comment-16713516 ] Apache Spark commented on SPARK-23375: -- User 'seancxmao' has created a pull request for this issue:

[jira] [Commented] (SPARK-23375) Optimizer should remove unneeded Sort

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713514#comment-16713514 ] Apache Spark commented on SPARK-23375: -- User 'seancxmao' has created a pull request for this issue:

[jira] [Commented] (SPARK-26224) Results in stackOverFlowError when trying to add 3000 new columns using withColumn function of dataframe.

2018-12-07 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713494#comment-16713494 ] Liang-Chi Hsieh commented on SPARK-26224: - I think it is not specified to withColumn. withColumn

[jira] [Resolved] (SPARK-23734) InvalidSchemaException While Saving ALSModel

2018-12-07 Thread Stanley Poon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stanley Poon resolved SPARK-23734. -- Resolution: Fixed Fix Version/s: 2.3.1 > InvalidSchemaException While Saving ALSModel

[jira] [Commented] (SPARK-23734) InvalidSchemaException While Saving ALSModel

2018-12-07 Thread Stanley Poon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713403#comment-16713403 ] Stanley Poon commented on SPARK-23734: -- Just confirmed the problem is fixed in Spark 2.3.1. The

[jira] [Commented] (SPARK-26282) Update JVM to 8u191 on jenkins workers

2018-12-07 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713388#comment-16713388 ] Dongjoon Hyun commented on SPARK-26282: --- Great, thanks again for this and email notifications. >

[jira] [Resolved] (SPARK-19526) Spark should raise an exception when it tries to read a Hive view but it doesn't have read access on the corresponding table(s)

2018-12-07 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-19526. Resolution: Cannot Reproduce > Spark should raise an exception when it tries to read a

[jira] [Commented] (SPARK-19526) Spark should raise an exception when it tries to read a Hive view but it doesn't have read access on the corresponding table(s)

2018-12-07 Thread Reza Safi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713368#comment-16713368 ] Reza Safi commented on SPARK-19526: --- It seems that this can be resolved since we can't reproduce the

[jira] [Commented] (SPARK-26282) Update JVM to 8u191 on jenkins workers

2018-12-07 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713353#comment-16713353 ] shane knapp commented on SPARK-26282: - test build passed!  

[jira] [Resolved] (SPARK-26282) Update JVM to 8u191 on jenkins workers

2018-12-07 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shane knapp resolved SPARK-26282. - Resolution: Fixed > Update JVM to 8u191 on jenkins workers >

[jira] [Commented] (SPARK-26282) Update JVM to 8u191 on jenkins workers

2018-12-07 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713359#comment-16713359 ] shane knapp commented on SPARK-26282: - done.  about to email dev@ for a heads-up. > Update JVM to

[jira] [Resolved] (SPARK-24333) Add fit with validation set to spark.ml GBT: Python API

2018-12-07 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-24333. -- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 21465

[jira] [Assigned] (SPARK-26304) Add default value to spark.kafka.sasl.kerberos.service.name parameter

2018-12-07 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-26304: -- Assignee: Gabor Somogyi > Add default value to

[jira] [Resolved] (SPARK-26304) Add default value to spark.kafka.sasl.kerberos.service.name parameter

2018-12-07 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-26304. Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 23254

[jira] [Assigned] (SPARK-24333) Add fit with validation set to spark.ml GBT: Python API

2018-12-07 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler reassigned SPARK-24333: Assignee: Huaxin Gao > Add fit with validation set to spark.ml GBT: Python API >

[jira] [Assigned] (SPARK-26310) Verification of JSON options

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26310: Assignee: (was: Apache Spark) > Verification of JSON options >

[jira] [Commented] (SPARK-26310) Verification of JSON options

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713285#comment-16713285 ] Apache Spark commented on SPARK-26310: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-26310) Verification of JSON options

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26310: Assignee: Apache Spark > Verification of JSON options > > >

[jira] [Commented] (SPARK-26310) Verification of JSON options

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713283#comment-16713283 ] Apache Spark commented on SPARK-26310: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-25696) The storage memory displayed on spark Application UI is incorrect.

2018-12-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-25696: - Docs Text: In Spark 3.0, the web UI and log statements now consistently report units in

[jira] [Created] (SPARK-26310) Verification of JSON options

2018-12-07 Thread Maxim Gekk (JIRA)
Maxim Gekk created SPARK-26310: -- Summary: Verification of JSON options Key: SPARK-26310 URL: https://issues.apache.org/jira/browse/SPARK-26310 Project: Spark Issue Type: Sub-task

[jira] [Resolved] (SPARK-26196) Total tasks message in the stage is incorrect, when there are failed or killed tasks

2018-12-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-26196. --- Resolution: Fixed Assignee: shahid Fix Version/s: 3.0.0 Resolved by

[jira] [Resolved] (SPARK-26281) Duration column of task table should be executor run time instead of real duration

2018-12-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-26281. --- Resolution: Fixed Assignee: shahid Fix Version/s: 3.0.0 Resolved by

[jira] [Created] (SPARK-26309) Verification of Data source options

2018-12-07 Thread Maxim Gekk (JIRA)
Maxim Gekk created SPARK-26309: -- Summary: Verification of Data source options Key: SPARK-26309 URL: https://issues.apache.org/jira/browse/SPARK-26309 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-26281) Duration column of task table should be executor run time instead of real duration

2018-12-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-26281: -- Priority: Minor (was: Major) Issue Type: Bug (was: Improvement) > Duration column of task

[jira] [Assigned] (SPARK-25299) Use remote storage for persisting shuffle data

2018-12-07 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-25299: -- Assignee: (was: Marcelo Vanzin) > Use remote storage for persisting shuffle data

[jira] [Assigned] (SPARK-25299) Use remote storage for persisting shuffle data

2018-12-07 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-25299: -- Assignee: Marcelo Vanzin > Use remote storage for persisting shuffle data >

[jira] [Resolved] (SPARK-26294) Delete Unnecessary If statement

2018-12-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-26294. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 23247

[jira] [Assigned] (SPARK-26294) Delete Unnecessary If statement

2018-12-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-26294: - Assignee: wangjiaochun > Delete Unnecessary If statement > --- > >

[jira] [Commented] (SPARK-24207) PrefixSpan: R API

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713242#comment-16713242 ] Apache Spark commented on SPARK-24207: -- User 'huaxingao' has created a pull request for this issue:

[jira] [Commented] (SPARK-26306) Flaky test: org.apache.spark.util.collection.SorterSuite

2018-12-07 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713231#comment-16713231 ] Gabor Somogyi commented on SPARK-26306: --- Tested it on my local machine in a loop and never

[jira] [Created] (SPARK-26308) Large BigDecimal value is converted to null when passed into a UDF

2018-12-07 Thread Jay Pranavamurthi (JIRA)
Jay Pranavamurthi created SPARK-26308: - Summary: Large BigDecimal value is converted to null when passed into a UDF Key: SPARK-26308 URL: https://issues.apache.org/jira/browse/SPARK-26308

[jira] [Assigned] (SPARK-24243) Expose exceptions from InProcessAppHandle

2018-12-07 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-24243: -- Assignee: Sahil Takiar > Expose exceptions from InProcessAppHandle >

[jira] [Resolved] (SPARK-24243) Expose exceptions from InProcessAppHandle

2018-12-07 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-24243. Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 23221

[jira] [Commented] (SPARK-26282) Update JVM to 8u191 on jenkins workers

2018-12-07 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713159#comment-16713159 ] Dongjoon Hyun commented on SPARK-26282: --- Thank you for sharing, [~shaneknapp]! > Update JVM to

[jira] [Commented] (SPARK-26282) Update JVM to 8u191 on jenkins workers

2018-12-07 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713142#comment-16713142 ] shane knapp commented on SPARK-26282: - btw, all of the compile and lint jobs have been running on

[jira] [Commented] (SPARK-26307) Fix CTAS when INSERT a partitioned table using Hive serde

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713140#comment-16713140 ] Apache Spark commented on SPARK-26307: -- User 'gatorsmile' has created a pull request for this

[jira] [Assigned] (SPARK-26307) Fix CTAS when INSERT a partitioned table using Hive serde

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26307: Assignee: Apache Spark (was: Xiao Li) > Fix CTAS when INSERT a partitioned table using

[jira] [Assigned] (SPARK-26307) Fix CTAS when INSERT a partitioned table using Hive serde

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26307: Assignee: Xiao Li (was: Apache Spark) > Fix CTAS when INSERT a partitioned table using

[jira] [Updated] (SPARK-26267) Kafka source may reprocess data

2018-12-07 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-26267: - Priority: Blocker (was: Major) > Kafka source may reprocess data >

[jira] [Updated] (SPARK-26267) Kafka source may reprocess data

2018-12-07 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-26267: - Labels: correctness (was: ) > Kafka source may reprocess data >

[jira] [Commented] (SPARK-26282) Update JVM to 8u191 on jenkins workers

2018-12-07 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713134#comment-16713134 ] shane knapp commented on SPARK-26282: - test build now running:

[jira] [Created] (SPARK-26307) Fix CTAS when INSERT a partitioned table using Hive serde

2018-12-07 Thread Xiao Li (JIRA)
Xiao Li created SPARK-26307: --- Summary: Fix CTAS when INSERT a partitioned table using Hive serde Key: SPARK-26307 URL: https://issues.apache.org/jira/browse/SPARK-26307 Project: Spark Issue Type:

[jira] [Commented] (SPARK-26282) Update JVM to 8u191 on jenkins workers

2018-12-07 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713118#comment-16713118 ] shane knapp commented on SPARK-26282: - i'm waiting on someone from databricks to merge.  i pinged

[jira] [Commented] (SPARK-26282) Update JVM to 8u191 on jenkins workers

2018-12-07 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713093#comment-16713093 ] Dongjoon Hyun commented on SPARK-26282: --- Hi, [~shaneknapp]. Is there any update on your PR to

[jira] [Updated] (SPARK-26283) When zstd compression enabled, Inprogress application in the history server appUI showing finished job as running

2018-12-07 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shahid updated SPARK-26283: --- Priority: Major (was: Minor) > When zstd compression enabled, Inprogress application in the history server

[jira] [Commented] (SPARK-25331) Structured Streaming File Sink duplicates records in case of driver failure

2018-12-07 Thread Mihaly Toth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16713030#comment-16713030 ] Mihaly Toth commented on SPARK-25331: - I have closed my PR. I guess it should be documented that we

[jira] [Comment Edited] (SPARK-26305) Breakthrough the memory limitation of broadcast join

2018-12-07 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712991#comment-16712991 ] Dongjoon Hyun edited comment on SPARK-26305 at 12/7/18 3:43 PM: +1 for

[jira] [Commented] (SPARK-26305) Breakthrough the memory limitation of broadcast join

2018-12-07 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712991#comment-16712991 ] Dongjoon Hyun commented on SPARK-26305: --- +1 for the idea. > Breakthrough the memory limitation of

[jira] [Updated] (SPARK-26305) Breakthrough the memory limitation of broadcast join

2018-12-07 Thread Lantao Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lantao Jin updated SPARK-26305: --- Description: If the join between a big table and a small one faces data skewing issue, we usually

[jira] [Commented] (SPARK-26306) Flaky test: org.apache.spark.util.collection.SorterSuite

2018-12-07 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712975#comment-16712975 ] Gabor Somogyi commented on SPARK-26306: --- No idea, I've seen it only in PR builder and thought file

[jira] [Commented] (SPARK-26306) Flaky test: org.apache.spark.util.collection.SorterSuite

2018-12-07 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712970#comment-16712970 ] Liang-Chi Hsieh commented on SPARK-26306: - Besides above build, is there any build that this

[jira] [Updated] (SPARK-26306) Flaky test: org.apache.spark.util.collection.SorterSuite

2018-12-07 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26306: -- Component/s: (was: Spark Core) Tests > Flaky test:

[jira] [Created] (SPARK-26306) Flaky test: org.apache.spark.util.collection.SorterSuite

2018-12-07 Thread Gabor Somogyi (JIRA)
Gabor Somogyi created SPARK-26306: - Summary: Flaky test: org.apache.spark.util.collection.SorterSuite Key: SPARK-26306 URL: https://issues.apache.org/jira/browse/SPARK-26306 Project: Spark

[jira] [Commented] (SPARK-26265) deadlock between TaskMemoryManager and BytesToBytesMap$MapIterator

2018-12-07 Thread qian han (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712905#comment-16712905 ] qian han commented on SPARK-26265: -- Okay > deadlock between TaskMemoryManager and

[jira] [Commented] (SPARK-25401) Reorder the required ordering to match the table's output ordering for bucket join

2018-12-07 Thread Wang, Gang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712873#comment-16712873 ] Wang, Gang commented on SPARK-25401: Yeah. I think so.  And please make sure the outputOrdering of 

[jira] [Commented] (SPARK-26305) Breakthrough the memory limitation of broadcast join

2018-12-07 Thread Lantao Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712855#comment-16712855 ] Lantao Jin commented on SPARK-26305: CC [~jiangxb1987] [~cloud_fan] [~dongjoon] [~hyukjin.kwon],

[jira] [Created] (SPARK-26305) Breakthrough the memory limitation of broadcast join

2018-12-07 Thread Lantao Jin (JIRA)
Lantao Jin created SPARK-26305: -- Summary: Breakthrough the memory limitation of broadcast join Key: SPARK-26305 URL: https://issues.apache.org/jira/browse/SPARK-26305 Project: Spark Issue Type:

[jira] [Commented] (SPARK-26254) Move delegation token providers into a separate project

2018-12-07 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712843#comment-16712843 ] Steve Loughran commented on SPARK-26254: maybe ask the Kafka people for opinions [~jkreps] can

[jira] [Updated] (SPARK-26266) Update to Scala 2.12.8

2018-12-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-26266: -- Docs Text: Use Spark with the latest maintenance release of Java, for security and bug fixes, and to

[jira] [Comment Edited] (SPARK-25401) Reorder the required ordering to match the table's output ordering for bucket join

2018-12-07 Thread David Vrba (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712350#comment-16712350 ] David Vrba edited comment on SPARK-25401 at 12/7/18 10:59 AM: -- I was

[jira] [Assigned] (SPARK-26304) Add default value to spark.kafka.sasl.kerberos.service.name parameter

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26304: Assignee: (was: Apache Spark) > Add default value to

[jira] [Commented] (SPARK-26304) Add default value to spark.kafka.sasl.kerberos.service.name parameter

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712639#comment-16712639 ] Apache Spark commented on SPARK-26304: -- User 'gaborgsomogyi' has created a pull request for this

[jira] [Assigned] (SPARK-26304) Add default value to spark.kafka.sasl.kerberos.service.name parameter

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26304: Assignee: Apache Spark > Add default value to spark.kafka.sasl.kerberos.service.name

[jira] [Created] (SPARK-26304) Add default value to spark.kafka.sasl.kerberos.service.name parameter

2018-12-07 Thread Gabor Somogyi (JIRA)
Gabor Somogyi created SPARK-26304: - Summary: Add default value to spark.kafka.sasl.kerberos.service.name parameter Key: SPARK-26304 URL: https://issues.apache.org/jira/browse/SPARK-26304 Project:

[jira] [Updated] (SPARK-26290) [K8s] Driver Pods no mounted volumes on submissions from older spark versions

2018-12-07 Thread Martin Buchleitner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Martin Buchleitner updated SPARK-26290: --- Environment: Kuberentes: 1.10.6 Container: Spark 2.4.0  Spark containers are built

[jira] [Assigned] (SPARK-26303) Return partial results for bad JSON records

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26303: Assignee: (was: Apache Spark) > Return partial results for bad JSON records >

[jira] [Created] (SPARK-26303) Return partial results for bad JSON records

2018-12-07 Thread Maxim Gekk (JIRA)
Maxim Gekk created SPARK-26303: -- Summary: Return partial results for bad JSON records Key: SPARK-26303 URL: https://issues.apache.org/jira/browse/SPARK-26303 Project: Spark Issue Type:

[jira] [Commented] (SPARK-26303) Return partial results for bad JSON records

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712603#comment-16712603 ] Apache Spark commented on SPARK-26303: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-26303) Return partial results for bad JSON records

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26303: Assignee: Apache Spark > Return partial results for bad JSON records >

[jira] [Commented] (SPARK-26303) Return partial results for bad JSON records

2018-12-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712600#comment-16712600 ] Apache Spark commented on SPARK-26303: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Created] (SPARK-26302) retainedBatches configuration can cause memory leak

2018-12-07 Thread Behroz Sikander (JIRA)
Behroz Sikander created SPARK-26302: --- Summary: retainedBatches configuration can cause memory leak Key: SPARK-26302 URL: https://issues.apache.org/jira/browse/SPARK-26302 Project: Spark

[jira] [Commented] (SPARK-26302) retainedBatches configuration can cause memory leak

2018-12-07 Thread Behroz Sikander (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712559#comment-16712559 ] Behroz Sikander commented on SPARK-26302: - I am willing to do a PR for documentation once

[jira] [Updated] (SPARK-26302) retainedBatches configuration can cause memory leak

2018-12-07 Thread Behroz Sikander (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Behroz Sikander updated SPARK-26302: Attachment: heap_dump_detail.png > retainedBatches configuration can cause memory leak >

[jira] [Updated] (SPARK-26295) [K8S] serviceAccountName is not set in client mode

2018-12-07 Thread Adrian Tanase (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adrian Tanase updated SPARK-26295: -- Description: When deploying spark apps in client mode (in my case from inside the driver

[jira] [Commented] (SPARK-26295) [K8S] serviceAccountName is not set in client mode

2018-12-07 Thread Adrian Tanase (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712493#comment-16712493 ] Adrian Tanase commented on SPARK-26295: --- [~vanzin] I'm not sure how it applies. I'd be happy to