[jira] [Created] (SPARK-32504) Shuffle Storage API: Dynamic updates of shuffle metadata

2020-07-30 Thread Matt Cheah (Jira)
Matt Cheah created SPARK-32504: -- Summary: Shuffle Storage API: Dynamic updates of shuffle metadata Key: SPARK-32504 URL: https://issues.apache.org/jira/browse/SPARK-32504 Project: Spark Issue

[jira] [Commented] (SPARK-28210) Shuffle Storage API: Reads

2020-07-30 Thread Matt Cheah (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17168287#comment-17168287 ] Matt Cheah commented on SPARK-28210: [~devaraj]  [~tianczha] Thanks for expressing interest in this!

[jira] [Created] (SPARK-31801) Register shuffle map output metadata with a shuffle output tracker

2020-05-22 Thread Matt Cheah (Jira)
Matt Cheah created SPARK-31801: -- Summary: Register shuffle map output metadata with a shuffle output tracker Key: SPARK-31801 URL: https://issues.apache.org/jira/browse/SPARK-31801 Project: Spark

[jira] [Updated] (SPARK-31798) Return map output metadata from shuffle writers

2020-05-22 Thread Matt Cheah (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-31798: --- Description: Part of the overall sub-push for shuffle metadata management as proposed in [this

[jira] [Updated] (SPARK-31798) Return map output metadata from shuffle writers

2020-05-22 Thread Matt Cheah (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-31798: --- Description: Part of the overall sub-push for shuffle metadata management as proposed in [this

[jira] [Updated] (SPARK-31798) Return map output metadata from shuffle writers

2020-05-22 Thread Matt Cheah (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-31798: --- Description: Part of the overall sub-push for shuffle metadata management as proposed in [this

[jira] [Created] (SPARK-31798) Return map output metadata from shuffle writers

2020-05-22 Thread Matt Cheah (Jira)
Matt Cheah created SPARK-31798: -- Summary: Return map output metadata from shuffle writers Key: SPARK-31798 URL: https://issues.apache.org/jira/browse/SPARK-31798 Project: Spark Issue Type:

[jira] [Updated] (SPARK-29072) Properly track shuffle write time with refactor

2019-09-12 Thread Matt Cheah (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-29072: --- Description: From SPARK-28209, SPARK-28570, and SPARK-28571, we used the new shuffle writer plugin

[jira] [Updated] (SPARK-29072) Properly track shuffle write time with refactor

2019-09-12 Thread Matt Cheah (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-29072: --- Description: >From SPARK-28209, SPARK-28570, and SPARK-28571, we used the new shuffle writer

[jira] [Updated] (SPARK-29072) Properly track shuffle write time with refactor

2019-09-12 Thread Matt Cheah (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-29072: --- Description: >From SPARK-28209, SPARK-28570, and SPARK-28571, we used the new shuffle writer

[jira] [Created] (SPARK-29072) Properly track shuffle write time with refactor

2019-09-12 Thread Matt Cheah (Jira)
Matt Cheah created SPARK-29072: -- Summary: Properly track shuffle write time with refactor Key: SPARK-29072 URL: https://issues.apache.org/jira/browse/SPARK-29072 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-28764) Remove unnecessary writePartitionedFile method from ExternalSorter

2019-08-16 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-28764: -- Summary: Remove unnecessary writePartitionedFile method from ExternalSorter Key: SPARK-28764 URL: https://issues.apache.org/jira/browse/SPARK-28764 Project: Spark

[jira] [Updated] (SPARK-28607) Don't hold a reference to two partitionLengths arrays

2019-08-02 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-28607: --- Description: SPARK-28209 introduced the new shuffle writer API and its usage in

[jira] [Created] (SPARK-28607) Don't hold a reference to two partitionLengths arrays

2019-08-02 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-28607: -- Summary: Don't hold a reference to two partitionLengths arrays Key: SPARK-28607 URL: https://issues.apache.org/jira/browse/SPARK-28607 Project: Spark Issue

[jira] [Updated] (SPARK-28209) Shuffle Storage API: Writer API and usage in BypassMergeSortShuffleWriter

2019-07-30 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-28209: --- Description: Adds the write-side API for storing shuffle data in arbitrary storage systems. Also

[jira] [Created] (SPARK-28571) Shuffle storage API: Use API in SortShuffleWriter

2019-07-30 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-28571: -- Summary: Shuffle storage API: Use API in SortShuffleWriter Key: SPARK-28571 URL: https://issues.apache.org/jira/browse/SPARK-28571 Project: Spark Issue Type:

[jira] [Created] (SPARK-28570) Shuffle Storage API: Use writer API in UnsafeShuffleWriter

2019-07-30 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-28570: -- Summary: Shuffle Storage API: Use writer API in UnsafeShuffleWriter Key: SPARK-28570 URL: https://issues.apache.org/jira/browse/SPARK-28570 Project: Spark Issue

[jira] [Updated] (SPARK-28209) Shuffle Storage API: Writer API and usage in BypassMergeSortShuffleWriter

2019-07-30 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-28209: --- Summary: Shuffle Storage API: Writer API and usage in BypassMergeSortShuffleWriter (was: Shuffle

[jira] [Updated] (SPARK-28209) Shuffle Storage API: Writer API

2019-07-30 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-28209: --- Summary: Shuffle Storage API: Writer API (was: Shuffle Storage API: Writes) > Shuffle Storage API:

[jira] [Created] (SPARK-28568) Make Javadoc in org.apache.spark.shuffle.api visible

2019-07-30 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-28568: -- Summary: Make Javadoc in org.apache.spark.shuffle.api visible Key: SPARK-28568 URL: https://issues.apache.org/jira/browse/SPARK-28568 Project: Spark Issue Type:

[jira] [Created] (SPARK-28238) DESCRIBE TABLE for Data Source V2 tables

2019-07-02 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-28238: -- Summary: DESCRIBE TABLE for Data Source V2 tables Key: SPARK-28238 URL: https://issues.apache.org/jira/browse/SPARK-28238 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-28212) Shuffle Storage API: Shuffle Cleanup

2019-06-28 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-28212: -- Summary: Shuffle Storage API: Shuffle Cleanup Key: SPARK-28212 URL: https://issues.apache.org/jira/browse/SPARK-28212 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-28211) Shuffle Storage API: Driver Lifecycle

2019-06-28 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-28211: -- Summary: Shuffle Storage API: Driver Lifecycle Key: SPARK-28211 URL: https://issues.apache.org/jira/browse/SPARK-28211 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-28210) Shuffle Storage API: Reads

2019-06-28 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-28210: -- Summary: Shuffle Storage API: Reads Key: SPARK-28210 URL: https://issues.apache.org/jira/browse/SPARK-28210 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-25299) Use remote storage for persisting shuffle data

2019-06-28 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-25299: --- Description: In Spark, the shuffle primitive requires Spark executors to persist data to the local

[jira] [Commented] (SPARK-25299) Use remote storage for persisting shuffle data

2019-06-28 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16875106#comment-16875106 ] Matt Cheah commented on SPARK-25299: I also noticed the SPIP document wasn't ever posted on this

[jira] [Created] (SPARK-28209) Shuffle Storage API: Writes

2019-06-28 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-28209: -- Summary: Shuffle Storage API: Writes Key: SPARK-28209 URL: https://issues.apache.org/jira/browse/SPARK-28209 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-25299) Use remote storage for persisting shuffle data

2019-06-28 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16875101#comment-16875101 ] Matt Cheah commented on SPARK-25299: Let's start by making sub-issues. I have a patch staged for

[jira] [Resolved] (SPARK-26874) With PARQUET-1414, Spark can erroneously write empty pages

2019-02-14 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-26874. Resolution: Not A Problem I did some more digging with [~rdblue] and we discovered that this is a

[jira] [Updated] (SPARK-26874) With PARQUET-1414, Spark can erroneously write empty pages

2019-02-13 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-26874: --- Description: This issue will only come up when Spark upgrades its Parquet dependency to the latest

[jira] [Updated] (SPARK-26874) When we upgrade Parquet to 1.11+, Spark can erroneously write empty pages

2019-02-13 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-26874: --- Priority: Critical (was: Major) > When we upgrade Parquet to 1.11+, Spark can erroneously write

[jira] [Updated] (SPARK-26874) When we upgrade Parquet to 1.11+, Spark can erroneously write empty pages

2019-02-13 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-26874: --- Description: This issue will only come up when Spark upgrades its Parquet dependency to the

[jira] [Commented] (SPARK-26874) When we upgrade Parquet to 1.11+, Spark can erroneously write empty pages

2019-02-13 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16767736#comment-16767736 ] Matt Cheah commented on SPARK-26874: [~rdblue] [~cloud_fan] - was wondering if you had any thoughts

[jira] [Created] (SPARK-26874) When we upgrade Parquet to 1.11+, Spark can erroneously write empty pages

2019-02-13 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-26874: -- Summary: When we upgrade Parquet to 1.11+, Spark can erroneously write empty pages Key: SPARK-26874 URL: https://issues.apache.org/jira/browse/SPARK-26874 Project: Spark

[jira] [Resolved] (SPARK-26625) spark.redaction.regex should include oauthToken

2019-01-16 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-26625. Resolution: Fixed Fix Version/s: 3.0.0 > spark.redaction.regex should include oauthToken >

[jira] [Resolved] (SPARK-25877) Put all feature-related code in the feature step itself

2018-12-12 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-25877. Resolution: Fixed Fix Version/s: 3.0.0 > Put all feature-related code in the feature step

[jira] [Created] (SPARK-26301) Consider switching from putting secret in environment variable directly to using secret reference

2018-12-06 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-26301: -- Summary: Consider switching from putting secret in environment variable directly to using secret reference Key: SPARK-26301 URL: https://issues.apache.org/jira/browse/SPARK-26301

[jira] [Resolved] (SPARK-26194) Support automatic spark.authenticate secret in Kubernetes backend

2018-12-06 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-26194. Resolution: Fixed Fix Version/s: 3.0.0 > Support automatic spark.authenticate secret in

[jira] [Commented] (SPARK-26239) Add configurable auth secret source in k8s backend

2018-12-03 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16707962#comment-16707962 ] Matt Cheah commented on SPARK-26239: It could work in client mode but is less useful there overall

[jira] [Resolved] (SPARK-25876) Simplify configuration types in k8s backend

2018-11-30 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-25876. Resolution: Fixed Fix Version/s: 3.0.0 > Simplify configuration types in k8s backend >

[jira] [Comment Edited] (SPARK-26239) Add configurable auth secret source in k8s backend

2018-11-30 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16705273#comment-16705273 ] Matt Cheah edited comment on SPARK-26239 at 11/30/18 8:59 PM: -- Could we add

[jira] [Commented] (SPARK-26239) Add configurable auth secret source in k8s backend

2018-11-30 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16705273#comment-16705273 ] Matt Cheah commented on SPARK-26239: Would a simple addition just to point to file paths for the

[jira] [Resolved] (SPARK-25957) Skip building spark-r docker image if spark distribution does not have R support

2018-11-21 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-25957. Resolution: Fixed Fix Version/s: 3.0.0 > Skip building spark-r docker image if spark

[jira] [Updated] (SPARK-26078) WHERE .. IN fails to filter rows when used in combination with UNION

2018-11-15 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-26078: --- Labels: Correctness (was: ) > WHERE .. IN fails to filter rows when used in combination with UNION

[jira] [Updated] (SPARK-26078) WHERE .. IN fails to filter rows when used in combination with UNION

2018-11-15 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-26078: --- Labels: Correctness correctness (was: Correctness) > WHERE .. IN fails to filter rows when used in

[jira] [Updated] (SPARK-26078) WHERE .. IN fails to filter rows when used in combination with UNION

2018-11-15 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-26078: --- Target Version/s: 2.4.1, 3.0.0 > WHERE .. IN fails to filter rows when used in combination with

[jira] [Updated] (SPARK-26078) WHERE .. IN fails to filter rows when used in combination with UNION

2018-11-15 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-26078: --- Labels: correctness (was: Correctness correctness) > WHERE .. IN fails to filter rows when used in

[jira] [Updated] (SPARK-26078) WHERE .. IN fails to filter rows when used in combination with UNION

2018-11-15 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-26078: --- Priority: Blocker (was: Major) > WHERE .. IN fails to filter rows when used in combination with

[jira] [Commented] (SPARK-26078) WHERE .. IN fails to filter rows when used in combination with UNION

2018-11-15 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16688460#comment-16688460 ] Matt Cheah commented on SPARK-26078: As per other correctness issues we have seen as of late and as

[jira] [Commented] (SPARK-25821) Remove SQLContext methods deprecated as of Spark 1.4

2018-11-13 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16685705#comment-16685705 ] Matt Cheah commented on SPARK-25821: Should we call this out in release notes? It seems pretty

[jira] [Updated] (SPARK-25862) Remove rangeBetween APIs introduced in SPARK-21608

2018-11-13 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-25862: --- Labels: release-notes (was: ) > Remove rangeBetween APIs introduced in SPARK-21608 >

[jira] [Commented] (SPARK-25862) Remove rangeBetween APIs introduced in SPARK-21608

2018-11-13 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16685704#comment-16685704 ] Matt Cheah commented on SPARK-25862: Adding release-notes label because this seems like a breaking

[jira] [Updated] (SPARK-25908) Remove old deprecated items in Spark 3

2018-11-13 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-25908: --- Labels: release-notes (was: ) > Remove old deprecated items in Spark 3 >

[jira] [Commented] (SPARK-24834) Utils#nanSafeCompare{Double,Float} functions do not differ from normal java double/float comparison

2018-11-08 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16680170#comment-16680170 ] Matt Cheah commented on SPARK-24834: [~srowen] - I know this is an old ticket but I wanted to

[jira] [Resolved] (SPARK-25875) Merge code to set up driver features for different languages

2018-11-02 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-25875. Resolution: Fixed > Merge code to set up driver features for different languages >

[jira] [Resolved] (SPARK-25809) Support additional K8S cluster types for integration tests

2018-11-01 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-25809. Resolution: Fixed > Support additional K8S cluster types for integration tests >

[jira] [Resolved] (SPARK-24434) Support user-specified driver and executor pod templates

2018-10-30 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-24434. Resolution: Fixed > Support user-specified driver and executor pod templates >

[jira] [Commented] (SPARK-18278) SPIP: Support native submission of spark jobs to a kubernetes cluster

2018-10-15 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16650524#comment-16650524 ] Matt Cheah commented on SPARK-18278: The fork is no longer being maintained, because Kubernetes

[jira] [Commented] (SPARK-18278) SPIP: Support native submission of spark jobs to a kubernetes cluster

2018-10-12 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16648208#comment-16648208 ] Matt Cheah commented on SPARK-18278: [~liushaohui] a significantly large prerequisite for dynamic

[jira] [Resolved] (SPARK-23429) Add executor memory metrics to heartbeat and expose in executors REST API

2018-09-07 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-23429. Resolution: Fixed > Add executor memory metrics to heartbeat and expose in executors REST API >

[jira] [Commented] (SPARK-25262) Make Spark local dir volumes configurable with Spark on Kubernetes

2018-09-06 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16606522#comment-16606522 ] Matt Cheah commented on SPARK-25262: For [https://github.com/apache/spark/pull/22323] we allow using

[jira] [Resolved] (SPARK-25222) Spark on Kubernetes Pod Watcher dumps raw container status

2018-09-06 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-25222. Resolution: Fixed > Spark on Kubernetes Pod Watcher dumps raw container status >

[jira] [Commented] (SPARK-25299) Use remote storage for persisting shuffle data

2018-09-04 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16603807#comment-16603807 ] Matt Cheah commented on SPARK-25299: (Changed the title to "remote storage" for a little more

[jira] [Updated] (SPARK-25299) Use remote storage for persisting shuffle data

2018-09-04 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-25299: --- Summary: Use remote storage for persisting shuffle data (was: Use distributed storage for

[jira] [Commented] (SPARK-24434) Support user-specified driver and executor pod templates

2018-09-01 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16599697#comment-16599697 ] Matt Cheah commented on SPARK-24434: Everyone, thank you for your contribution to this discussion.

[jira] [Commented] (SPARK-25299) Use distributed storage for persisting shuffle data

2018-08-31 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16599428#comment-16599428 ] Matt Cheah commented on SPARK-25299:   Note that SPARK-1529 was a much earlier feature request that

[jira] [Comment Edited] (SPARK-25299) Use distributed storage for persisting shuffle data

2018-08-31 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16599428#comment-16599428 ] Matt Cheah edited comment on SPARK-25299 at 9/1/18 12:27 AM: - Note that

[jira] [Created] (SPARK-25299) Use distributed storage for persisting shuffle data

2018-08-31 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-25299: -- Summary: Use distributed storage for persisting shuffle data Key: SPARK-25299 URL: https://issues.apache.org/jira/browse/SPARK-25299 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-25264) Fix comma-delineated arguments passed into PythonRunner and RRunner

2018-08-31 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-25264. Resolution: Fixed Fix Version/s: 2.4.0 > Fix comma-delineated arguments passed into

[jira] [Created] (SPARK-25152) Enable Spark on Kubernetes R Integration Tests

2018-08-17 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-25152: -- Summary: Enable Spark on Kubernetes R Integration Tests Key: SPARK-25152 URL: https://issues.apache.org/jira/browse/SPARK-25152 Project: Spark Issue Type: Test

[jira] [Updated] (SPARK-24433) Add Spark R support

2018-08-17 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-24433: --- Fix Version/s: 2.4.0 > Add Spark R support > --- > > Key:

[jira] [Resolved] (SPARK-24433) Add Spark R support

2018-08-17 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-24433. Resolution: Fixed > Add Spark R support > --- > > Key:

[jira] [Resolved] (SPARK-24960) k8s: explicitly expose ports on driver container

2018-08-01 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-24960. Resolution: Fixed Fix Version/s: 2.4.0 > k8s: explicitly expose ports on driver container

[jira] [Resolved] (SPARK-24963) Integration tests will fail if they run in a namespace not being the default

2018-07-30 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-24963. Resolution: Fixed Fix Version/s: 2.4.0 > Integration tests will fail if they run in a

[jira] [Resolved] (SPARK-23146) Support client mode for Kubernetes cluster backend

2018-07-25 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-23146. Resolution: Fixed Fix Version/s: 2.4.0 > Support client mode for Kubernetes cluster

[jira] [Commented] (SPARK-24825) [K8S][TEST] Kubernetes integration tests don't trace the maven project dependency structure

2018-07-17 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16547137#comment-16547137 ] Matt Cheah commented on SPARK-24825: We're looking into this now, this particular phase was built

[jira] [Updated] (SPARK-24825) [K8S][TEST] Kubernetes integration tests don't trace the maven project dependency structure

2018-07-16 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-24825: --- Issue Type: Bug (was: Improvement) > [K8S][TEST] Kubernetes integration tests don't trace the

[jira] [Created] (SPARK-24825) [K8S][TEST] Kubernetes integration tests don't trace the maven project dependency structure

2018-07-16 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-24825: -- Summary: [K8S][TEST] Kubernetes integration tests don't trace the maven project dependency structure Key: SPARK-24825 URL: https://issues.apache.org/jira/browse/SPARK-24825

[jira] [Updated] (SPARK-24825) [K8S][TEST] Kubernetes integration tests don't trace the maven project dependency structure

2018-07-16 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-24825: --- Priority: Critical (was: Major) > [K8S][TEST] Kubernetes integration tests don't trace the maven

[jira] [Resolved] (SPARK-24683) SparkLauncher.NO_RESOURCE doesn't work with Java applications

2018-07-02 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-24683. Resolution: Fixed Fix Version/s: 2.4.0 > SparkLauncher.NO_RESOURCE doesn't work with Java

[jira] [Created] (SPARK-24683) SparkLauncher.NO_RESOURCE doesn't work with Java applications

2018-06-28 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-24683: -- Summary: SparkLauncher.NO_RESOURCE doesn't work with Java applications Key: SPARK-24683 URL: https://issues.apache.org/jira/browse/SPARK-24683 Project: Spark

[jira] [Created] (SPARK-24655) [K8S] Custom Docker Image Expectations and Documentation

2018-06-25 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-24655: -- Summary: [K8S] Custom Docker Image Expectations and Documentation Key: SPARK-24655 URL: https://issues.apache.org/jira/browse/SPARK-24655 Project: Spark Issue

[jira] [Commented] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-06-18 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516368#comment-16516368 ] Matt Cheah commented on SPARK-24248: I've summarized what we ended up going with after some

[jira] [Commented] (SPARK-24584) [K8s] More efficient storage of executor pod state

2018-06-18 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16516300#comment-16516300 ] Matt Cheah commented on SPARK-24584: Related to https://issues.apache.org/jira/browse/SPARK-24248 >

[jira] [Created] (SPARK-24584) [K8s] More efficient storage of executor pod state

2018-06-18 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-24584: -- Summary: [K8s] More efficient storage of executor pod state Key: SPARK-24584 URL: https://issues.apache.org/jira/browse/SPARK-24584 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-06-14 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-24248. Resolution: Fixed Fix Version/s: 2.4.0 > [K8S] Use the Kubernetes cluster as the backing

[jira] [Resolved] (SPARK-23010) Add integration testing for Kubernetes backend into the apache/spark repository

2018-06-08 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-23010. Resolution: Fixed Fix Version/s: 2.4.0 Some tests are missing but we got a basic set of

[jira] [Resolved] (SPARK-23984) PySpark Bindings for K8S

2018-06-08 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah resolved SPARK-23984. Resolution: Fixed Fix Version/s: 2.4.0 > PySpark Bindings for K8S >

[jira] [Commented] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-05-10 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16471295#comment-16471295 ] Matt Cheah commented on SPARK-24248: I see - I suppose if the watch connection drops, we should try

[jira] [Comment Edited] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-05-10 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16471280#comment-16471280 ] Matt Cheah edited comment on SPARK-24248 at 5/10/18 11:18 PM: -- I thought

[jira] [Commented] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-05-10 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16471280#comment-16471280 ] Matt Cheah commented on SPARK-24248: I thought about it a bit more, and believe that we can do most

[jira] [Commented] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-05-10 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16471206#comment-16471206 ] Matt Cheah commented on SPARK-24248: [~foxish] [~liyinan926] curious as to what you think about this

[jira] [Created] (SPARK-24248) [K8S] Use the Kubernetes cluster as the backing store for the state of pods

2018-05-10 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-24248: -- Summary: [K8S] Use the Kubernetes cluster as the backing store for the state of pods Key: SPARK-24248 URL: https://issues.apache.org/jira/browse/SPARK-24248 Project:

[jira] [Created] (SPARK-24247) [K8S] currentNodeToLocalTaskCount is unused in KubernetesClusterSchedulerBackend

2018-05-10 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-24247: -- Summary: [K8S] currentNodeToLocalTaskCount is unused in KubernetesClusterSchedulerBackend Key: SPARK-24247 URL: https://issues.apache.org/jira/browse/SPARK-24247

[jira] [Commented] (SPARK-24135) [K8s] Executors that fail to start up because of init-container errors are not retried and limit the executor pool size

2018-05-04 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16464467#comment-16464467 ] Matt Cheah commented on SPARK-24135: Put up the PR< see above - created a separate setting for this

[jira] [Comment Edited] (SPARK-24135) [K8s] Executors that fail to start up because of init-container errors are not retried and limit the executor pool size

2018-05-03 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16462106#comment-16462106 ] Matt Cheah edited comment on SPARK-24135 at 5/3/18 8:35 AM: Not necessarily -

[jira] [Commented] (SPARK-24135) [K8s] Executors that fail to start up because of init-container errors are not retried and limit the executor pool size

2018-05-03 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16462106#comment-16462106 ] Matt Cheah commented on SPARK-24135: Not necessarily - if the pods fail to start up, we should retry

[jira] [Comment Edited] (SPARK-24135) [K8s] Executors that fail to start up because of init-container errors are not retried and limit the executor pool size

2018-05-02 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16461188#comment-16461188 ] Matt Cheah edited comment on SPARK-24135 at 5/2/18 3:37 PM: {quote}Restarting

[jira] [Comment Edited] (SPARK-24135) [K8s] Executors that fail to start up because of init-container errors are not retried and limit the executor pool size

2018-05-02 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16460047#comment-16460047 ] Matt Cheah edited comment on SPARK-24135 at 5/2/18 3:37 PM: {quote}But I'm

[jira] [Commented] (SPARK-24135) [K8s] Executors that fail to start up because of init-container errors are not retried and limit the executor pool size

2018-05-02 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16461188#comment-16461188 ] Matt Cheah commented on SPARK-24135: > Restarting seems like it would eventually be limited by the

  1   2   3   >