[jira] [Commented] (SPARK-31751) spark serde property path overwrites table property location

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141317#comment-17141317 ] Apache Spark commented on SPARK-31751: -- User 'TJX2014' has created a pull request f

[jira] [Assigned] (SPARK-31751) spark serde property path overwrites table property location

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-31751: Assignee: (was: Apache Spark) > spark serde property path overwrites table property l

[jira] [Assigned] (SPARK-31751) spark serde property path overwrites table property location

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-31751: Assignee: Apache Spark > spark serde property path overwrites table property location > -

[jira] [Commented] (SPARK-32041) Exchange reuse won't work in cases when DPP, subqueries are involved

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141306#comment-17141306 ] Apache Spark commented on SPARK-32041: -- User 'prakharjain09' has created a pull req

[jira] [Assigned] (SPARK-32041) Exchange reuse won't work in cases when DPP, subqueries are involved

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32041: Assignee: (was: Apache Spark) > Exchange reuse won't work in cases when DPP, subqueri

[jira] [Commented] (SPARK-32041) Exchange reuse won't work in cases when DPP, subqueries are involved

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141304#comment-17141304 ] Apache Spark commented on SPARK-32041: -- User 'prakharjain09' has created a pull req

[jira] [Assigned] (SPARK-32041) Exchange reuse won't work in cases when DPP, subqueries are involved

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32041: Assignee: Apache Spark > Exchange reuse won't work in cases when DPP, subqueries are invo

[jira] [Created] (SPARK-32041) Exchange reuse won't work in cases when DPP, subqueries are involved

2020-06-20 Thread Prakhar Jain (Jira)
Prakhar Jain created SPARK-32041: Summary: Exchange reuse won't work in cases when DPP, subqueries are involved Key: SPARK-32041 URL: https://issues.apache.org/jira/browse/SPARK-32041 Project: Spark

[jira] [Commented] (SPARK-32039) Unable to Set `spark.ui.port` configuration in Yarn Cluster Mode

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141291#comment-17141291 ] Apache Spark commented on SPARK-32039: -- User 'rajatahujaatinmobi' has created a pul

[jira] [Commented] (SPARK-32039) Unable to Set `spark.ui.port` configuration in Yarn Cluster Mode

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141290#comment-17141290 ] Apache Spark commented on SPARK-32039: -- User 'rajatahujaatinmobi' has created a pul

[jira] [Commented] (SPARK-32016) Why spark does not preserve the original timestamp format while writing dataset to file or hdfs

2020-06-20 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141265#comment-17141265 ] Hyukjin Kwon commented on SPARK-32016: -- I am not very clear what original format yo

[jira] [Resolved] (SPARK-32022) Can many executors share one gpu for spark3.0?

2020-06-20 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32022. -- Resolution: Invalid > Can many executors share one gpu for spark3.0? > ---

[jira] [Commented] (SPARK-32022) Can many executors share one gpu for spark3.0?

2020-06-20 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141261#comment-17141261 ] Hyukjin Kwon commented on SPARK-32022: -- Let's loop the mailing list when it's a que

[jira] [Updated] (SPARK-32040) Idle cores not being allocated

2020-06-20 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-32040: - Description: *Background:* I have a cluster (2.4.5) using standalone mode orchestrated by Nomad

[jira] [Updated] (SPARK-32040) Idle cores not being allocated

2020-06-20 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-32040: - Description: *Background:* I have a cluster (2.4.5) using standalone mode orchestrated by Nomad

[jira] [Commented] (SPARK-32010) Thread leaks in pinned thread mode

2020-06-20 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141259#comment-17141259 ] Hyukjin Kwon commented on SPARK-32010: -- cc [~irashid] FYI > Thread leaks in pinned

[jira] [Assigned] (SPARK-27702) Allow using some alternatives for service accounts

2020-06-20 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-27702: - Assignee: Udbhav Agrawal > Allow using some alternatives for service accounts > ---

[jira] [Resolved] (SPARK-27702) Allow using some alternatives for service accounts

2020-06-20 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-27702. --- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 24601 [https://

[jira] [Updated] (SPARK-24266) Spark client terminates while driver is still running

2020-06-20 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-24266: - Target Version/s: 3.1.0, 2.4.7 > Spark client terminates while driver is still running > ---

[jira] [Commented] (SPARK-31887) Date casting to string is giving wrong value

2020-06-20 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141246#comment-17141246 ] Hyukjin Kwon commented on SPARK-31887: -- Very likely there won't be 2.5 and 2.6. So

[jira] [Updated] (SPARK-24266) Spark client terminates while driver is still running

2020-06-20 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-24266: - Target Version/s: (was: 3.1.0, 2.4.7) > Spark client terminates while driver is still running

[jira] [Assigned] (SPARK-32019) Add spark.sql.files.minPartitionNum config

2020-06-20 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-32019: - Assignee: ulysses you > Add spark.sql.files.minPartitionNum config > --

[jira] [Resolved] (SPARK-32019) Add spark.sql.files.minPartitionNum config

2020-06-20 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-32019. --- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 28853 [https://

[jira] [Commented] (SPARK-32038) Regression in handling NaN values in COUNT(DISTINCT)

2020-06-20 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141218#comment-17141218 ] Dongjoon Hyun commented on SPARK-32038: --- Since this is a correctness regression, I

[jira] [Updated] (SPARK-32038) Regression in handling NaN values in COUNT(DISTINCT)

2020-06-20 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-32038: -- Priority: Blocker (was: Major) > Regression in handling NaN values in COUNT(DISTINCT) > -

[jira] [Updated] (SPARK-32038) Regression in handling NaN values in COUNT(DISTINCT)

2020-06-20 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-32038: -- Target Version/s: 3.0.1 > Regression in handling NaN values in COUNT(DISTINCT) > -

[jira] [Updated] (SPARK-32038) Regression in handling NaN values in COUNT(DISTINCT)

2020-06-20 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-32038: -- Labels: correctness (was: ) > Regression in handling NaN values in COUNT(DISTINCT) >

[jira] [Updated] (SPARK-32038) Regression in handling NaN values in COUNT(DISTINCT)

2020-06-20 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-32038: -- Component/s: (was: Optimizer) > Regression in handling NaN values in COUNT(DISTINCT) > ---

[jira] [Commented] (SPARK-32025) CSV schema inference with boolean & integer

2020-06-20 Thread Pablo Langa Blanco (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141213#comment-17141213 ] Pablo Langa Blanco commented on SPARK-32025: I'm looking for the problem, as

[jira] [Created] (SPARK-32040) Idle cores not being allocated

2020-06-20 Thread t oo (Jira)
t oo created SPARK-32040: Summary: Idle cores not being allocated Key: SPARK-32040 URL: https://issues.apache.org/jira/browse/SPARK-32040 Project: Spark Issue Type: Bug Components: Schedule

[jira] [Updated] (SPARK-32021) make_interval does not accept seconds >100

2020-06-20 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-32021: -- Fix Version/s: (was: 3.0.0) 3.0.1 > make_interval does not accept secon

[jira] [Updated] (SPARK-32021) make_interval does not accept seconds >100

2020-06-20 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-32021: -- Fix Version/s: 3.0.0 > make_interval does not accept seconds >100 > --

[jira] [Commented] (SPARK-32037) Rename blacklisting feature to avoid language with racist connotation

2020-06-20 Thread Meniluca (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141091#comment-17141091 ] Meniluca commented on SPARK-32037: -- I second the idea and I prefer healthy over other w

[jira] [Resolved] (SPARK-31893) Add a generic ClassificationSummary trait

2020-06-20 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-31893. -- Fix Version/s: 3.1.0 Assignee: Huaxin Gao Resolution: Fixed Resolved by https:

[jira] [Commented] (SPARK-32039) Unable to Set `spark.ui.port` configuration in Yarn Cluster Mode

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141065#comment-17141065 ] Apache Spark commented on SPARK-32039: -- User 'rajatahujaatinmobi' has created a pul

[jira] [Assigned] (SPARK-32039) Unable to Set `spark.ui.port` configuration in Yarn Cluster Mode

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32039: Assignee: Apache Spark > Unable to Set `spark.ui.port` configuration in Yarn Cluster Mode

[jira] [Assigned] (SPARK-32039) Unable to Set `spark.ui.port` configuration in Yarn Cluster Mode

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32039: Assignee: (was: Apache Spark) > Unable to Set `spark.ui.port` configuration in Yarn C

[jira] [Commented] (SPARK-32039) Unable to Set `spark.ui.port` configuration in Yarn Cluster Mode

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141064#comment-17141064 ] Apache Spark commented on SPARK-32039: -- User 'rajatahujaatinmobi' has created a pul

[jira] [Updated] (SPARK-32039) Unable to Set `spark.ui.port` configuration in Yarn Cluster Mode

2020-06-20 Thread rajat (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] rajat updated SPARK-32039: -- Description: Spark Web UI port in Yarn cluster mode always gets a random number since we disable the configur

[jira] [Created] (SPARK-32039) Unable to Set `spark.ui.port` configuration in Yarn Cluster Mode

2020-06-20 Thread rajat (Jira)
rajat created SPARK-32039: - Summary: Unable to Set `spark.ui.port` configuration in Yarn Cluster Mode Key: SPARK-32039 URL: https://issues.apache.org/jira/browse/SPARK-32039 Project: Spark Issue Typ

[jira] [Commented] (SPARK-32021) make_interval does not accept seconds >100

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141016#comment-17141016 ] Apache Spark commented on SPARK-32021: -- User 'MaxGekk' has created a pull request f

[jira] [Commented] (SPARK-32021) make_interval does not accept seconds >100

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17141015#comment-17141015 ] Apache Spark commented on SPARK-32021: -- User 'MaxGekk' has created a pull request f

[jira] [Commented] (SPARK-31980) Spark sequence() fails if start and end of range are identical dates

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17140995#comment-17140995 ] Apache Spark commented on SPARK-31980: -- User 'TJX2014' has created a pull request f

[jira] [Assigned] (SPARK-32038) Regression in handling NaN values in COUNT(DISTINCT)

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32038: Assignee: (was: Apache Spark) > Regression in handling NaN values in COUNT(DISTINCT)

[jira] [Commented] (SPARK-32038) Regression in handling NaN values in COUNT(DISTINCT)

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17140987#comment-17140987 ] Apache Spark commented on SPARK-32038: -- User 'viirya' has created a pull request fo

[jira] [Assigned] (SPARK-32038) Regression in handling NaN values in COUNT(DISTINCT)

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32038: Assignee: Apache Spark > Regression in handling NaN values in COUNT(DISTINCT) > -

[jira] [Commented] (SPARK-32038) Regression in handling NaN values in COUNT(DISTINCT)

2020-06-20 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17140988#comment-17140988 ] Apache Spark commented on SPARK-32038: -- User 'viirya' has created a pull request fo