[jira] [Commented] (SPARK-30817) SparkR ML algorithms parity

2020-07-29 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17166940#comment-17166940 ] Hyukjin Kwon commented on SPARK-30817: -- [~zero323] I will leave this JIRA resolved

[jira] [Resolved] (SPARK-30817) SparkR ML algorithms parity

2020-07-29 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-30817. -- Resolution: Done > SparkR ML algorithms parity > > >

[jira] [Created] (SPARK-32477) JsonProtocol.accumulablesToJson should be deterministic

2020-07-29 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created SPARK-32477: - Summary: JsonProtocol.accumulablesToJson should be deterministic Key: SPARK-32477 URL: https://issues.apache.org/jira/browse/SPARK-32477 Project: Spark Iss

[jira] [Assigned] (SPARK-32476) ResourceAllocator.availableAddrs should be deterministic

2020-07-29 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-32476: - Assignee: Dongjoon Hyun > ResourceAllocator.availableAddrs should be deterministic > --

[jira] [Assigned] (SPARK-32477) JsonProtocol.accumulablesToJson should be deterministic

2020-07-29 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-32477: - Assignee: Dongjoon Hyun > JsonProtocol.accumulablesToJson should be deterministic > ---

[jira] [Assigned] (SPARK-32477) JsonProtocol.accumulablesToJson should be deterministic

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32477: Assignee: (was: Apache Spark) > JsonProtocol.accumulablesToJson should be determinist

[jira] [Commented] (SPARK-32477) JsonProtocol.accumulablesToJson should be deterministic

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17166950#comment-17166950 ] Apache Spark commented on SPARK-32477: -- User 'dongjoon-hyun' has created a pull req

[jira] [Assigned] (SPARK-32477) JsonProtocol.accumulablesToJson should be deterministic

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32477: Assignee: Apache Spark > JsonProtocol.accumulablesToJson should be deterministic > --

[jira] [Updated] (SPARK-32478) Error message to show the schema mismatch in gapply with Arrow vectorization

2020-07-29 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-32478: - Description: Currently, the error message is confusing when the output schema type is not match

[jira] [Commented] (SPARK-32341) add mutiple filter in rdd function

2020-07-29 Thread gaokui (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167043#comment-17167043 ] gaokui commented on SPARK-32341: Yes, I can do that. But at that situation, I need creat

[jira] [Created] (SPARK-32478) Error message to show the schema mismatch in gapply with Arrow vectorization

2020-07-29 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-32478: Summary: Error message to show the schema mismatch in gapply with Arrow vectorization Key: SPARK-32478 URL: https://issues.apache.org/jira/browse/SPARK-32478 Project:

[jira] [Commented] (SPARK-32478) Error message to show the schema mismatch in gapply with Arrow vectorization

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167059#comment-17167059 ] Apache Spark commented on SPARK-32478: -- User 'HyukjinKwon' has created a pull reque

[jira] [Assigned] (SPARK-32478) Error message to show the schema mismatch in gapply with Arrow vectorization

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32478: Assignee: (was: Apache Spark) > Error message to show the schema mismatch in gapply w

[jira] [Assigned] (SPARK-32478) Error message to show the schema mismatch in gapply with Arrow vectorization

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32478: Assignee: Apache Spark > Error message to show the schema mismatch in gapply with Arrow v

[jira] [Commented] (SPARK-30519) Executor can't use spark.executorEnv.HADOOP_USER_NAME to change the user accessing to hdfs

2020-07-29 Thread Laurenceau Julien (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167075#comment-17167075 ] Laurenceau Julien commented on SPARK-30519: --- As I understand a possible work a

[jira] [Resolved] (SPARK-32355) 使用Structured Streaming窗口统计不能实现topN

2020-07-29 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32355. -- Resolution: Incomplete > 使用Structured Streaming窗口统计不能实现topN >

[jira] [Created] (SPARK-32479) Fix the slicing logic in createDataFrame when converting pandas dataframe to arrow table

2020-07-29 Thread Liang Zhang (Jira)
Liang Zhang created SPARK-32479: --- Summary: Fix the slicing logic in createDataFrame when converting pandas dataframe to arrow table Key: SPARK-32479 URL: https://issues.apache.org/jira/browse/SPARK-32479

[jira] [Assigned] (SPARK-32479) Fix the slicing logic in createDataFrame when converting pandas dataframe to arrow table

2020-07-29 Thread Weichen Xu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weichen Xu reassigned SPARK-32479: -- Assignee: Liang Zhang > Fix the slicing logic in createDataFrame when converting pandas dataf

[jira] [Updated] (SPARK-32479) Fix the slicing logic in createDataFrame when converting pandas dataframe to arrow table

2020-07-29 Thread Liang Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang Zhang updated SPARK-32479: Description: h1. Problem: In [https://github.com/databricks/runtime/blob/84a952313ae73e3df32f065

[jira] [Created] (SPARK-32480) Support insert overwrite to move the data to trash

2020-07-29 Thread jobit mathew (Jira)
jobit mathew created SPARK-32480: Summary: Support insert overwrite to move the data to trash Key: SPARK-32480 URL: https://issues.apache.org/jira/browse/SPARK-32480 Project: Spark Issue Typ

[jira] [Created] (SPARK-32481) Support truncate table to move the data to trash

2020-07-29 Thread jobit mathew (Jira)
jobit mathew created SPARK-32481: Summary: Support truncate table to move the data to trash Key: SPARK-32481 URL: https://issues.apache.org/jira/browse/SPARK-32481 Project: Spark Issue Type:

[jira] [Commented] (SPARK-32481) Support truncate table to move the data to trash

2020-07-29 Thread Udbhav Agrawal (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167106#comment-17167106 ] Udbhav Agrawal commented on SPARK-32481: I will raise MR to support this. > Sup

[jira] [Commented] (SPARK-32480) Support insert overwrite to move the data to trash

2020-07-29 Thread Udbhav Agrawal (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167105#comment-17167105 ] Udbhav Agrawal commented on SPARK-32480: I will raise a MR to support this. > S

[jira] [Commented] (SPARK-32479) Fix the slicing logic in createDataFrame when converting pandas dataframe to arrow table

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167112#comment-17167112 ] Apache Spark commented on SPARK-32479: -- User 'liangz1' has created a pull request f

[jira] [Commented] (SPARK-32479) Fix the slicing logic in createDataFrame when converting pandas dataframe to arrow table

2020-07-29 Thread Liang Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167113#comment-17167113 ] Liang Zhang commented on SPARK-32479: - https://github.com/apache/spark/pull/29284 is

[jira] [Issue Comment Deleted] (SPARK-32479) Fix the slicing logic in createDataFrame when converting pandas dataframe to arrow table

2020-07-29 Thread Liang Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang Zhang updated SPARK-32479: Comment: was deleted (was: https://github.com/apache/spark/pull/29284 is ready for review. [~weic

[jira] [Created] (SPARK-32482) Eliminate deprecated poll(long) API calls to avoid infinite wait in tests

2020-07-29 Thread Gabor Somogyi (Jira)
Gabor Somogyi created SPARK-32482: - Summary: Eliminate deprecated poll(long) API calls to avoid infinite wait in tests Key: SPARK-32482 URL: https://issues.apache.org/jira/browse/SPARK-32482 Project:

[jira] [Created] (SPARK-32483) spark-shell: error: value topByKey is not a member of org.apache.spark.rdd.RDD[(String, (String, Double))]

2020-07-29 Thread manley (Jira)
manley created SPARK-32483: -- Summary: spark-shell: error: value topByKey is not a member of org.apache.spark.rdd.RDD[(String, (String, Double))] Key: SPARK-32483 URL: https://issues.apache.org/jira/browse/SPARK-32483

[jira] [Commented] (SPARK-32483) spark-shell: error: value topByKey is not a member of org.apache.spark.rdd.RDD[(String, (String, Double))]

2020-07-29 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167158#comment-17167158 ] JinxinTang commented on SPARK-32483: [~ukiml] Please `import org.apache.spark.mllib

[jira] [Commented] (SPARK-21708) use sbt 1.x

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167168#comment-17167168 ] Apache Spark commented on SPARK-21708: -- User 'gemelen' has created a pull request f

[jira] [Commented] (SPARK-27830) Show Spark version at app lists of Spark History UI

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167171#comment-17167171 ] Apache Spark commented on SPARK-27830: -- User 'liucht-inspur' has created a pull req

[jira] [Created] (SPARK-32484) Not accurate Log Info in BroadcastExchangeExec.scala

2020-07-29 Thread pp (Jira)
pp created SPARK-32484: -- Summary: Not accurate Log Info in BroadcastExchangeExec.scala Key: SPARK-32484 URL: https://issues.apache.org/jira/browse/SPARK-32484 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-32484) Not accurate Log Info in BroadcastExchangeExec.scala

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32484: Assignee: (was: Apache Spark) > Not accurate Log Info in BroadcastExchangeExec.scal

[jira] [Commented] (SPARK-32484) Not accurate Log Info in BroadcastExchangeExec.scala

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167183#comment-17167183 ] Apache Spark commented on SPARK-32484: -- User 'prgitpr' has created a pull request f

[jira] [Assigned] (SPARK-32484) Not accurate Log Info in BroadcastExchangeExec.scala

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32484: Assignee: Apache Spark > Not accurate Log Info in BroadcastExchangeExec.scala > ---

[jira] [Commented] (SPARK-32484) Not accurate Log Info in BroadcastExchangeExec.scala

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167184#comment-17167184 ] Apache Spark commented on SPARK-32484: -- User 'prgitpr' has created a pull request f

[jira] [Assigned] (SPARK-32482) Eliminate deprecated poll(long) API calls to avoid infinite wait in tests

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32482: Assignee: (was: Apache Spark) > Eliminate deprecated poll(long) API calls to avoid in

[jira] [Commented] (SPARK-32482) Eliminate deprecated poll(long) API calls to avoid infinite wait in tests

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167190#comment-17167190 ] Apache Spark commented on SPARK-32482: -- User 'gaborgsomogyi' has created a pull req

[jira] [Assigned] (SPARK-32482) Eliminate deprecated poll(long) API calls to avoid infinite wait in tests

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32482: Assignee: Apache Spark > Eliminate deprecated poll(long) API calls to avoid infinite wait

[jira] [Commented] (SPARK-32484) Not accurate Log Info in BroadcastExchangeExec.scala

2020-07-29 Thread pp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167192#comment-17167192 ] pp commented on SPARK-32484: [https://github.com/apache/spark/pull/29290] > Not accurate Lo

[jira] [Commented] (SPARK-32484) Not accurate Log Info in BroadcastExchangeExec.scala

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167194#comment-17167194 ] Apache Spark commented on SPARK-32484: -- User 'prgitpr' has created a pull request f

[jira] [Updated] (SPARK-32032) Eliminate deprecated poll(long) API calls to avoid infinite wait in driver

2020-07-29 Thread Gabor Somogyi (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-32032: -- Summary: Eliminate deprecated poll(long) API calls to avoid infinite wait in driver (was: Use

[jira] [Commented] (SPARK-32032) Eliminate deprecated poll(long) API calls to avoid infinite wait in driver

2020-07-29 Thread Gabor Somogyi (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167213#comment-17167213 ] Gabor Somogyi commented on SPARK-32032: --- I've renamed the jira because the solutio

[jira] [Commented] (SPARK-32032) Eliminate deprecated poll(long) API calls to avoid infinite wait in driver

2020-07-29 Thread Gabor Somogyi (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167214#comment-17167214 ] Gabor Somogyi commented on SPARK-32032: --- I'm working on the solution. > Eliminate

[jira] [Resolved] (SPARK-32175) Fix the order between initialization for ExecutorPlugin and starting heartbeat thread

2020-07-29 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-32175. --- Fix Version/s: 3.1.0 3.0.1 Resolution: Fixed > Fix the order betwe

[jira] [Commented] (SPARK-32470) Remove task result size check for shuffle map stage

2020-07-29 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167233#comment-17167233 ] Thomas Graves commented on SPARK-32470: --- Please add a description to the Jira as t

[jira] [Created] (SPARK-32485) RecordBinaryComparatorSuite test failures on big-endian systems

2020-07-29 Thread Michael Munday (Jira)
Michael Munday created SPARK-32485: -- Summary: RecordBinaryComparatorSuite test failures on big-endian systems Key: SPARK-32485 URL: https://issues.apache.org/jira/browse/SPARK-32485 Project: Spark

[jira] [Commented] (SPARK-29314) ProgressReporter.extractStateOperatorMetrics should not overwrite updated as 0 when it actually runs a batch even with no data

2020-07-29 Thread Sandeep Katta (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167243#comment-17167243 ] Sandeep Katta commented on SPARK-29314: --- [~kabhwan] [~brkyvz] this is required to

[jira] [Commented] (SPARK-30276) Support Filter expression allows simultaneous use of DISTINCT

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167265#comment-17167265 ] Apache Spark commented on SPARK-30276: -- User 'beliefer' has created a pull request

[jira] [Resolved] (SPARK-32477) JsonProtocol.accumulablesToJson should be deterministic

2020-07-29 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-32477. --- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29282 [https://

[jira] [Updated] (SPARK-30322) Add stage level scheduling docs

2020-07-29 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-30322: -- Description: Add stage level scheduling docs. > Add stage level scheduling docs >

[jira] [Created] (SPARK-32486) Issue with deserialization and persist api in latest spark java versions

2020-07-29 Thread Dinesh Kumar (Jira)
Dinesh Kumar created SPARK-32486: Summary: Issue with deserialization and persist api in latest spark java versions Key: SPARK-32486 URL: https://issues.apache.org/jira/browse/SPARK-32486 Project: Spa

[jira] [Resolved] (SPARK-32449) Add summary to MultilayerPerceptronClassificationModel

2020-07-29 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-32449. -- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29250 [https://gi

[jira] [Assigned] (SPARK-32449) Add summary to MultilayerPerceptronClassificationModel

2020-07-29 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen reassigned SPARK-32449: Assignee: Huaxin Gao > Add summary to MultilayerPerceptronClassificationModel > -

[jira] [Updated] (SPARK-32346) Support filters pushdown in Avro datasource

2020-07-29 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen updated SPARK-32346: - Fix Version/s: (was: 3.1.0) > Support filters pushdown in Avro datasource >

[jira] [Resolved] (SPARK-32486) Issue with deserialization and persist api in latest spark java versions

2020-07-29 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-32486. -- Fix Version/s: (was: 2.3.3) Resolution: Invalid Please don't set Blocker, Target ve

[jira] [Updated] (SPARK-32299) Decide SMJ Join Orientation adaptively

2020-07-29 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen updated SPARK-32299: - Fix Version/s: (was: 3.1.0) Target Version/s: (was: 3.1.0) > Decide SMJ Join Orient

[jira] [Updated] (SPARK-32227) Bug in load-spark-env.cmd with Spark 3.0.0

2020-07-29 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen updated SPARK-32227: - Fix Version/s: (was: 3.0.1) > Bug in load-spark-env.cmd with Spark 3.0.0 >

[jira] [Resolved] (SPARK-32208) SparkSQL throw Illegal character exception when load certain abnormal path of HDFS

2020-07-29 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-32208. -- Fix Version/s: (was: 2.4.3) Resolution: Invalid > SparkSQL throw Illegal character

[jira] [Commented] (SPARK-32481) Support truncate table to move the data to trash

2020-07-29 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167300#comment-17167300 ] Sean R. Owen commented on SPARK-32481: -- Please add a description > Support truncat

[jira] [Resolved] (SPARK-12172) Consider removing SparkR internal RDD APIs

2020-07-29 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-12172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-12172. -- Resolution: Won't Fix Sounds like a WontFix for the foreseeable future, but could be reopened

[jira] [Commented] (SPARK-30322) Add stage level scheduling docs

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167306#comment-17167306 ] Apache Spark commented on SPARK-30322: -- User 'tgravescs' has created a pull request

[jira] [Assigned] (SPARK-30322) Add stage level scheduling docs

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-30322: Assignee: (was: Apache Spark) > Add stage level scheduling docs > ---

[jira] [Assigned] (SPARK-30322) Add stage level scheduling docs

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-30322: Assignee: Apache Spark > Add stage level scheduling docs > --

[jira] [Commented] (SPARK-30322) Add stage level scheduling docs

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167307#comment-17167307 ] Apache Spark commented on SPARK-30322: -- User 'tgravescs' has created a pull request

[jira] [Commented] (SPARK-30255) Support explain mode in SparkR df.explain

2020-07-29 Thread S Daniel Zafar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167316#comment-17167316 ] S Daniel Zafar commented on SPARK-30255: Hello- I would like to knock this one o

[jira] [Commented] (SPARK-30255) Support explain mode in SparkR df.explain

2020-07-29 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167320#comment-17167320 ] Hyukjin Kwon commented on SPARK-30255: -- Please go ahead and directly open a PR. We

[jira] [Commented] (SPARK-25770) support SparkDataFrame pretty print

2020-07-29 Thread S Daniel Zafar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167324#comment-17167324 ] S Daniel Zafar commented on SPARK-25770: [~adrian555], what would your preferred

[jira] [Updated] (SPARK-32470) Remove task result size check for shuffle map stage

2020-07-29 Thread Wei Xue (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wei Xue updated SPARK-32470: Description: The task result of a shuffle map stage is not the query result but instead is only map statu

[jira] [Resolved] (SPARK-32346) Support filters pushdown in Avro datasource

2020-07-29 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang resolved SPARK-32346. Resolution: Fixed > Support filters pushdown in Avro datasource >

[jira] [Resolved] (SPARK-32476) ResourceAllocator.availableAddrs should be deterministic

2020-07-29 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-32476. --- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29281 [https://

[jira] [Commented] (SPARK-32346) Support filters pushdown in Avro datasource

2020-07-29 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167390#comment-17167390 ] Gengliang Wang commented on SPARK-32346: This issue is resolved in https://githu

[jira] [Updated] (SPARK-32470) Remove task result size check for shuffle map stage

2020-07-29 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-32470: -- Affects Version/s: 2.4.6 3.0.0 > Remove task result size check for shuf

[jira] [Commented] (SPARK-32485) RecordBinaryComparatorSuite test failures on big-endian systems

2020-07-29 Thread Rohit Mishra (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167440#comment-17167440 ] Rohit Mishra commented on SPARK-32485: -- [~mundaym], can you please add environment

[jira] [Resolved] (SPARK-30322) Add stage level scheduling docs

2020-07-29 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-30322. --- Fix Version/s: 3.1.0 Assignee: Thomas Graves Resolution: Fixed > Add stage l

[jira] [Commented] (SPARK-32445) Make NullType.sql as VOID to support hive

2020-07-29 Thread Rohit Mishra (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167447#comment-17167447 ] Rohit Mishra commented on SPARK-32445: -- [~ulysses], Can you please add a descriptio

[jira] [Commented] (SPARK-32444) Infer filters from DPP

2020-07-29 Thread Rohit Mishra (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167451#comment-17167451 ] Rohit Mishra commented on SPARK-32444: -- [~yumwang], Can you please add a descriptio

[jira] [Commented] (SPARK-32469) ApplyColumnarRulesAndInsertTransitions should be idempotent

2020-07-29 Thread Rohit Mishra (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167455#comment-17167455 ] Rohit Mishra commented on SPARK-32469: -- [~cloud_fan], Can you please add a descript

[jira] [Resolved] (SPARK-32332) AQE doesn't adequately allow for Columnar Processing extension

2020-07-29 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-32332. --- Fix Version/s: 3.1.0 Assignee: Wenchen Fan Resolution: Fixed > AQE doesn't a

[jira] [Commented] (SPARK-32403) SCRIP TRANSFORM Extract common method from process row to avoid repeated judgement

2020-07-29 Thread Rohit Mishra (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167461#comment-17167461 ] Rohit Mishra commented on SPARK-32403: -- [~angerszhuuu], Can you please add a descri

[jira] [Commented] (SPARK-32465) How do I get the SPARK shuffle monitoring indicator?

2020-07-29 Thread Rohit Mishra (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167471#comment-17167471 ] Rohit Mishra commented on SPARK-32465: -- [~MOBIN], Can you please ask questions usin

[jira] [Commented] (SPARK-27335) cannot collect() from Correlation.corr

2020-07-29 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167475#comment-17167475 ] Ian Cook commented on SPARK-27335: -- Regarding the workaround code that [~natalinobusa]

[jira] [Resolved] (SPARK-32465) How do I get the SPARK shuffle monitoring indicator?

2020-07-29 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-32465. -- Resolution: Invalid > How do I get the SPARK shuffle monitoring indicator? > -

[jira] [Comment Edited] (SPARK-32385) Publish a "bill of materials" (BOM) descriptor for Spark with correct versions of various dependencies

2020-07-29 Thread DB Tsai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167504#comment-17167504 ] DB Tsai edited comment on SPARK-32385 at 7/29/20, 9:04 PM: --- +1

[jira] [Commented] (SPARK-32385) Publish a "bill of materials" (BOM) descriptor for Spark with correct versions of various dependencies

2020-07-29 Thread DB Tsai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167504#comment-17167504 ] DB Tsai commented on SPARK-32385: - +1 This will be very useful for users to include Spar

[jira] [Created] (SPARK-32487) Remove javax.ws.rs.NotFoundException from `import` in StagesResource/OneApplicationResource

2020-07-29 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created SPARK-32487: - Summary: Remove javax.ws.rs.NotFoundException from `import` in StagesResource/OneApplicationResource Key: SPARK-32487 URL: https://issues.apache.org/jira/browse/SPARK-32487

[jira] [Commented] (SPARK-32487) Remove javax.ws.rs.NotFoundException from `import` in StagesResource/OneApplicationResource

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167509#comment-17167509 ] Apache Spark commented on SPARK-32487: -- User 'dongjoon-hyun' has created a pull req

[jira] [Assigned] (SPARK-32487) Remove javax.ws.rs.NotFoundException from `import` in StagesResource/OneApplicationResource

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32487: Assignee: (was: Apache Spark) > Remove javax.ws.rs.NotFoundException from `import` in

[jira] [Assigned] (SPARK-32487) Remove javax.ws.rs.NotFoundException from `import` in StagesResource/OneApplicationResource

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32487: Assignee: Apache Spark > Remove javax.ws.rs.NotFoundException from `import` in > StagesR

[jira] [Resolved] (SPARK-32397) Snapshot artifacts can have differing timestamps, making it hard to consume

2020-07-29 Thread DB Tsai (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-32397. - Fix Version/s: 3.1.0 2.4.7 3.0.1 Resolution: Fixed Issue re

[jira] [Assigned] (SPARK-32487) Remove javax.ws.rs.NotFoundException from `import` in StagesResource/OneApplicationResource

2020-07-29 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-32487: - Assignee: Dongjoon Hyun > Remove javax.ws.rs.NotFoundException from `import` in > Stag

[jira] [Commented] (SPARK-32385) Publish a "bill of materials" (BOM) descriptor for Spark with correct versions of various dependencies

2020-07-29 Thread Vladimir Matveev (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167527#comment-17167527 ] Vladimir Matveev commented on SPARK-32385: -- [~hyukjin.kwon] almost: those are j

[jira] [Commented] (SPARK-32160) Executors should not be able to create SparkContext.

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167564#comment-17167564 ] Apache Spark commented on SPARK-32160: -- User 'ueshin' has created a pull request fo

[jira] [Commented] (SPARK-32160) Executors should not be able to create SparkContext.

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167563#comment-17167563 ] Apache Spark commented on SPARK-32160: -- User 'ueshin' has created a pull request fo

[jira] [Commented] (SPARK-32248) Recover JDK 11 builds in Github Actions

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167566#comment-17167566 ] Apache Spark commented on SPARK-32248: -- User 'dongjoon-hyun' has created a pull req

[jira] [Assigned] (SPARK-32248) Recover JDK 11 builds in Github Actions

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32248: Assignee: (was: Apache Spark) > Recover JDK 11 builds in Github Actions > ---

[jira] [Assigned] (SPARK-32248) Recover JDK 11 builds in Github Actions

2020-07-29 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32248: Assignee: Apache Spark > Recover JDK 11 builds in Github Actions > --

[jira] [Resolved] (SPARK-32487) Remove javax.ws.rs.NotFoundException from `import` in StagesResource/OneApplicationResource

2020-07-29 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-32487. --- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29293 [https://

[jira] [Assigned] (SPARK-32248) Recover JDK 11 builds in Github Actions

2020-07-29 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-32248: - Assignee: Hyukjin Kwon > Recover JDK 11 builds in Github Actions >

[jira] [Assigned] (SPARK-32248) Recover JDK 11 builds in Github Actions

2020-07-29 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-32248: - Assignee: Dongjoon Hyun (was: Hyukjin Kwon) > Recover JDK 11 builds in Github Actions

  1   2   >