[jira] [Assigned] (SPARK-32298) tree models prediction optimization

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32298: Assignee: Apache Spark > tree models prediction optimization >

[jira] [Commented] (SPARK-32298) tree models prediction optimization

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157144#comment-17157144 ] Apache Spark commented on SPARK-32298: -- User 'zhengruifeng' has created a pull request for this

[jira] [Assigned] (SPARK-32298) tree models prediction optimization

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32298: Assignee: (was: Apache Spark) > tree models prediction optimization >

[jira] [Created] (SPARK-32298) tree models prediction optimization

2020-07-13 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-32298: Summary: tree models prediction optimization Key: SPARK-32298 URL: https://issues.apache.org/jira/browse/SPARK-32298 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-32241) Remove empty children of union

2020-07-13 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-32241: --- Assignee: Peter Toth > Remove empty children of union > -- > >

[jira] [Resolved] (SPARK-32241) Remove empty children of union

2020-07-13 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-32241. - Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29053

[jira] [Commented] (SPARK-24983) Collapsing multiple project statements with dependent When-Otherwise statements on the same column can OOM the driver

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157132#comment-17157132 ] Apache Spark commented on SPARK-24983: -- User 'constzhou' has created a pull request for this issue:

[jira] [Commented] (SPARK-32266) Run smoke tests after a commit is pushed

2020-07-13 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157129#comment-17157129 ] Gengliang Wang commented on SPARK-32266: [~hyukjin.kwon]Thanks for the update. > Run smoke

[jira] [Commented] (SPARK-31356) Splitting Aggregate node into separate Aggregate and Serialize for Optimizer

2020-07-13 Thread Martin Loncaric (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157128#comment-17157128 ] Martin Loncaric commented on SPARK-31356: - Actually, there seem to be 3 separate performance

[jira] [Comment Edited] (SPARK-32253) Make readability better in the test result logs

2020-07-13 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157120#comment-17157120 ] L. C. Hsieh edited comment on SPARK-32253 at 7/14/20, 3:19 AM: --- Looks

[jira] [Comment Edited] (SPARK-32253) Make readability better in the test result logs

2020-07-13 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157120#comment-17157120 ] L. C. Hsieh edited comment on SPARK-32253 at 7/14/20, 3:18 AM: --- Will do

[jira] [Commented] (SPARK-32253) Make readability better in the test result logs

2020-07-13 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157120#comment-17157120 ] L. C. Hsieh commented on SPARK-32253: - Will do some tests. > Make readability better in the test

[jira] [Commented] (SPARK-32264) More resources in Github Actions

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157100#comment-17157100 ] Hyukjin Kwon commented on SPARK-32264: -- This is in progress at the private mailing list. > More

[jira] [Resolved] (SPARK-32266) Run smoke tests after a commit is pushed

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32266. -- Assignee: Dongjoon Hyun Resolution: Fixed > Run smoke tests after a commit is pushed >

[jira] [Updated] (SPARK-32266) Run smoke tests after a commit is pushed

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-32266: - Fix Version/s: 3.1.0 > Run smoke tests after a commit is pushed >

[jira] [Commented] (SPARK-32266) Run smoke tests after a commit is pushed

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157098#comment-17157098 ] Hyukjin Kwon commented on SPARK-32266: -- This was fixed in

[jira] [Commented] (SPARK-32253) Make readability better in the test result logs

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157097#comment-17157097 ] Hyukjin Kwon commented on SPARK-32253: -- [~Gengliang.Wang] or probably [~viirya] from the watchers

[jira] [Commented] (SPARK-32296) Flaky Test: submit a barrier ResultStage that requires more slots than current total under local-cluster mode

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157086#comment-17157086 ] Hyukjin Kwon commented on SPARK-32296: -- cc [~jiangxb1987] FYI > Flaky Test: submit a barrier

[jira] [Created] (SPARK-32297) Flaky Test: YarnClusterSuite 4 test cases

2020-07-13 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-32297: Summary: Flaky Test: YarnClusterSuite 4 test cases Key: SPARK-32297 URL: https://issues.apache.org/jira/browse/SPARK-32297 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-32138) Drop Python 2, 3.4 and 3.5 in codes and documentation

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32138. -- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 28957

[jira] [Assigned] (SPARK-32138) Drop Python 2, 3.4 and 3.5 in codes and documentation

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-32138: Assignee: Hyukjin Kwon > Drop Python 2, 3.4 and 3.5 in codes and documentation >

[jira] [Commented] (SPARK-32278) Install PyPy3 on Jenkins to enable PySpark tests with PyPy

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157080#comment-17157080 ] Hyukjin Kwon commented on SPARK-32278: -- Oh, yeah. I noticed this, and forgot to take an action to

[jira] [Resolved] (SPARK-32278) Install PyPy3 on Jenkins to enable PySpark tests with PyPy

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32278. -- Resolution: Not A Problem > Install PyPy3 on Jenkins to enable PySpark tests with PyPy >

[jira] [Commented] (SPARK-32279) Install Sphinx in Python 3 on Jenkins machines

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157078#comment-17157078 ] Hyukjin Kwon commented on SPARK-32279: -- I believe any version is fine. Probably the latest one :-).

[jira] [Resolved] (SPARK-32146) ValueError when loading a PipelineModel on a personal computer

2020-07-13 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim resolved SPARK-32146. -- Resolution: Invalid Please use user mailing list regarding question. If your issue is bound

[jira] [Updated] (SPARK-32146) ValueError when loading a PipelineModel on a personal computer

2020-07-13 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim updated SPARK-32146: - Priority: Major (was: Blocker) > ValueError when loading a PipelineModel on a personal

[jira] [Commented] (SPARK-32259) tmpfs=true, not pointing to SPARK_LOCAL_DIRS in k8s

2020-07-13 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157069#comment-17157069 ] Jungtaek Lim commented on SPARK-32259: -- Lowering the priority, as Critical+ requires committer's

[jira] [Commented] (SPARK-32197) 'Spark driver' stays running even though 'spark application' has FAILED

2020-07-13 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157070#comment-17157070 ] Jungtaek Lim commented on SPARK-32197: -- Lowering the priority, as Critical+ requires committer's

[jira] [Updated] (SPARK-32197) 'Spark driver' stays running even though 'spark application' has FAILED

2020-07-13 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim updated SPARK-32197: - Priority: Major (was: Blocker) > 'Spark driver' stays running even though 'spark application'

[jira] [Updated] (SPARK-32259) tmpfs=true, not pointing to SPARK_LOCAL_DIRS in k8s

2020-07-13 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim updated SPARK-32259: - Priority: Major (was: Blocker) > tmpfs=true, not pointing to SPARK_LOCAL_DIRS in k8s >

[jira] [Commented] (SPARK-32220) Cartesian Product Hint cause data error

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157066#comment-17157066 ] Apache Spark commented on SPARK-32220: -- User 'AngersZh' has created a pull request for this

[jira] [Updated] (SPARK-32296) Flaky Test: submit a barrier ResultStage that requires more slots than current total under local-cluster mode

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-32296: - Component/s: Spark Core > Flaky Test: submit a barrier ResultStage that requires more slots

[jira] [Commented] (SPARK-32294) GroupedData Pandas UDF 2Gb limit

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157065#comment-17157065 ] Hyukjin Kwon commented on SPARK-32294: -- Thanks for filing the issue, [~Tagar]. > GroupedData

[jira] [Created] (SPARK-32296) Flaky Test: submit a barrier ResultStage that requires more slots than current total under local-cluster mode

2020-07-13 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-32296: Summary: Flaky Test: submit a barrier ResultStage that requires more slots than current total under local-cluster mode Key: SPARK-32296 URL:

[jira] [Assigned] (SPARK-32292) Run only relevant builds in parallel at Github Actions

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-32292: Assignee: Hyukjin Kwon (was: Apache Spark) > Run only relevant builds in parallel at

[jira] [Resolved] (SPARK-32004) Drop references to slave

2020-07-13 Thread Holden Karau (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Holden Karau resolved SPARK-32004. -- Fix Version/s: 3.1.0 Assignee: Holden Karau Resolution: Fixed > Drop

[jira] [Assigned] (SPARK-32295) Add not null and size > 0 filters before inner explode to benefit from predicate pushdown

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32295: Assignee: Apache Spark > Add not null and size > 0 filters before inner explode to

[jira] [Assigned] (SPARK-32295) Add not null and size > 0 filters before inner explode to benefit from predicate pushdown

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32295: Assignee: (was: Apache Spark) > Add not null and size > 0 filters before inner

[jira] [Commented] (SPARK-32295) Add not null and size > 0 filters before inner explode to benefit from predicate pushdown

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156976#comment-17156976 ] Apache Spark commented on SPARK-32295: -- User 'tanelk' has created a pull request for this issue:

[jira] [Commented] (SPARK-32295) Add not null and size > 0 filters before inner explode to benefit from predicate pushdown

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156975#comment-17156975 ] Apache Spark commented on SPARK-32295: -- User 'tanelk' has created a pull request for this issue:

[jira] [Created] (SPARK-32295) Add not null and size > 0 filters before inner explode to benefit from predicate pushdown

2020-07-13 Thread Tanel Kiis (Jira)
Tanel Kiis created SPARK-32295: -- Summary: Add not null and size > 0 filters before inner explode to benefit from predicate pushdown Key: SPARK-32295 URL: https://issues.apache.org/jira/browse/SPARK-32295

[jira] [Updated] (SPARK-32234) Spark sql commands are failing on select Queries for the orc tables

2020-07-13 Thread Xiao Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-32234: Priority: Blocker (was: Major) > Spark sql commands are failing on select Queries for the orc tables >

[jira] [Updated] (SPARK-32234) Spark sql commands are failing on select Queries for the orc tables

2020-07-13 Thread Xiao Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-32234: Target Version/s: 3.0.1 > Spark sql commands are failing on select Queries for the orc tables >

[jira] [Commented] (SPARK-32258) NormalizeFloatingNumbers directly normalizes IF/CaseWhen/Coalesce child expressions

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156939#comment-17156939 ] Apache Spark commented on SPARK-32258: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-32258) NormalizeFloatingNumbers directly normalizes IF/CaseWhen/Coalesce child expressions

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156938#comment-17156938 ] Apache Spark commented on SPARK-32258: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32293) Inconsistent default unit between Spark memory configs and JVM option

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32293: Assignee: (was: Apache Spark) > Inconsistent default unit between Spark memory

[jira] [Assigned] (SPARK-32293) Inconsistent default unit between Spark memory configs and JVM option

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32293: Assignee: Apache Spark > Inconsistent default unit between Spark memory configs and JVM

[jira] [Commented] (SPARK-32293) Inconsistent default unit between Spark memory configs and JVM option

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156917#comment-17156917 ] Apache Spark commented on SPARK-32293: -- User 'attilapiros' has created a pull request for this

[jira] [Updated] (SPARK-32294) GroupedData Pandas UDF 2Gb limit

2020-07-13 Thread Ruslan Dautkhanov (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruslan Dautkhanov updated SPARK-32294: -- Description: `spark.sql.execution.arrow.maxRecordsPerBatch` is not respected for

[jira] [Created] (SPARK-32294) GroupedData Pandas UDF 2Gb limit

2020-07-13 Thread Ruslan Dautkhanov (Jira)
Ruslan Dautkhanov created SPARK-32294: - Summary: GroupedData Pandas UDF 2Gb limit Key: SPARK-32294 URL: https://issues.apache.org/jira/browse/SPARK-32294 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-30282) Migrate SHOW TBLPROPERTIES to new framework

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156900#comment-17156900 ] Apache Spark commented on SPARK-30282: -- User 'imback82' has created a pull request for this issue:

[jira] [Updated] (SPARK-32293) Inconsistent default unit between Spark memory configs and JVM option

2020-07-13 Thread Attila Zsolt Piros (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Attila Zsolt Piros updated SPARK-32293: --- Summary: Inconsistent default unit between Spark memory configs and JVM option

[jira] [Commented] (SPARK-32293) Inconsistent default units for configuring Spark memory

2020-07-13 Thread Attila Zsolt Piros (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156884#comment-17156884 ] Attila Zsolt Piros commented on SPARK-32293: I am working on this. > Inconsistent default

[jira] [Created] (SPARK-32293) Inconsistent default units for configuring Spark memory

2020-07-13 Thread Attila Zsolt Piros (Jira)
Attila Zsolt Piros created SPARK-32293: -- Summary: Inconsistent default units for configuring Spark memory Key: SPARK-32293 URL: https://issues.apache.org/jira/browse/SPARK-32293 Project: Spark

[jira] [Commented] (SPARK-32279) Install Sphinx in Python 3 on Jenkins machines

2020-07-13 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156846#comment-17156846 ] Shane Knapp commented on SPARK-32279: - any particular version of sphinx  you want installed? >

[jira] [Commented] (SPARK-32276) Remove redundant sorts before repartition nodes

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156844#comment-17156844 ] Apache Spark commented on SPARK-32276: -- User 'aokolnychyi' has created a pull request for this

[jira] [Comment Edited] (SPARK-32278) Install PyPy3 on Jenkins to enable PySpark tests with PyPy

2020-07-13 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156842#comment-17156842 ] Shane Knapp edited comment on SPARK-32278 at 7/13/20, 5:03 PM: --- which

[jira] [Commented] (SPARK-32276) Remove redundant sorts before repartition nodes

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156843#comment-17156843 ] Apache Spark commented on SPARK-32276: -- User 'aokolnychyi' has created a pull request for this

[jira] [Assigned] (SPARK-32276) Remove redundant sorts before repartition nodes

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32276: Assignee: Apache Spark > Remove redundant sorts before repartition nodes >

[jira] [Commented] (SPARK-32278) Install PyPy3 on Jenkins to enable PySpark tests with PyPy

2020-07-13 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156842#comment-17156842 ] Shane Knapp commented on SPARK-32278: - which version of pypy3 are we interested in?  we currently

[jira] [Assigned] (SPARK-32276) Remove redundant sorts before repartition nodes

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32276: Assignee: (was: Apache Spark) > Remove redundant sorts before repartition nodes >

[jira] [Assigned] (SPARK-32252) Enable doctests in run-tests.py back

2020-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-32252: - Assignee: Hyukjin Kwon > Enable doctests in run-tests.py back >

[jira] [Resolved] (SPARK-32252) Enable doctests in run-tests.py back

2020-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-32252. --- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29086

[jira] [Resolved] (SPARK-32292) Run only relevant builds in parallel at Github Actions

2020-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-32292. --- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29086

[jira] [Assigned] (SPARK-32289) Chinese characters are garbled when opening csv files with Excel

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32289: Assignee: (was: Apache Spark) > Chinese characters are garbled when opening csv

[jira] [Commented] (SPARK-32289) Chinese characters are garbled when opening csv files with Excel

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156780#comment-17156780 ] Apache Spark commented on SPARK-32289: -- User 'wangyum' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32289) Chinese characters are garbled when opening csv files with Excel

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32289: Assignee: Apache Spark > Chinese characters are garbled when opening csv files with

[jira] [Commented] (SPARK-28227) Spark can’t support TRANSFORM with aggregation

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156698#comment-17156698 ] Apache Spark commented on SPARK-28227: -- User 'AngersZh' has created a pull request for this

[jira] [Commented] (SPARK-28227) Spark can’t support TRANSFORM with aggregation

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156699#comment-17156699 ] Apache Spark commented on SPARK-28227: -- User 'AngersZh' has created a pull request for this

[jira] [Commented] (SPARK-32252) Enable doctests in run-tests.py back

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156641#comment-17156641 ] Apache Spark commented on SPARK-32252: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-32252) Enable doctests in run-tests.py back

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156639#comment-17156639 ] Apache Spark commented on SPARK-32252: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-32292) Run only relevant builds in parallel at Github Actions

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32292: Assignee: Apache Spark > Run only relevant builds in parallel at Github Actions >

[jira] [Assigned] (SPARK-32252) Enable doctests in run-tests.py back

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32252: Assignee: Apache Spark > Enable doctests in run-tests.py back >

[jira] [Assigned] (SPARK-32292) Run only relevant builds in parallel at Github Actions

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32292: Assignee: Apache Spark > Run only relevant builds in parallel at Github Actions >

[jira] [Assigned] (SPARK-32252) Enable doctests in run-tests.py back

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32252: Assignee: (was: Apache Spark) > Enable doctests in run-tests.py back >

[jira] [Commented] (SPARK-32292) Run only relevant builds in parallel at Github Actions

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156636#comment-17156636 ] Apache Spark commented on SPARK-32292: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-32292) Run only relevant builds in parallel at Github Actions

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32292: Assignee: (was: Apache Spark) > Run only relevant builds in parallel at Github

[jira] [Assigned] (SPARK-32106) Implement script transform in sql/core

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32106: Assignee: Apache Spark > Implement script transform in sql/core >

[jira] [Commented] (SPARK-32106) Implement script transform in sql/core

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156629#comment-17156629 ] Apache Spark commented on SPARK-32106: -- User 'AngersZh' has created a pull request for this

[jira] [Assigned] (SPARK-32106) Implement script transform in sql/core

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32106: Assignee: (was: Apache Spark) > Implement script transform in sql/core >

[jira] [Commented] (SPARK-32259) tmpfs=true, not pointing to SPARK_LOCAL_DIRS in k8s

2020-07-13 Thread Rob Vesse (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156627#comment-17156627 ] Rob Vesse commented on SPARK-32259: --- bq. We use Spark launcher to do spark submit in k8s. Since it is

[jira] [Comment Edited] (SPARK-32259) tmpfs=true, not pointing to SPARK_LOCAL_DIRS in k8s

2020-07-13 Thread Rob Vesse (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156626#comment-17156626 ] Rob Vesse edited comment on SPARK-32259 at 7/13/20, 10:32 AM: -- [~prakki79]

[jira] [Commented] (SPARK-32259) tmpfs=true, not pointing to SPARK_LOCAL_DIRS in k8s

2020-07-13 Thread Rob Vesse (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156626#comment-17156626 ] Rob Vesse commented on SPARK-32259: --- [~prakki79] Ideally you'd also include the following in your

[jira] [Created] (SPARK-32292) Run only relevant builds in parallel at Github Actions

2020-07-13 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-32292: Summary: Run only relevant builds in parallel at Github Actions Key: SPARK-32292 URL: https://issues.apache.org/jira/browse/SPARK-32292 Project: Spark Issue

[jira] [Commented] (SPARK-32253) Make readability better in the test result logs

2020-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156609#comment-17156609 ] Hyukjin Kwon commented on SPARK-32253: -- See also https://github.com/check-run-reporter/action >

[jira] [Resolved] (SPARK-32105) Refactor current script transform code

2020-07-13 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-32105. - Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 27983

[jira] [Assigned] (SPARK-32105) Refactor current script transform code

2020-07-13 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-32105: --- Assignee: angerszhu > Refactor current script transform code >

[jira] [Updated] (SPARK-30985) Propagate SPARK_CONF_DIR files to driver and exec pods.

2020-07-13 Thread Prashant Sharma (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Sharma updated SPARK-30985: Description: SPARK_CONF_DIR hosts configuration files like, 1) spark-defaults.conf -

[jira] [Commented] (SPARK-32289) Chinese characters are garbled when opening csv files with Excel

2020-07-13 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156548#comment-17156548 ] angerszhu commented on SPARK-32289: --- [https://bugs.java.com/bugdatabase/view_bug.do?bug_id=4508058] >

[jira] [Updated] (SPARK-32291) COALESCE should not reduce the child parallelism if it is Join

2020-07-13 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-32291: Attachment: coalesce.png > COALESCE should not reduce the child parallelism if it is Join >

[jira] [Updated] (SPARK-32291) COALESCE should not reduce the child parallelism if it is Join

2020-07-13 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-32291: Description: How to reproduce this issue: {code:scala} spark.range(100).createTempView("t1")

[jira] [Updated] (SPARK-32291) COALESCE should not reduce the child parallelism if it is Join

2020-07-13 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-32291: Attachment: repartition.png > COALESCE should not reduce the child parallelism if it is Join >

[jira] [Updated] (SPARK-32291) COALESCE should not reduce the child parallelism if it is Join

2020-07-13 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-32291: Attachment: COALESCE.png > COALESCE should not reduce the child parallelism if it is Join >

[jira] [Created] (SPARK-32291) COALESCE should not reduce the child parallelism if it is Join

2020-07-13 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-32291: --- Summary: COALESCE should not reduce the child parallelism if it is Join Key: SPARK-32291 URL: https://issues.apache.org/jira/browse/SPARK-32291 Project: Spark

[jira] [Commented] (SPARK-32220) Cartesian Product Hint cause data error

2020-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156540#comment-17156540 ] Apache Spark commented on SPARK-32220: -- User 'AngersZh' has created a pull request for this

[jira] [Updated] (SPARK-30985) Propagate SPARK_CONF_DIR files to driver and exec pods.

2020-07-13 Thread Prashant Sharma (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Sharma updated SPARK-30985: Component/s: (was: Spark Core) > Propagate SPARK_CONF_DIR files to driver and exec

[jira] [Updated] (SPARK-32290) NotInSubquery SingleColumn Optimize

2020-07-13 Thread Leanken.Lin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Leanken.Lin updated SPARK-32290: Fix Version/s: 3.0.1 > NotInSubquery SingleColumn Optimize > ---

[jira] [Created] (SPARK-32290) NotInSubquery SingleColumn Optimize

2020-07-13 Thread Leanken.Lin (Jira)
Leanken.Lin created SPARK-32290: --- Summary: NotInSubquery SingleColumn Optimize Key: SPARK-32290 URL: https://issues.apache.org/jira/browse/SPARK-32290 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-32226) JDBC TimeStamp predicates always append `.0`

2020-07-13 Thread Chen Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17156501#comment-17156501 ] Chen Zhang commented on SPARK-32226: [~thesuperzapper], glad to receive your reply. I don't have an