[jira] [Assigned] (SPARK-32975) [K8S] - executor fails to be restarted after it goes to ERROR/Failure state

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32975: Assignee: Apache Spark > [K8S] - executor fails to be restarted after it goes to ERROR/Fa

[jira] [Assigned] (SPARK-32975) [K8S] - executor fails to be restarted after it goes to ERROR/Failure state

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32975: Assignee: (was: Apache Spark) > [K8S] - executor fails to be restarted after it goes

[jira] [Commented] (SPARK-32975) [K8S] - executor fails to be restarted after it goes to ERROR/Failure state

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355513#comment-17355513 ] Apache Spark commented on SPARK-32975: -- User 'cchriswu' has created a pull request

[jira] [Created] (SPARK-35608) Support AQE optimizer side transformUpWithPruning

2021-06-01 Thread XiDuo You (Jira)
XiDuo You created SPARK-35608: - Summary: Support AQE optimizer side transformUpWithPruning Key: SPARK-35608 URL: https://issues.apache.org/jira/browse/SPARK-35608 Project: Spark Issue Type: Impro

[jira] [Resolved] (SPARK-35604) Fix condition check for FULL OUTER sort merge join

2021-06-01 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang resolved SPARK-35604. Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32736 [https:

[jira] [Assigned] (SPARK-35604) Fix condition check for FULL OUTER sort merge join

2021-06-01 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang reassigned SPARK-35604: -- Assignee: Cheng Su > Fix condition check for FULL OUTER sort merge join > ---

[jira] [Resolved] (SPARK-35594) Remove duplicate installations in build_and_test.yml

2021-06-01 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-35594. -- Resolution: Not A Problem > Remove duplicate installations in build_and_test.yml > ---

[jira] [Assigned] (SPARK-35474) Enable disallow_untyped_defs mypy check for pyspark.pandas.indexing.

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35474: Assignee: (was: Apache Spark) > Enable disallow_untyped_defs mypy check for pyspark.p

[jira] [Assigned] (SPARK-35474) Enable disallow_untyped_defs mypy check for pyspark.pandas.indexing.

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35474: Assignee: Apache Spark > Enable disallow_untyped_defs mypy check for pyspark.pandas.index

[jira] [Commented] (SPARK-35474) Enable disallow_untyped_defs mypy check for pyspark.pandas.indexing.

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355498#comment-17355498 ] Apache Spark commented on SPARK-35474: -- User 'pingsutw' has created a pull request

[jira] [Commented] (SPARK-35579) Fix a bug in janino or work around it in Spark.

2021-06-01 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355497#comment-17355497 ] Takeshi Yamamuro commented on SPARK-35579: -- I'm working on this. See: https://g

[jira] [Resolved] (SPARK-35583) Move JDBC data source options from Python and Scala into a single page

2021-06-01 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-35583. -- Fix Version/s: 3.2.0 Resolution: Fixed Fixed in https://github.com/apache/spark/pull/32

[jira] [Updated] (SPARK-35607) Migrate Spark Arm Job from Jenkins to GitHub Actions

2021-06-01 Thread Yikun Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yikun Jiang updated SPARK-35607: Description: There were two Arm CI jobs in AMP lab: [https://amplab.cs.berkeley.edu/jenkins/job/s

[jira] [Assigned] (SPARK-35077) Migrate to transformWithPruning for leftover optimizer rules

2021-06-01 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang reassigned SPARK-35077: -- Assignee: Yingyi Bu (was: Apache Spark) > Migrate to transformWithPruning for leftov

[jira] [Resolved] (SPARK-35077) Migrate to transformWithPruning for leftover optimizer rules

2021-06-01 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang resolved SPARK-35077. Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32721 [https:

[jira] [Updated] (SPARK-35607) Migrate Spark Arm Job from Jenkins to GitHub Actions

2021-06-01 Thread Yikun Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yikun Jiang updated SPARK-35607: Description: There were two Arm CI jobs in AMP lab: [https://amplab.cs.berkeley.edu/jenkins/job/s

[jira] [Updated] (SPARK-35607) Migrate Spark Arm Job from Jenkins to GitHub Actions

2021-06-01 Thread Yikun Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yikun Jiang updated SPARK-35607: Description: There were two Arm CI jobs in AMP lab: [https://amplab.cs.berkeley.edu/jenkins/job/s

[jira] [Created] (SPARK-35607) Migrate Spark Arm Job from Jenkins to GitHub Actions

2021-06-01 Thread Yikun Jiang (Jira)
Yikun Jiang created SPARK-35607: --- Summary: Migrate Spark Arm Job from Jenkins to GitHub Actions Key: SPARK-35607 URL: https://issues.apache.org/jira/browse/SPARK-35607 Project: Spark Issue Type

[jira] [Assigned] (SPARK-35606) List Python 3.9 installed libraries in build_and_test workflow

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35606: Assignee: (was: Apache Spark) > List Python 3.9 installed libraries in build_and_test

[jira] [Commented] (SPARK-35606) List Python 3.9 installed libraries in build_and_test workflow

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355466#comment-17355466 ] Apache Spark commented on SPARK-35606: -- User 'xinrong-databricks' has created a pul

[jira] [Assigned] (SPARK-35606) List Python 3.9 installed libraries in build_and_test workflow

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35606: Assignee: Apache Spark > List Python 3.9 installed libraries in build_and_test workflow >

[jira] [Created] (SPARK-35606) List Python 3.9 installed libraries in build_and_test workflow

2021-06-01 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-35606: Summary: List Python 3.9 installed libraries in build_and_test workflow Key: SPARK-35606 URL: https://issues.apache.org/jira/browse/SPARK-35606 Project: Spark

[jira] [Resolved] (SPARK-35100) Refactor AFT - support virtual centering

2021-06-01 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-35100. -- Resolution: Fixed > Refactor AFT - support virtual centering > ---

[jira] [Resolved] (SPARK-35560) Remove redundant subexpression evaluation in nested subexpressions

2021-06-01 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh resolved SPARK-35560. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32699 [https://gith

[jira] [Resolved] (SPARK-35595) Support multiple loggers in testing method withLogAppender

2021-06-01 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang resolved SPARK-35595. Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32725 [https:

[jira] [Assigned] (SPARK-35595) Support multiple loggers in testing method withLogAppender

2021-06-01 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang reassigned SPARK-35595: -- Assignee: Gengliang Wang > Support multiple loggers in testing method withLogAppender

[jira] [Updated] (SPARK-35605) Move pandas_on_spark accessor to the Spark DataFrame.

2021-06-01 Thread Haejoon Lee (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haejoon Lee updated SPARK-35605: Description: Inspired by https://github.com/apache/spark/pull/32729#discussion_r643591322, As Koa

[jira] [Commented] (SPARK-35605) Move pandas_on_spark accessor to the Spark DataFrame.

2021-06-01 Thread Haejoon Lee (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355438#comment-17355438 ] Haejoon Lee commented on SPARK-35605: - I'm working on this > Move pandas_on_spark a

[jira] [Created] (SPARK-35605) Move pandas_on_spark accessor to the Spark DataFrame.

2021-06-01 Thread Haejoon Lee (Jira)
Haejoon Lee created SPARK-35605: --- Summary: Move pandas_on_spark accessor to the Spark DataFrame. Key: SPARK-35605 URL: https://issues.apache.org/jira/browse/SPARK-35605 Project: Spark Issue Typ

[jira] [Assigned] (SPARK-35539) Restore to_koalas to keep the backward compatibility

2021-06-01 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-35539: Assignee: Haejoon Lee > Restore to_koalas to keep the backward compatibility > --

[jira] [Resolved] (SPARK-35539) Restore to_koalas to keep the backward compatibility

2021-06-01 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-35539. -- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32729 [https://gi

[jira] [Resolved] (SPARK-35600) Move Set command related test cases to a single test suite

2021-06-01 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-35600. -- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32732 [https://gi

[jira] [Commented] (SPARK-35604) Fix condition check for FULL OUTER sort merge join

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355428#comment-17355428 ] Apache Spark commented on SPARK-35604: -- User 'c21' has created a pull request for t

[jira] [Assigned] (SPARK-35604) Fix condition check for FULL OUTER sort merge join

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35604: Assignee: Apache Spark > Fix condition check for FULL OUTER sort merge join > ---

[jira] [Assigned] (SPARK-35604) Fix condition check for FULL OUTER sort merge join

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35604: Assignee: (was: Apache Spark) > Fix condition check for FULL OUTER sort merge join >

[jira] [Created] (SPARK-35604) Fix condition check for FULL OUTER sort merge join

2021-06-01 Thread Cheng Su (Jira)
Cheng Su created SPARK-35604: Summary: Fix condition check for FULL OUTER sort merge join Key: SPARK-35604 URL: https://issues.apache.org/jira/browse/SPARK-35604 Project: Spark Issue Type: Docume

[jira] [Resolved] (SPARK-35590) pyspark v3.1.1removed pyspark.streaming.kafka?

2021-06-01 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-35590. -- Resolution: Invalid > pyspark v3.1.1removed pyspark.streaming.kafka? > ---

[jira] [Comment Edited] (SPARK-35590) pyspark v3.1.1removed pyspark.streaming.kafka?

2021-06-01 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355422#comment-17355422 ] Hyukjin Kwon edited comment on SPARK-35590 at 6/2/21, 1:10 AM: ---

[jira] [Commented] (SPARK-35590) pyspark v3.1.1removed pyspark.streaming.kafka?

2021-06-01 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355422#comment-17355422 ] Hyukjin Kwon commented on SPARK-35590: -- I think we haven't added Kafka support for

[jira] [Updated] (SPARK-35603) Move data source options from R into a single page.

2021-06-01 Thread Haejoon Lee (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haejoon Lee updated SPARK-35603: Parent: (was: SPARK-34491) Issue Type: Documentation (was: Sub-task) > Move data sour

[jira] [Updated] (SPARK-35603) Move data source options from R into a single page.

2021-06-01 Thread Haejoon Lee (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haejoon Lee updated SPARK-35603: Description: We should consolidate the data source options from R documentation as well like we di

[jira] [Created] (SPARK-35603) Move data source options from R into a single page.

2021-06-01 Thread Haejoon Lee (Jira)
Haejoon Lee created SPARK-35603: --- Summary: Move data source options from R into a single page. Key: SPARK-35603 URL: https://issues.apache.org/jira/browse/SPARK-35603 Project: Spark Issue Type:

[jira] [Updated] (SPARK-34909) conv() does not convert negative inputs to unsigned correctly

2021-06-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-34909: -- Labels: correctness (was: ) > conv() does not convert negative inputs to unsigned correctly >

[jira] [Updated] (SPARK-34756) Fix FileScan equality check

2021-06-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-34756: -- Labels: correctness (was: ) > Fix FileScan equality check > --- > >

[jira] [Updated] (SPARK-34737) Discrepancy between TIMESTAMP_SECONDS and cast from float

2021-06-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-34737: -- Issue Type: Bug (was: Improvement) > Discrepancy between TIMESTAMP_SECONDS and cast from floa

[jira] [Updated] (SPARK-34840) Fix cases of corruption in merged shuffle blocks that are pushed

2021-06-01 Thread Chandni Singh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chandni Singh updated SPARK-34840: -- Parent: SPARK-30602 Issue Type: Sub-task (was: Bug) > Fix cases of corruption in merg

[jira] [Updated] (SPARK-34727) Difference in results of casting float to timestamp

2021-06-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-34727: -- Labels: correctness (was: ) > Difference in results of casting float to timestamp > -

[jira] [Closed] (SPARK-34540) Add convert_dtypes to the DataFrameLike protocol

2021-06-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34540?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun closed SPARK-34540. - > Add convert_dtypes to the DataFrameLike protocol > ---

[jira] [Assigned] (SPARK-35589) BlockManagerMasterEndpoint should not ignore index-only shuffle file during updating

2021-06-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-35589: - Assignee: Dongjoon Hyun > BlockManagerMasterEndpoint should not ignore index-only shuff

[jira] [Resolved] (SPARK-35589) BlockManagerMasterEndpoint should not ignore index-only shuffle file during updating

2021-06-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-35589. --- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32727 [https://

[jira] [Commented] (SPARK-35580) Support subexpression elimination for higher order functions

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355326#comment-17355326 ] Apache Spark commented on SPARK-35580: -- User 'viirya' has created a pull request fo

[jira] [Assigned] (SPARK-35580) Support subexpression elimination for higher order functions

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35580: Assignee: (was: Apache Spark) > Support subexpression elimination for higher order fu

[jira] [Assigned] (SPARK-35580) Support subexpression elimination for higher order functions

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35580: Assignee: Apache Spark > Support subexpression elimination for higher order functions > -

[jira] [Commented] (SPARK-35580) Support subexpression elimination for higher order functions

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355325#comment-17355325 ] Apache Spark commented on SPARK-35580: -- User 'viirya' has created a pull request fo

[jira] [Assigned] (SPARK-35423) The output of PCA is inconsistent

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35423: Assignee: (was: Apache Spark) > The output of PCA is inconsistent > -

[jira] [Commented] (SPARK-35423) The output of PCA is inconsistent

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355293#comment-17355293 ] Apache Spark commented on SPARK-35423: -- User 'shahidki31' has created a pull reques

[jira] [Assigned] (SPARK-35423) The output of PCA is inconsistent

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35423: Assignee: Apache Spark > The output of PCA is inconsistent >

[jira] [Updated] (SPARK-35343) Make conversion from/to pandas data-type-based

2021-06-01 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-35343: - Description: Make the conversion from/to pandas data-type-based. > Make conversion from/to panda

[jira] [Commented] (SPARK-34731) ConcurrentModificationException in EventLoggingListener when redacting properties

2021-06-01 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355289#comment-17355289 ] Bruce Robbins commented on SPARK-34731: --- I am working from memory, but I remember

[jira] [Created] (SPARK-35602) Job crashes with java.io.UTFDataFormatException: encoded string too long

2021-06-01 Thread dejan miljkovic (Jira)
dejan miljkovic created SPARK-35602: --- Summary: Job crashes with java.io.UTFDataFormatException: encoded string too long Key: SPARK-35602 URL: https://issues.apache.org/jira/browse/SPARK-35602 Projec

[jira] [Resolved] (SPARK-35314) Support arithmetic operations against bool IndexOpsMixin

2021-06-01 Thread Takuya Ueshin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-35314. --- Fix Version/s: 3.2.0 Assignee: Xinrong Meng Resolution: Fixed Issue resolved

[jira] [Created] (SPARK-35601) Support arithmetic operations against bool

2021-06-01 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-35601: Summary: Support arithmetic operations against bool Key: SPARK-35601 URL: https://issues.apache.org/jira/browse/SPARK-35601 Project: Spark Issue Type: Sub-ta

[jira] [Commented] (SPARK-35597) CTE With clause not working when using JDBC connection

2021-06-01 Thread Franck Thang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355258#comment-17355258 ] Franck Thang commented on SPARK-35597: -- What's your query ? It seems not related to

[jira] [Assigned] (SPARK-35402) Increase the max thread pool size of jetty server in HistoryServer UI

2021-06-01 Thread Kent Yao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kent Yao reassigned SPARK-35402: Assignee: Kent Yao > Increase the max thread pool size of jetty server in HistoryServer UI > ---

[jira] [Resolved] (SPARK-35402) Increase the max thread pool size of jetty server in HistoryServer UI

2021-06-01 Thread Kent Yao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kent Yao resolved SPARK-35402. -- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32539 [https://github.com

[jira] [Comment Edited] (SPARK-35590) pyspark v3.1.1removed pyspark.streaming.kafka?

2021-06-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355233#comment-17355233 ] Dongjoon Hyun edited comment on SPARK-35590 at 6/1/21, 4:58 PM: --

[jira] [Commented] (SPARK-35590) pyspark v3.1.1removed pyspark.streaming.kafka?

2021-06-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355233#comment-17355233 ] Dongjoon Hyun commented on SPARK-35590: --- BTW, since this is reported to PySpark 3.

[jira] [Assigned] (SPARK-35600) Move Set command related test cases to a single test suite

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35600: Assignee: Gengliang Wang (was: Apache Spark) > Move Set command related test cases to a

[jira] [Assigned] (SPARK-35600) Move Set command related test cases to a single test suite

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35600: Assignee: Apache Spark (was: Gengliang Wang) > Move Set command related test cases to a

[jira] [Commented] (SPARK-35600) Move Set command related test cases to a single test suite

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355218#comment-17355218 ] Apache Spark commented on SPARK-35600: -- User 'gengliangwang' has created a pull req

[jira] [Assigned] (SPARK-35600) Move Set command related test cases to a single test suite

2021-06-01 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-35600: - Assignee: Gengliang Wang > Move Set command related test cases to a single test suite >

[jira] [Created] (SPARK-35600) Move Set command related test cases to a single test suite

2021-06-01 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created SPARK-35600: - Summary: Move Set command related test cases to a single test suite Key: SPARK-35600 URL: https://issues.apache.org/jira/browse/SPARK-35600 Project: Spark

[jira] [Created] (SPARK-35599) Introduce a way to compare series of array for older pandas

2021-06-01 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-35599: Summary: Introduce a way to compare series of array for older pandas Key: SPARK-35599 URL: https://issues.apache.org/jira/browse/SPARK-35599 Project: Spark

[jira] [Created] (SPARK-35598) Improve Spark-ML PCA analysis

2021-06-01 Thread Antonio Zammuto (Jira)
Antonio Zammuto created SPARK-35598: --- Summary: Improve Spark-ML PCA analysis Key: SPARK-35598 URL: https://issues.apache.org/jira/browse/SPARK-35598 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-35402) Increase the max thread pool size of jetty server in HistoryServer UI

2021-06-01 Thread Kent Yao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kent Yao updated SPARK-35402: - Summary: Increase the max thread pool size of jetty server in HistoryServer UI (was: Make thread pool

[jira] [Assigned] (SPARK-35596) HighlyCompressedMapStatus should record accurately the size of skewed shuffle blocks

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35596: Assignee: Apache Spark > HighlyCompressedMapStatus should record accurately the size of s

[jira] [Assigned] (SPARK-35596) HighlyCompressedMapStatus should record accurately the size of skewed shuffle blocks

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35596: Assignee: (was: Apache Spark) > HighlyCompressedMapStatus should record accurately th

[jira] [Commented] (SPARK-35596) HighlyCompressedMapStatus should record accurately the size of skewed shuffle blocks

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355183#comment-17355183 ] Apache Spark commented on SPARK-35596: -- User 'exmy' has created a pull request for

[jira] [Commented] (SPARK-35557) Adapt uses of JDK 17 Internal APIs

2021-06-01 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-35557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355182#comment-17355182 ] Ismaël Mejía commented on SPARK-35557: -- Spark probably won't support intermediary v

[jira] [Created] (SPARK-35597) CTE With clause not working when using JDBC connection

2021-06-01 Thread Jira
Randall Suárez created SPARK-35597: -- Summary: CTE With clause not working when using JDBC connection Key: SPARK-35597 URL: https://issues.apache.org/jira/browse/SPARK-35597 Project: Spark Is

[jira] [Comment Edited] (SPARK-35557) Adapt uses of JDK 17 Internal APIs

2021-06-01 Thread Doychin Bondzhev (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355166#comment-17355166 ] Doychin Bondzhev edited comment on SPARK-35557 at 6/1/21, 3:25 PM: ---

[jira] [Commented] (SPARK-35557) Adapt uses of JDK 17 Internal APIs

2021-06-01 Thread Doychin Bondzhev (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355166#comment-17355166 ] Doychin Bondzhev commented on SPARK-35557: -- Same applies to JDK 16 > Adapt use

[jira] [Created] (SPARK-35596) HighlyCompressedMapStatus should record accurately the size of skewed shuffle blocks

2021-06-01 Thread exmy (Jira)
exmy created SPARK-35596: Summary: HighlyCompressedMapStatus should record accurately the size of skewed shuffle blocks Key: SPARK-35596 URL: https://issues.apache.org/jira/browse/SPARK-35596 Project: Spark

[jira] [Resolved] (SPARK-35577) Allow to log container output for docker integration tests

2021-06-01 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro resolved SPARK-35577. -- Fix Version/s: 3.2.0 Resolution: Fixed Resolved by https://github.com/apache/sp

[jira] [Updated] (SPARK-35539) Restore to_koalas to keep the backward compatibility

2021-06-01 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen updated SPARK-35539: - Priority: Minor (was: Major) > Restore to_koalas to keep the backward compatibility > -

[jira] [Commented] (SPARK-35595) Support multiple loggers in testing method withLogAppender

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355101#comment-17355101 ] Apache Spark commented on SPARK-35595: -- User 'gengliangwang' has created a pull req

[jira] [Assigned] (SPARK-35595) Support multiple loggers in testing method withLogAppender

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35595: Assignee: (was: Apache Spark) > Support multiple loggers in testing method withLogApp

[jira] [Commented] (SPARK-35595) Support multiple loggers in testing method withLogAppender

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355100#comment-17355100 ] Apache Spark commented on SPARK-35595: -- User 'gengliangwang' has created a pull req

[jira] [Assigned] (SPARK-35595) Support multiple loggers in testing method withLogAppender

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35595: Assignee: Apache Spark > Support multiple loggers in testing method withLogAppender > ---

[jira] [Created] (SPARK-35595) Support multiple loggers in testing method withLogAppender

2021-06-01 Thread Gengliang Wang (Jira)
Gengliang Wang created SPARK-35595: -- Summary: Support multiple loggers in testing method withLogAppender Key: SPARK-35595 URL: https://issues.apache.org/jira/browse/SPARK-35595 Project: Spark

[jira] [Updated] (SPARK-35563) [SQL] Window operations with over Int.MaxValue + 1 rows can silently drop rows

2021-06-01 Thread Robert Joseph Evans (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Joseph Evans updated SPARK-35563: Labels: data-loss (was: ) > [SQL] Window operations with over Int.MaxValue + 1 ro

[jira] [Updated] (SPARK-35563) [SQL] Window operations with over Int.MaxValue + 1 rows can silently drop rows

2021-06-01 Thread Robert Joseph Evans (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Joseph Evans updated SPARK-35563: Priority: Blocker (was: Major) > [SQL] Window operations with over Int.MaxValue +

[jira] [Commented] (SPARK-35089) non consistent results running count for same dataset after filter and lead window function

2021-06-01 Thread Robert Joseph Evans (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355088#comment-17355088 ] Robert Joseph Evans commented on SPARK-35089: - I should add that the above "

[jira] [Commented] (SPARK-35594) Remove duplicate installations in build_and_test.yml

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355086#comment-17355086 ] Apache Spark commented on SPARK-35594: -- User 'pingsutw' has created a pull request

[jira] [Commented] (SPARK-35594) Remove duplicate installations in build_and_test.yml

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355085#comment-17355085 ] Apache Spark commented on SPARK-35594: -- User 'pingsutw' has created a pull request

[jira] [Assigned] (SPARK-35594) Remove duplicate installations in build_and_test.yml

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35594: Assignee: Apache Spark > Remove duplicate installations in build_and_test.yml > -

[jira] [Assigned] (SPARK-35594) Remove duplicate installations in build_and_test.yml

2021-06-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35594: Assignee: (was: Apache Spark) > Remove duplicate installations in build_and_test.yml

[jira] [Commented] (SPARK-35089) non consistent results running count for same dataset after filter and lead window function

2021-06-01 Thread Robert Joseph Evans (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355083#comment-17355083 ] Robert Joseph Evans commented on SPARK-35089: - [~Tonzetic] to be clear my po

[jira] [Assigned] (SPARK-35516) Storage UI tab Storage Level tool tip correction

2021-06-01 Thread Kousuke Saruta (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta reassigned SPARK-35516: -- Assignee: Lidiya Nixon > Storage UI tab Storage Level tool tip correction > -

[jira] [Created] (SPARK-35594) Remove duplicate installations in build_and_test.yml

2021-06-01 Thread Kevin Su (Jira)
Kevin Su created SPARK-35594: Summary: Remove duplicate installations in build_and_test.yml Key: SPARK-35594 URL: https://issues.apache.org/jira/browse/SPARK-35594 Project: Spark Issue Type: Impr

  1   2   >