[jira] [Assigned] (SPARK-38840) Enable spark.sql.parquet.enableNestedColumnVectorizedReader on master branch by default

2022-04-08 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh reassigned SPARK-38840: --- Assignee: Chao Sun > Enable spark.sql.parquet.enableNestedColumnVectorizedReader on master

[jira] [Resolved] (SPARK-38840) Enable spark.sql.parquet.enableNestedColumnVectorizedReader on master branch by default

2022-04-08 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh resolved SPARK-38840. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36124 [https://gith

[jira] [Created] (SPARK-38842) Replace all the ArithmeticException with SparkArithmeticException

2022-04-08 Thread Gengliang Wang (Jira)
Gengliang Wang created SPARK-38842: -- Summary: Replace all the ArithmeticException with SparkArithmeticException Key: SPARK-38842 URL: https://issues.apache.org/jira/browse/SPARK-38842 Project: Spark

[jira] [Commented] (SPARK-37960) A new framework to represent catalyst expressions in DS v2 APIs

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519860#comment-17519860 ] Apache Spark commented on SPARK-37960: -- User 'beliefer' has created a pull request

[jira] [Commented] (SPARK-37960) A new framework to represent catalyst expressions in DS v2 APIs

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519859#comment-17519859 ] Apache Spark commented on SPARK-37960: -- User 'beliefer' has created a pull request

[jira] [Resolved] (SPARK-38841) Enable Bloom filter join by default

2022-04-08 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-38841. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36102 [https://gith

[jira] [Assigned] (SPARK-38841) Enable Bloom filter join by default

2022-04-08 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-38841: --- Assignee: Yingyi Bu > Enable Bloom filter join by default > ---

[jira] [Assigned] (SPARK-38841) Enable Bloom filter join by default

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38841: Assignee: (was: Apache Spark) > Enable Bloom filter join by default > ---

[jira] [Commented] (SPARK-38841) Enable Bloom filter join by default

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519848#comment-17519848 ] Apache Spark commented on SPARK-38841: -- User 'andylam-db' has created a pull reques

[jira] [Assigned] (SPARK-38841) Enable Bloom filter join by default

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38841: Assignee: Apache Spark > Enable Bloom filter join by default > --

[jira] [Commented] (SPARK-38841) Enable Bloom filter join by default

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519849#comment-17519849 ] Apache Spark commented on SPARK-38841: -- User 'andylam-db' has created a pull reques

[jira] [Commented] (SPARK-38840) Enable spark.sql.parquet.enableNestedColumnVectorizedReader on master branch by default

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519847#comment-17519847 ] Apache Spark commented on SPARK-38840: -- User 'sunchao' has created a pull request f

[jira] [Assigned] (SPARK-38840) Enable spark.sql.parquet.enableNestedColumnVectorizedReader on master branch by default

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38840: Assignee: (was: Apache Spark) > Enable spark.sql.parquet.enableNestedColumnVectorized

[jira] [Assigned] (SPARK-38840) Enable spark.sql.parquet.enableNestedColumnVectorizedReader on master branch by default

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38840: Assignee: Apache Spark > Enable spark.sql.parquet.enableNestedColumnVectorizedReader on m

[jira] [Created] (SPARK-38841) Enable Bloom filter join by default

2022-04-08 Thread Yingyi Bu (Jira)
Yingyi Bu created SPARK-38841: - Summary: Enable Bloom filter join by default Key: SPARK-38841 URL: https://issues.apache.org/jira/browse/SPARK-38841 Project: Spark Issue Type: Bug Compo

[jira] [Created] (SPARK-38840) Enable spark.sql.parquet.enableNestedColumnVectorizedReader on master branch by default

2022-04-08 Thread Chao Sun (Jira)
Chao Sun created SPARK-38840: Summary: Enable spark.sql.parquet.enableNestedColumnVectorizedReader on master branch by default Key: SPARK-38840 URL: https://issues.apache.org/jira/browse/SPARK-38840 Proje

[jira] [Commented] (SPARK-34863) Support nested column in Spark Parquet vectorized readers

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519809#comment-17519809 ] Apache Spark commented on SPARK-34863: -- User 'sunchao' has created a pull request f

[jira] [Commented] (SPARK-34863) Support nested column in Spark Parquet vectorized readers

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519810#comment-17519810 ] Apache Spark commented on SPARK-34863: -- User 'sunchao' has created a pull request f

[jira] [Updated] (SPARK-38839) Creating a struct with a float inside

2022-04-08 Thread Daniel deCordoba (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel deCordoba updated SPARK-38839: - Description: When creating a dataframe using createDataFrame that contains a float insid

[jira] [Updated] (SPARK-38839) Creating a struct with a float inside

2022-04-08 Thread Daniel deCordoba (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel deCordoba updated SPARK-38839: - Description: When creating a dataframe using createDataFrame that contains a float insid

[jira] [Comment Edited] (SPARK-38839) Creating a struct with a float inside

2022-04-08 Thread Daniel deCordoba (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519798#comment-17519798 ] Daniel deCordoba edited comment on SPARK-38839 at 4/8/22 9:04 PM:

[jira] [Commented] (SPARK-38839) Creating a struct with a float inside

2022-04-08 Thread Daniel deCordoba (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519798#comment-17519798 ] Daniel deCordoba commented on SPARK-38839: -- The style got messed up, hopefully

[jira] [Updated] (SPARK-38839) Creating a struct with a float inside

2022-04-08 Thread Daniel deCordoba (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel deCordoba updated SPARK-38839: - Description: When creating a dataframe using createDataFrame that contains a float insid

[jira] [Updated] (SPARK-38839) Creating a struct with a float inside

2022-04-08 Thread Daniel deCordoba (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel deCordoba updated SPARK-38839: - Description: When creating a dataframe using createDataFrame that contains a float insid

[jira] [Updated] (SPARK-38839) Creating a struct with a float inside

2022-04-08 Thread Daniel deCordoba (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel deCordoba updated SPARK-38839: - Description: When creating a dataframe using createDataFrame that contains a float insid

[jira] [Created] (SPARK-38839) Creating a struct with a float inside

2022-04-08 Thread Daniel deCordoba (Jira)
Daniel deCordoba created SPARK-38839: Summary: Creating a struct with a float inside Key: SPARK-38839 URL: https://issues.apache.org/jira/browse/SPARK-38839 Project: Spark Issue Type: Bu

[jira] [Commented] (SPARK-38838) Support ALTER TABLE ALTER COLUMN commands with DEFAULT values

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519755#comment-17519755 ] Apache Spark commented on SPARK-38838: -- User 'dtenedor' has created a pull request

[jira] [Commented] (SPARK-38838) Support ALTER TABLE ALTER COLUMN commands with DEFAULT values

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519753#comment-17519753 ] Apache Spark commented on SPARK-38838: -- User 'dtenedor' has created a pull request

[jira] [Assigned] (SPARK-38838) Support ALTER TABLE ALTER COLUMN commands with DEFAULT values

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38838: Assignee: Apache Spark > Support ALTER TABLE ALTER COLUMN commands with DEFAULT values >

[jira] [Assigned] (SPARK-38838) Support ALTER TABLE ALTER COLUMN commands with DEFAULT values

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38838: Assignee: (was: Apache Spark) > Support ALTER TABLE ALTER COLUMN commands with DEFAUL

[jira] [Created] (SPARK-38838) Support ALTER TABLE ALTER COLUMN commands with DEFAULT values

2022-04-08 Thread Daniel (Jira)
Daniel created SPARK-38838: -- Summary: Support ALTER TABLE ALTER COLUMN commands with DEFAULT values Key: SPARK-38838 URL: https://issues.apache.org/jira/browse/SPARK-38838 Project: Spark Issue Type

[jira] [Commented] (SPARK-38811) Support ALTER TABLE ADD COLUMN commands with DEFAULT values

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519751#comment-17519751 ] Apache Spark commented on SPARK-38811: -- User 'dtenedor' has created a pull request

[jira] [Commented] (SPARK-38837) Implement `dropna` parameter of `SeriesGroupBy.value_counts`

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519745#comment-17519745 ] Apache Spark commented on SPARK-38837: -- User 'xinrong-databricks' has created a pul

[jira] [Assigned] (SPARK-38837) Implement `dropna` parameter of `SeriesGroupBy.value_counts`

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38837: Assignee: Apache Spark > Implement `dropna` parameter of `SeriesGroupBy.value_counts` > -

[jira] [Commented] (SPARK-38837) Implement `dropna` parameter of `SeriesGroupBy.value_counts`

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519744#comment-17519744 ] Apache Spark commented on SPARK-38837: -- User 'xinrong-databricks' has created a pul

[jira] [Assigned] (SPARK-38837) Implement `dropna` parameter of `SeriesGroupBy.value_counts`

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38837: Assignee: (was: Apache Spark) > Implement `dropna` parameter of `SeriesGroupBy.value_

[jira] [Created] (SPARK-38837) Implement `dropna` parameter of `SeriesGroupBy.value_counts`

2022-04-08 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-38837: Summary: Implement `dropna` parameter of `SeriesGroupBy.value_counts` Key: SPARK-38837 URL: https://issues.apache.org/jira/browse/SPARK-38837 Project: Spark

[jira] [Assigned] (SPARK-38836) Increase the performance of ExpressionSet

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38836: Assignee: (was: Apache Spark) > Increase the performance of ExpressionSet > -

[jira] [Assigned] (SPARK-38836) Increase the performance of ExpressionSet

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38836: Assignee: Apache Spark > Increase the performance of ExpressionSet >

[jira] [Commented] (SPARK-38836) Increase the performance of ExpressionSet

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519741#comment-17519741 ] Apache Spark commented on SPARK-38836: -- User 'minyyy' has created a pull request fo

[jira] [Created] (SPARK-38836) Increase the performance of ExpressionSet

2022-04-08 Thread Min Yang (Jira)
Min Yang created SPARK-38836: Summary: Increase the performance of ExpressionSet Key: SPARK-38836 URL: https://issues.apache.org/jira/browse/SPARK-38836 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-38833) PySpark applyInPandas should allow to return empty DataFrame without columns

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519713#comment-17519713 ] Apache Spark commented on SPARK-38833: -- User 'EnricoMi' has created a pull request

[jira] [Assigned] (SPARK-38833) PySpark applyInPandas should allow to return empty DataFrame without columns

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38833: Assignee: (was: Apache Spark) > PySpark applyInPandas should allow to return empty Da

[jira] [Assigned] (SPARK-38833) PySpark applyInPandas should allow to return empty DataFrame without columns

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38833: Assignee: Apache Spark > PySpark applyInPandas should allow to return empty DataFrame wit

[jira] [Commented] (SPARK-38833) PySpark applyInPandas should allow to return empty DataFrame without columns

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519712#comment-17519712 ] Apache Spark commented on SPARK-38833: -- User 'EnricoMi' has created a pull request

[jira] [Commented] (SPARK-38835) Refactor FsHistoryProviderSuite to test rocks db

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519684#comment-17519684 ] Apache Spark commented on SPARK-38835: -- User 'LuciferYang' has created a pull reque

[jira] [Assigned] (SPARK-38835) Refactor FsHistoryProviderSuite to test rocks db

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38835: Assignee: Apache Spark > Refactor FsHistoryProviderSuite to test rocks db > -

[jira] [Assigned] (SPARK-38835) Refactor FsHistoryProviderSuite to test rocks db

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38835: Assignee: (was: Apache Spark) > Refactor FsHistoryProviderSuite to test rocks db > --

[jira] [Commented] (SPARK-38835) Refactor FsHistoryProviderSuite to test rocks db

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519685#comment-17519685 ] Apache Spark commented on SPARK-38835: -- User 'LuciferYang' has created a pull reque

[jira] [Created] (SPARK-38835) Refactor FsHistoryProviderSuite to test rocks db

2022-04-08 Thread Yang Jie (Jira)
Yang Jie created SPARK-38835: Summary: Refactor FsHistoryProviderSuite to test rocks db Key: SPARK-38835 URL: https://issues.apache.org/jira/browse/SPARK-38835 Project: Spark Issue Type: Improvem

[jira] [Resolved] (SPARK-38834) Update the version of TimestampNTZ related changes as 3.4.0

2022-04-08 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang resolved SPARK-38834. Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36118 [https:

[jira] [Commented] (SPARK-38834) Update the version of TimestampNTZ related changes as 3.4.0

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519571#comment-17519571 ] Apache Spark commented on SPARK-38834: -- User 'gengliangwang' has created a pull req

[jira] [Assigned] (SPARK-38834) Update the version of TimestampNTZ related changes as 3.4.0

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38834: Assignee: Gengliang Wang (was: Apache Spark) > Update the version of TimestampNTZ relate

[jira] [Commented] (SPARK-38834) Update the version of TimestampNTZ related changes as 3.4.0

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519570#comment-17519570 ] Apache Spark commented on SPARK-38834: -- User 'gengliangwang' has created a pull req

[jira] [Assigned] (SPARK-38834) Update the version of TimestampNTZ related changes as 3.4.0

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38834: Assignee: Apache Spark (was: Gengliang Wang) > Update the version of TimestampNTZ relate

[jira] [Updated] (SPARK-35662) Support Timestamp without time zone data type

2022-04-08 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang updated SPARK-35662: --- Affects Version/s: 3.4.0 (was: 3.3.0) > Support Timestamp without

[jira] [Created] (SPARK-38834) Update the version of TimestampNTZ related changes as 3.4.0

2022-04-08 Thread Gengliang Wang (Jira)
Gengliang Wang created SPARK-38834: -- Summary: Update the version of TimestampNTZ related changes as 3.4.0 Key: SPARK-38834 URL: https://issues.apache.org/jira/browse/SPARK-38834 Project: Spark

[jira] [Resolved] (SPARK-38813) Remove TimestampNTZ type support in Spark 3.3

2022-04-08 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang resolved SPARK-38813. Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 36094 [https:

[jira] [Updated] (SPARK-38833) PySpark applyInPandas should allow to return empty DataFrame without columns

2022-04-08 Thread Enrico Minack (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Enrico Minack updated SPARK-38833: -- Summary: PySpark applyInPandas should allow to return empty DataFrame without columns (was: P

[jira] [Created] (SPARK-38833) PySpark allows applyInPandas return empty DataFrame without columns

2022-04-08 Thread Enrico Minack (Jira)
Enrico Minack created SPARK-38833: - Summary: PySpark allows applyInPandas return empty DataFrame without columns Key: SPARK-38833 URL: https://issues.apache.org/jira/browse/SPARK-38833 Project: Spark

[jira] [Commented] (SPARK-38832) Remove unnecessary distinct in aggregate expression by distinctKeys

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519548#comment-17519548 ] Apache Spark commented on SPARK-38832: -- User 'ulysses-you' has created a pull reque

[jira] [Assigned] (SPARK-38832) Remove unnecessary distinct in aggregate expression by distinctKeys

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38832: Assignee: (was: Apache Spark) > Remove unnecessary distinct in aggregate expression b

[jira] [Assigned] (SPARK-38832) Remove unnecessary distinct in aggregate expression by distinctKeys

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38832: Assignee: Apache Spark > Remove unnecessary distinct in aggregate expression by distinctK

[jira] [Commented] (SPARK-38832) Remove unnecessary distinct in aggregate expression by distinctKeys

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519545#comment-17519545 ] Apache Spark commented on SPARK-38832: -- User 'ulysses-you' has created a pull reque

[jira] [Updated] (SPARK-38832) Remove unnecessary distinct in aggregate expression by distinctKeys

2022-04-08 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-38832: -- Description: We can remove the distinct in aggregate expression if the child distinct semantics is gu

[jira] [Created] (SPARK-38832) Remove unnecessary distinct in aggregate expression by distinctKeys

2022-04-08 Thread XiDuo You (Jira)
XiDuo You created SPARK-38832: - Summary: Remove unnecessary distinct in aggregate expression by distinctKeys Key: SPARK-38832 URL: https://issues.apache.org/jira/browse/SPARK-38832 Project: Spark

[jira] [Resolved] (SPARK-38825) Add a test to cover parquet notIn filter

2022-04-08 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-38825. - Fix Version/s: 3.3.0 Assignee: Huaxin Gao Resolution: Fixed > Add a test to cove

[jira] [Created] (SPARK-38831) How to enable encryption for checkpoint data?

2022-04-08 Thread zoli (Jira)
zoli created SPARK-38831: Summary: How to enable encryption for checkpoint data? Key: SPARK-38831 URL: https://issues.apache.org/jira/browse/SPARK-38831 Project: Spark Issue Type: Question

[jira] [Commented] (SPARK-38830) Warn corrupted Netty RPC messages

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519491#comment-17519491 ] Apache Spark commented on SPARK-38830: -- User 'dongjoon-hyun' has created a pull req

[jira] [Assigned] (SPARK-38830) Warn corrupted Netty RPC messages

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38830: Assignee: (was: Apache Spark) > Warn corrupted Netty RPC messages > -

[jira] [Assigned] (SPARK-38830) Warn corrupted Netty RPC messages

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38830: Assignee: Apache Spark > Warn corrupted Netty RPC messages >

[jira] [Commented] (SPARK-38830) Warn corrupted Netty RPC messages

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519489#comment-17519489 ] Apache Spark commented on SPARK-38830: -- User 'dongjoon-hyun' has created a pull req

[jira] [Created] (SPARK-38830) Warn corrupted Netty RPC messages

2022-04-08 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created SPARK-38830: - Summary: Warn corrupted Netty RPC messages Key: SPARK-38830 URL: https://issues.apache.org/jira/browse/SPARK-38830 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-38822) Raise indexError when insert loc is out of bounds

2022-04-08 Thread Yikun Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yikun Jiang updated SPARK-38822: Description:   [https://github.com/apache/spark/blob/becda3339381b3975ed567c156260eda036d7a1b/pyt

[jira] [Updated] (SPARK-38829) New configuration for controlling timestamp inference of Parquet

2022-04-08 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang updated SPARK-38829: --- Description: A new SQL conf which can fallback to the behavior that reads all the Parquet Ti

[jira] [Assigned] (SPARK-38822) Raise indexError when insert loc is out of bounds

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38822: Assignee: (was: Apache Spark) > Raise indexError when insert loc is out of bounds > -

[jira] [Assigned] (SPARK-38822) Raise indexError when insert loc is out of bounds

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-38822: Assignee: Apache Spark > Raise indexError when insert loc is out of bounds >

[jira] [Created] (SPARK-38829) New configuration for controlling timestamp inference of Parquet

2022-04-08 Thread Gengliang Wang (Jira)
Gengliang Wang created SPARK-38829: -- Summary: New configuration for controlling timestamp inference of Parquet Key: SPARK-38829 URL: https://issues.apache.org/jira/browse/SPARK-38829 Project: Spark

[jira] [Commented] (SPARK-38822) Raise indexError when insert loc is out of bounds

2022-04-08 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519410#comment-17519410 ] Apache Spark commented on SPARK-38822: -- User 'Yikun' has created a pull request for

[jira] [Updated] (SPARK-38813) Remove TimestampNTZ type support in Spark 3.3

2022-04-08 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang updated SPARK-38813: --- Description: Note that this one doesn't include the PySpark part. See also: https://issues.

[jira] [Created] (SPARK-38828) Remove TimestampNTZ type Python support in Spark 3.3

2022-04-08 Thread Gengliang Wang (Jira)
Gengliang Wang created SPARK-38828: -- Summary: Remove TimestampNTZ type Python support in Spark 3.3 Key: SPARK-38828 URL: https://issues.apache.org/jira/browse/SPARK-38828 Project: Spark Issu

[jira] [Updated] (SPARK-38822) Raise indexError when insert loc is out of bounds

2022-04-08 Thread Yikun Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yikun Jiang updated SPARK-38822: Description:   [https://github.com/apache/spark/blob/becda3339381b3975ed567c156260eda036d7a1b/pyt

[jira] [Resolved] (SPARK-38827) Improve the test coverage for pyspark/find_spark_home.py

2022-04-08 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-38827. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 36112 [https://gi

[jira] [Assigned] (SPARK-38827) Improve the test coverage for pyspark/find_spark_home.py

2022-04-08 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-38827: Assignee: Hyukjin Kwon > Improve the test coverage for pyspark/find_spark_home.py > -

[jira] [Resolved] (SPARK-38803) Set minio cpu to 250m (0.25) in K8s IT

2022-04-08 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-38803. --- Fix Version/s: 3.4.0 Assignee: Yikun Jiang Resolution: Fixed This is resolve

[jira] [Updated] (SPARK-38803) Set minio cpu to 250m (0.25) in K8s IT

2022-04-08 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-38803: -- Issue Type: Improvement (was: Bug) > Set minio cpu to 250m (0.25) in K8s IT > ---

[jira] [Updated] (SPARK-38824) Bug in async commit of Kafka offset in DirectKafkaInputDStream

2022-04-08 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-38824: -- Component/s: DStreams (was: Spark Core) > Bug in async commit of Kafka of

[jira] [Commented] (SPARK-38824) Bug in async commit of Kafka offset in DirectKafkaInputDStream

2022-04-08 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519363#comment-17519363 ] Dongjoon Hyun commented on SPARK-38824: --- cc [~viirya] > Bug in async commit of Ka

[jira] [Commented] (SPARK-18208) Executor OOM due to a memory leak in BytesToBytesMap

2022-04-08 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-18208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519361#comment-17519361 ] Dongjoon Hyun commented on SPARK-18208: --- [~connectsachit]. Sorry but there is noth

[jira] [Comment Edited] (SPARK-37660) Spark-3.2.0 Fetch Hbase Data not working

2022-04-08 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519350#comment-17519350 ] Dongjoon Hyun edited comment on SPARK-37660 at 4/8/22 7:00 AM: ---

[jira] [Comment Edited] (SPARK-37660) Spark-3.2.0 Fetch Hbase Data not working

2022-04-08 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519350#comment-17519350 ] Dongjoon Hyun edited comment on SPARK-37660 at 4/8/22 6:59 AM: ---