[jira] [Comment Edited] (SPARK-34210) Cannot create a record reader because of a previous error when spark accesses the hive on HBase table

2022-10-26 Thread zhangzhanchang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624873#comment-17624873 ] zhangzhanchang edited comment on SPARK-34210 at 10/27/22 6:09 AM:

[jira] [Commented] (SPARK-34210) Cannot create a record reader because of a previous error when spark accesses the hive on HBase table

2022-10-26 Thread zhangzhanchang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624873#comment-17624873 ] zhangzhanchang commented on SPARK-34210: The reason for not merging into the mai

[jira] [Resolved] (SPARK-40858) Cleanup github action warning

2022-10-26 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40858. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38408 [https://gi

[jira] [Assigned] (SPARK-40858) Cleanup github action warning

2022-10-26 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-40858: Assignee: Yikun Jiang > Cleanup github action warning > - > >

[jira] [Resolved] (SPARK-40929) Add official image dockerfile for Spark v3.3.1

2022-10-26 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-40929. - Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 20 [https://github.

[jira] [Assigned] (SPARK-40929) Add official image dockerfile for Spark v3.3.1

2022-10-26 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-40929: --- Assignee: Yikun Jiang > Add official image dockerfile for Spark v3.3.1 > --

[jira] [Assigned] (SPARK-40919) Bad case of `AnalysisTest#assertAnalysisErrorClass` when `expectedMessageParameters.size between [2, 4]`

2022-10-26 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk reassigned SPARK-40919: Assignee: Yang Jie > Bad case of `AnalysisTest#assertAnalysisErrorClass` when > `expectedMessage

[jira] [Resolved] (SPARK-40919) Bad case of `AnalysisTest#assertAnalysisErrorClass` when `expectedMessageParameters.size between [2, 4]`

2022-10-26 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk resolved SPARK-40919. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38396 [https://github.com

[jira] [Commented] (SPARK-36392) pandas fixed width file support

2022-10-26 Thread Haejoon Lee (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624856#comment-17624856 ] Haejoon Lee commented on SPARK-36392: - Thanks for the reminder. Let me take a look t

[jira] [Commented] (SPARK-40930) Support Collect() in Python client

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624835#comment-17624835 ] Apache Spark commented on SPARK-40930: -- User 'amaliujia' has created a pull request

[jira] [Assigned] (SPARK-40930) Support Collect() in Python client

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40930: Assignee: Apache Spark > Support Collect() in Python client > ---

[jira] [Commented] (SPARK-40930) Support Collect() in Python client

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624833#comment-17624833 ] Apache Spark commented on SPARK-40930: -- User 'amaliujia' has created a pull request

[jira] [Assigned] (SPARK-40930) Support Collect() in Python client

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40930: Assignee: (was: Apache Spark) > Support Collect() in Python client >

[jira] [Created] (SPARK-40930) Support Collect() in Python client

2022-10-26 Thread Rui Wang (Jira)
Rui Wang created SPARK-40930: Summary: Support Collect() in Python client Key: SPARK-40930 URL: https://issues.apache.org/jira/browse/SPARK-40930 Project: Spark Issue Type: Sub-task Com

[jira] [Created] (SPARK-40929) Add official image dockerfile for Spark v3.3.1

2022-10-26 Thread Yikun Jiang (Jira)
Yikun Jiang created SPARK-40929: --- Summary: Add official image dockerfile for Spark v3.3.1 Key: SPARK-40929 URL: https://issues.apache.org/jira/browse/SPARK-40929 Project: Spark Issue Type: Sub-

[jira] [Commented] (SPARK-40858) Cleanup github action warning

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624823#comment-17624823 ] Apache Spark commented on SPARK-40858: -- User 'Yikun' has created a pull request for

[jira] [Assigned] (SPARK-40858) Cleanup github action warning

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40858: Assignee: (was: Apache Spark) > Cleanup github action warning > -

[jira] [Commented] (SPARK-40858) Cleanup github action warning

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624822#comment-17624822 ] Apache Spark commented on SPARK-40858: -- User 'Yikun' has created a pull request for

[jira] [Assigned] (SPARK-40858) Cleanup github action warning

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40858: Assignee: Apache Spark > Cleanup github action warning > - >

[jira] [Commented] (SPARK-40858) Cleanup github action warning

2022-10-26 Thread Yikun Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624820#comment-17624820 ] Yikun Jiang commented on SPARK-40858: - [~hyukjin.kwon] According latest https://gi

[jira] [Created] (SPARK-40928) Upgrade actions/setup-python to v4

2022-10-26 Thread Yikun Jiang (Jira)
Yikun Jiang created SPARK-40928: --- Summary: Upgrade actions/setup-python to v4 Key: SPARK-40928 URL: https://issues.apache.org/jira/browse/SPARK-40928 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-34265) Instrument Python UDF execution using SQL Metrics

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624809#comment-17624809 ] Apache Spark commented on SPARK-34265: -- User 'HyukjinKwon' has created a pull reque

[jira] [Commented] (SPARK-34265) Instrument Python UDF execution using SQL Metrics

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624810#comment-17624810 ] Apache Spark commented on SPARK-34265: -- User 'HyukjinKwon' has created a pull reque

[jira] [Commented] (SPARK-36392) pandas fixed width file support

2022-10-26 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624807#comment-17624807 ] Hyukjin Kwon commented on SPARK-36392: -- I think it won'd be super difficult to impl

[jira] [Commented] (SPARK-40858) Cleanup github action warning

2022-10-26 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624806#comment-17624806 ] Hyukjin Kwon commented on SPARK-40858: -- Is it all done [~yikunkero]? (just asking o

[jira] [Resolved] (SPARK-40914) Mark internal API to be private[connect]

2022-10-26 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-40914. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 38392 [https://gi

[jira] [Assigned] (SPARK-40914) Mark internal API to be private[connect]

2022-10-26 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-40914: Assignee: Rui Wang > Mark internal API to be private[connect] > -

[jira] [Commented] (SPARK-40926) Refactor server side tests to only use DataFrame API

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624799#comment-17624799 ] Apache Spark commented on SPARK-40926: -- User 'amaliujia' has created a pull request

[jira] [Commented] (SPARK-40926) Refactor server side tests to only use DataFrame API

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624798#comment-17624798 ] Apache Spark commented on SPARK-40926: -- User 'amaliujia' has created a pull request

[jira] [Assigned] (SPARK-40926) Refactor server side tests to only use DataFrame API

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40926: Assignee: (was: Apache Spark) > Refactor server side tests to only use DataFrame API

[jira] [Assigned] (SPARK-40926) Refactor server side tests to only use DataFrame API

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40926: Assignee: Apache Spark > Refactor server side tests to only use DataFrame API > -

[jira] [Assigned] (SPARK-40925) Fix late record filtering to support chaining of steteful operators

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40925: Assignee: (was: Apache Spark) > Fix late record filtering to support chaining of stet

[jira] [Assigned] (SPARK-40925) Fix late record filtering to support chaining of steteful operators

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40925: Assignee: Apache Spark > Fix late record filtering to support chaining of steteful operat

[jira] [Commented] (SPARK-40925) Fix late record filtering to support chaining of steteful operators

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624775#comment-17624775 ] Apache Spark commented on SPARK-40925: -- User 'alex-balikov' has created a pull requ

[jira] [Updated] (SPARK-40927) Memory issue with Structured streaming

2022-10-26 Thread Mihir Kelkar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mihir Kelkar updated SPARK-40927: - Description: In Pyspark Structured streaming with Kafka as source and sink, the driver as well

[jira] [Updated] (SPARK-40927) Memory issue with Structured streaming

2022-10-26 Thread Mihir Kelkar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mihir Kelkar updated SPARK-40927: - Description: In Pyspark Structured streaming with Kafka as source and sink, the driver as well

[jira] [Created] (SPARK-40927) Memory issue with Structured streaming

2022-10-26 Thread Mihir Kelkar (Jira)
Mihir Kelkar created SPARK-40927: Summary: Memory issue with Structured streaming Key: SPARK-40927 URL: https://issues.apache.org/jira/browse/SPARK-40927 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-39405) NumPy input support in PySpark SQL

2022-10-26 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39405: - Summary: NumPy input support in PySpark SQL (was: NumPy input support in PySpark) > NumPy inpu

[jira] [Updated] (SPARK-39405) NumPy input support in PySpark

2022-10-26 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39405: - Summary: NumPy input support in PySpark (was: NumPy input support in PySpark SQL) > NumPy inpu

[jira] [Updated] (SPARK-39405) NumPy input support in PySpark SQL

2022-10-26 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39405: - Summary: NumPy input support in PySpark SQL (was: NumPy support in SQL) > NumPy input support i

[jira] [Created] (SPARK-40926) Refactor server side tests to only use DataFrame API

2022-10-26 Thread Rui Wang (Jira)
Rui Wang created SPARK-40926: Summary: Refactor server side tests to only use DataFrame API Key: SPARK-40926 URL: https://issues.apache.org/jira/browse/SPARK-40926 Project: Spark Issue Type: Sub-

[jira] [Updated] (SPARK-38697) Extend SparkSessionExtensions to inject rules into AQE Optimizer

2022-10-26 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-38697: -- Fix Version/s: 3.2.3 3.3.2 > Extend SparkSessionExtensions to inject rules

[jira] [Commented] (SPARK-40920) SVD: matrix U has wrong row order

2022-10-26 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624714#comment-17624714 ] Sean R. Owen commented on SPARK-40920: -- So, first, to reproduce the problem more re

[jira] [Updated] (SPARK-40821) Fix late record filtering to support chaining of stateful operators

2022-10-26 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim updated SPARK-40821: - Description: Currently chaining of stateful operators is Spark Structured Streaming is not supp

[jira] [Commented] (SPARK-38697) Extend SparkSessionExtensions to inject rules into AQE Optimizer

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624678#comment-17624678 ] Apache Spark commented on SPARK-38697: -- User 'andygrove' has created a pull request

[jira] [Commented] (SPARK-38697) Extend SparkSessionExtensions to inject rules into AQE Optimizer

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624649#comment-17624649 ] Apache Spark commented on SPARK-38697: -- User 'andygrove' has created a pull request

[jira] [Commented] (SPARK-38697) Extend SparkSessionExtensions to inject rules into AQE Optimizer

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624648#comment-17624648 ] Apache Spark commented on SPARK-38697: -- User 'andygrove' has created a pull request

[jira] [Assigned] (SPARK-40924) Unhex function works incorrectly when input has uneven number of symbols

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40924: Assignee: (was: Apache Spark) > Unhex function works incorrectly when input has uneve

[jira] [Assigned] (SPARK-40924) Unhex function works incorrectly when input has uneven number of symbols

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40924: Assignee: Apache Spark > Unhex function works incorrectly when input has uneven number of

[jira] [Commented] (SPARK-40924) Unhex function works incorrectly when input has uneven number of symbols

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624647#comment-17624647 ] Apache Spark commented on SPARK-40924: -- User 'vitaliili-db' has created a pull requ

[jira] [Updated] (SPARK-40821) Fix late record filtering to support chaining of stateful operators

2022-10-26 Thread Alex Balikov (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Balikov updated SPARK-40821: - Summary: Fix late record filtering to support chaining of stateful operators (was: [SQL][CORE][

[jira] [Created] (SPARK-40925) Fix late record filtering to support chaining of steteful operators

2022-10-26 Thread Alex Balikov (Jira)
Alex Balikov created SPARK-40925: Summary: Fix late record filtering to support chaining of steteful operators Key: SPARK-40925 URL: https://issues.apache.org/jira/browse/SPARK-40925 Project: Spark

[jira] [Updated] (SPARK-40821) [SQL][CORE][SS]Fix late record filtering to support chaining of stateful operators

2022-10-26 Thread Alex Balikov (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Balikov updated SPARK-40821: - Summary: [SQL][CORE][SS]Fix late record filtering to support chaining of stateful operators (wa

[jira] [Created] (SPARK-40924) Unhex function works incorrectly when input has uneven number of symbols

2022-10-26 Thread Vitalii Li (Jira)
Vitalii Li created SPARK-40924: -- Summary: Unhex function works incorrectly when input has uneven number of symbols Key: SPARK-40924 URL: https://issues.apache.org/jira/browse/SPARK-40924 Project: Spark

[jira] [Resolved] (SPARK-40767) Compile Spark example module will hang

2022-10-26 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-40767. -- Resolution: Not A Problem > Compile Spark example module will hang > -

[jira] [Resolved] (SPARK-40814) Exception in thread "main" java.lang.NoClassDefFoundError: io/fabric8/kubernetes/client/KubernetesClient

2022-10-26 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-40814. -- Resolution: Not A Problem This seems to be a problem with the user Dockerfile, not Spark > Ex

[jira] [Updated] (SPARK-40923) QueryStageExec canonical plan won't support columnar

2022-10-26 Thread Filipe Oliveira (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Filipe Oliveira updated SPARK-40923: Description: https://issues.apache.org/jira/browse/SPARK-34168 changed QueryStageExec can

[jira] [Created] (SPARK-40923) QueryStageExec canonical plan won't support columnar

2022-10-26 Thread Filipe Oliveira (Jira)
Filipe Oliveira created SPARK-40923: --- Summary: QueryStageExec canonical plan won't support columnar Key: SPARK-40923 URL: https://issues.apache.org/jira/browse/SPARK-40923 Project: Spark Is

[jira] [Commented] (SPARK-40921) Add WHEN NOT MATCHED BY SOURCE clause to MERGE INTO command

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624559#comment-17624559 ] Apache Spark commented on SPARK-40921: -- User 'johanl-db' has created a pull request

[jira] [Commented] (SPARK-40921) Add WHEN NOT MATCHED BY SOURCE clause to MERGE INTO command

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624557#comment-17624557 ] Apache Spark commented on SPARK-40921: -- User 'johanl-db' has created a pull request

[jira] [Assigned] (SPARK-40921) Add WHEN NOT MATCHED BY SOURCE clause to MERGE INTO command

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40921: Assignee: Apache Spark > Add WHEN NOT MATCHED BY SOURCE clause to MERGE INTO command > --

[jira] [Assigned] (SPARK-40921) Add WHEN NOT MATCHED BY SOURCE clause to MERGE INTO command

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40921: Assignee: (was: Apache Spark) > Add WHEN NOT MATCHED BY SOURCE clause to MERGE INTO c

[jira] [Resolved] (SPARK-40916) udf could not filter null value cause npe

2022-10-26 Thread jingxiong zhong (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jingxiong zhong resolved SPARK-40916. - Resolution: Fixed add --conf spark.sql.subexpressionElimination.enabled=false > udf co

[jira] [Commented] (SPARK-34210) Cannot create a record reader because of a previous error when spark accesses the hive on HBase table

2022-10-26 Thread Mehul Thakkar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624490#comment-17624490 ] Mehul Thakkar commented on SPARK-34210: --- I tried 3.0.1 and even the latest 3.2.2 a

[jira] (SPARK-40922) pyspark.pandas.read_csv supports reading multiple files, but that is undocumented

2022-10-26 Thread Stefaan Lippens (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40922 ] Stefaan Lippens deleted comment on SPARK-40922: - was (Author: soxofaan): I created initial PR at https://github.com/apache/spark/pull/38399 > pyspark.pandas.read_csv supports reading mul

[jira] [Commented] (SPARK-40922) pyspark.pandas.read_csv supports reading multiple files, but that is undocumented

2022-10-26 Thread Stefaan Lippens (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624485#comment-17624485 ] Stefaan Lippens commented on SPARK-40922: - I created initial PR at https://githu

[jira] [Commented] (SPARK-40922) pyspark.pandas.read_csv supports reading multiple files, but that is undocumented

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624483#comment-17624483 ] Apache Spark commented on SPARK-40922: -- User 'soxofaan' has created a pull request

[jira] [Assigned] (SPARK-40922) pyspark.pandas.read_csv supports reading multiple files, but that is undocumented

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40922: Assignee: (was: Apache Spark) > pyspark.pandas.read_csv supports reading multiple fil

[jira] [Commented] (SPARK-40922) pyspark.pandas.read_csv supports reading multiple files, but that is undocumented

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624482#comment-17624482 ] Apache Spark commented on SPARK-40922: -- User 'soxofaan' has created a pull request

[jira] [Assigned] (SPARK-40922) pyspark.pandas.read_csv supports reading multiple files, but that is undocumented

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40922: Assignee: Apache Spark > pyspark.pandas.read_csv supports reading multiple files, but tha

[jira] [Updated] (SPARK-40921) Add WHEN NOT MATCHED BY SOURCE clause to MERGE INTO command

2022-10-26 Thread Johan Lasperas (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johan Lasperas updated SPARK-40921: --- Target Version/s: (was: 3.4.0) > Add WHEN NOT MATCHED BY SOURCE clause to MERGE INTO comma

[jira] [Updated] (SPARK-40922) pyspark.pandas.read_csv supports reading multiple files, but that is undocumented

2022-10-26 Thread Stefaan Lippens (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stefaan Lippens updated SPARK-40922: Priority: Minor (was: Major) > pyspark.pandas.read_csv supports reading multiple files, b

[jira] [Created] (SPARK-40922) pyspark.pandas.read_csv supports reading multiple files, but that is undocumented

2022-10-26 Thread Stefaan Lippens (Jira)
Stefaan Lippens created SPARK-40922: --- Summary: pyspark.pandas.read_csv supports reading multiple files, but that is undocumented Key: SPARK-40922 URL: https://issues.apache.org/jira/browse/SPARK-40922

[jira] [Created] (SPARK-40921) Add WHEN NOT MATCHED BY SOURCE clause to MERGE INTO command

2022-10-26 Thread Johan Lasperas (Jira)
Johan Lasperas created SPARK-40921: -- Summary: Add WHEN NOT MATCHED BY SOURCE clause to MERGE INTO command Key: SPARK-40921 URL: https://issues.apache.org/jira/browse/SPARK-40921 Project: Spark

[jira] [Commented] (SPARK-36392) pandas fixed width file support

2022-10-26 Thread John Ayoub (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624468#comment-17624468 ] John Ayoub commented on SPARK-36392: [~itholic] [~hyukjin.kwon] hello, any update on

[jira] [Assigned] (SPARK-39778) Improve error messages: step 3

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-39778: Assignee: Apache Spark (was: Max Gekk) > Improve error messages: step 3 > --

[jira] [Commented] (SPARK-39778) Improve error messages: step 3

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624466#comment-17624466 ] Apache Spark commented on SPARK-39778: -- User 'MaxGekk' has created a pull request f

[jira] [Assigned] (SPARK-39778) Improve error messages: step 3

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-39778: Assignee: Max Gekk (was: Apache Spark) > Improve error messages: step 3 > --

[jira] [Commented] (SPARK-40918) Mismatch between ParquetFileFormat and FileSourceScanExec in # columns for WSCG.isTooManyFields when using _metadata

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624445#comment-17624445 ] Apache Spark commented on SPARK-40918: -- User 'juliuszsompolski' has created a pull

[jira] [Assigned] (SPARK-40918) Mismatch between ParquetFileFormat and FileSourceScanExec in # columns for WSCG.isTooManyFields when using _metadata

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40918: Assignee: (was: Apache Spark) > Mismatch between ParquetFileFormat and FileSourceScan

[jira] [Commented] (SPARK-40918) Mismatch between ParquetFileFormat and FileSourceScanExec in # columns for WSCG.isTooManyFields when using _metadata

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624443#comment-17624443 ] Apache Spark commented on SPARK-40918: -- User 'juliuszsompolski' has created a pull

[jira] [Assigned] (SPARK-40918) Mismatch between ParquetFileFormat and FileSourceScanExec in # columns for WSCG.isTooManyFields when using _metadata

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40918: Assignee: Apache Spark > Mismatch between ParquetFileFormat and FileSourceScanExec in # c

[jira] [Updated] (SPARK-40920) SVD: matrix U has wrong row order

2022-10-26 Thread Leonard Papenmeier (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Leonard Papenmeier updated SPARK-40920: --- Summary: SVD: matrix U has wrong row order (was: SVD: matrix U has wrong column ord

[jira] [Commented] (SPARK-40919) Bad case of `AnalysisTest#assertAnalysisErrorClass` when `expectedMessageParameters.size between [2, 4]`

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624404#comment-17624404 ] Apache Spark commented on SPARK-40919: -- User 'LuciferYang' has created a pull reque

[jira] [Commented] (SPARK-40919) Bad case of `AnalysisTest#assertAnalysisErrorClass` when `expectedMessageParameters.size between [2, 4]`

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624402#comment-17624402 ] Apache Spark commented on SPARK-40919: -- User 'LuciferYang' has created a pull reque

[jira] [Assigned] (SPARK-40919) Bad case of `AnalysisTest#assertAnalysisErrorClass` when `expectedMessageParameters.size between [2, 4]`

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40919: Assignee: (was: Apache Spark) > Bad case of `AnalysisTest#assertAnalysisErrorClass` w

[jira] [Assigned] (SPARK-40919) Bad case of `AnalysisTest#assertAnalysisErrorClass` when `expectedMessageParameters.size between [2, 4]`

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40919: Assignee: Apache Spark > Bad case of `AnalysisTest#assertAnalysisErrorClass` when > `exp

[jira] [Updated] (SPARK-40920) SVD: matrix U has wrong column order

2022-10-26 Thread Leonard Papenmeier (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Leonard Papenmeier updated SPARK-40920: --- Attachment: image-2022-10-26-13-59-13-425.png > SVD: matrix U has wrong column order

[jira] [Updated] (SPARK-40920) SVD: matrix U has wrong column order

2022-10-26 Thread Leonard Papenmeier (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Leonard Papenmeier updated SPARK-40920: --- Description: When performing SVD on a RowMatrix, the matrix U has the wrong row orde

[jira] [Updated] (SPARK-40920) SVD: matrix U has wrong column order

2022-10-26 Thread Leonard Papenmeier (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Leonard Papenmeier updated SPARK-40920: --- Attachment: image-2022-10-26-13-59-04-608.png > SVD: matrix U has wrong column order

[jira] [Updated] (SPARK-40920) SVD: matrix U has wrong column order

2022-10-26 Thread Leonard Papenmeier (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Leonard Papenmeier updated SPARK-40920: --- Attachment: image-2022-10-26-13-58-52-998.png > SVD: matrix U has wrong column order

[jira] [Created] (SPARK-40920) SVD: matrix U has wrong column order

2022-10-26 Thread Leonard Papenmeier (Jira)
Leonard Papenmeier created SPARK-40920: -- Summary: SVD: matrix U has wrong column order Key: SPARK-40920 URL: https://issues.apache.org/jira/browse/SPARK-40920 Project: Spark Issue Type:

[jira] [Updated] (SPARK-40919) Bad case of `AnalysisTest#assertAnalysisErrorClass` when `expectedMessageParameters.size between [2, 4]`

2022-10-26 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yang Jie updated SPARK-40919: - Description: If the size of the input parameter `expectedMessageParameters` of the `AnalysisTest#assert

[jira] [Created] (SPARK-40919) Bad case of `AnalysisTest#assertAnalysisErrorClass` when `expectedMessageParameters.size between [2, 4]`

2022-10-26 Thread Yang Jie (Jira)
Yang Jie created SPARK-40919: Summary: Bad case of `AnalysisTest#assertAnalysisErrorClass` when `expectedMessageParameters.size between [2, 4]` Key: SPARK-40919 URL: https://issues.apache.org/jira/browse/SPARK-40919

[jira] [Created] (SPARK-40918) Mismatch between ParquetFileFormat and FileSourceScanExec in # columns for WSCG.isTooManyFields when using _metadata

2022-10-26 Thread Juliusz Sompolski (Jira)
Juliusz Sompolski created SPARK-40918: - Summary: Mismatch between ParquetFileFormat and FileSourceScanExec in # columns for WSCG.isTooManyFields when using _metadata Key: SPARK-40918 URL: https://issues.apache

[jira] [Assigned] (SPARK-40917) Add a dedicated logical plan for `Summary`

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40917: Assignee: Apache Spark > Add a dedicated logical plan for `Summary` > ---

[jira] [Commented] (SPARK-40917) Add a dedicated logical plan for `Summary`

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17624368#comment-17624368 ] Apache Spark commented on SPARK-40917: -- User 'zhengruifeng' has created a pull requ

[jira] [Assigned] (SPARK-40917) Add a dedicated logical plan for `Summary`

2022-10-26 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40917: Assignee: (was: Apache Spark) > Add a dedicated logical plan for `Summary` >

[jira] [Created] (SPARK-40917) Add a dedicated logical plan for `Summary`

2022-10-26 Thread Ruifeng Zheng (Jira)
Ruifeng Zheng created SPARK-40917: - Summary: Add a dedicated logical plan for `Summary` Key: SPARK-40917 URL: https://issues.apache.org/jira/browse/SPARK-40917 Project: Spark Issue Type: Sub-

[jira] [Updated] (SPARK-40880) Reimplement `summary` with dataframe operations

2022-10-26 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng updated SPARK-40880: -- Parent: SPARK-39375 Issue Type: Sub-task (was: Improvement) > Reimplement `summary` w

  1   2   >