[jira] [Commented] (SPARK-44795) CodeGenCache should be ClassLoader specific

2023-08-15 Thread GridGain Integration (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17754852#comment-17754852 ] GridGain Integration commented on SPARK-44795: -- User 'LuciferYang' has created a pull

[jira] [Created] (SPARK-44824) There is content overlap in `ammoniteOut` used in ReplE2ESuite.

2023-08-15 Thread Yang Jie (Jira)
Yang Jie created SPARK-44824: Summary: There is content overlap in `ammoniteOut` used in ReplE2ESuite. Key: SPARK-44824 URL: https://issues.apache.org/jira/browse/SPARK-44824 Project: Spark

[jira] [Created] (SPARK-44823) Update black to 23.7.0 and fix erroneous check

2023-08-15 Thread BingKun Pan (Jira)
BingKun Pan created SPARK-44823: --- Summary: Update black to 23.7.0 and fix erroneous check Key: SPARK-44823 URL: https://issues.apache.org/jira/browse/SPARK-44823 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-44809) Remove unused custom metrics for RocksDB state store provider

2023-08-15 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim resolved SPARK-44809. -- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 42491

[jira] [Updated] (SPARK-44564) Refine the documents with LLM

2023-08-15 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng updated SPARK-44564: -- Attachment: docstr_prompt_only.py > Refine the documents with LLM >

[jira] [Updated] (SPARK-44564) Refine the documents with LLM

2023-08-15 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng updated SPARK-44564: -- Attachment: (was: docstr_prompt.py) > Refine the documents with LLM >

[jira] [Created] (SPARK-44822) Make Python UDTFs by default non-deterministic

2023-08-15 Thread Allison Wang (Jira)
Allison Wang created SPARK-44822: Summary: Make Python UDTFs by default non-deterministic Key: SPARK-44822 URL: https://issues.apache.org/jira/browse/SPARK-44822 Project: Spark Issue Type:

[jira] [Created] (SPARK-44821) Upgrade `kubernetes-client` to 6.8.1

2023-08-15 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created SPARK-44821: - Summary: Upgrade `kubernetes-client` to 6.8.1 Key: SPARK-44821 URL: https://issues.apache.org/jira/browse/SPARK-44821 Project: Spark Issue Type:

[jira] [Updated] (SPARK-44820) Switch languages consistently across docs for all code snippets

2023-08-15 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44820: - Description: When a user chooses a different language for a code snippet, all code snippets on

[jira] [Updated] (SPARK-44819) Make Python the first language in all Spark code snippet

2023-08-15 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44819: - Attachment: Screenshot 2023-08-15 at 11.59.11.png > Make Python the first language in all Spark

[jira] [Updated] (SPARK-44819) Make Python the first language in all Spark code snippet

2023-08-15 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44819: - Description: Currently, the first and default language for all code snippets is Sacla. For

[jira] [Updated] (SPARK-44819) Make Python the first language in all Spark code snippet

2023-08-15 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44819: - Description: Currently, the first and default language for all code snippets is Sacla. We

[jira] [Created] (SPARK-44820) Switch languages consistently across docs for all code snippets

2023-08-15 Thread Allison Wang (Jira)
Allison Wang created SPARK-44820: Summary: Switch languages consistently across docs for all code snippets Key: SPARK-44820 URL: https://issues.apache.org/jira/browse/SPARK-44820 Project: Spark

[jira] [Created] (SPARK-44819) Make Python the first language in all Spark code snippet

2023-08-15 Thread Allison Wang (Jira)
Allison Wang created SPARK-44819: Summary: Make Python the first language in all Spark code snippet Key: SPARK-44819 URL: https://issues.apache.org/jira/browse/SPARK-44819 Project: Spark

[jira] [Commented] (SPARK-44818) Fix race for pending interrupt issued before taskThread is initialized

2023-08-15 Thread Anish Shrigondekar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17754729#comment-17754729 ] Anish Shrigondekar commented on SPARK-44818: PR here -

[jira] [Created] (SPARK-44818) Fix race for pending interrupt issued before taskThread is initialized

2023-08-15 Thread Anish Shrigondekar (Jira)
Anish Shrigondekar created SPARK-44818: -- Summary: Fix race for pending interrupt issued before taskThread is initialized Key: SPARK-44818 URL: https://issues.apache.org/jira/browse/SPARK-44818

[jira] [Resolved] (SPARK-42664) Support bloomFilter for DataFrameStatFunctions

2023-08-15 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-42664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell resolved SPARK-42664. --- Fix Version/s: 3.5.0 Assignee: Yang Jie Resolution: Fixed > Support

[jira] [Resolved] (SPARK-44794) Propagate ArtifactSet to stream execution thread

2023-08-15 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-44794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hövell resolved SPARK-44794. --- Fix Version/s: 3.5.0 Resolution: Fixed > Propagate ArtifactSet to stream

[jira] [Resolved] (SPARK-44803) Replace `publish` with `publishOrSkip` in SparkBuild to eliminate warnings

2023-08-15 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-44803. --- Fix Version/s: 4.0.0 Assignee: BingKun Pan Resolution: Fixed > Replace

[jira] [Commented] (SPARK-44124) Upgrade AWS SDK to v2

2023-08-15 Thread Steve Loughran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17754698#comment-17754698 ] Steve Loughran commented on SPARK-44124: +will need to make sure any classloaders set up to pass

[jira] [Commented] (SPARK-44817) Incremental Stats Collection

2023-08-15 Thread Rakesh Raushan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17754694#comment-17754694 ] Rakesh Raushan commented on SPARK-44817: [~cloud_fan] [~gurwls223] [~maxgekk] What are your

[jira] [Updated] (SPARK-44817) Incremental Stats Collection

2023-08-15 Thread Rakesh Raushan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rakesh Raushan updated SPARK-44817: --- Description: Spark's Cost Based Optimizer is dependent on the table and column statistics.

[jira] [Created] (SPARK-44817) Incremental Stats Collection

2023-08-15 Thread Rakesh Raushan (Jira)
Rakesh Raushan created SPARK-44817: -- Summary: Incremental Stats Collection Key: SPARK-44817 URL: https://issues.apache.org/jira/browse/SPARK-44817 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-44806) Separate connect-client-jvm-internal

2023-08-15 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17754684#comment-17754684 ] Hudson commented on SPARK-44806: User 'juliuszsompolski' has created a pull request for this issue:

[jira] [Created] (SPARK-44816) Cryptic error message when UDF associated class is not found

2023-08-15 Thread Niranjan Jayakar (Jira)
Niranjan Jayakar created SPARK-44816: Summary: Cryptic error message when UDF associated class is not found Key: SPARK-44816 URL: https://issues.apache.org/jira/browse/SPARK-44816 Project: Spark

[jira] [Updated] (SPARK-44803) Replace `publish` with `publishOrSkip` in SparkBuild to eliminate warnings

2023-08-15 Thread BingKun Pan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] BingKun Pan updated SPARK-44803: Summary: Replace `publish` with `publishOrSkip` in SparkBuild to eliminate warnings (was:

[jira] [Created] (SPARK-44815) Cache Schema of DF

2023-08-15 Thread Martin Grund (Jira)
Martin Grund created SPARK-44815: Summary: Cache Schema of DF Key: SPARK-44815 URL: https://issues.apache.org/jira/browse/SPARK-44815 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-44814) Test to trigger protobuf 4.23.3 crash

2023-08-15 Thread Martin Grund (Jira)
Martin Grund created SPARK-44814: Summary: Test to trigger protobuf 4.23.3 crash Key: SPARK-44814 URL: https://issues.apache.org/jira/browse/SPARK-44814 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-44718) High On-heap memory usage is detected while doing parquet-file reading with Off-Heap memory mode enabled on spark

2023-08-15 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-44718: --- Assignee: Zamil Majdy > High On-heap memory usage is detected while doing parquet-file

[jira] [Resolved] (SPARK-44718) High On-heap memory usage is detected while doing parquet-file reading with Off-Heap memory mode enabled on spark

2023-08-15 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-44718. - Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 42394

[jira] [Updated] (SPARK-44564) Refine the documents with LLM

2023-08-15 Thread Ruifeng Zheng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruifeng Zheng updated SPARK-44564: -- Description: Let's first focus on the Documents of *PySpark DataFrame APIs*. *1*, Chose a

[jira] [Commented] (SPARK-44782) Adjust Pull Request Template to incorporate the ASF Generative Tooling Guidance recommendations

2023-08-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17754516#comment-17754516 ] ASF GitHub Bot commented on SPARK-44782: User 'zero323' has created a pull request for this

[jira] [Commented] (SPARK-44806) Separate connect-client-jvm-internal

2023-08-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17754500#comment-17754500 ] ASF GitHub Bot commented on SPARK-44806: User 'juliuszsompolski' has created a pull request for

[jira] [Commented] (SPARK-44782) Adjust Pull Request Template to incorporate the ASF Generative Tooling Guidance recommendations

2023-08-15 Thread Maciej Szymkiewicz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17754482#comment-17754482 ] Maciej Szymkiewicz commented on SPARK-44782: Created a pull request for this issue:

[jira] [Created] (SPARK-44813) The JIRA Python misses our assignee when it searches user again

2023-08-15 Thread Kent Yao (Jira)
Kent Yao created SPARK-44813: Summary: The JIRA Python misses our assignee when it searches user again Key: SPARK-44813 URL: https://issues.apache.org/jira/browse/SPARK-44813 Project: Spark

[jira] [Assigned] (SPARK-44801) SQL Page does not capture failed queries in analyzer

2023-08-15 Thread Kent Yao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kent Yao reassigned SPARK-44801: Assignee: Kent Yao (was: Kent Yao 2) > SQL Page does not capture failed queries in analyzer >

[jira] [Assigned] (SPARK-44801) SQL Page does not capture failed queries in analyzer

2023-08-15 Thread Kent Yao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kent Yao reassigned SPARK-44801: Assignee: (was: Kent Yao 2) > SQL Page does not capture failed queries in analyzer >

[jira] [Assigned] (SPARK-44801) SQL Page does not capture failed queries in analyzer

2023-08-15 Thread Kent Yao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kent Yao reassigned SPARK-44801: Assignee: Kent Yao 2 > SQL Page does not capture failed queries in analyzer >

[jira] [Commented] (SPARK-44782) Adjust Pull Request Template to incorporate the ASF Generative Tooling Guidance recommendations

2023-08-15 Thread Xiao Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17754429#comment-17754429 ] Xiao Li commented on SPARK-44782: - +1 We should update the PR template.  > Adjust Pull Request Template