[jira] [Updated] (SPARK-36255) FileNotFound exceptions from the shuffle push can cause the executor to terminate

2021-07-21 Thread Chandni Singh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chandni Singh updated SPARK-36255: -- Summary: FileNotFound exceptions from the shuffle push can cause the executor to terminate

[jira] [Updated] (SPARK-36255) FileNotFound exceptions in the Shuffle-push-thread can cause the executor to fail

2021-07-21 Thread Chandni Singh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chandni Singh updated SPARK-36255: -- Description: When the shuffle files are cleaned up by the executors once a job in a Spark

[jira] [Updated] (SPARK-36255) FileNotFound exceptions in the Shuffle-push-thread can cause the executor to fail

2021-07-21 Thread Chandni Singh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chandni Singh updated SPARK-36255: -- Description: When the shuffle files are cleaned up by the executors once a job in a Spark

[jira] [Resolved] (SPARK-36214) Add add_categories to CategoricalAccessor and CategoricalIndex.

2021-07-21 Thread Takuya Ueshin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-36214. --- Fix Version/s: 3.2.0 Assignee: Takuya Ueshin Resolution: Fixed Issue

[jira] [Updated] (SPARK-36255) FileNotFound exceptions in the Shuffle-push-thread can cause the executor to fail

2021-07-21 Thread Chandni Singh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chandni Singh updated SPARK-36255: -- Parent: SPARK-30602 Issue Type: Sub-task (was: Bug) > FileNotFound exceptions in the

[jira] [Created] (SPARK-36255) FileNotFound exceptions in the Shuffle-push-thread can cause the executor to fail

2021-07-21 Thread Chandni Singh (Jira)
Chandni Singh created SPARK-36255: - Summary: FileNotFound exceptions in the Shuffle-push-thread can cause the executor to fail Key: SPARK-36255 URL: https://issues.apache.org/jira/browse/SPARK-36255

[jira] [Updated] (SPARK-36253) Document added version of pandas-on-Spark support

2021-07-21 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-36253: - Fix Version/s: 3.2.0 > Document added version of pandas-on-Spark support >

[jira] [Resolved] (SPARK-36253) Document added version of pandas-on-Spark support

2021-07-21 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-36253. -- Resolution: Fixed > Document added version of pandas-on-Spark support >

[jira] [Commented] (SPARK-36253) Document added version of pandas-on-Spark support

2021-07-21 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385263#comment-17385263 ] Hyukjin Kwon commented on SPARK-36253: -- Fixed in https://github.com/apache/spark/pull/33473 >

[jira] [Created] (SPARK-36254) Install mlflow and delta in Github Actions CI

2021-07-21 Thread Haejoon Lee (Jira)
Haejoon Lee created SPARK-36254: --- Summary: Install mlflow and delta in Github Actions CI Key: SPARK-36254 URL: https://issues.apache.org/jira/browse/SPARK-36254 Project: Spark Issue Type:

[jira] [Commented] (SPARK-36253) Document added version of pandas-on-Spark support

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385245#comment-17385245 ] Apache Spark commented on SPARK-36253: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-36253) Document added version of pandas-on-Spark support

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36253: Assignee: Hyukjin Kwon (was: Apache Spark) > Document added version of pandas-on-Spark

[jira] [Assigned] (SPARK-36253) Document added version of pandas-on-Spark support

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36253: Assignee: Apache Spark (was: Hyukjin Kwon) > Document added version of pandas-on-Spark

[jira] [Commented] (SPARK-36253) Document added version of pandas-on-Spark support

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385243#comment-17385243 ] Apache Spark commented on SPARK-36253: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Created] (SPARK-36253) Document added version of pandas-on-Spark support

2021-07-21 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-36253: Summary: Document added version of pandas-on-Spark support Key: SPARK-36253 URL: https://issues.apache.org/jira/browse/SPARK-36253 Project: Spark Issue

[jira] [Updated] (SPARK-36252) Add log files rolling policy for driver running in cluster mode with spark standalone cluster

2021-07-21 Thread Jack Hu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jack Hu updated SPARK-36252: Description: For a long running driver in cluster mode, there is no rolling policy, the log

[jira] [Created] (SPARK-36252) Add log files rolling policy for driver running in cluster mode with spark standalone cluster

2021-07-21 Thread Jack Hu (Jira)
Jack Hu created SPARK-36252: --- Summary: Add log files rolling policy for driver running in cluster mode with spark standalone cluster Key: SPARK-36252 URL: https://issues.apache.org/jira/browse/SPARK-36252

[jira] [Resolved] (SPARK-36063) Optimize OneRowRelation subqueries

2021-07-21 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-36063. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 33284

[jira] [Assigned] (SPARK-36063) Optimize OneRowRelation subqueries

2021-07-21 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-36063: --- Assignee: Allison Wang > Optimize OneRowRelation subqueries >

[jira] [Resolved] (SPARK-36244) Upgrade zstd-jni to 1.5.0-3 to avoid a bug about buffer size calculation

2021-07-21 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-36244. --- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 33464

[jira] [Resolved] (SPARK-35912) [SQL] JSON read behavior is different depending on the cache setting when nullable is false.

2021-07-21 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-35912. -- Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 33436

[jira] [Assigned] (SPARK-35912) [SQL] JSON read behavior is different depending on the cache setting when nullable is false.

2021-07-21 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-35912: Assignee: Fu Chen > [SQL] JSON read behavior is different depending on the cache setting

[jira] [Commented] (SPARK-32666) Install ipython and nbsphinx in Jenkins for Binder integration

2021-07-21 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385176#comment-17385176 ] Hyukjin Kwon commented on SPARK-32666: -- Thanks [~shaneknapp]!!! > Install ipython and nbsphinx in

[jira] [Commented] (SPARK-36251) Cover GitHub Actions runs without SHA in testing script

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385174#comment-17385174 ] Apache Spark commented on SPARK-36251: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-36251) Cover GitHub Actions runs without SHA in testing script

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36251: Assignee: (was: Apache Spark) > Cover GitHub Actions runs without SHA in testing

[jira] [Commented] (SPARK-36251) Cover GitHub Actions runs without SHA in testing script

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385173#comment-17385173 ] Apache Spark commented on SPARK-36251: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-36251) Cover GitHub Actions runs without SHA in testing script

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36251: Assignee: Apache Spark > Cover GitHub Actions runs without SHA in testing script >

[jira] [Created] (SPARK-36251) Cover GitHub Actions runs without SHA in testing script

2021-07-21 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-36251: Summary: Cover GitHub Actions runs without SHA in testing script Key: SPARK-36251 URL: https://issues.apache.org/jira/browse/SPARK-36251 Project: Spark

[jira] [Created] (SPARK-36250) Add support for running make-distribution without a "clean"

2021-07-21 Thread Holden Karau (Jira)
Holden Karau created SPARK-36250: Summary: Add support for running make-distribution without a "clean" Key: SPARK-36250 URL: https://issues.apache.org/jira/browse/SPARK-36250 Project: Spark

[jira] [Assigned] (SPARK-36248) Add rename_categories to CategoricalAccessor and CategoricalIndex

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36248: Assignee: Apache Spark > Add rename_categories to CategoricalAccessor and

[jira] [Commented] (SPARK-36248) Add rename_categories to CategoricalAccessor and CategoricalIndex

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385161#comment-17385161 ] Apache Spark commented on SPARK-36248: -- User 'xinrong-databricks' has created a pull request for

[jira] [Assigned] (SPARK-36248) Add rename_categories to CategoricalAccessor and CategoricalIndex

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36248: Assignee: (was: Apache Spark) > Add rename_categories to CategoricalAccessor and

[jira] [Resolved] (SPARK-33242) Install numpydoc in Jenkins machines

2021-07-21 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shane Knapp resolved SPARK-33242. - Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 33469

[jira] [Resolved] (SPARK-32391) Install pydata_sphinx_theme in Jenkins machines

2021-07-21 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shane Knapp resolved SPARK-32391. - Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 33469

[jira] [Resolved] (SPARK-32666) Install ipython and nbsphinx in Jenkins for Binder integration

2021-07-21 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shane Knapp resolved SPARK-32666. - Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 33469

[jira] [Resolved] (SPARK-32797) Install mypy on the Jenkins CI workers

2021-07-21 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shane Knapp resolved SPARK-32797. - Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 33469

[jira] [Commented] (SPARK-32666) Install ipython and nbsphinx in Jenkins for Binder integration

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385132#comment-17385132 ] Apache Spark commented on SPARK-32666: -- User 'shaneknapp' has created a pull request for this

[jira] [Commented] (SPARK-32666) Install ipython and nbsphinx in Jenkins for Binder integration

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385133#comment-17385133 ] Apache Spark commented on SPARK-32666: -- User 'shaneknapp' has created a pull request for this

[jira] [Assigned] (SPARK-32666) Install ipython and nbsphinx in Jenkins for Binder integration

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32666: Assignee: Shane Knapp (was: Apache Spark) > Install ipython and nbsphinx in Jenkins for

[jira] [Commented] (SPARK-32666) Install ipython and nbsphinx in Jenkins for Binder integration

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385131#comment-17385131 ] Apache Spark commented on SPARK-32666: -- User 'shaneknapp' has created a pull request for this

[jira] [Commented] (SPARK-33242) Install numpydoc in Jenkins machines

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385130#comment-17385130 ] Apache Spark commented on SPARK-33242: -- User 'shaneknapp' has created a pull request for this

[jira] [Assigned] (SPARK-32666) Install ipython and nbsphinx in Jenkins for Binder integration

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32666: Assignee: Apache Spark (was: Shane Knapp) > Install ipython and nbsphinx in Jenkins for

[jira] [Assigned] (SPARK-32666) Install ipython and nbsphinx in Jenkins for Binder integration

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32666: Assignee: Shane Knapp (was: Apache Spark) > Install ipython and nbsphinx in Jenkins for

[jira] [Commented] (SPARK-33242) Install numpydoc in Jenkins machines

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385129#comment-17385129 ] Apache Spark commented on SPARK-33242: -- User 'shaneknapp' has created a pull request for this

[jira] [Assigned] (SPARK-33242) Install numpydoc in Jenkins machines

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-33242: Assignee: Shane Knapp (was: Apache Spark) > Install numpydoc in Jenkins machines >

[jira] [Commented] (SPARK-33242) Install numpydoc in Jenkins machines

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385128#comment-17385128 ] Apache Spark commented on SPARK-33242: -- User 'shaneknapp' has created a pull request for this

[jira] [Assigned] (SPARK-33242) Install numpydoc in Jenkins machines

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-33242: Assignee: Apache Spark (was: Shane Knapp) > Install numpydoc in Jenkins machines >

[jira] [Commented] (SPARK-32391) Install pydata_sphinx_theme in Jenkins machines

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385127#comment-17385127 ] Apache Spark commented on SPARK-32391: -- User 'shaneknapp' has created a pull request for this

[jira] [Assigned] (SPARK-32391) Install pydata_sphinx_theme in Jenkins machines

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32391: Assignee: Apache Spark (was: Shane Knapp) > Install pydata_sphinx_theme in Jenkins

[jira] [Assigned] (SPARK-32391) Install pydata_sphinx_theme in Jenkins machines

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32391: Assignee: Shane Knapp (was: Apache Spark) > Install pydata_sphinx_theme in Jenkins

[jira] [Commented] (SPARK-32797) Install mypy on the Jenkins CI workers

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385124#comment-17385124 ] Apache Spark commented on SPARK-32797: -- User 'shaneknapp' has created a pull request for this

[jira] [Commented] (SPARK-32391) Install pydata_sphinx_theme in Jenkins machines

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385125#comment-17385125 ] Apache Spark commented on SPARK-32391: -- User 'shaneknapp' has created a pull request for this

[jira] [Commented] (SPARK-32391) Install pydata_sphinx_theme in Jenkins machines

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385126#comment-17385126 ] Apache Spark commented on SPARK-32391: -- User 'shaneknapp' has created a pull request for this

[jira] [Assigned] (SPARK-32797) Install mypy on the Jenkins CI workers

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32797: Assignee: Apache Spark (was: Shane Knapp) > Install mypy on the Jenkins CI workers >

[jira] [Assigned] (SPARK-32797) Install mypy on the Jenkins CI workers

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32797: Assignee: Shane Knapp (was: Apache Spark) > Install mypy on the Jenkins CI workers >

[jira] [Commented] (SPARK-32797) Install mypy on the Jenkins CI workers

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385123#comment-17385123 ] Apache Spark commented on SPARK-32797: -- User 'shaneknapp' has created a pull request for this

[jira] [Commented] (SPARK-31162) Provide Configuration Parameter to select/enforce the Hive Hash for Bucketing

2021-07-21 Thread Ashish Singh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385071#comment-17385071 ] Ashish Singh commented on SPARK-31162: -- This is needed for reasons other than supporting hive

[jira] [Commented] (SPARK-36249) Add remove_categories to CategoricalAccessor and CategoricalIndex

2021-07-21 Thread Takuya Ueshin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385069#comment-17385069 ] Takuya Ueshin commented on SPARK-36249: --- I'm working on this. > Add remove_categories to

[jira] [Created] (SPARK-36249) Add remove_categories to CategoricalAccessor and CategoricalIndex

2021-07-21 Thread Takuya Ueshin (Jira)
Takuya Ueshin created SPARK-36249: - Summary: Add remove_categories to CategoricalAccessor and CategoricalIndex Key: SPARK-36249 URL: https://issues.apache.org/jira/browse/SPARK-36249 Project: Spark

[jira] [Commented] (SPARK-36214) Add add_categories to CategoricalAccessor and CategoricalIndex.

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385067#comment-17385067 ] Apache Spark commented on SPARK-36214: -- User 'ueshin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-36214) Add add_categories to CategoricalAccessor and CategoricalIndex.

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36214: Assignee: (was: Apache Spark) > Add add_categories to CategoricalAccessor and

[jira] [Assigned] (SPARK-36214) Add add_categories to CategoricalAccessor and CategoricalIndex.

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36214: Assignee: Apache Spark > Add add_categories to CategoricalAccessor and CategoricalIndex.

[jira] [Assigned] (SPARK-35546) Enable push-based shuffle when multiple app attempts are enabled and manage concurrent access to the state in a better way

2021-07-21 Thread Mridul Muralidharan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mridul Muralidharan reassigned SPARK-35546: --- Assignee: Ye Zhou > Enable push-based shuffle when multiple app attempts

[jira] [Commented] (SPARK-36248) Add rename_categories to CategoricalAccessor and CategoricalIndex

2021-07-21 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385063#comment-17385063 ] Xinrong Meng commented on SPARK-36248: -- I'm working on this. > Add rename_categories to

[jira] [Created] (SPARK-36248) Add rename_categories to CategoricalAccessor and CategoricalIndex

2021-07-21 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-36248: Summary: Add rename_categories to CategoricalAccessor and CategoricalIndex Key: SPARK-36248 URL: https://issues.apache.org/jira/browse/SPARK-36248 Project: Spark

[jira] [Resolved] (SPARK-36188) Add categories setter to CategoricalAccessor and CategoricalIndex.

2021-07-21 Thread Takuya Ueshin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-36188. --- Fix Version/s: 3.2.0 Assignee: Takuya Ueshin Resolution: Fixed Issue

[jira] [Assigned] (SPARK-36247) check string length for char/varchar in UPDATE/MERGE command

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36247: Assignee: (was: Apache Spark) > check string length for char/varchar in UPDATE/MERGE

[jira] [Assigned] (SPARK-36247) check string length for char/varchar in UPDATE/MERGE command

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36247: Assignee: Apache Spark > check string length for char/varchar in UPDATE/MERGE command >

[jira] [Commented] (SPARK-36247) check string length for char/varchar in UPDATE/MERGE command

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385045#comment-17385045 ] Apache Spark commented on SPARK-36247: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Updated] (SPARK-36142) Adjust exponentiation between Series with missing values and bool literal to follow pandas

2021-07-21 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-36142: - Summary: Adjust exponentiation between Series with missing values and bool literal to follow

[jira] [Commented] (SPARK-32391) Install pydata_sphinx_theme in Jenkins machines

2021-07-21 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385044#comment-17385044 ] Shane Knapp commented on SPARK-32391: - anyways, i installed this via conda and will roll out to all

[jira] [Created] (SPARK-36247) check string length for char/varchar in UPDATE/MERGE command

2021-07-21 Thread Wenchen Fan (Jira)
Wenchen Fan created SPARK-36247: --- Summary: check string length for char/varchar in UPDATE/MERGE command Key: SPARK-36247 URL: https://issues.apache.org/jira/browse/SPARK-36247 Project: Spark

[jira] [Commented] (SPARK-34930) Install PyArrow and pandas on Jenkins

2021-07-21 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385040#comment-17385040 ] Shane Knapp commented on SPARK-34930: - oh yeah, a LOT of those skipped tests are for pypy3, not

[jira] [Commented] (SPARK-32797) Install mypy on the Jenkins CI workers

2021-07-21 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385038#comment-17385038 ] Shane Knapp commented on SPARK-32797: - ill roll this out (and other python package updates) later

[jira] [Assigned] (SPARK-36246) WorkerDecommissionExtendedSuite flakes with GHA

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36246: Assignee: Holden Karau (was: Apache Spark) > WorkerDecommissionExtendedSuite flakes

[jira] [Commented] (SPARK-36246) WorkerDecommissionExtendedSuite flakes with GHA

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385035#comment-17385035 ] Apache Spark commented on SPARK-36246: -- User 'holdenk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-36246) WorkerDecommissionExtendedSuite flakes with GHA

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36246: Assignee: Apache Spark (was: Holden Karau) > WorkerDecommissionExtendedSuite flakes

[jira] [Resolved] (SPARK-29183) Upgrade JDK 11 Installation to 11.0.6

2021-07-21 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shane Knapp resolved SPARK-29183. - Resolution: Fixed this is done and all java11 installs are at 11.0.10 > Upgrade JDK 11

[jira] [Commented] (SPARK-34930) Install PyArrow and pandas on Jenkins

2021-07-21 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385030#comment-17385030 ] Shane Knapp commented on SPARK-34930: - pandas is installed, so i'm a little curious as to why the

[jira] [Commented] (SPARK-32391) Install pydata_sphinx_theme in Jenkins machines

2021-07-21 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385026#comment-17385026 ] Shane Knapp commented on SPARK-32391: - [~hyukjin.kwon] i am able to install this via conda...  any

[jira] [Commented] (SPARK-32666) Install ipython and nbsphinx in Jenkins for Binder integration

2021-07-21 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385021#comment-17385021 ] Shane Knapp commented on SPARK-32666: - ill roll this out (and other python package updates) later

[jira] [Commented] (SPARK-33242) Install numpydoc in Jenkins machines

2021-07-21 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385022#comment-17385022 ] Shane Knapp commented on SPARK-33242: - ill roll this out (and other python package updates) later

[jira] [Created] (SPARK-36246) WorkerDecommissionExtendedSuite flakes with GHA

2021-07-21 Thread Holden Karau (Jira)
Holden Karau created SPARK-36246: Summary: WorkerDecommissionExtendedSuite flakes with GHA Key: SPARK-36246 URL: https://issues.apache.org/jira/browse/SPARK-36246 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-36143) Adjust astype of Series with missing values to follow pandas

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36143: Assignee: (was: Apache Spark) > Adjust astype of Series with missing values to

[jira] [Assigned] (SPARK-36143) Adjust astype of Series with missing values to follow pandas

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36143: Assignee: Apache Spark > Adjust astype of Series with missing values to follow pandas >

[jira] [Commented] (SPARK-36143) Adjust astype of Series with missing values to follow pandas

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17384999#comment-17384999 ] Apache Spark commented on SPARK-36143: -- User 'xinrong-databricks' has created a pull request for

[jira] [Resolved] (SPARK-36213) Normalize PartitionSpec for DescTable with PartitionSpec

2021-07-21 Thread Kent Yao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kent Yao resolved SPARK-36213. -- Fix Version/s: 3.1.3 3.2.0 3.0.4 Resolution: Fixed Issue

[jira] [Assigned] (SPARK-36213) Normalize PartitionSpec for DescTable with PartitionSpec

2021-07-21 Thread Kent Yao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kent Yao reassigned SPARK-36213: Assignee: Kent Yao > Normalize PartitionSpec for DescTable with PartitionSpec >

[jira] [Resolved] (SPARK-36227) Remove TimestampNTZ type support in Spark 3.2

2021-07-21 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-36227. --- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 33444

[jira] [Updated] (SPARK-36143) Adjust astype of Series with missing values to follow pandas

2021-07-21 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-36143: - Description: {code:java} >>> pser = pd.Series([1, 2, np.nan], dtype=float) >>> psser =

[jira] [Commented] (SPARK-33865) When HiveDDL, we need check avro schema too like parquet & orc

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17384951#comment-17384951 ] Apache Spark commented on SPARK-33865: -- User 'AngersZh' has created a pull request for this

[jira] [Updated] (SPARK-36243) pyspark catalog.tableExists doesn't work for temporary views

2021-07-21 Thread Dominik Gehl (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dominik Gehl updated SPARK-36243: - Component/s: (was: Java API) PySpark Description: Documentation in

[jira] [Assigned] (SPARK-36245) Deduplicate the right side of left semi/anti join

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36245: Assignee: Apache Spark > Deduplicate the right side of left semi/anti join >

[jira] [Assigned] (SPARK-36245) Deduplicate the right side of left semi/anti join

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36245: Assignee: (was: Apache Spark) > Deduplicate the right side of left semi/anti join >

[jira] [Commented] (SPARK-36245) Deduplicate the right side of left semi/anti join

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17384939#comment-17384939 ] Apache Spark commented on SPARK-36245: -- User 'wangyum' has created a pull request for this issue:

[jira] [Commented] (SPARK-25075) Build and test Spark against Scala 2.13

2021-07-21 Thread Eric Richardson (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17384934#comment-17384934 ] Eric Richardson commented on SPARK-25075: - Great news that you will have 2.12 and 2.13

[jira] [Created] (SPARK-36245) Deduplicate the right side of left semi/anti join

2021-07-21 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-36245: --- Summary: Deduplicate the right side of left semi/anti join Key: SPARK-36245 URL: https://issues.apache.org/jira/browse/SPARK-36245 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-28266) data duplication when `path` serde property is present

2021-07-21 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-28266: --- Assignee: Shardul Mahadik > data duplication when `path` serde property is present >

[jira] [Resolved] (SPARK-28266) data duplication when `path` serde property is present

2021-07-21 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-28266. - Fix Version/s: 3.1.3 3.2.0 3.0.4 Resolution: Fixed

[jira] [Commented] (SPARK-36244) Upgrade zstd-jni to 1.5.0-3 to avoid a bug about buffer size calculation

2021-07-21 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17384923#comment-17384923 ] Apache Spark commented on SPARK-36244: -- User 'sarutak' has created a pull request for this issue:

  1   2   >