[jira] [Created] (SPARK-41150) Document PySpark memory profiler

2022-11-15 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-41150: Summary: Document PySpark memory profiler Key: SPARK-41150 URL: https://issues.apache.org/jira/browse/SPARK-41150 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-39405) NumPy Input Support in PySpark

2022-11-11 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39405: - Description: NumPy is the fundamental package for scientific computing with Python. It is very

[jira] [Updated] (SPARK-39405) NumPy input support in PySpark

2022-11-11 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39405: - Summary: NumPy input support in PySpark (was: NumPy input support in PySpark SQL) > NumPy

[jira] [Updated] (SPARK-39405) NumPy Input Support in PySpark

2022-11-11 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39405: - Summary: NumPy Input Support in PySpark (was: NumPy input support in PySpark) > NumPy Input

[jira] [Updated] (SPARK-41107) Install memory-profiler in the CI

2022-11-10 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-41107: - Description: PySpark memory profiler depends on

[jira] [Updated] (SPARK-41107) Install memory-profiler in the CI

2022-11-10 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-41107: - Description: PySpark memory profiler depends on

[jira] [Updated] (SPARK-41107) Install memory-profiler in the CI

2022-11-10 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-41107: - Description: PySpark memory profiler depends on

[jira] [Updated] (SPARK-41107) Install memory-profiler in the CI

2022-11-10 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-41107: - Summary: Install memory-profiler in the CI (was: Install memory-profiler in CI) > Install

[jira] [Updated] (SPARK-41107) Install memory-profiler in CI

2022-11-10 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-41107: - Summary: Install memory-profiler in CI (was: Install the Memory Profiler in CI) > Install

[jira] [Updated] (SPARK-41107) Install the Memory Profiler in CI

2022-11-10 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-41107: - Component/s: Tests > Install the Memory Profiler in CI > - > >

[jira] [Created] (SPARK-41107) Install the Memory Profiler in CI

2022-11-10 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-41107: Summary: Install the Memory Profiler in CI Key: SPARK-41107 URL: https://issues.apache.org/jira/browse/SPARK-41107 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-40281) Memory Profiler on Executors

2022-11-10 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40281: - Description: The ticket proposes to implement PySpark memory profiling on executors. See more

[jira] [Updated] (SPARK-40281) Memory Profiler on Executors

2022-11-09 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40281: - Description: The ticket proposes to implement PySpark memory profiling on executors. See more

[jira] [Updated] (SPARK-40281) Memory Profiler on Executors

2022-11-09 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40281: - Description: The ticket proposes to implement PySpark memory profiling on executors. See more

[jira] [Updated] (SPARK-40281) Memory Profiler on Executors

2022-11-09 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40281: - Description: The ticket proposes to implement a PySpark memory profiling on executors. See

[jira] [Updated] (SPARK-40281) Memory Profiler on Executors

2022-11-09 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40281: - Description: Profiling is critical to performance engineering. Memory consumption is a key

[jira] [Commented] (SPARK-40281) Memory Profiler on Executors

2022-11-07 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630120#comment-17630120 ] Xinrong Meng commented on SPARK-40281: -- Thanks [~alfiewdavidson] for the feedback! I am currently

[jira] [Comment Edited] (SPARK-39405) NumPy input support in PySpark SQL

2022-11-02 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17627984#comment-17627984 ] Xinrong Meng edited comment on SPARK-39405 at 11/2/22 9:10 PM: --- Hi

[jira] [Commented] (SPARK-37697) Make it easier to convert numpy arrays to Spark Dataframes

2022-11-02 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17627985#comment-17627985 ] Xinrong Meng commented on SPARK-37697: -- The commit is in. > Make it easier to convert numpy arrays

[jira] [Commented] (SPARK-40990) DataFrame creation from 2d NumPy array with arbitrary columns

2022-11-02 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17627983#comment-17627983 ] Xinrong Meng commented on SPARK-40990: -- Hi [~douglas.mo...@databricks.com] Any size of the 2d

[jira] [Commented] (SPARK-39405) NumPy input support in PySpark SQL

2022-11-02 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17627984#comment-17627984 ] Xinrong Meng commented on SPARK-39405: -- Hi [~douglas.mo...@databricks.com] the commit is in. >

[jira] [Commented] (SPARK-39405) NumPy input support in PySpark SQL

2022-11-01 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17627359#comment-17627359 ] Xinrong Meng commented on SPARK-39405: -- Thanks [~douglas.mo...@databricks.com] , your queries

[jira] [Commented] (SPARK-37697) Make it easier to convert numpy arrays to Spark Dataframes

2022-11-01 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17627357#comment-17627357 ] Xinrong Meng commented on SPARK-37697: -- Thanks [~douglas.mo...@databricks.com] , your queries

[jira] [Updated] (SPARK-40990) DataFrame creation from 2d NumPy array with arbitrary columns

2022-11-01 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40990: - Summary: DataFrame creation from 2d NumPy array with arbitrary columns (was: Complete support

[jira] [Updated] (SPARK-40990) Complete support for DataFrame creation from 2d NumPy array

2022-11-01 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40990: - Description: Currently, DataFrame creation from 2d ndarray works only with 2 columns. We

[jira] [Updated] (SPARK-40990) Complete support for DataFrame creation from 2d NumPy array

2022-11-01 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40990: - Summary: Complete support for DataFrame creation from 2d NumPy array (was: Support DataFrame

[jira] [Updated] (SPARK-40990) Support DataFrame creation from ndarray with >2 columns

2022-11-01 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40990: - Description: Currently, DataFrame creation from ndarray works only with <= 2 columns. We

[jira] [Updated] (SPARK-40990) Support DataFrame creation from ndarray with >2 columns

2022-11-01 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40990: - Description: Currently, DataFrame creation from ndarray works only with <= 2 columns. We

[jira] [Updated] (SPARK-40990) Support DataFrame creation from ndarray with >2 columns

2022-11-01 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40990: - Description: Currently, DataFrame creation from ndarray works only with <= 2 columns. We

[jira] [Commented] (SPARK-40990) Support DataFrame creation from ndarray with >2 columns

2022-11-01 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17627337#comment-17627337 ] Xinrong Meng commented on SPARK-40990: -- I am working on that. > Support DataFrame creation from

[jira] [Commented] (SPARK-39405) NumPy input support in PySpark SQL

2022-11-01 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17627336#comment-17627336 ] Xinrong Meng commented on SPARK-39405: -- Hi [~douglas.mo...@databricks.com] thanks for the bug

[jira] [Assigned] (SPARK-40990) Support DataFrame creation from ndarray with >2 columns

2022-11-01 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng reassigned SPARK-40990: Assignee: (was: Xinrong Meng) > Support DataFrame creation from ndarray with >2

[jira] [Created] (SPARK-40990) Support DataFrame creation from ndarray with >2 columns

2022-11-01 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-40990: Summary: Support DataFrame creation from ndarray with >2 columns Key: SPARK-40990 URL: https://issues.apache.org/jira/browse/SPARK-40990 Project: Spark

[jira] [Commented] (SPARK-37697) Make it easier to convert numpy arrays to Spark Dataframes

2022-10-31 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17626850#comment-17626850 ] Xinrong Meng commented on SPARK-37697: -- Hi, we have NumPy input support 

[jira] [Commented] (SPARK-6857) Python SQL schema inference should support numpy types

2022-10-31 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-6857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17626847#comment-17626847 ] Xinrong Meng commented on SPARK-6857: - Hi, we have NumPy input support

[jira] [Commented] (SPARK-31776) Literal lit() supports lists and numpy arrays

2022-10-31 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17626844#comment-17626844 ] Xinrong Meng commented on SPARK-31776: -- `lit` supports Python list and NumPy arrays in

[jira] [Updated] (SPARK-39405) NumPy input support in PySpark SQL

2022-10-26 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39405: - Summary: NumPy input support in PySpark SQL (was: NumPy input support in PySpark) > NumPy

[jira] [Updated] (SPARK-39405) NumPy input support in PySpark

2022-10-26 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39405: - Summary: NumPy input support in PySpark (was: NumPy input support in PySpark SQL) > NumPy

[jira] [Updated] (SPARK-39405) NumPy input support in PySpark SQL

2022-10-26 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39405: - Summary: NumPy input support in PySpark SQL (was: NumPy support in SQL) > NumPy input support

[jira] [Resolved] (SPARK-39199) Implement pandas API missing parameters

2022-10-10 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng resolved SPARK-39199. -- Resolution: Resolved > Implement pandas API missing parameters >

[jira] [Updated] (SPARK-40281) Memory Profiler on Executors

2022-10-06 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40281: - Description: Profiling is critical to performance engineering. Memory consumption is a key

[jira] [Updated] (SPARK-40281) Memory Profiler on Executors

2022-10-06 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40281: - Issue Type: New Feature (was: Umbrella) > Memory Profiler on Executors >

[jira] [Commented] (SPARK-40598) Fix plotting features work properly with pandas 1.5.0.

2022-10-06 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17613843#comment-17613843 ] Xinrong Meng commented on SPARK-40598: -- Resolved by [https://github.com/apache/spark/pull/38033].

[jira] [Assigned] (SPARK-40598) Fix plotting features work properly with pandas 1.5.0.

2022-10-06 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng reassigned SPARK-40598: Assignee: Haejoon Lee > Fix plotting features work properly with pandas 1.5.0. >

[jira] [Resolved] (SPARK-40598) Fix plotting features work properly with pandas 1.5.0.

2022-10-06 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng resolved SPARK-40598. -- Resolution: Resolved > Fix plotting features work properly with pandas 1.5.0. >

[jira] [Updated] (SPARK-40281) Memory Profiler on Executors

2022-10-04 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40281: - Description: Profiling is critical to performance engineering. Memory consumption is a key

[jira] [Reopened] (SPARK-39494) Support `createDataFrame` from a list of scalars when schema is not provided

2022-09-19 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng reopened SPARK-39494: -- > Support `createDataFrame` from a list of scalars when schema is not provided >

[jira] [Resolved] (SPARK-39494) Support `createDataFrame` from a list of scalars when schema is not provided

2022-09-19 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng resolved SPARK-39494. -- Resolution: Won't Do > Support `createDataFrame` from a list of scalars when schema is not

[jira] [Resolved] (SPARK-40084) Upgrade Py4J from 0.10.9.5 to 0.10.9.7

2022-09-19 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng resolved SPARK-40084. -- Resolution: Resolved > Upgrade Py4J from 0.10.9.5 to 0.10.9.7 >

[jira] [Commented] (SPARK-40084) Upgrade Py4J from 0.10.9.5 to 0.10.9.7

2022-09-19 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17606813#comment-17606813 ] Xinrong Meng commented on SPARK-40084: -- Resolved by https://github.com/apache/spark/pull/37523. >

[jira] [Assigned] (SPARK-40084) Upgrade Py4J from 0.10.9.5 to 0.10.9.7

2022-09-19 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng reassigned SPARK-40084: Assignee: BingKun Pan > Upgrade Py4J from 0.10.9.5 to 0.10.9.7 >

[jira] [Resolved] (SPARK-39405) NumPy support in SQL

2022-09-19 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng resolved SPARK-39405. -- Resolution: Resolved > NumPy support in SQL > > > Key:

[jira] [Assigned] (SPARK-39405) NumPy support in SQL

2022-09-19 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng reassigned SPARK-39405: Assignee: Xinrong Meng > NumPy support in SQL > > >

[jira] [Resolved] (SPARK-39745) Accept a list that contains NumPy scalars in `createDataFrame`

2022-09-19 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng resolved SPARK-39745. -- Resolution: Won't Do > Accept a list that contains NumPy scalars in `createDataFrame` >

[jira] [Updated] (SPARK-40196) Consolidate `lit` function with NumPy scalar in sql and pandas module

2022-09-14 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40196: - Description: Per [https://github.com/apache/spark/pull/37560#discussion_r952882996,] function

[jira] [Updated] (SPARK-40196) Consolidate `lit` function with NumPy scalar in sql and pandas module

2022-09-14 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40196: - Summary: Consolidate `lit` function with NumPy scalar in sql and pandas module (was:

[jira] [Deleted] (SPARK-40309) Introduce sql_conf context manager for pyspark.sql

2022-09-13 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng deleted SPARK-40309: - > Introduce sql_conf context manager for pyspark.sql >

[jira] [Resolved] (SPARK-40131) Support NumPy ndarray in built-in functions

2022-09-12 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng resolved SPARK-40131. -- Resolution: Resolved > Support NumPy ndarray in built-in functions >

[jira] [Commented] (SPARK-40131) Support NumPy ndarray in built-in functions

2022-09-12 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17603201#comment-17603201 ] Xinrong Meng commented on SPARK-40131: -- Resolved by [https://github.com/apache/spark/pull/37635.]  

[jira] [Assigned] (SPARK-40131) Support NumPy ndarray in built-in functions

2022-09-12 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng reassigned SPARK-40131: Assignee: Xinrong Meng > Support NumPy ndarray in built-in functions >

[jira] [Updated] (SPARK-40309) Introduce sql_conf context manager for pyspark.sql

2022-09-02 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40309: - Description: That would simplify the control of Spark SQL configuration as below from

[jira] [Updated] (SPARK-40309) Introduce sql_conf context manager for pyspark.sql

2022-09-02 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40309: - Description:

[jira] [Updated] (SPARK-40309) Introduce sql_conf context manager for pyspark.sql

2022-09-01 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40309: - Description: [https://github.com/apache/spark/blob/master/python/pyspark/pandas/utils.py#L490]

[jira] [Created] (SPARK-40309) Introduce sql_conf context manager for pyspark.sql

2022-09-01 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-40309: Summary: Introduce sql_conf context manager for pyspark.sql Key: SPARK-40309 URL: https://issues.apache.org/jira/browse/SPARK-40309 Project: Spark Issue

[jira] [Created] (SPARK-40307) Optimize (De)Serialization of Python UDF

2022-09-01 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-40307: Summary: Optimize (De)Serialization of Python UDF Key: SPARK-40307 URL: https://issues.apache.org/jira/browse/SPARK-40307 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-40281) Memory Profiler on Executors

2022-08-30 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng reassigned SPARK-40281: Assignee: (was: Xinrong Meng) > Memory Profiler on Executors >

[jira] [Created] (SPARK-40281) Memory Profiler on Executors

2022-08-30 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-40281: Summary: Memory Profiler on Executors Key: SPARK-40281 URL: https://issues.apache.org/jira/browse/SPARK-40281 Project: Spark Issue Type: Umbrella

[jira] [Assigned] (SPARK-40281) Memory Profiler on Executors

2022-08-30 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng reassigned SPARK-40281: Assignee: Xinrong Meng > Memory Profiler on Executors > > >

[jira] [Updated] (SPARK-40131) Support NumPy ndarray in built-in functions

2022-08-25 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40131: - Description: Support NumPy ndarray in built-in functions(`pyspark.sql.functions`) by

[jira] [Resolved] (SPARK-40130) Support NumPy scalars in built-in functions

2022-08-25 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng resolved SPARK-40130. -- Assignee: Xinrong Meng Resolution: Fixed > Support NumPy scalars in built-in functions

[jira] [Assigned] (SPARK-39483) Construct the schema from `np.dtype` when `createDataFrame` from a NumPy array

2022-08-25 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng reassigned SPARK-39483: Assignee: Xinrong Meng > Construct the schema from `np.dtype` when `createDataFrame`

[jira] [Commented] (SPARK-40130) Support NumPy scalars in built-in functions

2022-08-25 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17584975#comment-17584975 ] Xinrong Meng commented on SPARK-40130: -- Resolved by https://github.com/apache/spark/pull/37560 >

[jira] [Updated] (SPARK-40196) Consolidate `lit` function with NumPy input in sql and pandas module

2022-08-23 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-40196: - Description: Per [https://github.com/apache/spark/pull/37560#discussion_r952882996,] function

[jira] [Created] (SPARK-40196) Consolidate `lit` function with NumPy input in sql and pandas module

2022-08-23 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-40196: Summary: Consolidate `lit` function with NumPy input in sql and pandas module Key: SPARK-40196 URL: https://issues.apache.org/jira/browse/SPARK-40196 Project: Spark

[jira] [Created] (SPARK-40131) Support NumPy ndarray in built-in functions

2022-08-17 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-40131: Summary: Support NumPy ndarray in built-in functions Key: SPARK-40131 URL: https://issues.apache.org/jira/browse/SPARK-40131 Project: Spark Issue Type:

[jira] [Created] (SPARK-40130) Support NumPy scalars in built-in functions

2022-08-17 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-40130: Summary: Support NumPy scalars in built-in functions Key: SPARK-40130 URL: https://issues.apache.org/jira/browse/SPARK-40130 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-40090) Upgrade to Py4J 0.10.9.7

2022-08-15 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng resolved SPARK-40090. -- Resolution: Duplicate > Upgrade to Py4J 0.10.9.7 > > >

[jira] [Created] (SPARK-40090) Upgrade to Py4J 0.10.9.7

2022-08-15 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-40090: Summary: Upgrade to Py4J 0.10.9.7 Key: SPARK-40090 URL: https://issues.apache.org/jira/browse/SPARK-40090 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-39986) Better example for Co-grouped Map

2022-08-04 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-39986: Summary: Better example for Co-grouped Map Key: SPARK-39986 URL: https://issues.apache.org/jira/browse/SPARK-39986 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-39822) Provides a good error during create Index with different dtype elements

2022-07-21 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39822: - Parent: SPARK-39581 Issue Type: Sub-task (was: Bug) > Provides a good error during

[jira] [Commented] (SPARK-39794) Introduce parametric singleton for DataType

2022-07-18 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568139#comment-17568139 ] Xinrong Meng commented on SPARK-39794: -- I am working on that. > Introduce parametric singleton for

[jira] [Updated] (SPARK-39794) Introduce parametric singleton for DataType

2022-07-15 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39794: - Description: As per

[jira] [Created] (SPARK-39794) Introduce parametric singleton for DataType

2022-07-15 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-39794: Summary: Introduce parametric singleton for DataType Key: SPARK-39794 URL: https://issues.apache.org/jira/browse/SPARK-39794 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-39761) Add Apache Spark images info in running-on-kubernetes doc

2022-07-14 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng resolved SPARK-39761. -- Fix Version/s: 3.4.0 Resolution: Fixed > Add Apache Spark images info in

[jira] [Assigned] (SPARK-39761) Add Apache Spark images info in running-on-kubernetes doc

2022-07-14 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng reassigned SPARK-39761: Assignee: Yikun Jiang > Add Apache Spark images info in running-on-kubernetes doc >

[jira] [Commented] (SPARK-39761) Add Apache Spark images info in running-on-kubernetes doc

2022-07-14 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567037#comment-17567037 ] Xinrong Meng commented on SPARK-39761: -- Fixed in [https://github.com/apache/spark/pull/37174] >

[jira] [Commented] (SPARK-39732) pyspark.pandas.DataFrame.drop drops dataframe if axis not specified

2022-07-14 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566928#comment-17566928 ] Xinrong Meng commented on SPARK-39732: -- Thanks [~itsmeandy] for raising that!   Previously, the

[jira] [Commented] (SPARK-39732) pyspark.pandas.DataFrame.drop drops dataframe if axis not specified

2022-07-14 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566924#comment-17566924 ] Xinrong Meng commented on SPARK-39732: -- How about we match pandas behavior/results and utilize

[jira] [Created] (SPARK-39756) Better error messages for missing pandas scalars

2022-07-12 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-39756: Summary: Better error messages for missing pandas scalars Key: SPARK-39756 URL: https://issues.apache.org/jira/browse/SPARK-39756 Project: Spark Issue Type:

[jira] [Created] (SPARK-39745) Accept a list that contains NumPy scalars in `createDataFrame`

2022-07-11 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-39745: Summary: Accept a list that contains NumPy scalars in `createDataFrame` Key: SPARK-39745 URL: https://issues.apache.org/jira/browse/SPARK-39745 Project: Spark

[jira] [Updated] (SPARK-39494) Support `createDataFrame` from a list of scalars when schema is not provided

2022-07-11 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39494: - Description: Currently, DataFrame creation from a list of native Python scalars is unsupported

[jira] [Updated] (SPARK-39494) Support `createDataFrame` from a list of scalars when schema is not provided

2022-07-11 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39494: - Description:   {{>>> spark.createDataFrame([1, 2]).collect()}} {{Traceback (most recent call

[jira] [Updated] (SPARK-39494) Support `createDataFrame` from a list of scalars when schema is not provided

2022-07-11 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39494: - Description: {{Currently, DataFrame creation from a list of native Python scalars is

[jira] [Updated] (SPARK-39494) Support `createDataFrame` from a list of scalars when schema is not provided

2022-07-08 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39494: - Description: {{Currently, DataFrame creation from a list of scalars is unsupported in PySpark,

[jira] [Updated] (SPARK-39494) Support `createDataFrame` from a list of scalars when schema is not provided

2022-07-08 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39494: - Description: Currently, DataFrame creation from a list of scalars is unsupported in PySpark,

[jira] [Updated] (SPARK-39494) Support `createDataFrame` from a list of scalars when schema is not provided

2022-07-08 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39494: - Summary: Support `createDataFrame` from a list of scalars when schema is not provided (was:

[jira] [Updated] (SPARK-38953) Document PySpark common exceptions / errors

2022-07-06 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-38953: - Description: There are common exceptions/errors in PySpark SQL, pandas API on Spark, and Py4J.

[jira] [Resolved] (SPARK-39076) Standardize Statistical Functions of pandas API on Spark

2022-07-06 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng resolved SPARK-39076. -- Resolution: Done > Standardize Statistical Functions of pandas API on Spark >

[jira] [Updated] (SPARK-39227) Reach parity with pandas boolean cast

2022-07-06 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-39227: - Parent: (was: SPARK-39076) Issue Type: Improvement (was: Sub-task) > Reach parity

[jira] [Created] (SPARK-39648) Fix type hints of `like`, `rlike`, `ilike` of Column

2022-06-30 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-39648: Summary: Fix type hints of `like`, `rlike`, `ilike` of Column Key: SPARK-39648 URL: https://issues.apache.org/jira/browse/SPARK-39648 Project: Spark Issue

<    1   2   3   4   5   6   7   8   9   10   >