[jira] [Created] (SPARK-48205) Remove the private[sql] modifier for Python data sources

2024-05-08 Thread Allison Wang (Jira)
Allison Wang created SPARK-48205: Summary: Remove the private[sql] modifier for Python data sources Key: SPARK-48205 URL: https://issues.apache.org/jira/browse/SPARK-48205 Project: Spark

[jira] [Created] (SPARK-48064) Improve error messages for routine related errors

2024-04-30 Thread Allison Wang (Jira)
Allison Wang created SPARK-48064: Summary: Improve error messages for routine related errors Key: SPARK-48064 URL: https://issues.apache.org/jira/browse/SPARK-48064 Project: Spark Issue

[jira] [Created] (SPARK-48014) Change the makeFromJava error in EvaluatePython to a user-facing error

2024-04-26 Thread Allison Wang (Jira)
Allison Wang created SPARK-48014: Summary: Change the makeFromJava error in EvaluatePython to a user-facing error Key: SPARK-48014 URL: https://issues.apache.org/jira/browse/SPARK-48014 Project:

[jira] [Created] (SPARK-47921) Fix ExecuteJobTag creation in ExecuteHolder

2024-04-19 Thread Allison Wang (Jira)
Allison Wang created SPARK-47921: Summary: Fix ExecuteJobTag creation in ExecuteHolder Key: SPARK-47921 URL: https://issues.apache.org/jira/browse/SPARK-47921 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-47367) Support Python data source API with Spark Connect

2024-03-12 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-47367: - Summary: Support Python data source API with Spark Connect (was: Support Python data source

[jira] [Updated] (SPARK-47367) Support Python data source API in Spark Connect

2024-03-12 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-47367: - Summary: Support Python data source API in Spark Connect (was: Support Python data source API

[jira] [Created] (SPARK-47367) Support Python data source API with Spark Connect

2024-03-12 Thread Allison Wang (Jira)
Allison Wang created SPARK-47367: Summary: Support Python data source API with Spark Connect Key: SPARK-47367 URL: https://issues.apache.org/jira/browse/SPARK-47367 Project: Spark Issue

[jira] [Created] (SPARK-47346) Make daemon mode configurable when creating Python workers

2024-03-11 Thread Allison Wang (Jira)
Allison Wang created SPARK-47346: Summary: Make daemon mode configurable when creating Python workers Key: SPARK-47346 URL: https://issues.apache.org/jira/browse/SPARK-47346 Project: Spark

[jira] [Updated] (SPARK-46973) Skip V2 table lookup when a table is in V1 table cache

2024-03-02 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-46973: - Description: Improve v2 table lookup performance when a table is already in the v1 table cache.

[jira] [Updated] (SPARK-46973) Skip V2 table lookup when a table is in V1 table cache

2024-03-02 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-46973: - Summary: Skip V2 table lookup when a table is in V1 table cache (was: Add table cache for V2

[jira] [Created] (SPARK-46973) Add table cache for V2 tables

2024-02-04 Thread Allison Wang (Jira)
Allison Wang created SPARK-46973: Summary: Add table cache for V2 tables Key: SPARK-46973 URL: https://issues.apache.org/jira/browse/SPARK-46973 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-46818) Improve error messages for range with non-foldable input

2024-01-23 Thread Allison Wang (Jira)
Allison Wang created SPARK-46818: Summary: Improve error messages for range with non-foldable input Key: SPARK-46818 URL: https://issues.apache.org/jira/browse/SPARK-46818 Project: Spark

[jira] [Created] (SPARK-46618) Improve error messages for DATA_SOURCE_NOT_FOUND error

2024-01-08 Thread Allison Wang (Jira)
Allison Wang created SPARK-46618: Summary: Improve error messages for DATA_SOURCE_NOT_FOUND error Key: SPARK-46618 URL: https://issues.apache.org/jira/browse/SPARK-46618 Project: Spark Issue

[jira] [Created] (SPARK-46616) Disallow re-registration of statically registered data sources

2024-01-07 Thread Allison Wang (Jira)
Allison Wang created SPARK-46616: Summary: Disallow re-registration of statically registered data sources Key: SPARK-46616 URL: https://issues.apache.org/jira/browse/SPARK-46616 Project: Spark

[jira] [Created] (SPARK-46568) Python data source options should be a case insensitive dictionary

2024-01-02 Thread Allison Wang (Jira)
Allison Wang created SPARK-46568: Summary: Python data source options should be a case insensitive dictionary Key: SPARK-46568 URL: https://issues.apache.org/jira/browse/SPARK-46568 Project: Spark

[jira] [Created] (SPARK-46565) Improve Python data source error classes and messages

2024-01-02 Thread Allison Wang (Jira)
Allison Wang created SPARK-46565: Summary: Improve Python data source error classes and messages Key: SPARK-46565 URL: https://issues.apache.org/jira/browse/SPARK-46565 Project: Spark Issue

[jira] [Updated] (SPARK-46540) Respect column names when Python data source read function outputs named Row objects

2023-12-28 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46540?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-46540: - Summary: Respect column names when Python data source read function outputs named Row objects

[jira] [Created] (SPARK-46540) Respects named arguments when Python data source read function outputs Row objects

2023-12-28 Thread Allison Wang (Jira)
Allison Wang created SPARK-46540: Summary: Respects named arguments when Python data source read function outputs Row objects Key: SPARK-46540 URL: https://issues.apache.org/jira/browse/SPARK-46540

[jira] [Updated] (SPARK-46540) Respect named arguments when Python data source read function outputs Row objects

2023-12-28 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46540?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-46540: - Summary: Respect named arguments when Python data source read function outputs Row objects

[jira] [Created] (SPARK-46522) Block Python data source registration with name conflicts

2023-12-26 Thread Allison Wang (Jira)
Allison Wang created SPARK-46522: Summary: Block Python data source registration with name conflicts Key: SPARK-46522 URL: https://issues.apache.org/jira/browse/SPARK-46522 Project: Spark

[jira] [Created] (SPARK-46520) Support overwrite mode for Python data source write

2023-12-26 Thread Allison Wang (Jira)
Allison Wang created SPARK-46520: Summary: Support overwrite mode for Python data source write Key: SPARK-46520 URL: https://issues.apache.org/jira/browse/SPARK-46520 Project: Spark Issue

[jira] [Created] (SPARK-46452) Add a new API in DSv2 DataWriter to write an iterator of records

2023-12-18 Thread Allison Wang (Jira)
Allison Wang created SPARK-46452: Summary: Add a new API in DSv2 DataWriter to write an iterator of records Key: SPARK-46452 URL: https://issues.apache.org/jira/browse/SPARK-46452 Project: Spark

[jira] [Created] (SPARK-46375) Add documentation for Python data source API

2023-12-11 Thread Allison Wang (Jira)
Allison Wang created SPARK-46375: Summary: Add documentation for Python data source API Key: SPARK-46375 URL: https://issues.apache.org/jira/browse/SPARK-46375 Project: Spark Issue Type:

[jira] [Created] (SPARK-46290) Change saveMode to overwrite for DataSourceWriter constructor

2023-12-06 Thread Allison Wang (Jira)
Allison Wang created SPARK-46290: Summary: Change saveMode to overwrite for DataSourceWriter constructor Key: SPARK-46290 URL: https://issues.apache.org/jira/browse/SPARK-46290 Project: Spark

[jira] [Created] (SPARK-46273) Support INSERT INTO/OVERWRITE using DSv2 sources

2023-12-05 Thread Allison Wang (Jira)
Allison Wang created SPARK-46273: Summary: Support INSERT INTO/OVERWRITE using DSv2 sources Key: SPARK-46273 URL: https://issues.apache.org/jira/browse/SPARK-46273 Project: Spark Issue Type:

[jira] [Created] (SPARK-46272) Support CTAS using DSv2 sources

2023-12-05 Thread Allison Wang (Jira)
Allison Wang created SPARK-46272: Summary: Support CTAS using DSv2 sources Key: SPARK-46272 URL: https://issues.apache.org/jira/browse/SPARK-46272 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-46253) Plan Python data source read using mapInArrow

2023-12-04 Thread Allison Wang (Jira)
Allison Wang created SPARK-46253: Summary: Plan Python data source read using mapInArrow Key: SPARK-46253 URL: https://issues.apache.org/jira/browse/SPARK-46253 Project: Spark Issue Type:

[jira] [Updated] (SPARK-46057) Support SQL user-defined functions

2023-11-22 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-46057: - Description: This is an umbrella ticket to support SQL user-defined functions. (was: This is

[jira] [Created] (SPARK-46057) Support SQL user-defined functions

2023-11-22 Thread Allison Wang (Jira)
Allison Wang created SPARK-46057: Summary: Support SQL user-defined functions Key: SPARK-46057 URL: https://issues.apache.org/jira/browse/SPARK-46057 Project: Spark Issue Type: Umbrella

[jira] [Created] (SPARK-46043) Support create table using DSv2 sources

2023-11-21 Thread Allison Wang (Jira)
Allison Wang created SPARK-46043: Summary: Support create table using DSv2 sources Key: SPARK-46043 URL: https://issues.apache.org/jira/browse/SPARK-46043 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-46013) Improve basic datasource examples

2023-11-20 Thread Allison Wang (Jira)
Allison Wang created SPARK-46013: Summary: Improve basic datasource examples Key: SPARK-46013 URL: https://issues.apache.org/jira/browse/SPARK-46013 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45940) Add InputPartition to DataSourceReader interface

2023-11-15 Thread Allison Wang (Jira)
Allison Wang created SPARK-45940: Summary: Add InputPartition to DataSourceReader interface Key: SPARK-45940 URL: https://issues.apache.org/jira/browse/SPARK-45940 Project: Spark Issue Type:

[jira] [Commented] (SPARK-45861) Add user guide for dataframe creation

2023-11-15 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17786487#comment-17786487 ] Allison Wang commented on SPARK-45861: -- [~panbingkun] again, thanks for working on this. Let me

[jira] [Created] (SPARK-45931) Refine docstring of `mapInPandas`

2023-11-14 Thread Allison Wang (Jira)
Allison Wang created SPARK-45931: Summary: Refine docstring of `mapInPandas` Key: SPARK-45931 URL: https://issues.apache.org/jira/browse/SPARK-45931 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45930) Allow non-deterministic Python UDFs in MapInPandas/MapInArrow

2023-11-14 Thread Allison Wang (Jira)
Allison Wang created SPARK-45930: Summary: Allow non-deterministic Python UDFs in MapInPandas/MapInArrow Key: SPARK-45930 URL: https://issues.apache.org/jira/browse/SPARK-45930 Project: Spark

[jira] [Updated] (SPARK-45927) Update `path` handling in Python data source

2023-11-14 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45927: - Summary: Update `path` handling in Python data source (was: Remove `path` from data source

[jira] [Created] (SPARK-45927) Remove `path` from data source constructor

2023-11-14 Thread Allison Wang (Jira)
Allison Wang created SPARK-45927: Summary: Remove `path` from data source constructor Key: SPARK-45927 URL: https://issues.apache.org/jira/browse/SPARK-45927 Project: Spark Issue Type:

[jira] [Created] (SPARK-45914) Support `commit` and `abort` API for Python data source write

2023-11-13 Thread Allison Wang (Jira)
Allison Wang created SPARK-45914: Summary: Support `commit` and `abort` API for Python data source write Key: SPARK-45914 URL: https://issues.apache.org/jira/browse/SPARK-45914 Project: Spark

[jira] [Updated] (SPARK-45525) Initial support for Python data source write API

2023-11-13 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45525: - Description: Add a new command and logical rules (similar to V1Writes and V2Writes) to support

[jira] [Updated] (SPARK-45600) Make Python data source registration session level

2023-11-09 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45600: - Description: Currently, registered data sources are stored in `sharedState` and can be accessed

[jira] [Updated] (SPARK-45600) Make data source registration session level

2023-11-09 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45600: - Summary: Make data source registration session level (was: Separate the Python data source

[jira] [Updated] (SPARK-45600) Make Python data source registration session level

2023-11-09 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45600: - Summary: Make Python data source registration session level (was: Make data source

[jira] [Created] (SPARK-45865) Add user guide for window operations

2023-11-09 Thread Allison Wang (Jira)
Allison Wang created SPARK-45865: Summary: Add user guide for window operations Key: SPARK-45865 URL: https://issues.apache.org/jira/browse/SPARK-45865 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45864) Add user guide for groupby and aggregate

2023-11-09 Thread Allison Wang (Jira)
Allison Wang created SPARK-45864: Summary: Add user guide for groupby and aggregate Key: SPARK-45864 URL: https://issues.apache.org/jira/browse/SPARK-45864 Project: Spark Issue Type:

[jira] [Created] (SPARK-45863) Add user guide for column selections

2023-11-09 Thread Allison Wang (Jira)
Allison Wang created SPARK-45863: Summary: Add user guide for column selections Key: SPARK-45863 URL: https://issues.apache.org/jira/browse/SPARK-45863 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45862) Add user guide for basic dataframe operations

2023-11-09 Thread Allison Wang (Jira)
Allison Wang created SPARK-45862: Summary: Add user guide for basic dataframe operations Key: SPARK-45862 URL: https://issues.apache.org/jira/browse/SPARK-45862 Project: Spark Issue Type:

[jira] [Created] (SPARK-45861) Add user guide for dataframe creation

2023-11-09 Thread Allison Wang (Jira)
Allison Wang created SPARK-45861: Summary: Add user guide for dataframe creation Key: SPARK-45861 URL: https://issues.apache.org/jira/browse/SPARK-45861 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45783) Improve exception message when no remote url is set

2023-11-03 Thread Allison Wang (Jira)
Allison Wang created SPARK-45783: Summary: Improve exception message when no remote url is set Key: SPARK-45783 URL: https://issues.apache.org/jira/browse/SPARK-45783 Project: Spark Issue

[jira] [Created] (SPARK-45773) Refine docstring of `SparkSession.builder.config`

2023-11-02 Thread Allison Wang (Jira)
Allison Wang created SPARK-45773: Summary: Refine docstring of `SparkSession.builder.config` Key: SPARK-45773 URL: https://issues.apache.org/jira/browse/SPARK-45773 Project: Spark Issue

[jira] [Updated] (SPARK-45765) Improve error messages when loading multiple paths in PySpark

2023-11-01 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45765: - Description: Currently, the error message is super confusing when a user tries to load

[jira] [Resolved] (SPARK-45765) Improve error messages when loading multiple paths in PySpark

2023-11-01 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang resolved SPARK-45765. -- Resolution: Invalid > Improve error messages when loading multiple paths in PySpark >

[jira] [Created] (SPARK-45765) Improve error messages when loading multiple paths in PySpark

2023-11-01 Thread Allison Wang (Jira)
Allison Wang created SPARK-45765: Summary: Improve error messages when loading multiple paths in PySpark Key: SPARK-45765 URL: https://issues.apache.org/jira/browse/SPARK-45765 Project: Spark

[jira] [Updated] (SPARK-45764) Make code block copyable

2023-11-01 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45764: - Description: We should consider adding a copy button next to the pyspark code blocks. For

[jira] [Commented] (SPARK-45764) Make code block copyable

2023-11-01 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17781887#comment-17781887 ] Allison Wang commented on SPARK-45764: -- cc [~podongfeng] WDYT? > Make code block copyable >

[jira] [Created] (SPARK-45764) Make code block copyable

2023-11-01 Thread Allison Wang (Jira)
Allison Wang created SPARK-45764: Summary: Make code block copyable Key: SPARK-45764 URL: https://issues.apache.org/jira/browse/SPARK-45764 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-45713) Support registering Python data sources

2023-10-27 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45713: - Description: Support registering Python data sources. Users can register a Python data source

[jira] [Updated] (SPARK-45713) Support registering Python data sources

2023-10-27 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45713: - Description: Support registering Python data sources. Users can register a Python data source

[jira] [Created] (SPARK-45713) Support registering Python data sources

2023-10-27 Thread Allison Wang (Jira)
Allison Wang created SPARK-45713: Summary: Support registering Python data sources Key: SPARK-45713 URL: https://issues.apache.org/jira/browse/SPARK-45713 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-45639) Support loading Python data sources in DataFrameReader

2023-10-27 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45639: - Description: Allow users to read from a Python data source using

[jira] [Updated] (SPARK-45639) Support loading Python data sources in DataFrameReader

2023-10-27 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45639: - Summary: Support loading Python data sources in DataFrameReader (was: Support Python data

[jira] [Created] (SPARK-45654) Add Python data source write API

2023-10-24 Thread Allison Wang (Jira)
Allison Wang created SPARK-45654: Summary: Add Python data source write API Key: SPARK-45654 URL: https://issues.apache.org/jira/browse/SPARK-45654 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-45639) Support Python data source in DataFrameReader

2023-10-23 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45639: - Description: Allow users to read from a Python data source using

[jira] [Updated] (SPARK-45639) Support Python data source in DataFrameReader

2023-10-23 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45639: - Description: Allow users to read from a Python data source using

[jira] [Updated] (SPARK-45639) Support Python data source in DataFrameReader

2023-10-23 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45639: - Description: Allow users to read from a Python data source using

[jira] [Created] (SPARK-45639) Support Python data source in DataFrameReader

2023-10-23 Thread Allison Wang (Jira)
Allison Wang created SPARK-45639: Summary: Support Python data source in DataFrameReader Key: SPARK-45639 URL: https://issues.apache.org/jira/browse/SPARK-45639 Project: Spark Issue Type:

[jira] [Updated] (SPARK-45524) Initial support for Python data source read API

2023-10-23 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45524: - Description: Add API for data source and data source reader and add Catalyst + execution

[jira] [Commented] (SPARK-45023) SPIP: Python Stored Procedures

2023-10-20 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1954#comment-1954 ] Allison Wang commented on SPARK-45023: -- [~abhinavofficial] this proposal is on hold, given the

[jira] [Resolved] (SPARK-45023) SPIP: Python Stored Procedures

2023-10-20 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang resolved SPARK-45023. -- Resolution: Won't Do > SPIP: Python Stored Procedures > -- > >

[jira] [Created] (SPARK-45600) Separate the Python data source logic from DataFrameReader

2023-10-18 Thread Allison Wang (Jira)
Allison Wang created SPARK-45600: Summary: Separate the Python data source logic from DataFrameReader Key: SPARK-45600 URL: https://issues.apache.org/jira/browse/SPARK-45600 Project: Spark

[jira] [Updated] (SPARK-45559) Support spark.read.schema(...) for Python data source API

2023-10-18 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45559: - Description: Support `spark.read.schema(...)` for Python data source read. Add test cases

[jira] [Created] (SPARK-45597) Support creating table using a Python data source in SQL

2023-10-18 Thread Allison Wang (Jira)
Allison Wang created SPARK-45597: Summary: Support creating table using a Python data source in SQL Key: SPARK-45597 URL: https://issues.apache.org/jira/browse/SPARK-45597 Project: Spark

[jira] [Updated] (SPARK-45584) Execution fails when there are subqueries in TakeOrderedAndProjectExec

2023-10-17 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45584: - Description: When there are subqueries in TakeOrderedAndProjectExec, the query can throw this

[jira] [Created] (SPARK-45584) Execution fails when there are subqueries in TakeOrderedAndProjectExec

2023-10-17 Thread Allison Wang (Jira)
Allison Wang created SPARK-45584: Summary: Execution fails when there are subqueries in TakeOrderedAndProjectExec Key: SPARK-45584 URL: https://issues.apache.org/jira/browse/SPARK-45584 Project:

[jira] [Updated] (SPARK-45560) Support spark.read.load() with non-empty path for Python data source API

2023-10-16 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45560: - Summary: Support spark.read.load() with non-empty path for Python data source API (was:

[jira] [Updated] (SPARK-45559) Support spark.read.schema(...) for Python data source API

2023-10-16 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45559: - Summary: Support spark.read.schema(...) for Python data source API (was: Support

[jira] [Created] (SPARK-45560) Support spark.read.load() with paths for Python data source API

2023-10-16 Thread Allison Wang (Jira)
Allison Wang created SPARK-45560: Summary: Support spark.read.load() with paths for Python data source API Key: SPARK-45560 URL: https://issues.apache.org/jira/browse/SPARK-45560 Project: Spark

[jira] [Created] (SPARK-45559) Support df.read.schema(...) for Python data source API

2023-10-16 Thread Allison Wang (Jira)
Allison Wang created SPARK-45559: Summary: Support df.read.schema(...) for Python data source API Key: SPARK-45559 URL: https://issues.apache.org/jira/browse/SPARK-45559 Project: Spark Issue

[jira] [Created] (SPARK-45526) Refine docstring of `options` for dataframe reader and writer

2023-10-12 Thread Allison Wang (Jira)
Allison Wang created SPARK-45526: Summary: Refine docstring of `options` for dataframe reader and writer Key: SPARK-45526 URL: https://issues.apache.org/jira/browse/SPARK-45526 Project: Spark

[jira] [Created] (SPARK-45525) Initial support for Python data source write API

2023-10-12 Thread Allison Wang (Jira)
Allison Wang created SPARK-45525: Summary: Initial support for Python data source write API Key: SPARK-45525 URL: https://issues.apache.org/jira/browse/SPARK-45525 Project: Spark Issue Type:

[jira] [Created] (SPARK-45524) Initial support for Python data source read API

2023-10-12 Thread Allison Wang (Jira)
Allison Wang created SPARK-45524: Summary: Initial support for Python data source read API Key: SPARK-45524 URL: https://issues.apache.org/jira/browse/SPARK-45524 Project: Spark Issue Type:

[jira] [Updated] (SPARK-45509) Investigate the behavior difference in self-join

2023-10-11 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45509: - Description: SPARK-45220 discovers a behavior difference for a self-join scenario between

[jira] [Updated] (SPARK-45509) Investigate the behavior difference in self-join

2023-10-11 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45509: - Description: SPARK-45220 discovers a behavior difference for a self-join scenario between

[jira] [Updated] (SPARK-45509) Investigate the behavior difference in self-join

2023-10-11 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45509: - Description: SPARK-45220 discovers a behavior difference for a self-join scenario between

[jira] [Created] (SPARK-45509) Investigate the behavior difference in self-join

2023-10-11 Thread Allison Wang (Jira)
Allison Wang created SPARK-45509: Summary: Investigate the behavior difference in self-join Key: SPARK-45509 URL: https://issues.apache.org/jira/browse/SPARK-45509 Project: Spark Issue Type:

[jira] [Created] (SPARK-45505) Refactor analyzeInPython function to make it reusable

2023-10-11 Thread Allison Wang (Jira)
Allison Wang created SPARK-45505: Summary: Refactor analyzeInPython function to make it reusable Key: SPARK-45505 URL: https://issues.apache.org/jira/browse/SPARK-45505 Project: Spark Issue

[jira] [Updated] (SPARK-44076) SPIP: Python Data Source API

2023-10-10 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44076: - Affects Version/s: 4.0.0 (was: 3.5.0) > SPIP: Python Data Source API

[jira] [Created] (SPARK-45442) Refine docstring of `DataFrame.show`

2023-10-06 Thread Allison Wang (Jira)
Allison Wang created SPARK-45442: Summary: Refine docstring of `DataFrame.show` Key: SPARK-45442 URL: https://issues.apache.org/jira/browse/SPARK-45442 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45428) Add Matomo analytics to all released docs pages

2023-10-05 Thread Allison Wang (Jira)
Allison Wang created SPARK-45428: Summary: Add Matomo analytics to all released docs pages Key: SPARK-45428 URL: https://issues.apache.org/jira/browse/SPARK-45428 Project: Spark Issue Type:

[jira] [Commented] (SPARK-45428) Add Matomo analytics to all released docs pages

2023-10-05 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17772395#comment-17772395 ] Allison Wang commented on SPARK-45428: -- cc [~podongfeng]  > Add Matomo analytics to all released

[jira] [Updated] (SPARK-44729) Add canonical links to the PySpark docs page

2023-10-05 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44729: - Description: We should add the canonical link to the PySpark docs page

[jira] [Commented] (SPARK-45264) Configurable error when generating Python docs

2023-09-21 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17767743#comment-17767743 ] Allison Wang commented on SPARK-45264: -- [~podongfeng] do we have ways to bypass such pandas version

[jira] [Created] (SPARK-45264) Configurable error when generating Python docs

2023-09-21 Thread Allison Wang (Jira)
Allison Wang created SPARK-45264: Summary: Configurable error when generating Python docs Key: SPARK-45264 URL: https://issues.apache.org/jira/browse/SPARK-45264 Project: Spark Issue Type:

[jira] [Created] (SPARK-45260) Refine docstring of count_distinct

2023-09-21 Thread Allison Wang (Jira)
Allison Wang created SPARK-45260: Summary: Refine docstring of count_distinct Key: SPARK-45260 URL: https://issues.apache.org/jira/browse/SPARK-45260 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45259) Refine docstring of `count`

2023-09-21 Thread Allison Wang (Jira)
Allison Wang created SPARK-45259: Summary: Refine docstring of `count` Key: SPARK-45259 URL: https://issues.apache.org/jira/browse/SPARK-45259 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45258) Refine docstring of `sum`

2023-09-21 Thread Allison Wang (Jira)
Allison Wang created SPARK-45258: Summary: Refine docstring of `sum` Key: SPARK-45258 URL: https://issues.apache.org/jira/browse/SPARK-45258 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-45220) Refine docstring of `DataFrame.join`

2023-09-19 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45220: - Description: Refine the docstring of `DataFrame.join`. The examples should also include: left

[jira] [Created] (SPARK-45223) Refine docstring of `Column.when`

2023-09-19 Thread Allison Wang (Jira)
Allison Wang created SPARK-45223: Summary: Refine docstring of `Column.when` Key: SPARK-45223 URL: https://issues.apache.org/jira/browse/SPARK-45223 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45222) Refine docstring of `DataFrameReader.json`

2023-09-19 Thread Allison Wang (Jira)
Allison Wang created SPARK-45222: Summary: Refine docstring of `DataFrameReader.json` Key: SPARK-45222 URL: https://issues.apache.org/jira/browse/SPARK-45222 Project: Spark Issue Type:

[jira] [Created] (SPARK-45221) Refine docstring of `DataFrameReader.parquet`

2023-09-19 Thread Allison Wang (Jira)
Allison Wang created SPARK-45221: Summary: Refine docstring of `DataFrameReader.parquet` Key: SPARK-45221 URL: https://issues.apache.org/jira/browse/SPARK-45221 Project: Spark Issue Type:

[jira] [Created] (SPARK-45220) Refine docstring of `DataFrame.join`

2023-09-19 Thread Allison Wang (Jira)
Allison Wang created SPARK-45220: Summary: Refine docstring of `DataFrame.join` Key: SPARK-45220 URL: https://issues.apache.org/jira/browse/SPARK-45220 Project: Spark Issue Type: Sub-task

  1   2   3   4   >