[jira] [Updated] (SPARK-48302) Preserve nulls in map columns in PyArrow Tables

2024-06-07 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Target Version/s: 4.0.0 > Preserve nulls in map columns in PyArrow Tables >

[jira] [Updated] (SPARK-48302) Preserve nulls in map columns in PyArrow Tables

2024-06-07 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Parent: SPARK-44111 Issue Type: Sub-task (was: Bug) > Preserve nulls in map columns in PyArrow

[jira] [Updated] (SPARK-48302) Preserve nulls in map columns in PyArrow Tables

2024-06-07 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Affects Version/s: 3.5.1 > Preserve nulls in map columns in PyArrow Tables > ---

[jira] [Updated] (SPARK-48302) Preserve nulls in map columns in PyArrow Tables

2024-06-02 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Parent: SPARK-44111 Issue Type: Sub-task (was: Bug) > Preserve nulls in map columns in PyArrow

[jira] [Updated] (SPARK-48302) Preserve nulls in map columns in PyArrow Tables

2024-06-02 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Parent: (was: SPARK-44111) Issue Type: Bug (was: Sub-task) > Preserve nulls in map columns

[jira] [Updated] (SPARK-48302) Preserve nulls in map columns in PyArrow Tables

2024-06-02 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Summary: Preserve nulls in map columns in PyArrow Tables (was: Null values in map columns of PyArrow ta

[jira] [Updated] (SPARK-48302) Null values in map columns of PyArrow tables are replaced with empty lists

2024-06-02 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Description: Because of a limitation in PyArrow, when PyArrow Tables containing MapArray columns with n

[jira] [Updated] (SPARK-48302) Null values in map columns of PyArrow tables are replaced with empty lists

2024-06-02 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Description: Because of a limitation in PyArrow, when PyArrow Tables are passed to {{{}spark.createData

[jira] [Updated] (SPARK-48302) Null values in map columns of PyArrow tables are replaced with empty lists

2024-06-02 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Description: Because of a limitation in PyArrow, when PyArrow Tables are passed to {{{}spark.createData

[jira] [Updated] (SPARK-48302) Null values in map columns of PyArrow tables are replaced with empty lists

2024-06-02 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Description: Because of a limitation in PyArrow, when PyArrow Tables are passed to {{{}spark.createData

[jira] [Updated] (SPARK-47466) Add PySpark DataFrame method to return iterator of PyArrow RecordBatches

2024-06-02 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47466: - Component/s: Connect Input/Output SQL > Add PySpark DataFrame method t

[jira] [Updated] (SPARK-47466) Add PySpark DataFrame method to return iterator of PyArrow RecordBatches

2024-06-02 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47466: - Affects Version/s: 4.0.0 > Add PySpark DataFrame method to return iterator of PyArrow RecordBatches > --

[jira] [Updated] (SPARK-48302) Null values in map columns of PyArrow tables are replaced with empty lists

2024-06-02 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Language: Python > Null values in map columns of PyArrow tables are replaced with empty lists >

[jira] [Commented] (SPARK-48478) Allow passing iterator of PyArrow RecordBatches to createDataFrame()

2024-05-30 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17850865#comment-17850865 ] Ian Cook commented on SPARK-48478: -- For Connect, see class {{LocalRelation}} in {{{}py

[jira] [Created] (SPARK-48478) Allow passing iterator of PyArrow RecordBatches to createDataFrame()

2024-05-30 Thread Ian Cook (Jira)
Ian Cook created SPARK-48478: Summary: Allow passing iterator of PyArrow RecordBatches to createDataFrame() Key: SPARK-48478 URL: https://issues.apache.org/jira/browse/SPARK-48478 Project: Spark

[jira] [Updated] (SPARK-48478) Allow passing iterator of PyArrow RecordBatches to createDataFrame()

2024-05-30 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48478: - Language: Python > Allow passing iterator of PyArrow RecordBatches to createDataFrame() > --

[jira] [Commented] (SPARK-47466) Add PySpark DataFrame method to return iterator of PyArrow RecordBatches

2024-05-30 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17850855#comment-17850855 ] Ian Cook commented on SPARK-47466: -- For Connect, see the function {{to_table_as_iterato

[jira] [Resolved] (SPARK-48373) Allow schema parameter of createDataFrame() to be length-1 list or tuple of StructType

2024-05-22 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook resolved SPARK-48373. -- Resolution: Won't Fix > Allow schema parameter of createDataFrame() to be length-1 list or tuple of >

[jira] [Closed] (SPARK-48373) Allow schema parameter of createDataFrame() to be length-1 list or tuple of StructType

2024-05-22 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook closed SPARK-48373. > Allow schema parameter of createDataFrame() to be length-1 list or tuple of > StructType >

[jira] [Updated] (SPARK-48220) Allow passing PyArrow Table to createDataFrame()

2024-05-21 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48220: - Fix Version/s: (was: 4.0.0) Target Version/s: 4.0.0 > Allow passing PyArrow Table to createDa

[jira] [Created] (SPARK-48374) Support additional PyArrow Table column types

2024-05-21 Thread Ian Cook (Jira)
Ian Cook created SPARK-48374: Summary: Support additional PyArrow Table column types Key: SPARK-48374 URL: https://issues.apache.org/jira/browse/SPARK-48374 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-48374) Support additional PyArrow Table column types

2024-05-21 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48374: - Parent: SPARK-44111 Issue Type: Sub-task (was: Improvement) > Support additional PyArrow Table

[jira] [Created] (SPARK-48373) Allow schema parameter of createDataFrame() to be length-1 list or tuple of StructType

2024-05-21 Thread Ian Cook (Jira)
Ian Cook created SPARK-48373: Summary: Allow schema parameter of createDataFrame() to be length-1 list or tuple of StructType Key: SPARK-48373 URL: https://issues.apache.org/jira/browse/SPARK-48373 Projec

[jira] [Commented] (SPARK-48220) Allow passing PyArrow Table to createDataFrame()

2024-05-17 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17847448#comment-17847448 ] Ian Cook commented on SPARK-48220: -- [~gurwls223] the PR for this is ready for review:

[jira] [Updated] (SPARK-48302) Null values in map columns of PyArrow tables are replaced with empty lists

2024-05-16 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Description: Because of a limitation in PyArrow, when PyArrow Tables are passed to {{{}spark.createData

[jira] [Updated] (SPARK-48302) Null values in map columns of PyArrow tables are replaced with empty lists

2024-05-16 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Description: Because of a limitation in PyArrow, when PyArrow Tables are passed to {{{}spark.createData

[jira] [Updated] (SPARK-48302) Null values in map columns of PyArrow tables are replaced with empty lists

2024-05-16 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Description: Because of a limitation in PyArrow, when PyArrow Tables are passed to {{{}spark.createData

[jira] [Updated] (SPARK-48302) Null values in map columns of PyArrow tables are replaced with empty lists

2024-05-16 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Description: Because of a limitation in PyArrow, when PyArrow Tables are passed to {{{}spark.createData

[jira] [Updated] (SPARK-48302) Null values in map columns of PyArrow tables are replaced with empty lists

2024-05-16 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Description: Because of a limitation in PyArrow, when PyArrow Tables are passed to {{spark.createDataFr

[jira] [Updated] (SPARK-48220) Allow passing PyArrow Table to createDataFrame()

2024-05-16 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48220: - Fix Version/s: 4.0.0 > Allow passing PyArrow Table to createDataFrame() > --

[jira] [Updated] (SPARK-48302) Null values in map columns of PyArrow tables are replaced with empty lists

2024-05-16 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48302: - Description: Because of a limitation in PyArrow, when PyArrow Tables are passed to spark.createDataFram

[jira] [Created] (SPARK-48302) Null values in map columns of PyArrow tables are replaced with empty lists

2024-05-16 Thread Ian Cook (Jira)
Ian Cook created SPARK-48302: Summary: Null values in map columns of PyArrow tables are replaced with empty lists Key: SPARK-48302 URL: https://issues.apache.org/jira/browse/SPARK-48302 Project: Spark

[jira] [Updated] (SPARK-48220) Allow passing PyArrow Table to createDataFrame()

2024-05-14 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48220: - Parent: SPARK-44111 Issue Type: Sub-task (was: Improvement) > Allow passing PyArrow Table to cr

[jira] [Updated] (SPARK-47465) Remove experimental tag from toArrow() PySpark DataFrame method

2024-05-09 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47465: - Fix Version/s: 4.0.0 > Remove experimental tag from toArrow() PySpark DataFrame method > ---

[jira] [Updated] (SPARK-48220) Allow passing PyArrow Table to createDataFrame()

2024-05-09 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-48220: - Description: SPARK-47365 added support for returning a Spark DataFrame as a PyArrow Table. It would be

[jira] [Updated] (SPARK-47466) Add PySpark DataFrame method to return iterator of PyArrow RecordBatches

2024-05-09 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47466: - Description: As a follow-up to SPARK-47365: {{toArrow()}} is useful when the data is relatively small.

[jira] [Created] (SPARK-48220) Allow passing PyArrow Table to createDataFrame()

2024-05-09 Thread Ian Cook (Jira)
Ian Cook created SPARK-48220: Summary: Allow passing PyArrow Table to createDataFrame() Key: SPARK-48220 URL: https://issues.apache.org/jira/browse/SPARK-48220 Project: Spark Issue Type: Improvem

[jira] [Updated] (SPARK-47465) Remove experimental tag from toArrow() PySpark DataFrame method

2024-05-08 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47465: - Description: As a follow-up to SPARK-47365: What is needed to consider making the *toArrow()* PySpark D

[jira] [Updated] (SPARK-47465) Remove experimental tag from toArrow() PySpark DataFrame method

2024-05-08 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47465: - Summary: Remove experimental tag from toArrow() PySpark DataFrame method (was: Remove experimental tag

[jira] [Updated] (SPARK-47466) Add PySpark DataFrame method to return iterator of PyArrow RecordBatches

2024-05-08 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47466: - Description: As a follow-up to SPARK-47365: *toArrow()* is useful when the data is relatively small. Fo

[jira] [Updated] (SPARK-47365) Add toArrow() DataFrame method to PySpark

2024-05-08 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Summary: Add toArrow() DataFrame method to PySpark (was: Add toArrowTable() DataFrame method to PySpark

[jira] [Updated] (SPARK-47365) Add toArrowTable() DataFrame method to PySpark

2024-05-08 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Description: Over in the Apache Arrow community, we hear from a lot of users who want to return the con

[jira] [Updated] (SPARK-47365) Add toArrowTable() DataFrame method to PySpark

2024-05-08 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Affects Version/s: 4.0.0 > Add toArrowTable() DataFrame method to PySpark >

[jira] [Updated] (SPARK-47365) Add toArrowTable() DataFrame method to PySpark

2024-05-08 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Description: Over in the Apache Arrow community, we hear from a lot of users who want to return the con

[jira] [Resolved] (SPARK-47465) Remove experimental tag from toArrowTable() PySpark DataFrame method

2024-05-08 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook resolved SPARK-47465. -- Resolution: Duplicate This is now part of SPARK-47365. > Remove experimental tag from toArrowTable()

[jira] [Updated] (SPARK-47365) Add toArrowTable() DataFrame method to PySpark

2024-05-08 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Parent: SPARK-44111 Issue Type: Sub-task (was: Improvement) > Add toArrowTable() DataFrame meth

[jira] [Updated] (SPARK-47365) Add toArrowTable() DataFrame method to PySpark

2024-05-08 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Fix Version/s: 4.0.0 > Add toArrowTable() DataFrame method to PySpark >

[jira] [Updated] (SPARK-47365) Add toArrowTable() DataFrame method to PySpark

2024-05-07 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Description: Over in the Apache Arrow community, we hear from a lot of users who want to return the con

[jira] [Updated] (SPARK-47365) Add toArrowTable() DataFrame method to PySpark

2024-05-07 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Summary: Add toArrowTable() DataFrame method to PySpark (was: Add _toArrowTable() DataFrame method to P

[jira] [Updated] (SPARK-47465) Remove experimental tag from toArrowTable() PySpark DataFrame method

2024-05-07 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47465: - Description: As a follow-up to SPARK-47365: What is needed to consider making the *toArrowTable()* PySp

[jira] [Updated] (SPARK-47466) Add PySpark DataFrame method to return iterator of PyArrow RecordBatches

2024-05-07 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47466: - Description: As a follow-up to SPARK-47365: *toArrowTable()* is useful when the data is relatively smal

[jira] [Updated] (SPARK-47465) Remove experimental tag from toArrowTable() PySpark DataFrame method

2024-05-07 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47465: - Summary: Remove experimental tag from toArrowTable() PySpark DataFrame method (was: Remove experimental

[jira] [Updated] (SPARK-47365) Add _toArrowTable() DataFrame method to PySpark

2024-05-07 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Summary: Add _toArrowTable() DataFrame method to PySpark (was: Add _toArrow() DataFrame method to PySpa

[jira] [Updated] (SPARK-47365) Add _toArrowTable() DataFrame method to PySpark

2024-05-07 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Description: Over in the Apache Arrow community, we hear from a lot of users who want to return the con

[jira] [Created] (SPARK-47466) Add PySpark DataFrame method to return iterator of PyArrow RecordBatches

2024-03-19 Thread Ian Cook (Jira)
Ian Cook created SPARK-47466: Summary: Add PySpark DataFrame method to return iterator of PyArrow RecordBatches Key: SPARK-47466 URL: https://issues.apache.org/jira/browse/SPARK-47466 Project: Spark

[jira] [Updated] (SPARK-47365) Add _toArrow() DataFrame method to PySpark

2024-03-19 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Component/s: Connect > Add _toArrow() DataFrame method to PySpark >

[jira] [Created] (SPARK-47465) Remove experimental tag from toArrow() PySpark DataFrame method

2024-03-19 Thread Ian Cook (Jira)
Ian Cook created SPARK-47465: Summary: Remove experimental tag from toArrow() PySpark DataFrame method Key: SPARK-47465 URL: https://issues.apache.org/jira/browse/SPARK-47465 Project: Spark Issu

[jira] [Updated] (SPARK-47365) Add _toArrow() DataFrame method to PySpark

2024-03-19 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Description: Over in the Apache Arrow community, we hear from a lot of users who want to return the con

[jira] [Updated] (SPARK-47365) Add _toArrow() DataFrame method to PySpark

2024-03-19 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Summary: Add _toArrow() DataFrame method to PySpark (was: Add toArrow() DataFrame method to PySpark) >

[jira] [Updated] (SPARK-47365) Add toArrow() DataFrame method to PySpark

2024-03-12 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Component/s: SQL > Add toArrow() DataFrame method to PySpark > -

[jira] [Updated] (SPARK-47365) Add toArrow() DataFrame method to PySpark

2024-03-12 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Description: Over in the Apache Arrow community, we hear from a lot of users who want to return the con

[jira] [Updated] (SPARK-47365) Add toArrow() DataFrame method to PySpark

2024-03-12 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated SPARK-47365: - Summary: Add toArrow() DataFrame method to PySpark (was: Add toArrow() DataFrame method) > Add toArrow

[jira] [Comment Edited] (SPARK-47365) Add toArrow() DataFrame method

2024-03-12 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17825769#comment-17825769 ] Ian Cook edited comment on SPARK-47365 at 3/12/24 8:04 PM: --- It

[jira] [Commented] (SPARK-47365) Add toArrow() DataFrame method

2024-03-12 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17825769#comment-17825769 ] Ian Cook commented on SPARK-47365: -- It looks like all the pieces required to enable thi

[jira] [Created] (SPARK-47365) Add toArrow() DataFrame method

2024-03-12 Thread Ian Cook (Jira)
Ian Cook created SPARK-47365: Summary: Add toArrow() DataFrame method Key: SPARK-47365 URL: https://issues.apache.org/jira/browse/SPARK-47365 Project: Spark Issue Type: Improvement Comp

[jira] [Commented] (SPARK-27335) cannot collect() from Correlation.corr

2020-07-29 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167475#comment-17167475 ] Ian Cook commented on SPARK-27335: -- Regarding the workaround code that [~natalinobusa]