Ian Cook created SPARK-48302:
--------------------------------

             Summary: Null values in map columns of PyArrow tables are replaced 
with empty lists
                 Key: SPARK-48302
                 URL: https://issues.apache.org/jira/browse/SPARK-48302
             Project: Spark
          Issue Type: Bug
          Components: PySpark
    Affects Versions: 4.0.0
            Reporter: Ian Cook


Because of a limitation in PyArrow, when PyArrow Tables are passed to 
spark.createDataFrame(), null values in MapArray columns are replaced with 
empty lists.

The PySpark function where this happens is pyspark.sql.pandas.types.
_check_arrow_array_timestamps_localize.
Also see [https://github.com/apache/arrow/issues/41684].



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to