Ian Cook created SPARK-48302: -------------------------------- Summary: Null values in map columns of PyArrow tables are replaced with empty lists Key: SPARK-48302 URL: https://issues.apache.org/jira/browse/SPARK-48302 Project: Spark Issue Type: Bug Components: PySpark Affects Versions: 4.0.0 Reporter: Ian Cook
Because of a limitation in PyArrow, when PyArrow Tables are passed to spark.createDataFrame(), null values in MapArray columns are replaced with empty lists. The PySpark function where this happens is pyspark.sql.pandas.types. _check_arrow_array_timestamps_localize. Also see [https://github.com/apache/arrow/issues/41684]. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org