Bryan Cutler created SPARK-30640:
------------------------------------

             Summary: Prevent unnessary copies of data in Arrow to Pandas 
conversion with Timestamps
                 Key: SPARK-30640
                 URL: https://issues.apache.org/jira/browse/SPARK-30640
             Project: Spark
          Issue Type: Improvement
          Components: PySpark, SQL
    Affects Versions: 2.4.4
            Reporter: Bryan Cutler


During conversion of Arrow to Pandas, timestamp columns are modified to 
localize for the current timezone. If there are no timestamp columns, this can 
sometimes result in unnecessary copies of the data. See 
[https://www.mail-archive.com/dev@arrow.apache.org/msg17008.html] for 
discussion.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to