Adam Ling created ARROW-18298: --------------------------------- Summary: [Python] datetime shifted when using pyarrow.Table.from_pandas to load a pandas DateFrame containing datetime with timezone Key: ARROW-18298 URL: https://issues.apache.org/jira/browse/ARROW-18298 Project: Apache Arrow Issue Type: Bug Components: Python Affects Versions: 9.0.0 Environment: MacOS M1, Python 3.8.13 Reporter: Adam Ling
Problem: When using pyarrow.Table.from_pandas to load a pandas DataFrame which contains a timestamp object with timezone information, the created Table object will shift the datetime, while still keeping the timezone information. Please see my scripts. Reproduce scripts: {code:java} import pandas as pd import pyarrow ts = pd.Timestamp("2022-10-21 22:46:17", tz="America/Los_Angeles") df = pd.DataFrame({"TS": [ts]}) table = pyarrow.Table.from_pandas(df) print(df) """ TS 0 2022-10-21 22:46:17-07:00 """ print(table) """ pyarrow.Table TS: timestamp[ns, tz=America/Los_Angeles] ---- TS: [[2022-10-22 05:46:17.000000000]]""" {code} Expected results: The table should not shift the datetime when timezone information is provided. -- This message was sent by Atlassian Jira (v8.20.10#820010)