Yevgeni Litvin created ARROW-7214: ------------------------------------- Summary: [Python] unpickling a pyarrow table with dictionary fields crashes Key: ARROW-7214 URL: https://issues.apache.org/jira/browse/ARROW-7214 Project: Apache Arrow Issue Type: Bug Components: Python Affects Versions: 0.15.1, 0.15.0, 0.14.1, 0.14.0 Reporter: Yevgeni Litvin
The following code crashes on this check: {code:java} F1120 07:51:37.523720 12432 array.cc:773] Check failed: (data->dictionary) != (nullptr) {code} Used pandas 0.24.2. {code:java} import cPickle as pickle import pandas as pd import pyarrow as pa df = pd.DataFrame([{"cat": "a", "val":1},{"cat": "b", "val":2} ]) df["cat"] = df["cat"].astype('category')index_table = pa.Table.from_pandas(df, preserve_index=False) with open('/tmp/zz.pickle', 'wb') as f: pickle.dump(index_table, f, protocol=2) with open('/tmp/zz.pickle', 'rb') as f: index_table = pickle.load(f) {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)