Adam Hooper created ARROW-6568: ---------------------------------- Summary: pyarrow.parquet crash writing zero-chunk dictionary-type column Key: ARROW-6568 URL: https://issues.apache.org/jira/browse/ARROW-6568 Project: Apache Arrow Issue Type: Bug Components: Python Affects Versions: 0.14.1 Environment: Pyarrow v0.14.1, manylinux1 Reporter: Adam Hooper
Trying to write a zero-RecordBatch file to parquet: {code:python} import pyarrow import pyarrow.parquet table = pyarrow.Table.from_batches([], pyarrow.schema([('A', pyarrow.dictionary(pyarrow.int32(), pyarrow.string()))])) pyarrow.parquet.write_table(table, 'x.parquet') {code} ... I receive an error and Python exits with exit code {{139}}: {noformat} WARNING: Logging before InitGoogleLogging() is written to STDERR F0915 18:37:23.099939 1 table.cc:64] Check failed: (chunks.size()) > (0) cannot construct ChunkedArray from empty vector and omitted type *** Check failure stack trace: *** {noformat} -- This message was sent by Atlassian Jira (v8.3.2#803003)