[
https://issues.apache.org/jira/browse/PARQUET-215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356436#comment-14356436
]
Alex Levenson commented on PARQUET-215:
---------------------------------------
https://github.com/apache/incubator-parquet-mr/pull/146
> Parquet Thrift should discard records with unrecognized union members
> ---------------------------------------------------------------------
>
> Key: PARQUET-215
> URL: https://issues.apache.org/jira/browse/PARQUET-215
> Project: Parquet
> Issue Type: Bug
> Components: parquet-mr
> Reporter: Alex Levenson
>
> When writing parquet-thrift files, when a thrift record with an unknown union
> member is encountered, it should be considered a bad record and discarded.
> Currently, because unions are treated as structs with one optional field per
> union member, parquet-thrift happily writes the empty struct, but then
> crashes in the read path when trying to read this record.
> We should discard these records in the write path, just as we discard other
> unparseable records.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)