[
https://issues.apache.org/jira/browse/PARQUET-918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15943156#comment-15943156
]
Itai Incze commented on PARQUET-918:
------------------------------------
Submitted pull request - https://github.com/apache/parquet-cpp/pull/275
> FromParquetSchema API crashes on nested schemas
> -----------------------------------------------
>
> Key: PARQUET-918
> URL: https://issues.apache.org/jira/browse/PARQUET-918
> Project: Parquet
> Issue Type: Bug
> Components: parquet-cpp
> Affects Versions: cpp-1.0.0
> Reporter: Itai Incze
>
> {{FromParquetSchema@src/parquet/arrow/schema.cc:276}} misbehaves by using its
> column_indices parameter in the second version of the function as indices to
> the direct schema root fields.
> This is problematic with nested schema parquet files - the bug crashes the
> process by accessing the fields vector out of bounds.
> This bug is masked by another bug in the first version of the
> {{FromParquetSchema}} function which constructs a complete indices list the
> size of the number of schema fields (instead of the # of columns).
> The bug is triggered in many significant use-cases, for example when using
> the {{arrow::ReadTable}} API.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)