Ying Zhou created ARROW-11117: --------------------------------- Summary: [C++] ORC Reader uses wrong types Key: ARROW-11117 URL: https://issues.apache.org/jira/browse/ARROW-11117 Project: Apache Arrow Issue Type: Bug Components: C++ Reporter: Ying Zhou
The Arrow C++ ORC reader does not process types correctly. In particular it does the following: 1. It converts the ORC STRING type to the Arrow STRING type despite the fact that all ORC STRINGs are large. 2. It converts the ORC LIST type to the Arrow LIST type despite the fact that all ORC LISTs are large. 3. It converts the ORC MAP type to LISTS of STRUCTS with hardcoded field names while an actual MAP type exists in Arrow (note that the ORC MAPs are large so we need to filter out large ones when converting). These issues need to be fixed. -- This message was sent by Atlassian Jira (v8.3.4#803005)