noirello created ORC-1047: ----------------------------- Summary: [C++] Handle quoted field names during string schema parsing Key: ORC-1047 URL: https://issues.apache.org/jira/browse/ORC-1047 Project: ORC Issue Type: Bug Components: C++ Affects Versions: 1.6.11, 1.7.0, 1.8.0 Reporter: noirello Assignee: noirello
The current implementation of _Type::buildTypeFromString_ cannot handle string schemas with quoted field names. The following code will raise a logic error of "Unrecognized character.": {code:java} auto schema = Type::buildTypeFromString("struct<`quoted.field`:string>"); {code} Besides that, two other limitations have been encountered: * Cannot parse a string schema that only has a _timestamp with local time zone_ type in root. {code:java} Type::buildTypeFromString("timestamp with local time zone"); {code} * It allows to create struct types without setting a field name, which (based on the Java implementation) should not be a valid ORC schema. {code:java} Type::buildTypeFromString("struct<struct<bigint>>"); {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)