[ https://issues.apache.org/jira/browse/ARROW-17066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ian Cook updated ARROW-17066: ----------------------------- Fix Version/s: 9.0.0 > [C++][Python][Substrait] "ignore_unknown_fields" should be specified when > converting JSON to binary > --------------------------------------------------------------------------------------------------- > > Key: ARROW-17066 > URL: https://issues.apache.org/jira/browse/ARROW-17066 > Project: Apache Arrow > Issue Type: Bug > Reporter: Richard Tia > Assignee: Vibhatha Lakmal Abeykoon > Priority: Critical > Labels: pull-request-available > Fix For: 9.0.0 > > Time Spent: 1h 20m > Remaining Estimate: 0h > > [https://developers.google.com/protocol-buffers/docs/reference/cpp/google.protobuf.util.json_util#JsonParseOptions] > > When converting a substrait JSON to binary, there are many unknown fields > that may exist since substrait is being built every week. > ignore_unknown_fields should be specified when doing this conversion. > > This is resulting in frequent errors similar to this: > {code:java} > E pyarrow.lib.ArrowInvalid: JsonToBinaryStream returned > INVALID_ARGUMENT:(relations[0].root.input.sort.input.aggregate.measures[0].measure) > arguments: Cannot find field. > pyarrow/error.pxi:100: ArrowInvalid {code} -- This message was sent by Atlassian Jira (v8.20.10#820010)