veronica-m-ef opened a new pull request, #7954:
URL: https://github.com/apache/arrow-rs/pull/7954

   # Which issue does this PR close?
   
   - Part of https://github.com/apache/arrow-rs/issues/4886
   - Related to https://github.com/apache/arrow-rs/pull/6965
   
   # Rationale for this change
   
   This change introduces support for Avro files generated by systems like 
Impala, which have a specific convention for representing nullable fields. In 
Avro, nullability is typically represented by a union of a type and a type. 
This PR updates the Avro reader to correctly interpret these schemas, ensuring 
proper handling of nullable data and improving interoperability with 
Impala-generated data. `null`
   
   # What changes are included in this PR?
   
   This pull request introduces several changes to support Impala-style 
nullability in the Avro reader:
   - The Avro schema parser has been updated to recognize unions where is the 
second type (e.g., `['type', 'null']`) as a nullable field. `null`
   - Logic has been added to handle this nullability convention during Avro 
decoding.
   - New tests are included to verify that Avro files using this nullability 
format are read correctly while ensuring that strict mode properly identifies 
them.
   
   # Are these changes tested?
   
   Yes, I added new test cases covering these changes to the tests named: 
`test_nonnullable_impala`, `test_nonnullable_impala_strict`, 
`test_nullable_impala` and `test_nullable_impala_strict`.
   
   # Are there any user-facing changes?
   
   N/A
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to