Igosuki opened a new pull request #910:
URL: https://github.com/apache/arrow-datafusion/pull/910
# Which issue does this PR close?
Closes #903.
# Rationale for this change
Enables loading avro data files through datafusion.
# What changes are included in this PR?
Avro is added as a table provider and a supported file format.
Avro schemas can be translated into arrow schemas.
# Are there any user-facing changes?
Yes, as one can now call register_avro on df and 'STORED AS AVRO' in sqsl.
# N.B.:
- Need to add tests in avro_to_arrow/arrow_array_reader.rs
# Missing :
- Writing back to avro
I find there is duplication between modules with these additions, I should
probably do some refactoring.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]