adamreeve opened a new pull request, #6953:
URL: https://github.com/apache/arrow-rs/pull/6953
# Which issue does this PR close?
Closes #6952
# Rationale for this change
This treats NaNs as equal to other NaNs of the same type for the purpose of
dictionary encoding them when writing f32 or f64 Parquet physical values.
# What changes are included in this PR?
* Introduces a new `Intern` trait to define equality behaviour for
interning, replacing the use of `PartialEq`.
* Adds a benchmark for writing floating point values with NaNs to Parquet.
# Are there any user-facing changes?
* Users should see improved performance when writing floating point data
with many NaNs to Parquet
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]