valkum commented on issue #8404: URL: https://github.com/apache/arrow-rs/issues/8404#issuecomment-3378439532
I had a bit of time to spend on this again and figured out that this was actually a regression introduced by 55.2.0. Bisecting showed that e814b97415d2df78c273302dab934dfe2a0a4a64 regressed. Both the market and the type leaf columns report 0 records and values in the `GenericRecordReader`. I see two calls to `ByteArrayDictionaryReader::consume_batch` for each column (etld, type, market); in the second call the values are already taken by `GenericRecordReader::consume_record_data` One note: It looks like Polars created a shared `LargeStringArray` Dict for all cols (etld, type, market). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
