pepijnve commented on issue #18411: URL: https://github.com/apache/datafusion/issues/18411#issuecomment-3490975739
> 🤔 figuring out how to avoid re-hashing the same short values over and over again (StringView can be thought of as a type of dictionary encoding) is a very interesting possibility For values with length <= 12 (which is the case in Q1 since the columns involved are both `char(1)`) views lose their dictionary advantage since each entry will be an inline value. I think you would have to solve this at the source and make them actual dictionary encoded values instead (if that's not already the case). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
