alamb commented on PR #7650: URL: https://github.com/apache/arrow-rs/pull/7650#issuecomment-2976197147
I have been thinking about this PR. My proposal is going to be to merge it as is (even though it shows a performance regression for many large strings). The thinking is that even though this particular change will slow down for large strings, we are still better off overall due to - XXX Also, I think when I incorporate actual filtering as well we'll further make up in performance - https://github.com/apache/arrow-rs/pull/7652 So I plan to: 1. Do some more testing 2. Make a POC with adding native filtering on top of this PR to see if we can get the performance back down -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org