pitrou commented on PR #13442:
URL: https://github.com/apache/arrow/pull/13442#issuecomment-1171419265
I still disagree. First I'll note that the error message is quite explicit
about the workaround ("straddling object straddles two block boundaries (try to
increase block size?)"). Second, it's not about "feeling better about cache
locality", but potentially decreasing performance for actual users (which,
unlike the users getting an error, will have no clue why performance may have
decreased).
If we didn't care about performance or resource consumption at all, we could
just read the entire file in one block and that would solve the issue.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]