snvijaya commented on PR #5133: URL: https://github.com/apache/hadoop/pull/5133#issuecomment-1315456701
> Having spent the last couple of weeks staring at this code, I think what I would like from the module is > > * per file system instance > * stats reporting to that fs (gauges on queues; actual fetch/hit/miss numbers could come via abfs input stream) > * demand creation of the buffer pool. Maybe when the last active stream is closed it could even free them all, no that at the reference counting problem to the mix. > * when an abfs input stream is created for vector/random IO, prefetch being automatically disabled. > * anything we can do to improve testing, especially simulation of heavy load from multiple tasks. Thanks for your inputs @steveloughran . Agree that all these apply for the long term transformation on the prefetch handling. I am going to take a day and see what of these I can get into this change as well, but want to prioritize a hotfix and remove the buggy bit quickly. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org