[GitHub] [hadoop] snvijaya commented on pull request #5133: HADOOP-18521. Draft change - ABFS Prefetch corruption

GitBox Tue, 15 Nov 2022 07:16:35 -0800


snvijaya commented on PR #5133:
URL: https://github.com/apache/hadoop/pull/5133#issuecomment-1315456701


   > Having spent the last couple of weeks staring at this code, I think what I 
would like from the module is
   > 
   > * per file system instance
   > * stats reporting to that fs (gauges on queues; actual fetch/hit/miss 
numbers could come via abfs input stream)
   > * demand creation of the buffer pool. Maybe when the last active stream is 
closed it could even free them all, no that at the reference counting problem 
to the mix.
   > * when an abfs input stream is created for vector/random IO, prefetch 
being automatically disabled.
   > * anything we can do to improve testing, especially simulation of heavy 
load from multiple tasks.
   
   Thanks for your inputs @steveloughran . Agree that all these apply for the 
long term transformation on the prefetch handling.  I am going to take a day 
and see what of these I can get into this change as well, but want to 
prioritize a hotfix and remove the buggy bit quickly.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

[GitHub] [hadoop] snvijaya commented on pull request #5133: HADOOP-18521. Draft change - ABFS Prefetch corruption

Reply via email to