[ 
https://issues.apache.org/jira/browse/HADOOP-19596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18006609#comment-18006609
 ] 

Steve Loughran commented on HADOOP-19596:
-----------------------------------------

ok, look at the filesystem spec for openFIle and don't do this if the read 
policy is random, vectored, parquet, orc, but only for : whole-file, 
sequential, adaptive (plus whatever default you want).

parquet now opens files as "parquet, vector, random, adaptive" in order of 
preference

> ABFS: [ReadAheadV2] Increase Prefetch Aggressiveness to improve sequential 
> read performance
> -------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-19596
>                 URL: https://issues.apache.org/jira/browse/HADOOP-19596
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/azure
>    Affects Versions: 3.5.0, 3.4.1
>            Reporter: Anuj Modi
>            Assignee: Anuj Modi
>            Priority: Major
>
> Various analyses done in the past have shown a need for significant 
> improvement in the performance of sequential reads. The current 
> implementation clearly shows the lack of parallelism that is needed to cater 
> to high throughput sequential read workloads. 
> More details on updated design and results of POC benchmarking will be added 
> here soon.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to