[ https://issues.apache.org/jira/browse/HIVE-26432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ádám Szita reassigned HIVE-26432: --------------------------------- > Improve LlapCacheAwareFs by caching file status information > ----------------------------------------------------------- > > Key: HIVE-26432 > URL: https://issues.apache.org/jira/browse/HIVE-26432 > Project: Hive > Issue Type: Improvement > Reporter: Ádám Szita > Assignee: Ádám Szita > Priority: Major > > The current implementation of LlapCacheAwareFs is used to wrap InputStreams > of non-ORC file formatted file reads, if set up to utilize LLAP caching. > File content is cached by the calculated file ID and the required offsets > within the file. This is later served from cache, however LlapCacheAwareFs > acting as a FileSystem sometimes receives listStatus / getFileStatus calls > too, which is only proxied to the original FS. If such operation on the > original FS is slow, e.g. listing on S3, performance will be impacted. (This > is not the case with how ORC is integrated into LLAP cache as it's not acting > as a FS) > I propose we cache the file status information too besides the content. -- This message was sent by Atlassian Jira (v8.20.10#820010)