vineetgarg02 opened a new pull request #552: Hive 21279 URL: https://github.com/apache/hive/pull/552 This patch avoids rename/move (to tmpPath) during File Sink operation and creates a list of file to pass it over to Fetch operator to fetch from. In context of cloud file system file I/Os are expensive so avoiding even a single operation provides sufficient boost. Internal experiments show more than 50% boost in fetch result performance.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services