[ https://issues.apache.org/jira/browse/FLINK-19121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jingsong Lee updated FLINK-19121: --------------------------------- Summary: Avoid accessing HDFS frequently in HiveBulkWriterFactory (was: Avoid accessing HDFS in HiveBulkWriterFactory) > Avoid accessing HDFS frequently in HiveBulkWriterFactory > -------------------------------------------------------- > > Key: FLINK-19121 > URL: https://issues.apache.org/jira/browse/FLINK-19121 > Project: Flink > Issue Type: Bug > Components: Connectors / Hive > Affects Versions: 1.12.0, 1.11.1 > Reporter: Jingsong Lee > Priority: Blocker > > In HadoopPathBasedBulkWriter, getSize will invoke `FileSystem.exists` and > `FileSystem.getFileStatus`, but it is invoked per record. > There will be lots of visits to HDFS, may make HDFS pressure too high. -- This message was sent by Atlassian Jira (v8.3.4#803005)