[ https://issues.apache.org/jira/browse/HDFS-15811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated HDFS-15811: ---------------------------------- Labels: pull-request-available (was: ) > completeFile should log final file size > --------------------------------------- > > Key: HDFS-15811 > URL: https://issues.apache.org/jira/browse/HDFS-15811 > Project: Hadoop HDFS > Issue Type: Improvement > Reporter: Zehao Chen > Assignee: Zehao Chen > Priority: Minor > Labels: pull-request-available > Time Spent: 10m > Remaining Estimate: 0h > > Jobs, particularly hive queries by non-headless users, can create an > excessive number of files (many hundreds of thousands). A single user's query > can generate a sustained burst of 60-80% of all creates for tens of minutes > or more and impact overall cluster performance. Adding the file size to the > logline allows us to identify excessive tiny or large files. > -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org