Hemanth Yamijala created AMBARI-17834:
-----------------------------------------

             Summary: HDFS copy thread in OutputHDFSFile is not removing files 
that are processed 
                 Key: AMBARI-17834
                 URL: https://issues.apache.org/jira/browse/AMBARI-17834
             Project: Ambari
          Issue Type: Bug
          Components: ambari-logsearch
            Reporter: Hemanth Yamijala


Consider the HDFS copy thread here: 
https://github.com/apache/ambari/blob/trunk/ambari-logsearch/ambari-logsearch-logfeeder/src/main/java/org/apache/ambari/logfeeder/output/OutputHDFSFile.java#L177

It uses a data structure localReadyFiles that has the list of files to copy to 
HDFS. But AFAICS, nowhere are processed files removed from this data structure. 
This could result in multiple incorrect behaviours:

* Over a period of time, this structure could keep growing indefinitely. While 
the check for existence of files will allow progress, it will take longer and 
longer to run.
*  The check to wait when there are no files to process will never pass because 
the size of the data structure will never be 0. This could lead to tight looped 
checks and some utilization issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to