Github user pvillard31 commented on the issue: https://github.com/apache/nifi/pull/2021 Hi @ijokarumawak. I'm really not sure but I think that 'if' statement was in case the file is rolling over and the new file has exactly the same content as the one rolled over. In that case (I acknowledge that would be a very strange edge case) we have: ``timestamp < file.lastModified() && length == file.length())`` In this case, the checksum would be the same and the boolean ``rolloverOccurred`` could be false. Again, that would be a really weird situation and I'd say that this condition is not needed. Your change makes sense to me especially with the issue you raised about duplicated data.
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---