hkeebler commented on a change in pull request #1287: Fix #1273 Fix Bulk 
Messages
URL: https://github.com/apache/accumulo/pull/1287#discussion_r306901038
 
 

 ##########
 File path: 
core/src/main/java/org/apache/accumulo/core/clientImpl/bulk/BulkImport.java
 ##########
 @@ -449,31 +450,31 @@ private Text toText(byte[] row) {
 
   }
 
-  private static List<FileStatus> filterInvalid(FileStatus[] files) {
-    ArrayList<FileStatus> fileList = new ArrayList<>(files.length);
+  public static List<FileStatus> filterInvalid(FileStatus[] files) {
 
+    ArrayList<FileStatus> fileList = new ArrayList<>(files.length);
     for (FileStatus fileStatus : files) {
 
       String fname = fileStatus.getPath().getName();
 
+      if (fileStatus.isDirectory()) {
+        log.debug("{} is a directory, ignoring.", fileStatus.getPath());
+        continue;
+      }
+
       if (fname.equals("_SUCCESS") || fname.equals("_logs")) {
 
 Review comment:
   Agree, they are hadoop working files that occur during the Bulk processing. 
Also, found a hadoop constant ...FileOutputCommitter.SUCCEEDED but can not 
locate a constant for "_logs".

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to