steveloughran commented on PR #5563:
URL: https://github.com/apache/hadoop/pull/5563#issuecomment-1517764238

   looks good, just two issues to worry about
   minor: checkstyle unhappy about line length...please keep at 100 chars or 
less
   
   One bigger issue, which you already mentioned: excessively long filenames. 
S3 supports 1024 chars of path so this should work through the other block 
buffers, and MUST work here too.
   
   looking at a table of length, there's 255 chars to play with, including 
block id, span id etc
   https://www.baeldung.com/linux/bash-filename-limit
   
   How about adding a new test case or modifying testRegularUpload() to create 
a file with a name > 256 chars just see what happens?
   
   Oh, and we have to remember about windows too, though as java apis go 
through the unicode ones, its 255 char limit doesn't always hold.
   
   Maybe the solution is to do some cutting down of paths such that first few 
and final chars are always preserved. along with span ID that should be good, 
though it does depend on filenames generated...does accumulo generate 
sufficiently unique ones that the last, say, 128 chars will be something you 
can map to an upload?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to