On Sat, Jan 24, 2009 at 10:03 AM, Mark Kerzner <markkerz...@gmail.com> wrote: > Hi, > > there is a performance penalty in Windows (pardon the expression) if you put > too many files in the same directory. The OS becomes very slow, stops seeing > them, and lies about their status to my Java requests. I do not know if this > is also a problem in Linux, but in HDFS - do I need to balance a directory > tree if I want to store millions of files, or can I put them all in the same > directory? > >From my old windows days... There is a registry setting to turn off some feature where by Windows keeps a mapping of 8.3 filenames to the full filenames - can't recall it exactly but it is worth looking for. Also try name your files so that the 'uniuqe' part of the filename comes first, e.g. 123_inventoryid.ext is 'better' than inventoryid_123.ext
HTH Mark > Thank you, > Mark >