Hi all, I use hadoop-0.21.0 distribution. I have a large number of small files (KB). Is there any efficient way of handling it in hadoop?
I have heard that solution for that problem is using: 1. HAR (hadoop archives) 2. cat on files I would like to know if there are any other solutions for processing large number of small files. Regards, Naveen Mahale