>So you have multiple xmls in a single file and you have many such files.. >In that case, the best answer is the StreamXmlRecordReader.
>Or you can write your own InputFormat to create splits such that each split >in an xml file in itself, or each record in a split is a complete xml >message. Thanks Amandeep! I want to use the split efficiently. If I use one split per xml file then it wont be very efficient since size of xml files is small ~ 1meg. What I want to do is to have multiple files in one split. I will update you once I am done with this. -- Vipul Sharma sharmavipul AT gmail DOT com