On Wed, 24 Jun 2009 12:45:59 +0200, Usman Waheed wrote: >Hi All, > >Can I map/reduce logs that have the .bz2 extension in Hadoop 18.3? >I tried but interestingly the output was not what i expected versus >what i got when my data was in uncompressed format. > >Thanks, >Usman >
Not AFAIK, but we have added bzip2 support as of 0.19 (see JIRA HADOOP-3646), and have splitting support working (see JIRA HADOOP-4012) as a patch. Getting HADOOP-4012 committed has been painful, but it seems close. -John Heidemann