Very cool, we are using Debian and I checked Cloudera's website. You have packages for the Debian platform.
Will check it out and install on a test cluster.

Thanks much,
Usman

This is correct - thanks for the note Jason. You can see the current
patch list for Cloudera's Distribution (based on 18.3) at:
http://www.cloudera.com/hadoop-manifest

In addition to Bzip2, we have patched in: DBInputFormat, the fair
scheduler, job level task limiting, "soft" fd leak fix, a fix for HDFS
under-replication, shuffle improvements, EC2/S3 improvements, and
Sqoop - database import for Hadoop.

You can download RPMs and Ubuntu packages as well as preconfigured EC2
images from: http://www.cloudera.com/hadoop

Cheers,
Christophe

On Wed, Jun 24, 2009 at 6:47 AM, jason hadoop<jason.had...@gmail.com> wrote:
I believe the cloudera 18.3 supports bzip2

On Wed, Jun 24, 2009 at 3:45 AM, Usman Waheed <usm...@opera.com> wrote:

Hi All,

Can I map/reduce logs that have the .bz2 extension in Hadoop 18.3?
I tried but interestingly the output was not what i expected versus what i
got when my data was in uncompressed format.

Thanks,
Usman




--
Pro Hadoop, a book to guide you from beginner to hadoop mastery,
http://www.amazon.com/dp/1430219424?tag=jewlerymall
www.prohadoopbook.com a community for Hadoop Professionals







--
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/

Reply via email to