Very cool, we are using Debian and I checked Cloudera's website. You have
packages for the Debian platform.
Will check it out and install on a test cluster.
Thanks much,
Usman
This is correct - thanks for the note Jason. You can see the current
patch list for Cloudera's Distribution (based on 18.3) at:
http://www.cloudera.com/hadoop-manifest
In addition to Bzip2, we have patched in: DBInputFormat, the fair
scheduler, job level task limiting, "soft" fd leak fix, a fix for HDFS
under-replication, shuffle improvements, EC2/S3 improvements, and
Sqoop - database import for Hadoop.
You can download RPMs and Ubuntu packages as well as preconfigured EC2
images from: http://www.cloudera.com/hadoop
Cheers,
Christophe
On Wed, Jun 24, 2009 at 6:47 AM, jason hadoop<jason.had...@gmail.com>
wrote:
I believe the cloudera 18.3 supports bzip2
On Wed, Jun 24, 2009 at 3:45 AM, Usman Waheed <usm...@opera.com> wrote:
Hi All,
Can I map/reduce logs that have the .bz2 extension in Hadoop 18.3?
I tried but interestingly the output was not what i expected versus
what i
got when my data was in uncompressed format.
Thanks,
Usman
--
Pro Hadoop, a book to guide you from beginner to hadoop mastery,
http://www.amazon.com/dp/1430219424?tag=jewlerymall
www.prohadoopbook.com a community for Hadoop Professionals
--
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/