Re: Project ideas !

2009-10-17 Thread Amund Tveit
2009/10/14 Patterson, Josh > Siddu, > If this is for an undergraduate class, I would suggest something that > allows you to get some work in with basic data structures such as > building an inverted index over a few million documents (maybe Wikipedia > pages?). You will also need to get a general

Re: type mismatch, newbie help

2009-10-17 Thread Aaron
Yep that was it. I changed the method signature to LongWriteable for the 1st parameter (also annotated it w/ Override to see the compile error) thanks for your help! On Oct 15, 2009, at 6:40 PM, Ahad Rana wrote: Hi, Unfortunately Mapper is now a class and from your call stack it seems t

Re: Cascading jobs in hadoop

2009-10-17 Thread Kevin Weil
Bharath, The mapred package is largely deprecated, as hadoop is moving towards the mapreduce package. Use mapreduce for any new jobs you write, because mapred will go away in some future release. For now, both are there to give developers time to rewrite existing older jobs. Kevin On Sat, Oct 3

Re: Project ideas !

2009-10-17 Thread sudha sadhasivam
Has any publications been done in this area? G Sudha Sadasivam --- On Sat, 10/17/09, Siddu wrote: From: Siddu Subject: Re: Project ideas ! To: common-user@hadoop.apache.org Date: Saturday, October 17, 2009, 3:29 PM On Wed, Oct 14, 2009 at 5:05 PM, tim robertson wrote: > I am interested to s

Re: Project ideas !

2009-10-17 Thread Siddu
On Wed, Oct 14, 2009 at 5:05 PM, tim robertson wrote: > I am interested to see more spatial processing carried out on hadoop. > I have done basic spatial joins intersecting 100s millions of points with 100s thousands of polygons but this is all. It's something I'd like to spend time researching,

Re: How can I run such a mapreduce program?

2009-10-17 Thread Aaron Kimball
If you're working with the Cloudera distribution, you can install CDH1 (0.18.3) and CDH2 (0.20.1) side-by-side on your development machine. They'll install to /usr/lib/hadoop-0.18 and /usr/lib/hadoop-0.20; use /usr/bin/hadoop-0.18 and /usr/bin/hadoop-0.20 to execute, etc. See http://archive.cloud