Re: how to use hadoop apis with cloudera distribution ?

2011-03-08 Thread Alejandro Abdelnur
If write your code within a Maven project (which you can open from Eclipse) then you should the following in your pom.xml: * Define Cloudera repository: ... cdh.repo https://repository.cloudera.com/content/groups/cloudera-repos Cloudera Repositories

Re: how to use hadoop apis with cloudera distribution ?

2011-03-08 Thread Marcos Ortiz Valmaseda
You can check the Cloudera Training Videos, where is a screencast explaining how to develop Hadoop using Eclipse. http://www.cloudera.com/presentations http://vimeo.com/cloudera Now, For working with Hadoop APIs using Eclipse, for developing applications based on Hadoop, you can use the Kamasphe

Re: how to use hadoop apis with cloudera distribution ?

2011-03-08 Thread Mapred Learn
Thanks Marco ! I was trying to use CDH3 with eclipse and not able to know why eclipse complains for the import statement for hadoop apis when cloudera already includes them. I did not understand how CDH3 works with eclipse, does it download hadoop apis when we add svn urls ? On Tue, Mar 8, 201

Re: Dataset comparison and ranking - views

2011-03-08 Thread Chase Bradford
How much smaller is the smaller dataset? If you can use the DC and precompute bigrams, locations, etc, and hold all the results in memory during setup before mapping on the large dataset, then I would suggest that approach. Another trick I've seen for similar problems where the final score is a

Re: Dataset comparison and ranking - views

2011-03-08 Thread Marcos Ortiz
On Tue, 2011-03-08 at 10:51 +0530, Sonal Goyal wrote: > Hi Marcos, > > Thanks for replying. I think I was not very clear in my last post. Let > me describe my use case in detail. > > I have two datasets coming from different sources, lets call them > dataset1 and dataset2. Both of them contain re

how to use hadoop apis with cloudera distribution ?

2011-03-08 Thread Marcos Ortiz
On Tue, 2011-03-08 at 07:16 -0800, Mapred Learn wrote: > > > Hi, > > I downloaded CDH3 VM for hadoop but if I want to use something like: > > > > import org.apache.hadoop.conf.Configuration; > > > > in my java code, what else do I need to do ? Can you see all tutorial that Cloudera has on it

Re: how to use hadoop apis with cloudera distribution ?

2011-03-08 Thread Mapred Learn
> Hi, > I downloaded CDH3 VM for hadoop but if I want to use something like: > > import org.apache.hadoop.conf.Configuration; > > in my java code, what else do I need to do ? > > > Do i need to download hadoop from apache ? > > if yes, then what does cdh3 do ? > > if not, then where