MapReduce: Two Reduce Tasks

2008-04-15 Thread Chaman
Hello, I am developing some applications in which I can use the output of Map to 3-4 different Reduce tasks ? What is the best way to accomplish such task ? Thanks. With regards, csv -- View this message in context: http://www.nabble.com/MapReduce%3A-Two-Reduce-Tasks-tp16703412p16703412.html

More MapReduce Examples

2008-03-22 Thread Chaman Singh Verma
Hello, Where can I find more examples for MapReduce ? The simplicity of MapReduce raises many questions in my mind and I am looking for some complex applications. Thanks. csv - Be a better friend, newshound, and know-it-all with Yahoo! Mobile. Try it no

One Simple Question About Hadoop DFS

2008-03-23 Thread Chaman Singh Verma
Hello, I am exploring Hadoop and MapReduce and I have one very simple question. I have 500GB dataset on my local disk and I have written both Map-Reduce functions. Now how should I start ? 1. I copy the data from local disk to DFS. I have configured DFS with 100 machines. I hope that it will

Archive

2008-04-15 Thread Chaman Singh Verma
Hello How can I see browse through the Archive of Hadoop users ? Everytime I try I get the following message: Not Found The requested URL /mail/core-user/ was not found on this server. - Apache/2.2.8 (Unix) Server at hadoop.apache.org Port 80 between -00-

Large Weblink Graph

2008-04-15 Thread Chaman Singh Verma
Hello, Does anyone have large Weblink graph ? I want to experiment and benchmark MapReduce with some real dataset. Thanks, With regards, Chaman Singh Verma, Poona, India between -00-00 and -99-99

Re: Large Weblink Graph

2008-04-15 Thread Chaman Singh Verma
Thanks a lot Andrzej. csv Andrzej Bialecki <[EMAIL PROTECTED]> wrote: Ted Dunning wrote: > Please include the Mahout sub-project when you report what you find. This > kind of dataset would be very helpful for that project as well. > > And you might find something helpful there as well. The go

Page Ranking, Hadoop And MPI.

2008-04-15 Thread Chaman Singh Verma
ndard Message Passing (MPI) may not be suitable for this purpose(Fault tolerance issue) Suppose we implement ranking algorithm on top of Hadoop, what could be the best way/best distributed algorithm/library etc ? With regards. Chaman Singh Verma Poona, India between -00-00 and -99-99

Thanks

2008-04-15 Thread Chaman Singh Verma
Thanks Paco for the information. csv Paco NATHAN <[EMAIL PROTECTED]> wrote: Another site which has data sets available for study is UCI Machine Learning Repository: http://archive.ics.uci.edu/ml/ On Tue, Apr 15, 2008 at 8:29 AM, Chaman Singh Verma wrote: > Does anyone h

Re: MapReduce: Two Reduce Tasks

2008-04-15 Thread Chaman Singh Verma
the examples, which I have come across contains one Map Function and One Reduce Function. I want one Map Function and 3-4 Reduce Functions which can utilize the output of Map Function. Thanks, With Regards, Chaman Singh Chaman Singh Verma wrote: > > Hello, > > I am dev

EigenValue Calculations, Hadoop and MPI.

2008-04-15 Thread Chaman Singh Verma
y etc ? Some hint from Google implementation could be extremely useful. With regards. csv - Chaman Singh Verma Poona, India -- View this message in context: http://www.nabble.com/EigenValue-Calculations%2C-Hadoop-and-MPI.-tp16703423p16703423.html Sent from the Hadoop core-user mailing

Re: Page Ranking, Hadoop And MPI.

2008-04-15 Thread Chaman Singh Verma
Hello, It was wonderful explanation about the role and beauty of eigenvalues in ranking. But I am still far from the real answer/hint. How Google handle such a large matrix and solve it ? Do they use MapReduce framework for these process or adopt standard and reliable Message Passing Interface

Re: EigenValue Calculations, Hadoop and MPI.

2008-04-15 Thread Chaman Singh Verma
ments on the web as detail as possible. Thanks. On Wed, Apr 16, 2008 at 3:17 AM, Chaman Singh Verma wrote: > > Hello, > > After googling for many days, I couldn't get one answer from many of the > published reports on > Ranking algorithm done by Google. Since Google uses GF

Aborting Map Function

2008-04-16 Thread Chaman Singh Verma
? Thanks. With Regards - Chaman Singh Verma Poona, India -- View this message in context: http://www.nabble.com/Aborting-Map-Function-tp16722552p16722552.html Sent from the Hadoop core-user mailing list archive at Nabble.com.

Backup Tasks in Hadoop MapReduce.

2008-04-16 Thread Chaman Singh Verma
Hello, I am curious to know whether Hadoop MapReduce has the feature of "Backup Tasks" as described in the seminal paper "MapReduce:Simplified Data Processing in Large Cluster" ( Dean and Ghemawat ). Any implementation detail will be extremely valuable. Thanks. With R