Re: Custom Distributed crawl - NDFS?

2006-03-16 Thread Grégory Debord
Thanks for your help Marko, I'll have a look a this project soon. This will probably help me a lot. -- greg Le 16 mars 06 à 14:13, Marko Bauhardt a écrit : Am 16.03.2006 um 12:50 schrieb Grégory Debord: Hi all, I would like to implement a distributed crawl which would be something like

Re: Custom Distributed crawl - NDFS?

2006-03-16 Thread Marko Bauhardt
Am 16.03.2006 um 12:50 schrieb Grégory Debord: Hi all, I would like to implement a distributed crawl which would be something like this : The hadoop project is used for working with a dfs. In hadoop exists one master (namenode, jobtracker) and n slaves (datanodes and tasktrackers).

Custom Distributed crawl - NDFS?

2006-03-16 Thread Grégory Debord
Hi all, I would like to implement a distributed crawl which would be something like this : - A main machine that would store all nutch database (1) - n machines that would only be used for fetching (because I use specific computations in the fetcher process which are time consuming). (2) After f