Hello Mike,
For distributed database, do we have install nutch on all servers or only one.
does the database automatically distribute the database or we have to let it know where to store. Let us say i want to store 100 million urls, and i am using 2 machines to do this. how long will it take to store 100 million urls and is 500 gigs of hard-drive enough. i need to test this as soon as possible if you can help me out and let me know what to do.


thank you
pierre

From: Michael Cafarella <[EMAIL PROTECTED]>
Reply-To: [EMAIL PROTECTED]
To: [EMAIL PROTECTED]
Subject: Re: [Nutch-dev] Distributed Proposal
Date: 08 Feb 2004 22:26:49 -0800


Hi Stefan,


  I can finally start to answer your question.  I've written
a long document about how the WebDB works.  In fact, I haven't
yet gotten to the distributed part!  But everything I've written
so far is a prereq to understanding how the distributed code works.
I think the distributed element should be comparatively small.

Let me know if it makes sense.

--Mike


<< webdb.txt >>

_________________________________________________________________
The new MSN 8: advanced junk mail protection and 2 months FREE* http://join.msn.com/?page=dept/bcomm&pgmarket=en-ca&RU=http%3a%2f%2fjoin.msn.com%2f%3fpage%3dmisc%2fspecialoffers%26pgmarket%3den-ca




-------------------------------------------------------
SF.Net is sponsored by: Speed Start Your Linux Apps Now.
Build and deploy apps & Web services for Linux with
a free DVD software kit from IBM. Click Now!
http://ads.osdn.com/?ad_id=1356&alloc_id=3438&op=click
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to