Lourival Júnior wrote:

> I have total certainty that nutch is what are you looking for. Take a 
> look
> to nutch's documentation for more details and you will see :).


an alternative is websphinx, but it's not really maintained anymore.

HTH

Michael

>
> On 4/3/07, Meryl Silverburgh <[EMAIL PROTECTED]> wrote:
>
>>
>> Hi,
>>
>> I would like to know if know if it is a good idea to use nutch web
>> carwler?
>> Basically, this is what I need:
>> 1. I have a list of web site
>> 2. I want the web crawler to go thru each site, parser the anchor. if
>> it is the same domain, go thru the same step for 3 level.
>> 3. For each link, write to a new file.
>>
>> Is nutch a good solution? or there is other better open source
>> alternative for my purpose?
>>
>> Thank you.
>>
>
>
>


-- 
Michael Wechner
Wyona      -   Open Source Content Management   -    Apache Lenya
http://www.wyona.com                      http://lenya.apache.org
[EMAIL PROTECTED]                        [EMAIL PROTECTED]
+41 44 272 91 61


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to