Greetings all,

I have just been handed the administration of our nutch implementation,
we are currently using nutch 0.7 and it very badly needs updating.
However we are evaluating several options, and I wanted to know about
where nutch is going as a project. I have not been able to find anything
in the wiki or in the mailing list archives with this information
(forgive me if I have missed it).

The central issue is that our needs are for our crawling our own
website with about 200,000 pages and documents with a single machine
containing nutch, not for crawling the web with a massively scalar
architecture. I have heard nutch is moving towards the latter and that
the former usage is becoming very slow in 0.8 compared to 0.7, is this
correct?

Thank you for helping me out.

Regards,


Anthony May
Web Developer
NZQA

********************************************************************************
This email may contain legally privileged information and is intended only for 
the addressee. It is not necessarily the official view or 
communication of the New Zealand Qualifications Authority. If you are not the 
intended recipient you must not use, disclose, copy or distribute this email or 
information in it. If you have received this email in error, please contact the 
sender immediately. NZQA does not accept any liability for changes made to this 
email or attachments after sending by NZQA. 

All emails have been scanned for viruses and content by MailMarshal. 
NZQA reserves the right to monitor all email communications through its network.

********************************************************************************

-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to