2006/10/14, Toufeeq Hussain <[EMAIL PROTECTED]>:

> From internal tests with ntlmaps + Nutch the conclusion we came to was
> that though it "kinda-works" it puts a huge load on the Nutch server
> as ntlmaps is a major memory-hog and the mixture of the two leads to
> performance issues. For a PoC this will do but for
> production-deployments I would not suggest one goes the ntlmaps way.
>
> An alternate would be to have a separate ntlmaps-server ,a dedicated
> machine acting as the NTLM proxy for the Nutch-box which sits behind
> it.

I haven't noticed the added resource drain, but then again, I haven't
really tested all that much: the constraints on the partical project I
implemented the approach weren't very strict.
I'll keep my eye on the cpu usage.

> The right way would be to use the in-built authentication features of
> Nutch for Auth based crawling.

Nutch supports ntlm authentication? I see I've got some reading to
catch up on...

t.n.a.

-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to