[Nutch-general] Re: details: stackoverflow error

Andrzej Bialecki Wed, 12 Apr 2006 11:42:03 -0700

Doug Cutting wrote:

Perhaps we could enhance the logic of the loop at Fetcher.java:320.Currently this exits the fetcher when all threads exceed a timeout.Instead it could kill any thread that exceeds the timeout, and restarta new thread to replace it. So instead of just keeping a count offetcher threads, we could maintain a table of all running fetcherthreads, each with a lastRequestStart time, rather than a globallastRequestStart. Then, in this loop, we can check to see if anythread has exceeded the maximum timeout, and, if it has, kill it andstart a new thread. When no urls remain, threads will exit and removethemselves from the set of threads, so the loop can exit as it doesnow, when there are no more running fetcher threads. Does this makesense? It would prevent all sorts thread hangs, not just in regexes.


+1, sounds like a good solution to this.

--
Best regards,
Andrzej Bialecki     <><
___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com




-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

[Nutch-general] Re: details: stackoverflow error

Reply via email to