> Beside that, we may should add a kind of timeout to the url filter in > general. > Since it can happen that a user configure a regex for his nutch setup > that run in the same problem as we had run right now. > Something like below attached. > Would you agree? I can create a serious patch and test it if we are > interested to add this as a fail back into the sources.
+1 as a short term solution. In the long term, I think we should try to reproduce it and analyze what really happen. (I will commit some minimal unit test in the next few days). Regards Jérôme -- http://motrech.free.fr/ http://www.frutch.org/