a defect of org.apache.nutch.analysis.NutchAnalysis ?

2005-11-13 Thread Game Now
Hi all, I read the code of org.apache.nutch.analysis.NutchAnalysis and found there is some codes at line 126: && field == Clause.DEFAULT_FIELD The field and the Clause.DEFAULT_FIELD are both String type, so i think is it should be written as: && field.equals(Clause.DEFAULT_FI

Question of Range search

2005-11-13 Thread Game Now
Hi All, I wanna Nutch help me do a range search, such as price:{1000 TO 2000} or date[20050101 TO 2005]. But org.apache.nutch.searcher.Query#parse() method parse them to "price 1000 2000" and "date 20050101 2005" when i pass them to the method. Anybody can help me complete a range search

InterruptedException from ControllerThreadSocketFactory.SocketTask

2005-11-13 Thread Chris Schneider
Gang, I also sent the following to <[EMAIL PROTECTED]>, but I imagine some of you might not be on that list (or check it as closely): My crawl died with an InterruptedException the other day, and I'm wondering whether any of you have run into the same problem. Reviewing the code, it seems l

Re: suspicious outlink count

2005-11-13 Thread Piotr Kosiorowski
EM wrote: 202443 Pages consumed: 13 (at index 13). Links fetched: 233386. 202443 Suspicious outlink count = 30442 for [http://www.dmoz.org/]. 202444 Pages consumed: 135000 (at index 135000). Links fetched: 272315. If there is maxoutlinks already specified in the xml config, why does nut

mapper & Exceptions

2005-11-13 Thread Stefan Groschupf
Hi Doug, a very small improvement suggestion. Actually the method map in the mapper Interface can throw a IOException. I would found it better in case it just throw a general Exception since a map task can fail for other reasons as well, e.g. a in the map search server scenario you suggested.