[Nutch-dev] [jira] Created: (NUTCH-289) CrawlDatum should store IP address

2006-05-26 Thread Doug Cutting (JIRA)
CrawlDatum should store IP address -- Key: NUTCH-289 URL: http://issues.apache.org/jira/browse/NUTCH-289 Project: Nutch Type: Bug Components: fetcher Versions: 0.8-dev Reporter: Doug Cutting If the CrawlDatum stored

[Nutch-dev] [jira] Commented: (NUTCH-273) When a page is redirected, the original url is NOT updated.

2006-05-26 Thread Doug Cutting (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-273?page=comments#action_12413528 ] Doug Cutting commented on NUTCH-273: Redirects should really not be followed immediately anyway. We should instead note that it was redirected and to which URL in the fetc

[Nutch-dev] Re: Where exactly nutch scoring takes place ?

2006-05-26 Thread Andrzej Bialecki
Gal Nitzan wrote: Hi, The scoring in Nutch-08 is done in a plugin: scoring-opic. It is called from Indexr.java ... plus in 6 other places ... -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Se

[Nutch-dev] RE: Where exactly nutch scoring takes place ?

2006-05-26 Thread Gal Nitzan
Hi, The scoring in Nutch-08 is done in a plugin: scoring-opic. It is called from Indexr.java HTH -Original Message- From: ahmed ghouzia [mailto:[EMAIL PROTECTED] Sent: Friday, May 26, 2006 3:16 PM To: nutch-user@lucene.apache.org; nutch-dev@incubator.apache.org Subject: Where exactly

[Nutch-dev] Where exactly nutch scoring takes place ?

2006-05-26 Thread ahmed ghouzia
I want to use nutch as an environment to test my proposed algorithm for web mining 1- Where exactly does the nutch score take place ? in which packages or files? 2- Can the LinkAnalysisTool be run at the intranet level?, some documents mentioned that it can take place only at the whole web craw

[Nutch-dev] 出售出口核销单

2006-05-26 Thread [EMAIL PROTECTED]
尊敬的老板您好: 我公司长期有出口核销单出售,每份250元。 任报,全国各口岸通关。如有需要请来电联系。 此致 祝商祺! 深圳市华美报关公司 联系人曾先生 手机013602645951