Re: Why URLNormalizer doesn't implement the Pluggable?

2011-08-26 Thread Kaiwii Ho
thank u alot.But,anyway,I am still curious why the developer of the nutch not treat the UrlNormalizer as the others do.Is it a bug or a specified trick? On Thu, Aug 25, 2011 at 5:09 PM, lewis john mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Kaiwii, If you look at

Are there any tutorial for writing regex-normalize.xml?

2011-08-26 Thread Kaiwii Ho
I'm gonna to specify my own regex-normalize.xml.Are there any tutorial for writing regex-normalize.xml? waiting for ur help and thank u

Re: Are there any tutorial for writing regex-normalize.xml?

2011-08-26 Thread lewis john mcgibbney
Apart from looking through the list archives, as far as I aware nothing has been specifically documented on this topic. In the mean time you may find this helpful http://geekswithblogs.net/brcraju/articles/235.aspx On Fri, Aug 26, 2011 at 9:22 AM, Kaiwii Ho kaiwi...@gmail.com wrote: I'm gonna

subscription for nutch

2011-08-26 Thread Samata Sirsikar
Hello, I would like to subscribe to nutch as a user.

Re: Are there any tutorial for writing regex-normalize.xml?

2011-08-26 Thread Kaiwii Ho
thank u a alot On Fri, Aug 26, 2011 at 9:31 PM, lewis john mcgibbney lewis.mcgibb...@gmail.com wrote: Apart from looking through the list archives, as far as I aware nothing has been specifically documented on this topic. In the mean time you may find this helpful

Re: keeping index up to date

2011-08-26 Thread Radim Kolar
Dne 26.7.2011 21:55, Markus Jelsma napsal(a): We have the injector for that ;) What will injector do if injected URL is already in database? Will be injected with priority 1.0 and re-scheduled for immediate fetch?