[Nutch-dev] RE: extend java.net.URL?

2005-08-10 Thread Nick Lothian
> > Hi, All, > > I need to extend java.net.URL, in order to handle newer/more schemes. > The problem is that java.net.URL is final, not meant to be subclassed. > Since URL is extensively used in nutch, I am also reluctant > to replace every one with my own MyURL. It seems I will have > to hack

[Nutch-dev] extend java.net.URL?

2005-08-10 Thread John X
Hi, All, I need to extend java.net.URL, in order to handle newer/more schemes. The problem is that java.net.URL is final, not meant to be subclassed. Since URL is extensively used in nutch, I am also reluctant to replace every one with my own MyURL. It seems I will have to hack java.net.URL source

[Nutch-dev] Re: How to extend Nutch?

2005-08-10 Thread Michael Ji
hi Fuad: I am probably doing the same thing. I think plug-in is the right place to put my own code. But not sure, why we need to touch other config files. Regards, Michael Ji --- Fuad Efendi <[EMAIL PROTECTED]> wrote: > > I need some pre-processing, to add additional fields > to Document, a

[Nutch-dev] Amin GH's invitation

2005-08-10 Thread AminGH
Did you know that you have 1 friend(s) that invited you to join their mobile friends network. Friend(s) waiting for you to accept include: Amin GH * Send text messages (SMS) to over 400 wireless networks worldwide. * Meet new friends by searching over 30 million members on the world's larges

[Nutch-dev] Re: Nutch versions - Was: [Nutch-cvs] svn commit: r230887 - /lucene/nutch/trunk/conf/nutch-default.xml

2005-08-10 Thread Doug Cutting
Piotr Kosiorowski wrote: I think we all refer to 0.7 as next number (and 0.6 as current) so nutch-default.xml contains wrong format. In fact it should still contain -dev suffix. To make undocumented comvention documented I would also like to suggest naming releases with X.Y format and naming

[Nutch-dev] Nutch versions - Was: [Nutch-cvs] svn commit: r230887 - /lucene/nutch/trunk/conf/nutch-default.xml

2005-08-10 Thread Piotr Kosiorowski
Hello, I think a lot of people will wait before moving to mapreduce implementation for some time so we will have a 0.7 version to support. I was a heavy CVS branch user in my previous job taking care about common library so I fully agree that such branch would be needed for bug fixing. I would

[Nutch-dev] How to extend Nutch?

2005-08-10 Thread Fuad Efendi
I need some pre-processing, to add additional fields to Document, and to show it on a web-page I probably need to work with plugins, and to modify config files... nutch-conf.xsl nutch-default.xml nutch-site.xml Am I right? Thanks -Original Message- From: Fuad Efendi [mailto:[EMAIL PR

Re: [Nutch-dev] Re: regex-url filter

2005-08-10 Thread Hasan Diwan
On Aug 9, 2005, at 6:23 PM, Zhou LiBing wrote: 1) If I want to limit multi-domains,what should I do Change: +^http://([a-z0-9]*\.)*MY.DOMAIN.NAME/ to: +^http://([a-z0-9]*\.)*(domain1|domain2|...|domainN)/ 2) if I have a mirror web site at local disk, How can I use Nutch to search the conte