if (matcher.find()) ....

On Thu, 2006-02-23 at 18:10 +0800, Jack Tang wrote:
> Hi
> 
> I think in the url-filter it uses "contain" rather than "match".
> 
> /Jack
> 
> On 2/23/06, Elwin <[EMAIL PROTECTED]> wrote:
> > # accept hosts in MY.DOMAIN.NAME
> > +^http://([a-z0-9]*\.)*MY.DOMAIN.NAME/
> >
> > Will this pattern accept url like this 
> > http://MY.DOMAIN.NAME/([a-z0-9]*\.)*/?
> > I think it's not, but in fact nutch can crawl and get urls like that in
> > intranet crawl. Why?
> >
> >
> 
> 
> --
> Keep Discovering ... ...
> http://www.jroller.com/page/jmars
> 




-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to