Hi I think in the url-filter it uses "contain" rather than "match".
/Jack On 2/23/06, Elwin <[EMAIL PROTECTED]> wrote: > # accept hosts in MY.DOMAIN.NAME > +^http://([a-z0-9]*\.)*MY.DOMAIN.NAME/ > > Will this pattern accept url like this http://MY.DOMAIN.NAME/([a-z0-9]*\.)*/? > I think it's not, but in fact nutch can crawl and get urls like that in > intranet crawl. Why? > > -- Keep Discovering ... ... http://www.jroller.com/page/jmars ------------------------------------------------------- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnk&kid0944&bid$1720&dat1642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
