Re: Limiting outlink tags.

2007-09-07 Thread Doğacan Güney
Hi Marcin, On 9/7/07, Marcin Okraszewski [EMAIL PROTECTED] wrote: Hi, I have noticed that Nutch considers img/@src as an outlink. I suppose in many cases people do not want to threat image as an outlink. At least I don't want. The same case is with script/@src. But, it seems there is no way

Limiting outlink tags.

2007-09-06 Thread Marcin Okraszewski
Hi, I have noticed that Nutch considers img/@src as an outlink. I suppose in many cases people do not want to threat image as an outlink. At least I don't want. The same case is with script/@src. But, it seems there is no way to limit outlink tags. The DOMContentUtils.getOutlinks() takes links