Hi guys:

I am using nutch in a project. But I found that nutch repeat fetching some
pages. For example:

http://www.me.washington.edu//people/faculty/wang/

this is a page fetched. But also, there are some urls like this in
commandline output:

http://www.me.washington.edu//people/faculty/wang/
http://www.me.washington.edu///people/faculty/wang/
http://www.me.washington.edu////people/faculty/wang/
......
http://www.me.washington.edu////////////people/faculty/wang/

it seems nutch will repeat this process for ever. Why is that?

any help is appreciated!

yanky

Reply via email to