I am trying to crawl a single site and have used db.ignore.external.links=true flag. But it seems to fail because it will crawl sites with a different country extension so for example: if the seed is mysite.com, it will crawl mysite.com, mysite.es & mysite.it -
I dont want to use a regex to exclude them because I have multiple URLs and don't want to maintain a long list. Is this a known bug? Thanks, Jean Vence