Hey guys, I try to crawl a website generated with a Mediawiki-extension and always get the message:
"[WebcrawlerConnector.java:1312] - WEB: Decided not to ingest 'http://wiki.<host>/index.php?title=Spezial%3AAlle+Seiten&from=p&to=s&namespace=0' because it did not match ingestability criteria" Seed-url: 'http://wiki.<host>/index.php?title=Spezial%3AAlle+Seiten&from=p&to=s&namespace=0 Inclusions (crawl and index): .* Exclusions: none Other sites are crawled without problems, so I'm wondering what those ingestability criteria exactly are. Best regards, Tobias