> But other than that, your analysis is correct, probably there should be > an "application/xml" added to the list of handled content types. But > this is further complicated by the fact, that Nutch doesn't do the right > thing now if you have more than one plugin handling the same mime type...
I have created a Jira issue concerning this problem that was recently discussed in http://www.mail-archive.com/nutch-user%40lucene.apache.org/msg00744.html The Issue: http://issues.apache.org/jira/browse/NUTCH-88 Regards Jérôme -- http://motrech.free.fr/ http://www.frutch.org/