How can I exlude certain mime-types from crawling, for example Word-documents?
If I have parse-tika in plugin.includes it will parse them. Do I have
to change parse-plugins.xml?

I can't exclude them in regex-urlfilter as the .doc extension is not
present in the urls.

Thanks
Matthias

Reply via email to