How can I exlude certain mime-types from crawling, for example Word-documents? If I have parse-tika in plugin.includes it will parse them. Do I have to change parse-plugins.xml?
I can't exclude them in regex-urlfilter as the .doc extension is not present in the urls. Thanks Matthias