[ https://issues.apache.org/jira/browse/JCR-4551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Julian Reschke updated JCR-4551: -------------------------------- Labels: candidate_jcr_2_18 (was: ) > Use the normalized MediaType to check if the given MediaType should be indexed > ------------------------------------------------------------------------------ > > Key: JCR-4551 > URL: https://issues.apache.org/jira/browse/JCR-4551 > Project: Jackrabbit Content Repository > Issue Type: Bug > Components: jackrabbit-core > Reporter: Claus Köll > Assignee: Claus Köll > Priority: Major > Labels: candidate_jcr_2_18 > Fix For: 2.20.1, 2.22, 2.21.1 > > > We have configured some MimeTypes in the tika.config File as following ... > {code:java} > <parser name="parse-office" > class="org.apache.tika.parser.microsoft.OfficeParser"> > <mime>application/msword</mime> > <mime>application/vnd.ms-excel</mime> > <mime>application/msexcel</mime> > <mime>application/vnd.ms-powerpoint</mime> > </parser> > {code} > As we have old Excel Files with Mimetype (application/msexcel) it should be > parsed with the OfficeParser. Tika internally converts (normalizse) this > MimeType with the MediaTypeRegistry to application/vnd.ms-excel. > The NodeIndexer should also use the normalized MediaType in > #isSupportedMediaType(String type) > Otherwise the old MimeTypes will not be indexed anymore. -- This message was sent by Atlassian Jira (v8.3.4#803005)