Add human content-type ---------------------- Key: SOLR-1645 URL: https://issues.apache.org/jira/browse/SOLR-1645 Project: Solr Issue Type: Improvement Components: contrib - Solr Cell (Tika extraction) Affects Versions: 1.4 Reporter: Khalid Yagoubi Fix For: 1.4
Idea is to allow Solr-Cell to "calculate" the human content-type from the extracted content-type and map it to a field in the schema. So the user can search on "media: image" or "media:video" Idea : 1) Hardcode a hashmap in somewhere in extraction classes and get human content-type from extracted content-type. I Think to SolrContentHandler.java 2) Write an xml file where we can put a mapping like in tika-config.xml for parsers 3) Use tika-config.xml to get all supported mime-types -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.