Add human content-type
----------------------
Key: SOLR-1645
URL: https://issues.apache.org/jira/browse/SOLR-1645
Project: Solr
Issue Type: Improvement
Components: contrib - Solr Cell (Tika extraction)
Affects Versions: 1.4
Reporter: Khalid Yagoubi
Fix For: 1.4
Idea is to allow Solr-Cell to "calculate" the human content-type from the
extracted content-type and map it to a field in the schema.
So the user can search on "media: image" or "media:video"
Idea :
1) Hardcode a hashmap in somewhere in extraction classes and get human
content-type from extracted content-type. I Think to SolrContentHandler.java
2) Write an xml file where we can put a mapping like in tika-config.xml for
parsers
3) Use tika-config.xml to get all supported mime-types
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.