Maciej Lizewski created SOLR-5276: ------------------------------------- Summary: highlighter working using stemmed tokens from another field and text from another Key: SOLR-5276 URL: https://issues.apache.org/jira/browse/SOLR-5276 Project: Solr Issue Type: Improvement Components: highlighter Reporter: Maciej Lizewski Priority: Minor
The case is like this: I have 'content' field with content extracted with tika and several fields for language versions (like content_pl, content_en, content_es, content_ru, etc). I also use custom langid component which can copy 'content' to serveral content_* fields if it detects more than one language (so those parts are properly stemmed in every language present in text). Now to use highlighter in such scenario I need to store all those language fields even if their contents is always same as the one in 'content' field. Would be nice if I could configure language specific fields to be not stored, and configure highlighter to take tokens positions from those fields and apply them to text in 'content' field... In other words - to say: take tokens from 'content_pl', and prepare highlight based on text in 'content' field. It could be administrator responsibility to guarantee that mapped fields have same content. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org