[ 
https://issues.apache.org/jira/browse/LUCENE-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erick Erickson updated LUCENE-4817:
-----------------------------------

    Attachment: docs.patch

On a quick look, it looks like Porter, KStem, Snowball and Hunspell all respect 
the keyword attribute. So I'll make the *docs only* change in the attached 
patch unless I've misrepresented things (have to run precommit dont'cha know).

I've include Varun's suggestion as well, thanks!

It always amazes me how simple some solutions are in the hands of an expert. 
"Why didn't I think of that?".
                
> Add KeywordRepeaterFilter to emit tokens twice once as keyword and once not 
> as keyword
> --------------------------------------------------------------------------------------
>
>                 Key: LUCENE-4817
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4817
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 4.1
>            Reporter: Simon Willnauer
>            Priority: Minor
>             Fix For: 5.0, 4.3
>
>         Attachments: docs.patch, LUCENE-4817.patch, LUCENE-4817.patch
>
>
> if you want to have a stemmed and an unstemmed version of a token one for 
> recall and one for precision you have to do two fields today in most of the 
> cases. Yet, most of the stemmers respect the keyword attribute so we could 
> add a token filter that emits the same token twice once as keyword and once 
> plain. Folks would most likely need to combine this 
> RemoveDuplicatesTokenFilter but that way we can have stemmed and unstemmed 
> version in the same field.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to