[ 
https://issues.apache.org/jira/browse/SOLR-3099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jan Høydahl updated SOLR-3099:
------------------------------

         Fix Version/s: 4.0
    Remaining Estimate:     (was: 4h)
     Original Estimate:     (was: 4h)

This is wanted. Scheduling for 4.x. I think for this to work we need some 
better metadata support in analysis? Currently you can tag a token with a 
TOKENTYPE, so the stemmer could add the stemmed token on same position with 
tokentype=stem. However, we'd need a way to convey from the query that [=foo] 
should NOT match the "stem" token types?

Also, could we not simply adopt Google's syntax, i.e. if a single token is 
quoted, it is searched verbatim, e.g. foo "bar".
                
> Add query operator, index structure, and analyzer for "exact match" searching
> -----------------------------------------------------------------------------
>
>                 Key: SOLR-3099
>                 URL: https://issues.apache.org/jira/browse/SOLR-3099
>             Project: Solr
>          Issue Type: Sub-task
>          Components: Schema and Analysis
>            Reporter: Mike
>             Fix For: 4.0
>
>
> A project I'm working on requires *exact match* searching with stemming 
> turned off. The users are accostomed to Sphinx search, and thus expect a 
> query like [ =runs ] to return only documents that contain the exact term, 
> "runs", and not the stemmed word "run".
> In SOLR-2866, there is similar work, but I believe it is different because it 
> uses a huge-synonym file rather than storing the original terms directly in 
> the index. 
> What I'd like instead is two things:
> 1. An analyzer that says, "store the original form of all words in the index 
> along with the stemmed variations." If necessary, it's fine if this is simply 
> an unstemmed field, but that seems cumbersome schema-wise and 
> performance-wise.
> 2. An operator in edismax that allows users to query the exact form of the 
> word. Sphinx uses the equals sign (=), and that makes sense logically to me.
> This issue is part of a meta issue, SOLR-3028, that is requesting two other 
> operators in edismax (quorum search and word order).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to