[jira] [Created] (SOLR-7193) Concatenate words from token stream

abhishek bafna (JIRA) Wed, 04 Mar 2015 21:48:59 -0800

abhishek bafna created SOLR-7193:
------------------------------------

             Summary: Concatenate words from token stream
                 Key: SOLR-7193
                 URL: https://issues.apache.org/jira/browse/SOLR-7193
             Project: Solr
          Issue Type: New Feature
          Components: Schema and Analysis
            Reporter: abhishek bafna



The user entered data often don't have proper spacing between words and words 
spelling and format also varies from data like business names, address etc. 
After tokenizing data, we might perform pattern replacement, stop word 
filtering etc. Later we want to concatenate all the tokens and generate n-grams 
token for indexing business name and perform the fuzzy match.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Created] (SOLR-7193) Concatenate words from token stream

Reply via email to