[ https://issues.apache.org/jira/browse/SOLR-7193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16508806#comment-16508806 ]
Alexandre Rafalovitch commented on SOLR-7193: --------------------------------------------- This seems to be satisfied by LUCENE-8332 and SOLR-12376, both coming in 7.4. > Concatenate words from token stream > ----------------------------------- > > Key: SOLR-7193 > URL: https://issues.apache.org/jira/browse/SOLR-7193 > Project: Solr > Issue Type: New Feature > Components: Schema and Analysis > Reporter: Abhishek Bafna > Priority: Major > Attachments: concatenate_words.patch > > > The user entered data often don't have proper spacing between words and words > spelling and format also varies from data like business names, address etc. > After tokenizing data, we might perform pattern replacement, stop word > filtering etc. Later we want to concatenate all the tokens and generate > n-grams token for indexing business name and perform the fuzzy match. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org