[ 
https://issues.apache.org/jira/browse/LUCENE-8265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16448339#comment-16448339
 ] 

David Smiley commented on LUCENE-8265:
--------------------------------------

bq.  code up a TokenFilter that wraps another TokenFilter, and bypasses the 
wrapped filter if a certain condition is met?

Yes; this has been my very long Lucene/Solr idea TODO list.   Or perhaps 
alternatively, some TokenFilters could extend a new TokenFilter subclass that 
checks a condition.   By default it could be a Predicate that simply returns 
true.  This would address Mike Sokolov's concern on propagating the lifecycle 
calls... I've had to delegate a tokenizer/filter before and it was a bit 
annoying to get right.

> WordDelimiterFilter should pass through terms marked as keywords
> ----------------------------------------------------------------
>
>                 Key: LUCENE-8265
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8265
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Mike Sokolov
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> This will help in cases where some terms containing separator characters 
> should be split, but others should not.  For example, this will enable a 
> filter that identifies things that look like fractions and identifies them as 
> keywords so that 1/2 does not become 12, while doing splitting and joining on 
> terms that look like part numbers containing slashes, eg something like 
> "sn-999123/1" might sometimes be written "sn-999123-1".



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to