[ https://issues.apache.org/jira/browse/LUCENE-8265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16448339#comment-16448339 ]
David Smiley commented on LUCENE-8265: -------------------------------------- bq. code up a TokenFilter that wraps another TokenFilter, and bypasses the wrapped filter if a certain condition is met? Yes; this has been my very long Lucene/Solr idea TODO list. Or perhaps alternatively, some TokenFilters could extend a new TokenFilter subclass that checks a condition. By default it could be a Predicate that simply returns true. This would address Mike Sokolov's concern on propagating the lifecycle calls... I've had to delegate a tokenizer/filter before and it was a bit annoying to get right. > WordDelimiterFilter should pass through terms marked as keywords > ---------------------------------------------------------------- > > Key: LUCENE-8265 > URL: https://issues.apache.org/jira/browse/LUCENE-8265 > Project: Lucene - Core > Issue Type: Improvement > Reporter: Mike Sokolov > Priority: Major > Time Spent: 10m > Remaining Estimate: 0h > > This will help in cases where some terms containing separator characters > should be split, but others should not. For example, this will enable a > filter that identifies things that look like fractions and identifies them as > keywords so that 1/2 does not become 12, while doing splitting and joining on > terms that look like part numbers containing slashes, eg something like > "sn-999123/1" might sometimes be written "sn-999123-1". -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org