Re: Payload doesn't apply to WordDelimiterFilterFactory-generated tokens

Chris Hostetter Mon, 18 Jul 2011 15:39:07 -0700

: It seems that the payloads are applied only to the original word that I
: index and the WordDelimiterFilter doesn't apply the payloads to the tokens
: it generates.


I believe you are correct.  I think the general rule for most TokenFilters 
that you will find in Lucene/Solr is that they don't typically "clone" 
attributes (like payloads) when generating new Tokens -- it may be what 
you want in your use case, but there's no hard & fast rule that it would 
always make sense to do so.

If you'd like to opne a jira (or submit a patch) i suspect a new 
"clonePayload" attribute could be added to the WDF Factory to drive this 
kind of behavior so people with use cases where it made sense could enable 
this -- but i haven't looked at that code (or the current TokenStream API) 
enough to have any idea how hard it would be.



-Hoss

Re: Payload doesn't apply to WordDelimiterFilterFactory-generated tokens

Reply via email to