Sounds like a job for org.apache.lucene.analysis.miscellaneous.PerFieldAnalyzerWrapper.
-- Ian. On Tue, Feb 17, 2015 at 8:51 AM, Ravikumar Govindarajan <[email protected]> wrote: > We have a requirement in that E-mail addresses need to be added in a > tokenized form to one field while untokenized form is added to another field > > Ex: > > "I have mailed [email protected]" . It should tokenize as below > > body = {"I", "have", "mailed", "abc", "xyz", "com"}; > > I also have a body-addr field. Tokenizer needs to extract e-mail addresses > from body field and add them as below > > body-addr = {"[email protected]"} > > How to achieve this via tokenizer chain? > > -- > Ravi --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
