We have a requirement in that E-mail addresses need to be added in a tokenized form to one field while untokenized form is added to another field
Ex: "I have mailed [email protected]" . It should tokenize as below body = {"I", "have", "mailed", "abc", "xyz", "com"}; I also have a body-addr field. Tokenizer needs to extract e-mail addresses from body field and add them as below body-addr = {"[email protected]"} How to achieve this via tokenizer chain? -- Ravi
