If a regex based approach would work for you, it is super easy to create a TokenFilter that executes this regex on tokens (assuming proper tokenization was done). However, the real-world use-cases are probably way more complex than that, so for any non-trivial use case I'd go with Steve's answer
-- Itamar Syn-Hershko http://code972.com | @synhershko <https://twitter.com/synhershko> Freelance Developer & Consultant Author of RavenDB in Action <http://manning.com/synhershko/> On Tue, Aug 12, 2014 at 7:09 PM, Kenneth Glidden <[email protected]> wrote: > Thanks. I'm familiar with the regex approach, but am wondering if someone > has cooked up a char or token filter that auto-magically takes care of > this. > > > On Tue, Aug 12, 2014 at 11:57 AM, Mark Bennett < > [email protected]> > wrote: > > > I'd think a Regex based filter could do this. In Solr it'd be something > > you'd put in the analyzer chain; are you using Solr or Lucene? > > > > -- > > Mark Bennett / LucidWorks: Search & Big Data / > [email protected] > > Office: 408-898-4201 / Telecommute: 408-733-0387 / Cell: 408-829-6513 > > > > On Aug 12, 2014, at 8:45 AM, Kenneth Glidden <[email protected]> > wrote: > > > > > Is there a filter (or other option that I'm missing) that can convert > > > Japanese numerals to Arabic numerals? > > > > > > e.g. Convert *二五〇* to *250*. > > > > >
