If a regex based approach would work for you, it is super easy to create a
TokenFilter that executes this regex on tokens (assuming proper
tokenization was done). However, the real-world use-cases are probably way
more complex than that, so for any non-trivial use case I'd go with Steve's
answer

--

Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>


On Tue, Aug 12, 2014 at 7:09 PM, Kenneth Glidden <[email protected]>
wrote:

> Thanks.  I'm familiar with the regex approach, but am wondering if someone
> has cooked up a char or token filter that auto-magically takes care of
> this.
>
>
> On Tue, Aug 12, 2014 at 11:57 AM, Mark Bennett <
> [email protected]>
> wrote:
>
> > I'd think a Regex based filter could do this.  In Solr it'd be something
> > you'd put in the analyzer chain; are you using Solr or Lucene?
> >
> > --
> > Mark Bennett / LucidWorks: Search & Big Data /
> [email protected]
> > Office: 408-898-4201 / Telecommute: 408-733-0387 / Cell: 408-829-6513
> >
> > On Aug 12, 2014, at 8:45 AM, Kenneth Glidden <[email protected]>
> wrote:
> >
> > > Is there a filter (or other option that I'm missing) that can convert
> > > Japanese numerals to Arabic numerals?
> > >
> > > e.g. Convert *二五〇* to *250*.
> >
> >
>

Reply via email to