Agreed
On Tue, Aug 12, 2014 at 12:13 PM, Itamar Syn-Hershko <[email protected]> wrote: > If a regex based approach would work for you, it is super easy to create a > TokenFilter that executes this regex on tokens (assuming proper > tokenization was done). However, the real-world use-cases are probably way > more complex than that, so for any non-trivial use case I'd go with Steve's > answer > > -- > > Itamar Syn-Hershko > http://code972.com | @synhershko <https://twitter.com/synhershko> > Freelance Developer & Consultant > Author of RavenDB in Action <http://manning.com/synhershko/> > > > On Tue, Aug 12, 2014 at 7:09 PM, Kenneth Glidden <[email protected]> > wrote: > > > Thanks. I'm familiar with the regex approach, but am wondering if > someone > > has cooked up a char or token filter that auto-magically takes care of > > this. > > > > > > On Tue, Aug 12, 2014 at 11:57 AM, Mark Bennett < > > [email protected]> > > wrote: > > > > > I'd think a Regex based filter could do this. In Solr it'd be > something > > > you'd put in the analyzer chain; are you using Solr or Lucene? > > > > > > -- > > > Mark Bennett / LucidWorks: Search & Big Data / > > [email protected] > > > Office: 408-898-4201 / Telecommute: 408-733-0387 / Cell: 408-829-6513 > > > > > > On Aug 12, 2014, at 8:45 AM, Kenneth Glidden <[email protected]> > > wrote: > > > > > > > Is there a filter (or other option that I'm missing) that can convert > > > > Japanese numerals to Arabic numerals? > > > > > > > > e.g. Convert *二五〇* to *250*. > > > > > > > > >
