Hi Irfan, LUCENE-6672 is not really an issue, because the maxDeterminizedStates limit is still enforced, just at an earlier stage (determinize) that that issue expected (after UTF8 conversion of the determinized automaton).
Mike McCandless http://blog.mikemccandless.com On Tue, Dec 1, 2015 at 12:39 PM, Irfan Hamid <[email protected]> wrote: > Hi Michael, > > Is the functionality you're mentioning the same as the one pointed out by > David Causse in LUCENE-6672? If so he is claiming that maxDeterminizedStates > is not respected by UTF32ToUTF8 and can thus cause a problem. I'm looking at > lucene trunk to try and figure it out. However, input from you would be much > appreciated. > > TIA, > Irfan. > > On Tue, Dec 1, 2015 at 3:19 AM, Dawid Weiss <[email protected]> wrote: >>> >>> >>> I think it would be interesting to explore an NFA implementation for >>> Lucene! >>> >> >> It would be interesting and valuable to have an optimized non-DFA graph >> engine in general. I'm thinking of something like re2. >> https://github.com/google/re2 >> >> Dawid >> > --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
