There are actually two different automata implementations inside
Lucene - one is a somewhat modified fork of Brics
(https://www.brics.dk/automaton/), which is a great general-purpose
library for handling automata. The other is an algorithm for building
automata (transducers) from sorted input-output pairs based on the
Daciuk-Mihov algorithm. This implementation handles large data very
well because it is compact and fast but requires sorted input. Jan
Daciuk designed an algorithm capable of implementing the construction
from non-sorted input but it's more complex and in practice sorting
typically wins (http://www.jandaciuk.pl/thesis/node9.html).

Dawid

On Thu, May 5, 2022 at 7:47 PM Petar Dambovaliev
<petar.atanasov.1...@gmail.com> wrote:
>
> Hello,
>
> I was looking for a way to update Non/Deterministic finite automatons with 
> new data.
> Some people pointed out that "Lucene is doing that. You should ask them how 
> they do it.".
> So here i am. I would appreciate it if someone can point me in the right 
> direction so i can see how you are doing this.
> Thank you.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to