Bruno Roustant created LUCENE-9245:
--------------------------------------

             Summary: Optimize AutomatonTermsEnum and 
Operations.getCommonPrefixBytesRef
                 Key: LUCENE-9245
                 URL: https://issues.apache.org/jira/browse/LUCENE-9245
             Project: Lucene - Core
          Issue Type: Improvement
            Reporter: Bruno Roustant
            Assignee: Bruno Roustant


Reduce AutomatonTermsEnum memory usage. It uses a long[] with size = number of 
automaton states, which may be large. Instead we can reduce to short[] with 
slightly faster performance. (I tried a FixedBitSet, but it requires to clear 
and this impacts negatively performance)

Operations.getCommonPrefixBytesRef() is used by CompiledAutomaton to compute 
the common suffix for all automatons with infinite language. It can be 
optimized with lazy structure creation and less operations. This gave some % 
improvement in my benchmarks (LUCENE-9237).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to