Bruno Roustant created LUCENE-9245: -------------------------------------- Summary: Optimize AutomatonTermsEnum and Operations.getCommonPrefixBytesRef Key: LUCENE-9245 URL: https://issues.apache.org/jira/browse/LUCENE-9245 Project: Lucene - Core Issue Type: Improvement Reporter: Bruno Roustant Assignee: Bruno Roustant
Reduce AutomatonTermsEnum memory usage. It uses a long[] with size = number of automaton states, which may be large. Instead we can reduce to short[] with slightly faster performance. (I tried a FixedBitSet, but it requires to clear and this impacts negatively performance) Operations.getCommonPrefixBytesRef() is used by CompiledAutomaton to compute the common suffix for all automatons with infinite language. It can be optimized with lazy structure creation and less operations. This gave some % improvement in my benchmarks (LUCENE-9237). -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org