Greg Miller created LUCENE-9850:
-----------------------------------
Summary: Explore PFOR for Doc ID delta encoding (instead of FOR)
Key: LUCENE-9850
URL: https://issues.apache.org/jira/browse/LUCENE-9850
Project: Lucene - Core
Issue Type: Task
Components: core/codecs
Affects Versions: main (9.0)
Reporter: Greg Miller
It'd be interesting to explore using PFOR instead of FOR for doc ID encoding.
Right now PFOR is used for positions, frequencies and payloads, but FOR is used
for doc ID deltas. From a recent
[conversation|http://mail-archives.apache.org/mod_mbox/lucene-dev/202103.mbox/%3CCAPsWd%2BOp7d_GxNosB5r%3DQMPA-v0SteHWjXUmG3gwQot4gkubWw%40mail.gmail.com%3E]
on the dev mailing list, it sounds like this decision was made based on the
optimization possible when expanding the deltas.
I'd be interesting in measuring the index size reduction possible with
switching to PFOR compared to the performance reduction we might see by no
longer being able to apply the deltas in as optimal a way.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]