Michael McCandless created LUCENE-4602:
------------------------------------------
Summary: Use DocValues to store per-doc facet ord
Key: LUCENE-4602
URL: https://issues.apache.org/jira/browse/LUCENE-4602
Project: Lucene - Core
Issue Type: Improvement
Reporter: Michael McCandless
Spinoff from LUCENE-4600
DocValues can be used to hold the byte[] encoding all facet ords for
the document, instead of payloads. I made a hacked up approximation
of in-RAM DV (see CachedCountingFacetsCollector in the patch) and the
gains were somewhat surprisingly large:
{noformat}
Task QPS base StdDev QPS comp StdDev
Pct diff
HighTerm 0.53 (0.9%) 1.00 (2.5%)
87.3% ( 83% - 91%)
LowTerm 7.59 (0.6%) 26.75 (12.9%)
252.6% ( 237% - 267%)
MedTerm 3.35 (0.7%) 12.71 (9.0%)
279.8% ( 268% - 291%)
{noformat}
I didn't think payloads were THAT slow; I think it must be the advance
implementation?
We need to separately test on-disk DV to make sure it's at least
on-par with payloads (but hopefully faster) and if so ... we should
cutover facets to using DV.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]