[
https://issues.apache.org/jira/browse/SOLR-153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12987003#action_12987003
]
Robert Muir commented on SOLR-153:
----------------------------------
Hi Yonik, I've been thinking of some possibilities here with regards to our
terms index itself.
Currently the terms index is an FST with an output going to the terms, but what
if instead this
was a PairOutputs<existingOutput, max(docFreq)> or whatever faceting needs.
Somehow via termsenum we would have to expose its FSTEnum or whatever so
faceting
could use it, but you could then make use of our existing "tree" structure
(terms index + terms dict)
for this case.
Sorry i'm not up to speed on really how fast termsenum faceting is versus say
using the fieldcache
or the various faceting techniques we have today, but it seems like we could do
something here, e.g.
if you know up-front you will facet on this field a lot you specify the
appropriate codec so that
your terms index accelerates faceting.
> Facet Index
> -----------
>
> Key: SOLR-153
> URL: https://issues.apache.org/jira/browse/SOLR-153
> Project: Solr
> Issue Type: New Feature
> Reporter: Yonik Seeley
> Attachments: facettree.patch, facettree.patch
>
>
> A facet index, initially for non-hierarchical facets.
> Start with all terms, and a set of documents for each term. Group lower
> level nodes by taking the union of the sets, but keep track of the largest
> set going back all the way to the leaves (the max doc-freq for that node).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]