[ 
https://issues.apache.org/jira/browse/SOLR-153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12987003#action_12987003
 ] 

Robert Muir commented on SOLR-153:
----------------------------------

Hi Yonik, I've been thinking of some possibilities here with regards to our 
terms index itself.

Currently the terms index is an FST with an output going to the terms, but what 
if instead this
was a PairOutputs<existingOutput, max(docFreq)> or whatever faceting needs.

Somehow via termsenum we would have to expose its FSTEnum or whatever so 
faceting 
could use it, but you could then make use of our existing "tree" structure 
(terms index + terms dict)
for this case.

Sorry i'm not up to speed on really how fast termsenum faceting is versus say 
using the fieldcache
or the various faceting techniques we have today, but it seems like we could do 
something here, e.g.
if you know up-front you will facet on this field a lot you specify the 
appropriate codec so that
your terms index accelerates faceting.


> Facet Index
> -----------
>
>                 Key: SOLR-153
>                 URL: https://issues.apache.org/jira/browse/SOLR-153
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Yonik Seeley
>         Attachments: facettree.patch, facettree.patch
>
>
> A facet index, initially for non-hierarchical facets.
> Start with all terms, and a set of documents for each term.  Group lower 
> level nodes by taking the union of the sets, but keep track of the largest 
> set going back all the way to the leaves (the max doc-freq for that node).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to