Re: Faceting and first letter of fields

Jonathan Rochkind Thu, 14 Oct 2010 14:29:09 -0700

Markus Jelsma wrote:

Here's a very recent thread on the matter:
http://lucene.472066.n3.nabble.com/facet-method-enum-vs-fc-td1681277.html


Thanks, that's helpful, but still leaves me with questions.

Yonik suggests with only ~25 unique facet values, method=enum isprobably the way to go.

What about 100? 200? It probably depends on number of documents too:I've got about 3 million.

I know I can just try it and see, but since the penalty for pickingwrong is using way a lot of memory, rather than performance -- this isvery hard for me, with my limited JVM knowledge, to know if I've pickedwrong or not. The only thing I know tells me I did it wrong is if I getan OutOfMemory. But maybe I don't get one right away, but get one acouple weeks later, perhaps under a different usage pattern. Was itcaused by the facet.method=enum? Or something else maybe I changed inthe interim. Or something else that was always there but which thedifferent usage pattern triggered. It's confusing, you know?


That thread Markus references says:

"The enum method creates a bitset for #each# unique facet value. The bitset is (maxdocs / 8) bytes in size (I'm ignoring

some overhead here)."

Is that maxdocs the number of docs in your index, or the number of docsthat are assigned to a given unique facet value? (and in the currentresult set, or in the index as a whole?) Makes a pretty big differencein overall memory use if you've got, say, 3 million docs, 100 uniquefacet values and the documents are relatively evenly distributed withinthem. I _think_ from the math that follows, Erick is saying "maxdocs"in that simple equation is the number of documents assigned to a givenunique facet value, in the index as a whole. But that would seem to meanthat the amount of memory taken up would be solely a function of numberof documents in your index, not in fact of number of unique facetvalues. And that doesn't doesn't seem to square with the other advice weget on the subject.


So... I am confused.

Re: Faceting and first letter of fields

Reply via email to