On Wed, 2013-07-03 at 21:58 +0300, Shai Erera wrote:
> What's maxCount? What I mean is that if you create a FacetRequest with
> numResults = 5*K (for example), then you get the top-5K categories and can
> choose the best top-K of those, by their label. Yes, this will hurt top-K
> computation the least, but is not guaranteed to return the correct top-K.
maxCount = numResults

> 
> The other alternative, which you should test, is to create a
> FacetResultsHandler which labels every ordinal and compares by their label.
> While it will be slower, perhaps it's acceptable for your app.
I will try it!

> 
> Regarding the ranges, I assume you're not talking about numeric ranges
> (cause we have a RangeFacetRequest for that), but something else?
> E.g. maybe show Year/1981-1990 and Year/1991-2000? Is that the case?
when lucene 5.x will be released or these api ported to 4.x?

> 
> If so, how would you decide which buckets to create? And can the buckets
> pre-created at indexing already? E.g.
> Year/1981-1990/1982/Jan-Mar/Feb/1-10/8?
> This can definitely turn into an interesting usecase and feature to have in
> Lucene, but I'd need to understand better what sort of ranges do u have in
> mind.
> Maybe we can discuss that on a separate thread?
> 
> Shai

Nicola
> 
> 
> On Wed, Jul 3, 2013 at 7:34 PM, Nicola Buso <nb...@ebi.ac.uk> wrote:
> 
> > Hi Shai,
> >
> > if I'm correctly understanding, you are suggesting to use maxCount in a
> > new FacetResultHandler to subset the whole facet hierarchy so that the
> > counting still has good performances; this is not guaranteeing it will
> > obtain maxcCount values but some approximations to it.
> >
> > The second solution, get all children and than sort by label, is not
> > feasible because the facet values are spread into a big range.
> >
> > Another thing that should be nice to introduce is grouping the result
> > facet values; like for date, but for many other things that are
> > measurable, the values should be spread into big ranges and giving a
> > representation to the user is difficult and probably not useful. If you
> > can group values you should give the user a good approximation.
> > Do you think is feasible to create a new facet hierarchy at runtime
> > composed by groups of values and fill the values in the original
> > hierarchy in the new created one? Too expensive?
> >
> >
> > Nicola.
> >
> >
> >
> > On Tue, 2013-07-02 at 20:49 +0300, Shai Erera wrote:
> > > Well, in general it can be done, but it won't be cheap. You can
> > > implement a FacetResultsHandler which instead of sorting by value will
> > > sort by the category label. But that means you're going to label
> > > *every single category* in order to sort by it.
> > >
> > >
> > > Maybe if you can do away with approximations, you can ask for top-50
> > > and return the "top by label" from that list.
> > > Or, if the number of children is bounded, maybe ask to return all of
> > > them, and then just sort by label. I think that sorting is cheaper
> > > than updating a heap.
> > >
> > > Shai
> > >
> > >
> > >
> > > On Tue, Jul 2, 2013 at 6:54 PM, Nicola Buso <nb...@ebi.ac.uk> wrote:
> > >         Hi,
> > >
> > >         I was thinking about it, what is needed is the 1st, than,
> > >         supposing
> > >         FacetRequest maxCount is setted to 10 I want the latest 10
> > >         years with
> > >         respective counts, also if on year 2000 there are more counts
> > >         than in
> > >         2013.
> > >
> > >
> > >         Nicola.
> > >
> > >         On Tue, 2013-07-02 at 18:40 +0300, Shai Erera wrote:
> > >         > Do you want your top-K to be computed by label too? Or first
> > >         deduce
> > >         > the top-K facets, then sort them otherwise?
> > >         >
> > >         > Shai
> > >         >
> > >         >
> > >         >
> > >         > On Tue, Jul 2, 2013 at 6:36 PM, Nicola Buso
> > >         <nb...@ebi.ac.uk> wrote:
> > >         >         Hi,
> > >         >
> > >         >         I was looking to change the order of the facet
> > >         results; in
> > >         >         this case I
> > >         >         would like to order by the facet label instead of
> > >         the facet
> > >         >         value
> > >         >         (count).
> > >         >
> > >         >         An example is a facet on dates; suppose the facet is
> > >         saved as
> > >         >         YYYY/MM/dd, I would like obtain values for this date
> > >         ordered
> > >         >         by the
> > >         >         date; i.e. with depth 1:
> > >         >
> > >         >         2013 (2,403,222)
> > >         >         2012 (3,632.098)
> > >         >         2011 (1,213,990)
> > >         >         ....
> > >         >
> > >         >         is this possible?
> > >         >
> > >         >
> > >         >         Nicola
> > >         >
> > >         >
> > >         >
> > >
> > ---------------------------------------------------------------------
> > >         >         To unsubscribe, e-mail:
> > >         >         java-user-unsubscr...@lucene.apache.org
> > >         >         For additional commands, e-mail:
> > >         >         java-user-h...@lucene.apache.org
> > >         >
> > >         >
> > >         >
> > >
> > >
> > >
> > >
> >
> >
> >



---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to