Hi Mikhail,

Thanks for the reply.

I think coord works at the document level, I was thinking of having
something that worked at a field level, against a 'principle/primary'
field.

I'm using edismax with tie=1 (a.k.a. Disjunction Sum) and several fields,
but docs with greater query overlap on the primary field should score
higher if you see what I mean.

Cheers,
Dan

On Tue, Jan 29, 2013 at 7:14 PM, Mikhail Khludnev <
mkhlud...@griddynamics.com> wrote:

> Daniel,
>
> You can start from here
>
> http://lucene.apache.org/core/4_0_0-BETA/core/org/apache/lucene/search/similarities/Similarity.html#coord%28int,%20int%29but
> it requires deep understanding of Lucene internals
>
>
>
> On Tue, Jan 29, 2013 at 2:12 PM, Daniel Rosher <rosh...@gmail.com> wrote:
>
> > Hi,
> >
> > I'm wondering if there exists or if someone has implemented something
> like
> > the following as a function query:
> >
> > overlap(query,field) = number of matching terms in field/number of terms
> in
> > field
> >
> > e.g. with three docs having these tokens(e.g.A B C) in a field
> > D
> > 1:A B B
> > 2:A B
> > 3:A
> >
> > The overlap would be for these queries (-- highlights possibly highest
> > scoring doc):
> >
> > Q:A
> > 1:1/3
> > 2:1/2
> > 3:1/1 --
> >
> > Q:A B
> > 1:2/3
> > 2:2/2 --
> > 3:1/1
> >
> > Q:A B C
> > 1:2/3
> > 2:2/2 --
> > 3:1/1
> >
> > The objective to to pick the most likely doc using the overlap to boost
> the
> > score.
> >
> > Cheers,
> > Dan
> >
>
>
>
> --
> Sincerely yours
> Mikhail Khludnev
> Principal Engineer,
> Grid Dynamics
>
> <http://www.griddynamics.com>
>  <mkhlud...@griddynamics.com>
>

Reply via email to