Lucene’s scoring function (which I believe is okapi BM25 https://en.m.wikipedia.org/wiki/Okapi_BM25) is a kind of nearest neighbor using the TF-IDF vector representation of documents and query. Are you interested in ANN to be applied to a different kind of vector representation, say for example Doc2Vec?
On Thu, Feb 28, 2019 at 5:59 AM Adrien Grand <[email protected]> wrote: > Hi Pedram, > > We don't have much in this area, but I'm hearing increasing interest > so it'd be nice to get better there! The closest that we have is this > class that can search for nearest neighbors for a vector of up to 8 > dimensions: > https://github.com/apache/lucene-solr/blob/master/lucene/sandbox/src/java/org/apache/lucene/document/FloatPointNearestNeighbor.java > . > > On Wed, Feb 27, 2019 at 1:44 AM Pedram Rezaei > <[email protected]> wrote: > > > > Hi there, > > > > > > > > Is there a way to store numerical vectors (vector based index) and > perform search based on Approximate Nearest Neighbor class of algorithms in > Lucene? > > > > > > > > If not, has there been any interests in the topic so far? > > > > > > > > Thanks, > > > > > > > > Pedram > > > > -- > Adrien > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > >
