On Tue, Mar 16, 2010 at 1:17 AM, Marvin Humphrey <[email protected]> wrote: > What I'd like to do is identify the cluster that best represents the document, > and exclude any terms outside of that cluster when building the > MoreLikeThisQuery. > > What kind of a data structure would we need to achieve that? > > Marvin Humphrey > >
Marvin, I use this for query expansion purposes, so if you have any ideas (even very slow ones) you want to test, I'd be happy to help with some relevance-testing gruntwork. -- Robert Muir [email protected]
