On Tue, Mar 16, 2010 at 1:17 AM, Marvin Humphrey <[email protected]> wrote:
> What I'd like to do is identify the cluster that best represents the document,
> and exclude any terms outside of that cluster when building the
> MoreLikeThisQuery.
>
> What kind of a data structure would we need to achieve that?
>
> Marvin Humphrey
>
>

Marvin, I use this for query expansion purposes, so if you have any
ideas (even very slow ones) you want to test, I'd be happy to help
with some relevance-testing gruntwork.

-- 
Robert Muir
[email protected]

Reply via email to