Re: document diversity

Tricia Williams Thu, 01 Oct 2009 10:19:10 -0700

Hi Mike,

The first thing that comes to mind is to run a query for each documenttype (assuming that you have a field that stores the type) and qualifythe document type: for example type:pdf. Then you would have to writesomething to combine the query results drawing an equal number of hitsfrom each query result. I think that the score would still be able toprovide you with an appropriate relevance score if you wanted tomaintain relevance order across all the hits from all document types,but my logic could be faulty there.


Tricia


Michael Masters wrote:

I was wondering if there is any way to control what kind of documents
are returned from a search. For example, lets say we have an index
built from different types of documents (pdf, txt, html, etc.). Is
there a way to have the first x results have a specified distribution
of document types? It would be nice to have an even number of results
that are from pdfs, txt files, and html files.


Any help would greatly be appreciated.


-Mike

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: document diversity

Reply via email to