Re: document diversity

Grant Ingersoll Sat, 03 Oct 2009 15:25:58 -0700

I'm curious, can you elaborate more on the deeper use case for this?

Perhaps just implementing faceting on doc type would be sufficient?That way users can drill in on doc type. Alternatively, I suppose youcould implement a hit collector that accesses a field cache on the doctype field and promotes lesser seen doc types until they are evenlyrepresented. Could also likely write a Function query that does asimilar thing. I'd imagine you need to be careful to control yourmemory.


-Grant

On Oct 1, 2009, at 12:56 PM, Michael Masters wrote:

I was wondering if there is any way to control what kind of documents
are returned from a search. For example, lets say we have an index
built from different types of documents (pdf, txt, html, etc.). Is
there a way to have the first x results have a specified distribution
of document types? It would be nice to have an even number of results
that are from pdfs, txt files, and html files.


Any help would greatly be appreciated.


-Mike

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]


--------------------------
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)using Solr/Lucene:

http://www.lucidimagination.com/search


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: document diversity

Reply via email to