Sounds a lot like multi-tenancy, where you don't want the document
frequencies of one tenant to influence the query relevancy scores for other
tenants.
No ready solution.
Although, I have thought of a simplified document scoring using just tf and
leaving out df/idf. Not as good a tf*idf or BM25 s
Hey Solr people:
Suppose that we did not want to break up our document set into separate
indexes, but had certain cases where many versions of a document were not
relevant for certain searches.
I guess this could be thought of as a "authorization" class of problem,
however it is not that