In a word, “yes”. For time routed alias, you also have to be aware of the 
nature of your data. Take the canonical example of news stories for instance, 
and let’s assume that every day a new collection is created.

Now a hot news story breaks and the news is flooded with the latest story, 
“Hurricane hits Florida" for instance. The recent news will contain many more 
mentions of Florida .vs. older collections. So the TF/IDF statistics for recent 
collections will be much different than old collections.

In the normal SolrCloud case where routing is done by hashing the <uniqueKey>, 
the assumption is that the close-to-random distribution of stories will make 
the stats on individual shards “close enough”.

Best,
Erick

> On May 17, 2019, at 11:14 PM, SOLR4189 <klin892...@yandex.ru> wrote:
> 
> I ask my question due to I want to use TRA (Time Routed Aliases). Let's say
> SOLR will open new collection every month. In the beginning of month a new
> collection will be empty almost. 
> So IDF will be different between new collection and collection of previous
> month? 
> 
> 
> 
> --
> Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Reply via email to