Also if you are expecting indexing of 2 billion docs as NRT or if it will be offline (during off hours etc). For more accurate sizing you may also want to index say 10 million documents which may give you idea how much is your index size and then use that for extrapolation to come up with memory requirements.
Thanks, Susheel On Mon, Feb 8, 2016 at 11:00 AM, Emir Arnautovic < emir.arnauto...@sematext.com> wrote: > Hi Mark, > Can you give us bit more details: size of docs, query types, are docs > grouped somehow, are they time sensitive, will they update or it is rebuild > every time, etc. > > Thanks, > Emir > > > On 08.02.2016 16:56, Mark Robinson wrote: > >> Hi, >> We have a requirement where we would need to index around 2 Billion docs >> in >> a day. >> The queries against this indexed data set can be around 80K queries per >> second during peak time and during non peak hours around 12K queries per >> second. >> >> Can Solr realize this huge volumes. >> >> If so, assuming we have no constraints for budget what would be a >> recommended Solr set up (number of shards, number of Solr instances >> etc...) >> >> Thanks! >> Mark >> >> > -- > Monitoring * Alerting * Anomaly Detection * Centralized Log Management > Solr & Elasticsearch Support * http://sematext.com/ > >