Official numbers? There are none. If for no other reason than that performance is completely dependent on your specific hardware and your specific data and your specific data model. The standard recommendation is that you should do a proof of concept implementation with a reasonable subset of your data and judge for yourself whether the throughput and latency are sufficient for your own specific requirements. Not everyone has extreme throughput and latency requirements. If your requirements are extreme then virtualization will likely not to work out for you, but if your requirements are reasonably mild and you adequately provision your cluster with enough shards and enough replicas, then virtualization may actually work out well for you. Either way, adequately provisioning the cluster (not overloading individual nodes with either too many documents or too many requests) is always essential unless you are working with a very small collection of data with a very light load.
The standard recommendation is to avoid the use of a load balancer between the app and Solr - since the server client API in SolrJ automatically does smart routing and round-robin load balancing: https://lucene.apache.org/solr/5_4_0/solr-solrj/org/apache/solr/client/solrj/impl/CloudSolrClient.html https://lucene.apache.org/solr/5_4_0/solr-solrj/org/apache/solr/client/solrj/impl/CloudSolrServer.html You may want a load balancer in front of multiple instances of your app, but that's not a question or issue for Solr. The only issue there is assuring that you have enough Solr shards and replicas to handle the aggregate request load. -- Jack Krupansky On Thu, Jan 21, 2016 at 6:37 AM, Gian Maria Ricci - aka Alkampfer < alkamp...@nablasoft.com> wrote: > Hi, > > > > I’ve a couple of quick question about production setup. > > > > The first one is about virtualization, I’d like to know if there are any > official test on loss of performance in virtualization environment. I think > that the loss of performance is negligible, and quick question on test > infrastructure is confirming this, but I’d like to know if there is some > official numbers on this. > > > > The second question is about Load Balancer: any clue on how to > automatically change the configuration on the load balancer if some of the > node goes down? I’m looking to advices on what to monitor, the simplest > solution could be issuing some test query and verify if the node is able to > answer, but it would be nice to know if there are some standard metrics to > monitor to proactively alert. (Es. Heap size almost full, so it would be > probably better to remove the node from the balancer and alert a human to > have a look at the status of the node). > > > > Many thanks. > > > > -- > Gian Maria Ricci > Cell: +39 320 0136949 > > [image: > https://ci5.googleusercontent.com/proxy/5oNMOYAeFXZ_LDKanNfoLRHC37mAZkVVhkPN7QxMdA0K5JW2m0bm8azJe7oWZMNt8fKHNX1bzrUTd-kIyE40CmwT2Mlf8OI=s0-d-e1-ft#http://www.codewrecks.com/files/signature/mvp.png] > <http://mvp.microsoft.com/en-us/mvp/Gian%20Maria%20Ricci-4025635> [image: > https://ci3.googleusercontent.com/proxy/f-unQbmk6NtkHFspO5Y6x4jlIf_xrmGLUT3fU9y_7VUHSFUjLs7aUIMdZQYTh3eWIA0sBnvNX3WGXCU59chKXLuAHi2ArWdAcBclKA=s0-d-e1-ft#http://www.codewrecks.com/files/signature/linkedin.jpg] > <http://www.linkedin.com/in/gianmariaricci> [image: > https://ci3.googleusercontent.com/proxy/gjapMzu3KEakBQUstx_-cN7gHJ_GpcIZNEPjCzOYMrPl-r1DViPE378qNAQyEWbXMTj6mcduIAGaApe9qHG1KN_hyFxQAIkdNSVT=s0-d-e1-ft#http://www.codewrecks.com/files/signature/twitter.jpg] > <https://twitter.com/alkampfer> [image: > https://ci5.googleusercontent.com/proxy/iuDOD2sdaxRDvTwS8MO7-CcXchpNJX96uaWuvagoVLcjpAPsJi88XeOonE4vHT6udVimo7yL9ZtdrYueEfH7jXnudmi_Vvw=s0-d-e1-ft#http://www.codewrecks.com/files/signature/rss.jpg] > <http://feeds.feedburner.com/AlkampferEng> [image: > https://ci6.googleusercontent.com/proxy/EBJjfkBzcsSlAzlyR88y86YXcwaKfn3x7ydAObL1vtjJYclQr_l5TvrFx4PQ5qLNYW3yp7Ig66DJ-0tPJCDbDmYAFcamPQehwg=s0-d-e1-ft#http://www.codewrecks.com/files/signature/skype.jpg] > > >