Understanding solrcloud/multi-server loadbalancing

2013-08-13 Thread Torsten Albrecht
I have a solr 3.6 infrastructure with 4 server 24 cores/128GB (~15 shards at 
every server), 70 million documents.

Now I set up a new solr 4 infrastructure with the same hardware. I reduce the 
shards and have only 6 shards.

But I don't understand the difference between solrcloud and a multi-server 
loadbalancing. And if solrcloud the better way (more performance)?


LoadBalancer - solrcloud (4 Nodes)

LoadBalancer - 4 solr server with the same shards


Is there a benefit of the solrcloud in this case?


Regards,

Torsten


Internal shard communication - performance?

2013-08-07 Thread Torsten Albrecht
Hi,

I use a system with solr 3 and 20 shards (3 million docs per shard).

At a testsystem with one shard (60 million docs) I get 750 requests per second. 
At my live system (20 shards) I get 200 requests per second.

Is the internal communication between the 20 shards a performance killer?

Another question. Is a solr 4 system with solrcloud and Zookeeper a high 
availability system?


Regards,

Torsten


Re: Internal shard communication - performance?

2013-08-07 Thread Torsten Albrecht
Hi Jack,

I would like to run zookeeper external at my old master server.

So I have two zookeeper to control my cloud. The third and fourth zookeeper 
will be a virtual machine.


Torsten


Von: Jack Krupansky
Gesendet: ?Mittwoch?, ?7?. ?August? ?2013 ?20?:?05
An: solr-user@lucene.apache.org

Three zookeepers give you bare minimum high availability - one can go down.

But... I would personally assert that running embedded zookeeper is
inherently not high availability, just by definition (okay, by MY
definition.)

You didn't say whether you were running embedded zookeeper or not.

But if you were, to be HA, your cluster should be able to have all but one
node per shard go down and your cluster should still service both queries
and updates. But with embedded zookeeper on a four-node cluster, taking down
two of the nodes running embedded zookeeper would make zookeeper no longer
usable, and hence your cluster would not be HA.

-- Jack Krupansky

-Original Message-
From: Torsten Albrecht
Sent: Wednesday, August 07, 2013 1:15 PM
To: solr-user
Subject: Internal shard communication - performance?

Hi,

I use a system with solr 3 and 20 shards (3 million docs per shard).

At a testsystem with one shard (60 million docs) I get 750 requests per
second. At my live system (20 shards) I get 200 requests per second.

Is the internal communication between the 20 shards a performance killer?

Another question. Is a solr 4 system with solrcloud and Zookeeper a high
availability system?


Regards,

Torsten



solr sizing

2013-07-29 Thread Torsten Albrecht
Hi all,

we have

- 70 mio documents to 100 mio documents

and we want

- 800 requests per second


How many servers Amazon EC2/real hardware we Need for this?

Solr 4.x with solr cloud or better shards with loadbalancer?

Is anyone here who can give me some information, or who operates a similar 
system itself?


Regards,

Torsten


solr master server

2013-05-02 Thread Torsten Albrecht
Hi,

I want to set up a master / slave configuration for solr 3.6

Is there a best practice for the Raid config and the Linux partitions for the 
master server?

Cheers,

Torsten