Re: Multy-tenancy - level of service garantee

2015-02-13 Thread rondelvictor
Regarding shards RAM allocation :

   - Since each shard comes with a Lucene instance :
   - If I have 2 shards, each belonging to a different index, each index 
   being used by a different application.
   Given that each shard's Lucene highly uses OS cache, how can I certify 
   that each Lucene will have enough OS cache for its magic to perform?
   
Thanks,

Le jeudi 12 février 2015 22:44:29 UTC+1, Mark Walkom a écrit :

 How do you guarantee a level of service provided any other way? Redundancy 
 and smart planning and design.

 It's no different with ES.


-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/5ca10ab5-8e25-4072-828f-7a4fafb3afdf%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: Moving to production considerations - applications with their clusters' infrastructure

2015-02-13 Thread rondelvictor
Hi Charlie,

That's excellent news! Thank you for your slideshare and related github!

Regards,

Victor

Le vendredi 13 février 2015 16:07:11 UTC+1, Charlie Hull a écrit :

 Hi,

 Firstly, thanks for the kind words about the performance study: we're 
 hoping to revisit this soon after the feedback we've had on better tuning 
 for each engine. I agree there's a paucity of studies, but in a month or 
 two we should have one from a project we're working on to build an index of 
 product data - we hope to present this at the London Meetup group but will 
 publish the slides afterwards. Sorry this is in the future!

 Cheers

 Charlie

 On 12 February 2015 at 14:27, rondel...@gmail.com javascript: wrote:

 Hi everyone,

 I am considering moving one or several elasticsearch clusters to 
 production.
 Although Elasticsearch's documentation and community is *great*, I am 
 strongly startled not to find any *complete use-case story* stretching 
 from application(s) needs and data considerations to hardware ones.
 Indeed, I understand why what/how much hardware / configuration / 
 sharding questions are systematically replied with both it depends 
 followed by test.
 But then, what about a few complete descriptions, out of so many 
 elasticsearch users, from data use case to cluster's internals, along with 
 a few performance and nodes stats?

 So here are questions, before moving to production :

 Are there any *complete* use cases around? Could you share some? By 
 complete I mean including *at least some* of the following :

1. *Application needs and scope*
2. *Indexing Data indications* : data volume, documents mapping, 
documents / indexes volume
3. *Searching Data indications* : different applications, queries, 
use of facets - filters - aggregations, concurrent indexing
4. *Cluster Hardware* : machines' hardware (RAM, Disks/SSD - 
DAS-JBOD/SAN/NAS), JVM heap / OS Cache, nb of machines, back office 
 network
5. *Cluster Configuration* : one or several indexes, sharding, 
replication, master nodes, data nodes, use of over-sharding at start-up, 
use of re-indexing
6. *Benchmaks *: queries response times, QPS, with or without 
concurrent indexing, memory heap sweet spot, nodes stats

 For those interested, here are the (not *complete*) best-among-very-few 
 exemples I've stumbled upon so far :

- The very best (perfs with hardware and query description) : 

 http://fr.slideshare.net/charliejuggler/lucene-solrlondonug-meetup28nov2014-solr-es-performance
- Hardware and master nodes heap : 

 https://groups.google.com/forum/?fromgroups#!searchin/elasticsearch/sizing/elasticsearch/V5BtrCGOqoU/l7x6vqMEx5YJ
- *6th slide* - Hardware and storage with number of documents (well, 
without indexes and documents storage volume nor RAM consumption) :


 https://speakerdeck.com/bhaskarvk/scaling-elasticsearch-washington-dc-meetup
With JBOD / SAN storage discussion in To Raid or not to Raid:


 https://groups.google.com/forum/?fromgroups#!searchin/elasticsearch/hardware/elasticsearch/HSj2fZGdU1Y/4mFCBTCb-JcJ
- Usual heap considerations in a real case :


 https://codeascraft.com/2014/12/04/juggling-multiple-elasticsearch-instances-on-a-single-host/

 Do not forget Elasticsearch awesome docs for moving to production 
 considerations :

- 

 http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/administration.html
- 

 http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/deploy.html
- 
 *http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/hardware.html
  

 http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/hardware.html*
- 
 *http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/heap-sizing.html
  

 http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/heap-sizing.html*


  -- 
 You received this message because you are subscribed to the Google Groups 
 elasticsearch group.
 To unsubscribe from this group and stop receiving emails from it, send an 
 email to elasticsearc...@googlegroups.com javascript:.
 To view this discussion on the web visit 
 https://groups.google.com/d/msgid/elasticsearch/927f60b1-8ae2-463e-b725-5f4f993905d9%40googlegroups.com
  
 https://groups.google.com/d/msgid/elasticsearch/927f60b1-8ae2-463e-b725-5f4f993905d9%40googlegroups.com?utm_medium=emailutm_source=footer
 .
 For more options, visit https://groups.google.com/d/optout.




-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/34acb9f4-a622-4ed6-b877-d605f0c6382b%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Moving to production considerations - applications with their clusters' infrastructure

2015-02-12 Thread rondelvictor
Hi everyone,

I am considering moving one or several elasticsearch clusters to production.
Although Elasticsearch's documentation and community is *great*, I am 
strongly startled not to find any *complete use-case story* stretching from 
application(s) needs and data considerations to hardware ones.
Indeed, I understand why what/how much hardware / configuration / 
sharding questions are systematically replied with both it depends 
followed by test.
But then, what about a few complete descriptions, out of so many 
elasticsearch users, from data use case to cluster's internals, along with 
a few performance and nodes stats?

So here are questions, before moving to production :

Are there any *complete* use cases around? Could you share some? By 
complete I mean including *at least some* of the following :
   
   1. *Application needs and scope*
   2. *Indexing Data indications* : data volume, documents mapping, 
   documents / indexes volume
   3. *Searching Data indications* : different applications, queries, use 
   of facets - filters - aggregations, concurrent indexing
   4. *Cluster Hardware* : machines' hardware (RAM, Disks/SSD - 
   DAS-JBOD/SAN/NAS), JVM heap / OS Cache, nb of machines, back office network
   5. *Cluster Configuration* : one or several indexes, sharding, 
   replication, master nodes, data nodes, use of over-sharding at start-up, 
   use of re-indexing
   6. *Benchmaks *: queries response times, QPS, with or without concurrent 
   indexing, memory heap sweet spot, nodes stats

For those interested, here are the (not *complete*) best-among-very-few 
exemples I've stumbled upon so far :

   - The very best (perfs with hardware and query description) : 
   
http://fr.slideshare.net/charliejuggler/lucene-solrlondonug-meetup28nov2014-solr-es-performance
   - Hardware and master nodes heap : 
   
https://groups.google.com/forum/?fromgroups#!searchin/elasticsearch/sizing/elasticsearch/V5BtrCGOqoU/l7x6vqMEx5YJ
   - *6th slide* - Hardware and storage with number of documents (well, 
   without indexes and documents storage volume nor RAM consumption) :
   
   https://speakerdeck.com/bhaskarvk/scaling-elasticsearch-washington-dc-meetup
   With JBOD / SAN storage discussion in To Raid or not to Raid:
   
   
https://groups.google.com/forum/?fromgroups#!searchin/elasticsearch/hardware/elasticsearch/HSj2fZGdU1Y/4mFCBTCb-JcJ
   - Usual heap considerations in a real case :
   
   
https://codeascraft.com/2014/12/04/juggling-multiple-elasticsearch-instances-on-a-single-host/

Do not forget Elasticsearch awesome docs for moving to production 
considerations :

   - 
   
http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/administration.html
   - 
   http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/deploy.html
   - 
   
*http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/hardware.html*
   - 
   
*http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/heap-sizing.html*


-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/927f60b1-8ae2-463e-b725-5f4f993905d9%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Use cases - Production examples: datas, queries, cluster hardware and conf, and statistics

2015-02-12 Thread rondelvictor
Hi everyone,

I am considering moving one or several elasticsearch clusters to production.
Although Elasticsearch's documentation and community is *great*, I am 
strongly startled not to find any *complete use-case story* stretching from 
application(s) needs and data considerations to hardware ones.
Indeed, I understand why what/how much hardware / configuration / 
sharding questions are systematically replied with both it depends 
followed by test.
But then, what about a few complete descriptions, out of so many 
elasticsearch users, from data use case to cluster's internals, along with 
a few performance and nodes stats?

So here are questions, before moving to production :

Are there any *complete* use cases around? Could you share some? By 
complete I mean including *at least some* of the following :
   
   1. *Application needs and scope*
   2. *Indexing Data indications* : data volume, documents mapping, 
   documents / indexes volume
   3. *Searching Data indications* : different applications, queries, use 
   of facets - filters - aggregations, concurrent indexing
   4. *Cluster Hardware* : machines' hardware (RAM, Disks/SSD - 
   DAS-JBOD/SAN/NAS), JVM heap / OS Cache, nb of machines, back office network
   5. *Cluster Configuration* : one or several indexes, sharding, 
   replication, master nodes, data nodes, use of over-sharding at start-up, 
   use of re-indexing
   6. *Benchmaks *: queries response times, QPS, with or without concurrent 
   indexing, memory heap sweet spot, nodes stats

For those interested, here are the (not *complete*) best-among-very-few 
exemples I've stumbled upon so far :

   - The very best (perfs with hardware and query description) : 
   
http://fr.slideshare.net/charliejuggler/lucene-solrlondonug-meetup28nov2014-solr-es-performance
   - Hardware and master nodes heap : 
   
https://groups.google.com/forum/?fromgroups#!searchin/elasticsearch/sizing/elasticsearch/V5BtrCGOqoU/l7x6vqMEx5YJ
   - *6th slide* - Hardware and storage with number of documents (well, 
   without indexes and documents storage volume nor RAM consumption) :
   
   https://speakerdeck.com/bhaskarvk/scaling-elasticsearch-washington-dc-meetup
   With JBOD / SAN storage discussion in To Raid or not to Raid:
   
   
https://groups.google.com/forum/?fromgroups#!searchin/elasticsearch/hardware/elasticsearch/HSj2fZGdU1Y/4mFCBTCb-JcJ
   - Usual heap considerations in a real case :
   
   
https://codeascraft.com/2014/12/04/juggling-multiple-elasticsearch-instances-on-a-single-host/

Do not forget Elasticsearch awesome docs for moving to production 
considerations :

   - 
   
http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/administration.html
   - 
   http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/deploy.html
   - 
*http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/hardware.html
 
   
http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/hardware.html*
   - 
*http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/heap-sizing.html
 
   
http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/heap-sizing.html*

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/57c967ea-8bf0-4dce-a7ca-4a746ee21250%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Multy-tenancy - level of service garantee

2015-02-12 Thread rondelvictor
Hi everyone,

After my precedent question 
https://groups.google.com/forum/?fromgroups#!topic/elasticsearch/US-BA4R_Qdc 
regarding examples of clusters in production, I am wondering about 
multy-tenancy and garantee of service in Elasticsearch :

*Multy-tenant cluster* : Is there a way to *garantee a level of service* / 
capacity planning for *each tenant* using the cluster (its *own indexes*) ?


Thanks,
 

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/7f99a562-ae18-447f-b227-cd145483dcf3%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Moving to production - Storage and servers

2015-02-12 Thread rondelvictor
Hi everyone,

After my first 
https://groups.google.com/forum/?fromgroups#!topic/elasticsearch/US-BA4R_Qdc 
and second question 
https://groups.google.com/forum/?fromgroups#!topic/elasticsearch/FGKUmzn-WSs 
regarding clusters' examples and multy-tenancy with garantee of service, I 
have one more on my way to production :

Could you describe the *pros* and *cons* of the :
   
   1. Disks : *DAS* (Direct Attached Storage) - *JBOD* (Just a Bunch Of 
  Disks) / *SAN* (Storage Area Network)? I take it that NAS is most of 
  the time to be avoided
  2. Servers : bare metal servers / virtual servers ?
   
For those interested :

   - JBOD / SAN storage discussion in To Raid or not to Raid:
   
   
https://groups.google.com/forum/?fromgroups#!searchin/elasticsearch/hardware/elasticsearch/HSj2fZGdU1Y/4mFCBTCb-JcJ
   - Doc which states about SSDs, RAID0, NAS, networks
   
   
http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/hardware.html

Thanks!

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/3ab36482-e4fb-4dd0-99cd-044d2e4257ea%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.