Recreating lost shard data

2015-04-12 Thread Vladi Feigin
Hi All, Mistakenly we shut downed 3 nodes in the cluster in the same time. They contained both the primary shard and its replica :-( ES probably decided that it has lost the entire shard and created new shard but with zero documents. We've returned those 3 previously shut downed nodes to the clu

Re: ElasticSearch Multi Data Center replication

2015-04-02 Thread Vladi Feigin
try to keep them decoupled > > -- > > Itamar Syn-Hershko > http://code972.com | @synhershko <https://twitter.com/synhershko> > Freelance Developer & Consultant > Lucene.NET committer and PMC member > > On Thu, Apr 2, 2015 at 2:48 PM, Vladi Feigin > wrote: >

ElasticSearch Multi Data Center replication

2015-04-02 Thread Vladi Feigin
Hi, Is there a solution for this ? Could you recommend some production proven? Thank you Vladi -- This message may contain confidential and/or privileged information. If you are not the addressee or authorized to receive this on behalf of the addressee you must not use, copy, disclose or take

Reindex from the existing index

2015-03-31 Thread Vladi Feigin
Hi, Is there a way to build a new index from the existing index ? Thank you, Vladi -- This message may contain confidential and/or privileged information. If you are not the addressee or authorized to receive this on behalf of the addressee you must not use, copy, disclose or take action based o

Is ElasticSearch supposed to run fast when searching the text?

2015-03-30 Thread Vladi Feigin
Hi, Please help me understand what's wrong here We have this query (see below) Without this part, which searches in a text: query": { "has_child": { "type": "ttype", "query": {"match": { "text": "sometext" }}}) it works pretty fast (10

Re: fielddata_breaker - too high tripped value

2015-03-26 Thread Vladi Feigin
Can someone bring some insight? On Wednesday, March 25, 2015 at 8:29:52 PM UTC+2, Vladi Feigin wrote: > > Hi , > > We're observing on the one of our servers the fielddata_breake.tripped > value much higher than on others > On this specific server we have tripped = ~9K

fielddata_breaker - too high tripped value

2015-03-25 Thread Vladi Feigin
Hi , We're observing on the one of our servers the fielddata_breake.tripped value much higher than on others On this specific server we have tripped = ~9K but on all rest servers we have tripped = 0 What's going wrong with this server? Is it kind of hot spot in terms of data distribution betwe

Filter cache and field data caches

2015-03-24 Thread Vladi Feigin
Hi All, We observe the relatively high heap usage : 70-75% per node. Our heap size is 32G. Each node has 2 shards (one primary , one replica). Each shard size is approximately 43G The size of the field data cache is ~22G and there are zero evictions from this cache The size of the filter cache

Number of shards per node

2015-03-22 Thread Vladi Feigin
Hi All, Is there a way to estimate a max number of shards per node (physical)? Is there some formula for this? Let's say our node is a commodity server. What're best practices? Thank you, Vladi -- This message may contain confidential and/or privileged information. If you are not the addres

Re: Multiple indices vs. multiple shards approach

2015-03-21 Thread Vladi Feigin
Thank you everybody for the help! Is there a way to run routing in a debug mode? For example calculate the shard id via API ? Thank you, Vladi On Friday, March 20, 2015 at 6:28:55 PM UTC+2, Vladi Feigin wrote: > > Hello, > > Please share your thoughts > We have one big ES index a

Re: Multiple indices vs. multiple shards approach

2015-03-20 Thread Vladi Feigin
in our case it means having hundreds or maybe thousands indexes >>> In terms of the maintenance is a big overhead >>> Other approach is create many shards >>> Could you, please share your experience and thoughts? >>> What would you recommend in this scenario >&

Multiple indices vs. multiple shards approach

2015-03-20 Thread Vladi Feigin
our case it means having hundreds or maybe thousands indexes In terms of the maintenance is a big overhead Other approach is create many shards Could you, please share your experience and thoughts? What would you recommend in this scenario Thank you in advance, Vladi Feigin -- This messag

Very slow queries for parent-child index

2015-03-18 Thread Vladi Feigin
Hello, I would like to ask for advice about our index that is built on the principle of Parent Child, Now a search word takes a very long time about 3 minutes. Below is the index structure. Our database (shown below the index schema) contains information about purchases and reviews in different

Re: Accessing tf-idf

2014-05-05 Thread Vladi Feigin
Hello, I have a requirement to retrieve the terms frequency (TF) from all recently indexed documents (last 24 hours) So in a query I have to supply the time range and expect to get the TFs of all terms in the given time range Is it possible to do in ES? If yes, please refer me to the documentati