Re: Multi DC cluster or separate cluster per DC?

2014-05-13 Thread Amit Soni
I am just wondering whether elastic search team has any plans to add features for multi-data center deployment (active-active)? -Amit. On Mon, May 12, 2014 at 11:02 AM, Deepak Jha wrote: > Having a separate cluster is definitely a better way to go. OR, you can > control the shard, replica plac

Re: Error after "build from source"

2014-05-13 Thread Adrien Grand
Hi, The build has been broken recently indeed (which is my bad, all apologies) but it should be fixed now, can you update your clone of elasticsearch and try again? On Wed, May 14, 2014 at 5:14 AM, Raine Nguyen wrote: > I'm following the guide to build from the source pulled directly from > gi

Re: Elasticsearch node under utilising file descriptors

2014-05-13 Thread Tamanjit Bindra
Hi, Sorry about the paste thing. I have recently joined the organisation and they have been using ES for some time now. I am not too sure if there is any plan to migrate to 1.X version any time soon. Although yes that is something that we may have to do eventually. But is there any possibility

modify explain string in elasticsearch plugin

2014-05-13 Thread Srinivasan Ramaswamy
Hi I am overriding the scoring function using a plugin as described in https://github.com/imotov/elasticsearch-native-script-example I would like to know how to override the explain string, so that i can add explanation for all my custom scoring. Thanks Srini -- You received this message b

Looking for a regex kind of search from kibana dashboard to ES

2014-05-13 Thread dharmendra pratap singh
Hi Guys, I want to perform a search to Es from kibana which includes few words and then it should find out all the sentnces with my search string present in it. EG- Search String=== 'transit department reverted'* No I am looking for all the records which has a string 'transit department rev

Re: Elasticsearch node under utilising file descriptors

2014-05-13 Thread Mark Walkom
It's better to put that sort of thing in a gist/pastebin :) Any reason you aren't running a more recent version of ES? You'll get much better performance from the new versions. Also can you be specific about your java, what update, OpenJDK or Oracle? Regards, Mark Walkom Infrastructure Engineer

Re: Elasticsearch node under utilising file descriptors

2014-05-13 Thread Tamanjit Bindra
Hi Mark, Thanks for the reply. We are using ES 0.90 on Java 7 The stats are as under : "ok" : true, "_shards" : { "total" : 300, "successful" : 300, "failed" : 0 }, "_all" : { "primaries" : { "docs" : { "count" : 28403397, "deleted" : 1385142 }

unable to take snapshot and restore using repository-hdfs

2014-05-13 Thread Chetana
I am using ES 1.1.1 and Hadoop hortonworks 2.2.2 and have installed the plugin '*repository-hdfs' (https://github.com/elasticsearch/elasticsearch-hadoop/tree/master/repository-hdfs )* in ES. I have created one

Re: Elasticsearch node under utilising file descriptors

2014-05-13 Thread Mark Walkom
ES will use descriptors to open files it needs for general operations and it can have a lot of those open at one time. But the level you've set and the level you're using are not related to your slow queries. How much data do you have in your cluster? What version of ES, what version of java? You

Elasticsearch node under utilising file descriptors

2014-05-13 Thread Tamanjit Bindra
Hi, I have read here, that we should set the soft and hard values of nooffiles for ES to utilise. I have done that, but somehow my BigDesk console shows that even though the upper limit is 65535, ES is only using about 2K+ file descrip

Performance Issue(Java heap memory error) while Indexing 20k records with 10 fields on windows 64 bit machine

2014-05-13 Thread dharmendra pratap singh
Hi Guys, Hope you all are doing good. In my ES, I am facing java heap memory error, while indexing just 20k records. Even I have increasesd the heap memory size in ES bat file. can anyone of you suggest me what all changesges I need to do for this. Or ES has certain restricition on windows mac

Re: Time Date: Giant Index w/Shard Routing VS Small Indices w/Little Shards and Aliasing

2014-05-13 Thread webish
Ok. Makes sense. I'd like to setup an indexing strategy for time data that will hold for some time without needing to reshuffle everything. Advantages I've found of the small indices and shards would be that there is NO finite number of shards. Aliasing strategies have more power than basic

Re: Search in Multiple Index

2014-05-13 Thread dharmendra pratap singh
Ivan, Thanks a lot for your quick response. I am looking to have multiindex search from the kibana dashboard's searchbox, It would be a gr8 if you can suggeset some sample query which searches from multiple index from kibana. (As of now I am not querying from rest api's GET method). @Group memb

Re: Time Date: Giant Index w/Shard Routing VS Small Indices w/Little Shards and Aliasing

2014-05-13 Thread Mark Walkom
Sharding is good for when you have multiple nodes, that way you have a small number of shards per node that can be queried in parallel, rather than one (or a few) done sequentially. However you will get similar results by having many smaller indexes across multiple nodes. The key thing between the

Time Date: Giant Index w/Shard Routing VS Small Indices w/Little Shards and Aliasing

2014-05-13 Thread webish
I am attempting to optimize time based data such as that of a newsfeed. I've been running tests with data broken into indices based on month, week, day. I'm using aliases to query the entire set or smaller ranges such as "last-month", "last-quarter". I'm still trying to figure out what will be

Re: elasticsearch performance issue with load increasing reponse time is increasing

2014-05-13 Thread webish
Did you ever solve this? On Thursday, February 27, 2014 4:32:29 AM UTC-5, kondapal...@gmail.com wrote: > > Hi All > > We had elasticsearch with 4 Nodes the the Cluster each 8gb memory. > when we are calling the elastic search query from java with 25 threads > first thread is taking 400 msec wher

Re: reverse_nested available in 1.1.1?

2014-05-13 Thread Raine Nguyen
Thank you very much for clearing my confusion On Wednesday, May 14, 2014 12:43:24 AM UTC+7, David Pilato wrote: > > No it's not in 1.1.1. It should be documented that this feature is only > available from 1.2.0. > > -- > David ;-) > Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs > > > Le 1

Error after "build from source"

2014-05-13 Thread Raine Nguyen
I'm following the guide to build from the source pulled directly from github for the branch 1.x I'm getting this when I run bin/elasticsearch on Windows 8 from the extracted files from target/releases folder [2014-05-14 08:44:28,871][INFO ][node ] [gen Harada] > version > [

types in segments

2014-05-13 Thread Anand Nalya
Hi, Are documents of different mappings in an index stored in different segments or a single segment can contain documents of different types. Is _type the only way of differentiating the type of an documents? Regards, Anand -- You received this message because you are subscribed to the Googl

Conditional query for geo location lookup

2014-05-13 Thread Utkarsh Sengar
I have a usecase where I have 2 types of locations (i.e. with geo_point type): 1. Location 1: Has a lat/lon with say radius=90 miles (it will vary) and type=outgoing 2. Location 2: Has a lat/lon with no radius and type=incoming Now, when a query comes in with: lat/lon and radius=20, I expect this

Sort Order when relevance is equal

2014-05-13 Thread Erich Lin
Ignoring the bouncing results problem with multiple shards , is the order of results deterministic when sorting by relevance score or any other field. What I mean by this is if two documents have the same score, 1) will they always be in the same order if we set the preference parameter to

Re: query to get latest comment of each user in the month

2014-05-13 Thread 'Binh Ly' via elasticsearch
Not at the moment. However you can do a terms agg on user, then nest a max agg on the timestamp. Then from the results that come back, you will need to run a second _search to retrieve the comment values (using the user + timestamp combo for each bucket from the aggs). -- You received this mes

Re: Out of heap error on machines with 18GB heap and 6GB index

2014-05-13 Thread Justin Zhu
Here's log of GC [2014-05-13 18:09:25,912][WARN ][monitor.jvm ] [elasticsearch-i2-1] [gc][young][404487][53746] duration [1s], collections [1]/[1s], total [1s]/[33.7m], memory [10.4gb]->[10.6gb]/[17.7gb], all_pools {[young] [48.9mb]->[12.9mb]/[266.2mb]}{[survivor] [33.2mb]->[33.2m

Out of heap error on machines with 18GB heap and 6GB index

2014-05-13 Thread Justin Zhu
We have a 3 node cluster, each with 30gb total memory, 18gb allocated to elasticsearch and replicas set at 2. Our largest index is 6GB. After running for a few days, the cluster would go down with Java out of heap errors. We currently have a multi-get aggregation that issues 40 requests on the

Re: Search in Multiple Index

2014-05-13 Thread Ivan Brusic
You cannot do a join between two indices. -- Ivan On Tue, May 13, 2014 at 2:05 PM, Arnaud Boniface wrote: > Hello Ivan, > > I thank you for your link. It confirms that we can search into several > indices or types. > > Do you think is it possible to make a query with criteria on the presence

Re: Filter cache invalidation

2014-05-13 Thread Gabe Gorelick-Feldman
@Jörg It looks like you're using a plain terms filter. I'm trying to do a terms lookup filter. This blog post does a good job describing terms lookup [1]. [1] http://www.elasticsearch.org/blog/terms-filter-lookup/ On Tuesday, May 13, 2014 5:56:19 PM UTC-4, Jörg Prante wrote: > > I modified yo

Re: Filter cache invalidation

2014-05-13 Thread Gabe Gorelick-Feldman
It looks like there was some discussion about this last year: https://github.com/elasticsearch/elasticsearch/issues/3219. The consensus in that issue seems to be to disable terms lookup caching with `cache:false` where you care about consistency. This is probably good enough for me, although it

Re: Filter cache invalidation

2014-05-13 Thread joergpra...@gmail.com
I modified your example a little bit, in this gist https://gist.github.com/jprante/042aaa910e47ebf4536b but I find the filter cache is invalidated with the refresh after overwriting an existing doc. Maybe your example is confused because of the two indices you use? Jörg On Tue, May 13, 2014

query to get latest comment of each user in the month

2014-05-13 Thread Aditya Raut
I have a es index which is of the following structure User: 1, timestamp: `13-5-2014`, comment: "abcded" User: 2, timestamp: `10-5-2014`, comment: "abcded" User: 1, timestamp: `12-5-2014`, comment: "abcded" User: 2, timestamp: `8-5-2014`, comment: "abcded" User: 3, timestamp: `5-5-2014`, comment: "

Re: Elasticsearch primary shards (re)location

2014-05-13 Thread Mark Walkom
Do you have replicas? I'm guessing yes, and thus it's not a major concern, as if the node with the primaries disappears then the secondary shards on the other nodes will be promoted. And as far as searching is concerned, a primary and a secondary as considered the same. You can tweak this behaviou

Re: cluster.routing.allocation.cluster_concurrent_rebalance not respected?

2014-05-13 Thread Mark Walkom
I might be misreading this, but as far as I know spacing matters in the yml config. Try - cluster.routing.allocation.cluster_concurrent_rebalance: 2 Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 14 May 2014 00:27,

Re: time problem

2014-05-13 Thread Mark Walkom
Do you have logstash indexes in the elasticsearch cluster? Also, what's your kibana config look like? Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 13 May 2014 23:55, Quentin Guiot wrote: > hello > > I have a lit

Re: Search in Multiple Index

2014-05-13 Thread Arnaud Boniface
Hello Ivan, I thank you for your link. It confirms that we can search into several indices or types. Do you think is it possible to make a query with criteria on the presence of result into each index ? for example, we must have the same "owner id" into each index specified. Best regards, PM

Re: Filter cache invalidation

2014-05-13 Thread Gabe Gorelick-Feldman
I double-checked, and it looks like the terms lookup filter is not invalidated. Here are some steps to reproduce: First, seed your data: PUT /user/user/1 { "roles": ["admin"] } POST /foo/bar { "a": 1, "role": "admin" } This query will return 1 hit, as expected: POST /foo/bar/_sear

Re: elasticsearch service on windows 64 memory issues

2014-05-13 Thread VB
Can someone please reply to this. On Tuesday, 13 May 2014 11:41:05 UTC-7, VB wrote: > > Hello all, > > We are running 90.11 version on a cluster with master, data and client > nodes. > > Our master and data nodes are on Linux. But client boxes are running on > Windows 64 but boxes as we have .Ne

Re: getting average aggregation in java

2014-05-13 Thread David Pilato
Have a look here https://github.com/dadoonet/legacy-search/blob/06-compute/src/main/java/fr/pilato/demo/legacysearch/dao/ElasticsearchDao.java#L37 Should help. -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 13 mai 2014 à 22:21, T Vinod Gupta a écrit : has anyone gotten

getting average aggregation in java

2014-05-13 Thread T Vinod Gupta
has anyone gotten average aggregation in java? how do you use the api to create the aggregation request? it is quite confusing.. i want to do the following in java - "aggregations" : { "avg_grade" : { "avg" : { "field" : "score" }}} thanks -- You received this message because you are subscribed

Re: Filter cache invalidation

2014-05-13 Thread Nikolas Everett
You'd have to post an example for me to be sure what's up but I don't expect terms filters to be different from other filters. On Tue, May 13, 2014 at 3:45 PM, Gabe Gorelick-Feldman < gabegorel...@gmail.com> wrote: > Very cool, but what about terms lookup? The docs mention it uses an LRU > cache

Re: Newbie - memory issues

2014-05-13 Thread Alexandru Ungur
Thank you so much! :) Indeed, on all the others is like 20-40 max (at peak), but on this one is just growing and growing (past 16k now). We do use firewalls on each slice to protect ES from outside access, but they are configured identically, to allow each other access. Just verified and I can

Re: Filter cache invalidation

2014-05-13 Thread Gabe Gorelick-Feldman
Very cool, but what about terms lookup? The docs mention it uses an LRU cache, is that separate from the segment? I ask because in my naive little tests it seems that I'm getting stale results from my terms lookup filters, even with `indices.cache.filter.terms.expire_after_write` set. On Tuesda

Re: History of searches in ElasticSearch

2014-05-13 Thread David Pilato
No. -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 13 mai 2014 à 20:48, IronMan2014 a écrit : Can I get results about "What is the most frequently searched" and most recent searches? I understand I can have this info logged in a database, but I am wondering if Elastic

Just get back specific entries of an array?

2014-05-13 Thread Sentient6
Hallo dear all, I have trouble with arrays in ElasticSearch. I want to index a document of this structure: {"trial" : [ { "title": "hallo", "id": "5678", "fulltext": "world" }, { "title": "test", "id": "1234", "fulltext":

Re: Percolator on nested object shard failure

2014-05-13 Thread razafinr
Thanks it was the problem. My query was missing the "doc" field. -- View this message in context: http://elasticsearch-users.115913.n3.nabble.com/Percolator-on-nested-object-shard-failure-tp4055434p4055796.html Sent from the ElasticSearch Users mailing list archive at Nabble.com. -- You rece

History of searches in ElasticSearch

2014-05-13 Thread IronMan2014
Can I get results about "What is the most frequently searched" and most recent searches? I understand I can have this info logged in a database, but I am wondering if ElasticSearch has this info instead? -- You received this message because you are subscribed to the Google Groups "elasticsear

Re: Search in Multiple Index

2014-05-13 Thread Ivan Brusic
The Search API allows you to search in multiple indexes at the same time: http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-search.html Cheers, Ivan On Tue, May 13, 2014 at 1:52 AM, dharmendra pratap singh < dharmendra0...@gmail.com> wrote: > Hi Team, > Hope you all

elasticsearch service on windows 64 memory issues

2014-05-13 Thread VB
Hello all, We are running 90.11 version on a cluster with master, data and client nodes. Our master and data nodes are on Linux. But client boxes are running on Windows 64 but boxes as we have .Net applications. In past we had allocated ES_HEAP_SIZE as 2g to client boxes and we had issues of

Re: Filter cache invalidation

2014-05-13 Thread Nikolas Everett
On Tue, May 13, 2014 at 2:24 PM, Gabe Gorelick-Feldman < gabegorel...@gmail.com> wrote: > Does elasticsearch invalidate your filter caches automatically after a > write? The only documentation I've found on filter cache invalidation > (besides time-based expiry) is on > `indices.cache.filter.terms

Re: Newbie - memory issues

2014-05-13 Thread joergpra...@gmail.com
Your thread count on es3 is ridiculous high. Hint: check your network settings and routing if the es3 node can connect to all others. I assume es3 is caught in a loop desperatly trying to reconnect to something and consuming the hell out of threads... can you take a thread snapshot of es3? Do you

Filter cache invalidation

2014-05-13 Thread Gabe Gorelick-Feldman
Does elasticsearch invalidate your filter caches automatically after a write? The only documentation I've found on filter cache invalidation (besides time-based expiry) is on `indices.cache.filter.terms.expire_after_write`, which is disabled by default. But what about other filters besides term

Best way to sync with MongoDB

2014-05-13 Thread Demetrius Nunes
Hi there, I know, there are a few options for performing a close to real-time integration with MongoDB, such as rivers [1], mongodb-connector [2] or manually. I've been reading about rivers being something that will be deprecated. I've tested [1] and it worked reasonably well, although I had

Re: [hadoop] Getting elasticsearch-hadoop working with Shark

2014-05-13 Thread Costin Leau
Hi Nick, I'm glad to see you are making progress. This week I'm mainly on the road but maybe we can meet on the IRC next week, my invitation still stands :) Timestamp is relatively new type and doesn't handle timezones properly - it is backed by java.sq.Timestamp so it inherits a lot of its iss

Re: reverse_nested available in 1.1.1?

2014-05-13 Thread David Pilato
No it's not in 1.1.1. It should be documented that this feature is only available from 1.2.0. -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 13 mai 2014 à 15:54, Raine Nguyen a écrit : Hi all, I'm using ES 1.1.1 (latest version available for download at this time) and

Re: New Elasticsearch Discovery Plugin

2014-05-13 Thread joergpra...@gmail.com
1. Yes, discovered nodes are provided by the DiscoveryNode class 2. Can you please be more specific - what discovery class use what constructors? Maybe the answer is in 4. 3. ClusterState keeps the current state of the cluster regarding nodes, indices, mappings, and most important the elected lea

Plain Highlighter using document-defined analyzer

2014-05-13 Thread Mateusz Kaczynski
We have recently encountered a problem with item highlighting when we changed the way we define analysers. In short, the problem boiled down to PlainHighlighter using analyser for the document type. while we specify analyser on per-document basis. '_analyzer' field is not used (or, from my

Elasticsearch primary shards (re)location

2014-05-13 Thread Georgi Ivanov
Hi, I have the following situation : I have 8 node cluster. Periodically some nodes are restarted, and their primary shards allocated on other nodes. After the node is back, it contains much less primary shards then the rest of the nodes. Now i have a situation, where one node holds many primar

cluster.routing.allocation.cluster_concurrent_rebalance not respected?

2014-05-13 Thread Georgi Ivanov
Hi, In elasticsearch.yml i have : cluster.routing.allocation.cluster_concurrent_rebalance : 6 still i see curl http://localhost:9200/_cat/health?v epoch timestamp cluster status node.total node.data shards pri relo init unassign 131043 16:24:03 mycluster green

time problem

2014-05-13 Thread Quentin Guiot
hello I have a little problem with elasticsearch when using it with kibana and logstash when i lauch my logstash.conf, i get an error on kibana : "*No results* There were no results because no indices were found that match your selected time span" i really don't know why can someone help m

reverse_nested available in 1.1.1?

2014-05-13 Thread Raine Nguyen
Hi all, I'm using ES 1.1.1 (latest version available for download at this time) and I'm getting Parse Failure [Could not find aggregator type [reverse_nested] I was following the example here: https://github.com/elasticsearch/elasticsearch/issues/5485, the issue seems to be tagged for 1.2.0

New Elasticsearch Discovery Plugin

2014-05-13 Thread StarDrek59
I just start using Elasticsearch and analyze its source code. I need to modify its discovery system by registering the nodes on a kind of database. So I suppose that it's easier to develop a plugin instead of modifying the source itself. But after reading some discovery plugins like the basic on

Re: Shard Initialization slow down

2014-05-13 Thread Paul
Thanks Jörg, we've heard of others pre-creating indices, we were seeing it as a work around rather than a regular action but what you say makes it seem like something we should work with. On Tuesday, May 13, 2014 12:13:10 PM UTC+1, Jörg Prante wrote: > > You should create indexes before bulk in

Re: Newbie, basic search help

2014-05-13 Thread Kruti Shukla
I have problem with plurals and want to more relevance in the search. please look at my question here: https://groups.google.com/d/msg/elasticsearch/8yjfx2HLelc/2bEuar6NT9YJ. Thank you. Please help or suggest. Thank you for your time. On Tuesday, September 3, 2013 8:54:41 AM UTC-4, MArk Willia

Re: Return result same score problem, any option to re-sequence return result order?

2014-05-13 Thread Kruti Shukla
one more question I have posted before few weeks but no suggestion worked: https://groups.google.com/forum/#!searchin/elasticsearch/plural/elasticsearch/ET-S3SCD22I/zFyErGO4SRcJ On Tuesday, May 13, 2014 8:22:55 AM UTC-4, Kruti Shukla wrote: > > Hi All, > I want to search document in following or

Return result same score problem, any option to re-sequence return result order?

2014-05-13 Thread Kruti Shukla
Hi All, I want to search document in following order --> 1. men's foil shaver 2. men's foils shaver 3. men's foil advanced shaver I tried multi-field option while mapping the field and mapped it with different filters and analyzers like one index with "not_analyzed",one with "snow ball analyzer

Re: Understanding date format issue

2014-05-13 Thread Karan singh
Any help here please?? On Tuesday, April 29, 2014 7:04:41 PM UTC+5:30, Karan singh wrote: > > Hi, > > Here are my version details: > > ES 1.0.0, MongoDB 2.4.9, mongo-river 2.0.0(with the driver that is > required for the river plugin) > > I have a river with mongo which fetches events from the re

Re: Marvel Indices taking lot of space ? Can we specify automatic delete of marvel indice ?

2014-05-13 Thread deepakas
Hi Ivan/Boaz, Thanks for the responses. I will checkout Curator. I already got the latest version of marvel last week using this command. bin/plugin -i elasticsearch/marvel/latest Thanks, Deepak -- View this message in context: http://elasticsearch-users.115913.n3.nabble.com/Marvel-Indices-

Re: Shard Initialization slow down

2014-05-13 Thread joergpra...@gmail.com
You should create indexes before bulk indexing. First, bulk indexing works much better if all indices and their mappings are already present, the operations will run faster and without conflicts, and the cluster state updates are less frequent which reduces some noise and hiccups. Second, setting t

Re: Shard Initialization slow down

2014-05-13 Thread Paul
This looks very interesting, thanks. On Tuesday, May 13, 2014 11:38:27 AM UTC+1, Mark Harwood wrote: > > This API should give an indication on any backlog in processing the > cluster state: > http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/cluster-pending.html > > > > On T

Function score query does NOT replace boost field well - how to access child's fields?!

2014-05-13 Thread Paweł Młynarczyk
Hello Boost field is deprecated since ES 1.0.0 so I've decided to change it to the function score query as suggested in the docs. So I tried to change this: https://gist.github.com/zwrss/aeaf2828f6dd35a1e888#file-boost-field Into this: https://gist.github.com/zwrss/aeaf2828f6dd35a1e888#file-

Re: Shard Initialization slow down

2014-05-13 Thread Paul
Thanks Mark, we'll have a look at the available metrics. On Tuesday, May 13, 2014 11:34:51 AM UTC+1, Mark Walkom wrote: > > You will want to obtain Marvel ( > http://www.elasticsearch.org/guide/en/marvel/current/) and then wait till > you have a history and start digging. > > Regards, > Mark W

Re: Shard Initialization slow down

2014-05-13 Thread Mark Harwood
This API should give an indication on any backlog in processing the cluster state: http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/cluster-pending.html On Tuesday, May 13, 2014 11:29:20 AM UTC+1, Paul wrote: > > Ok, do you know if there are clear indicators when limits ar

Re: Shard Initialization slow down

2014-05-13 Thread Mark Walkom
You will want to obtain Marvel ( http://www.elasticsearch.org/guide/en/marvel/current/) and then wait till you have a history and start digging. Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 13 May 2014 20:29, Paul

Re: Shard Initialization slow down

2014-05-13 Thread Paul
Ok, do you know if there are clear indicators when limits are being reached? We don't see errors in the logs (apart from the 30s timeout) but if there are system or ES provided metrics that we can track to know when we need to scale it would be really useful. Thanks, Paul. On Tuesday, Ma

Re: Shard Initialization slow down

2014-05-13 Thread Mark Walkom
Empty or not, there is still metadata that ES needs to maintain in the cluster state. So the more indexes you have open the bigger that is and the more resources required to track it. Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignm

Re: Shard Initialization slow down

2014-05-13 Thread Paul
In testing and replicating the issue, this slow down has been seen occurring with empty indices. The running cluster is at present ~100 GB across 2,200 Indices with a total of 13,500 shards and ~430,000,000 documents. We have 7GB RAM and 5GB heap on the data nodes - haven't looked overly care

context suggester

2014-05-13 Thread Jilles van Gurp
Hi, I could really use the new context suggester, which I understand is coming with 1.2. I'm planning to use it for user specific suggestions on e.g. tags and names. Is there any ETA on when 1.2 is going to happen (roughly)? I saw that Lucene 4.8 landed on master but not yet on the 1.1 branch,

Suggest Query with additional condition

2014-05-13 Thread Raghavendar T S
Hi I am trying to implement auto complete feature with suggest query on a field which I am able to do. Further I need to implement suggest query along with another condition which I am not able to do. Can any one guide me on this? Thank you -- You received this message because you are subscri

Re: Shard Initialization slow down

2014-05-13 Thread Mark Walkom
Sounds like the inevitable "add more nodes" situation. How much RAM on each node, how big is your data set? Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 13 May 2014 19:59, Paul wrote: > We are seeing a slow dow

Shard Initialization slow down

2014-05-13 Thread Paul
We are seeing a slow down in shard initialization speed as the number of shards/indices grows in our cluster. With 0-100's of indices/shards existing in the cluster a new bulk creation of indices up the 100's at a time is fine, we see them pass through the states and get a green cluster in a re

Re: unable to write data to elasticsearch using hadoop PIG

2014-05-13 Thread Costin Leau
I would recommend upgrading to the latest es-hadoop, 2.0 RC1. Also considering reading [1] Hope this helps, [1] http://www.elasticsearch.org/guide/en/elasticsearch/hadoop/current/troubleshooting.html On 5/13/14 1:20 AM, hanine haninne wrote: thank you so much for your quick reply, Here is what

Re: ElasticSearch: restrict results but not facet counts

2014-05-13 Thread David Pilato
Hey Felix, You need to add your second filter as a "post_filter": http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-request-post-filter.html#search-request-post-filter HTH --  David Pilato | Technical Advocate | Elasticsearch.com @dadoonet | @elasticsearchfr Le 13 m

Plsease explain me how cache works in ES

2014-05-13 Thread Marcin Ciurla
I got pretty simple question, but i can'f find and answer rly. After i index document in ES it wil be commited to index in about 1 second. What will happen to my cache filled with filter query ealier. It will be cleared to give me most current response? In solr when i index and commit new docume

Re: Default Routing and Get API performance

2014-05-13 Thread tai khuu
Please some one help :(, any idea? On Sunday, May 11, 2014 5:52:56 PM UTC+7, tai khuu wrote: > > Hi All, > > I'm new to elasticsearch and still on the way to learn more , recently i > have hit a question that can't be found on google :D, the scenario is I > have a list of 1000 document id , the

Search in Multiple Index

2014-05-13 Thread dharmendra pratap singh
Hi Team, Hope you all are doing good. I stuck with a question that If I want to search for a perticular keywotd in many indexes (not from a specififc index), then what I need to do??? Looking for some quick suggessions. Regards Dharmendra -- You received this message because you are subscribe

Re: Improving Elasticsearch performance on a single node by increasing shards

2014-05-13 Thread Rujuta Deshpande
We plan to store data for only about 3- 6 months and hence, we thought this configuration might be okay. A couple of simultaneous Kibana dashboard queries (mainly to generate histograms) resulted in the system load reaching 10. This happened owing to a large number of disk I/O operations.

ElasticSearch: restrict results but not facet counts

2014-05-13 Thread Felix Schwarz
I'm struggling writing an ElasticSearch query with facet counts. This is very likely a simple question but I'm confused by all the facet filters and filter facets. The query itself needs two conditions: - a condition which restricts the document set, excluded documents should not be used for

Re: Improving Elasticsearch performance on a single node by increasing shards

2014-05-13 Thread joergpra...@gmail.com
Elasticsearch is using all cores by default. If you do not see 100% of CPU use, this is no reason to worry. 100% CPU would signal bad programming style (this would be a bug). You should watch the system load. If system load is low, you have either not enough query load, or your configuration preven

Clear cache on demand and Circuit breaker Problem

2014-05-13 Thread Thomas
Hi, I'm trying to get some aggregated information by Querying Elasticsearch via my app. What I notice is that after some time I get a CircuitBreaker exception and my query fails. I can assume that I load too many fielddata and eventually the CircuitBreaker stops my query. Inside my application

Re: Improving Elasticsearch performance on a single node by increasing shards

2014-05-13 Thread Mark Walkom
Given you're only on one server you are limited with what you can do. You'd be better off adding another node if you can, maybe someone else can comment on the rest. Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 13