It's normal to see 40-60% deleted docs if you frequently update existing documents. See this recent blog post I wrote for some details: http://www.elasticsearch.org/blog/lucenes-handling-of-deleted-documents/
Mike McCandless http://blog.mikemccandless.com On Sun, Feb 1, 2015 at 3:50 PM, Mark Walkom <markwal...@gmail.com> wrote: > If you're updating single documents often then you should expect high > delete rates, your heap and CPU use seems to be ok though so it's not > anything to be super concerned with (for now). You do have the option of > forcing an optimise (which does a merge and removes deleted docs), but this > is resource intensive so be careful of when you run it, ie it'd be best to > schedule it to run when your users are asleep. > > Your main indices seem to only have 2 shards, with no replicas. You should > really have one shard per node, with one replica to give you some > redundancy for a total of 6 shards, 3 primary, 3 replica. Currently you > will have an imbalance of data, which means some nodes are more overloaded > than others, plus without replicas you cannot survive the loss of a node. > > Without knowing your use, it also looks like something is creating indices > that shouldn't be there - eg phppath, cgi-bin, cfg, phpmyadmin - which > doesn't look like a problem at the moment, but is something you should > check, out if they shouldn't be there that is). > > Finally, if you want better performance, move to Oracle JDK and then > upgrade to the latest release (1.4.3). There is a lot of improvements in > the latest versions, though I cannot comment on the Groovy side of your > question. > > On 2 February 2015 at 06:31, Ricardo Fiel <ricardo.f...@rupeal.com> wrote: > >> Hi, >> >> We've been using ElasticSearch (through qBox) for almost 3 months now, >> but something's very wrong and we need help on this one. I'll try to give >> as much context as possible. >> >> Last month we had a major downtime (almost 24hr). We were using a single >> node with 2GB and after this downtime we upgraded to a 3 node 4GB setup. We >> are online again, but our Documents Deleted values are still way too high >> (40-50% and more) and they never go down, which leaves us uncomfortable. >> We've contacted qBox, which are very heplful, but they always say we need a >> bigger node because of that. It's really not an option, as hosting costs of >> our product skyrocketed with this. >> >> Our product is an invoicing platform. Our text search, filters and all >> invoice listings now get all their data from elastic instead of MySQL, so >> we can't rely (we think) on bulk indexing, as the invoices, items, clients, >> etc.. need to be indexed asap to be shown in listings. Maybe we have a >> design issue and shouldn't be using elastic for so many things, it's >> something we need to understand better. >> >> At the moment, our main concern in making sure we're ok in the long run >> and avoid another huge downtime at all costs. >> >> Can someone help in some way? We'd be very grateful. >> >> Some more info: >> >> - We're still not using round robbin. Could this help? >> - Version 1.2.4. It's on our roadmap to upgrade but we've heard >> stories of worst performance in some cases (groovy-related). >> - I've added some screens with node info from qbox. >> >> Thanks, >> Ricardo >> >> >> >> >> >> >> -- >> You received this message because you are subscribed to the Google Groups >> "elasticsearch" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to elasticsearch+unsubscr...@googlegroups.com. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/elasticsearch/eca7aa05-fda6-4c21-9fff-dd418d6ae2cb%40googlegroups.com >> <https://groups.google.com/d/msgid/elasticsearch/eca7aa05-fda6-4c21-9fff-dd418d6ae2cb%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > > -- > You received this message because you are subscribed to the Google Groups > "elasticsearch" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to elasticsearch+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/elasticsearch/CAEYi1X-L-C%3DSfNjRN6559%3DGsQ90qY_HsugOmV-i2Di3vOf8r1w%40mail.gmail.com > <https://groups.google.com/d/msgid/elasticsearch/CAEYi1X-L-C%3DSfNjRN6559%3DGsQ90qY_HsugOmV-i2Di3vOf8r1w%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAD7smRcJKMYKg-OX2CkpGezF9VDNYpXOE0mvewFU112j1yXSNg%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.