UnsupportedOperationException raised during "DFS query then fetch" with range query in rescorer

2014-10-11 Thread g . dhordain
Hi, When I try to run a query with a range query rescorer, Elasticsearch (version 1.3.2) will raise an java.lang.UnsupportedOperationException on org.apache.lucene.search.Query.extractTerms. The exact class raising the exception was NumericRangeQuery in my test. After some investigations, I fo

Indexing stops when too many unassigned shards

2014-10-11 Thread Veselin Kantsev
Hello, I have a test cluster of 2 ES nodes (Elasticsearch 1.1.1-1). I've noticed that whenever the number of unassigned shards increases past a threshold, the cluster stops accepting Write operations. *Example1*: nodes: 2 primary shards: 6 replicas: 1 The cluster works as expected. *Example

Re: Cluster discovery on Amazon EC2 problem - need urgent help

2014-10-11 Thread Zoran Jeremic
Hi Norberto, Thank you for your advices. This is really helpful, since I have never used elasticsearch in the cluster before, and never had went live with a number of users. My previous experience was on ES single node and very small number of users, so I'm still concern how this will work. Th

Re: using a different analyzer for each query and same index

2014-10-11 Thread Aris Alexis
bump On Wednesday, October 8, 2014 4:16:42 PM UTC+2, Aris Alexis wrote: > > Hi, > > first of all I am a noob with elastic so bear with me. > > I am interested to know if this scenario is possible: > > A user posts some text , my app detects the language and index the text > with the spanish analy

skip every n hits

2014-10-11 Thread Ted Smith
Hello: My query would returns 1 hits sorted. I'd like client app to only skip by 100 hits before retrieve next hit For examle, retrieve only 1st, 101th, 201th, 301th,... 901th, I'd like this to be done at server side so server will only return the needed hits, instead of return all 100

Re: How to extend TermQuery to ''_all" field

2014-10-11 Thread xiehaiwei
Hi, The copy_to parameter can work in my case. Thanks mails from lucene mail list= > 'Internally this is indexing every field a second time into the "_all" field.' > This sentence mean second indexing has total different analyzer and > indexing

forcing index to be created/realocated in/to specific data dir

2014-10-11 Thread Alexander Piavlo
I have ES nodes in ec2 with both with local ephemeral SSD disks and large EBS disks New indexes are created on daily basis. I want current day index to be created and on SSD disks with number_of_replicas = 1 Once the day ends i want to move the index to EBS disks and change the number_of_replicas t

Re: environment variable values .

2014-10-11 Thread Mark Walkom
Only set ES_HAP_SIZE, there is no point setting all 3 separately. Best practise is to use 50% of system RAM, in your case that'd be 15GB. Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 11 October 2014 20:20, Deepak R

environment variable values .

2014-10-11 Thread Deepak R K
Hi All, We are using elasticsearch in production . Currently we are planning to migrate on bigger ( 30 GB RAM/ 8 core CPU ) server . Can anyone suggest us what would be best values for ES_HEAP_SIZE , ES_MIN_MEM , ES_MAX_MEM if RAM memory is 30 GB . Thanks Deepak -- You received this me

Re: Optimal usage of big server (use VMs or not ...)

2014-10-11 Thread Mark Walkom
We use VMs as we knew the tech, we did consider containers (eg docker) but they were still new to us and considered a risk. Running multiple instance on a single node can work, you just need to be able to manage it. The Elasticsearch puppet module lets you do this though. Regards, Mark Walkom In

Re: Cluster Configuration.

2014-10-11 Thread Mark Walkom
You don't store data in memory in ES, it is only pulled into memory when querying. For your queries, check out http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-request-preference.html Not sure what you are thinking about DR here, but if the primary shard disappears (ie

Re: 1 cluster, multiple replicas with different synch settings

2014-10-11 Thread Mark Walkom
A and B should be ok, not sure on C as I believe that a/sync is on an index level and would apply to any and all replicas. http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/docs-index_.html#index-replication Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email:

Re: Performance with an arbitrary number of indicies

2014-10-11 Thread Mark Walkom
Why not use aliases? This way you can move larger customers to their own index if need be. Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 11 October 2014 01:54, jnortey wrote: > Lets say that I was providing a ser

Re: Bulk insert vs Single insert

2014-10-11 Thread Mark Walkom
>From what I understand (which may not be 100% right), most of the overhead is with generating and dealing with the HTTP request as it's a heavy operation. Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 11 October 20

Re: How many shards is to many shards per server on SSD?

2014-10-11 Thread joergpra...@gmail.com
RAM is cheap and the best method to achieve highest performance in ES. You can try search benchmarks on ES clusters with > 8 GB RAM and > 4 GB heap plus mlockall. Either with SSD or HDD you will see only tiny difference in response times after warmup. If you are only after search performance, the

Re: Email notification on ERROR in log

2014-10-11 Thread joergpra...@gmail.com
Have you tried log4j SMTP appender? Regarding JDBC exception, I will add a documentation how to set up log4j2 http://logging.apache.org/log4j/2.0/manual/appenders.html#SMTPAppender in the next version of JDBC river/plugin. Jörg On Sat, Oct 11, 2014 at 8:49 AM, Prasad Lele wrote: > Hi frnds,

Re: Optimal usage of big server (use VMs or not ...)

2014-10-11 Thread knacktus
A third option has come to my mind: How about starting several nodes on one server? Any known disadvantages for this option compared to virtual machines? Cheers, Jan -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this g