How to use scripts in an integration Test

2014-07-25 Thread Nick T
Hello, I've got a native script written in Java and I would like to test this when my end-to-end integration tests run. The problem is that I do not know how to register the script with my test node. In my application a test node is created with a temporary directory in /tmp. It will be

rest api or java client?

2014-07-25 Thread CB
hi all, i'm new to elastic search and would like to ask some basic questions. we are developing a system based on the play framework (non blocking io, event loop, scala) we are currently working with elastic search through the rest api which is working ok in dev. we are concerned about

Re: Deploying ElasticSearch 1.3 to Wildfly 8.1

2014-07-25 Thread joergpra...@gmail.com
You have two choices: - drop ASM and MVEL, switch to Groovy for scripting - use TransportClient to avoid ES-cluster-side services like scripting in Wildfly deployment (this is how I do it currently) Jörg On Fri, Jul 25, 2014 at 3:30 AM, John D. Ament john.d.am...@gmail.com wrote: Hi all I

Re: Deploying ElasticSearch 1.3 to Wildfly 8.1

2014-07-25 Thread joergpra...@gmail.com
As a side note, I run Wildfly 8.1 on Java 8. ASM 3 does not work with Java 8, so I wonder how things can work out at all, even without ES. If possible, ASM 3 should be replaced by ASM 5.0 Jörg On Fri, Jul 25, 2014 at 9:02 AM, joergpra...@gmail.com joergpra...@gmail.com wrote: You have two

I have a doubt at how to avoiding type gotchas

2014-07-25 Thread xu piao
http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/mapping.html#_avoiding_type_gotchas at above guidence it say i can use _type + field name to avoid analyzer conflicts. but i have try many times all is no use. i have a index with two type is group and user which all have

Re: Can I change type of current master node and how to increase performance for my Elasticsearch cluster

2014-07-25 Thread Warat Wongmaneekit
Thank you so much if we have data like this how many node we have to provided? On Friday, July 25, 2014 2:18:58 PM UTC+7, Warat Wongmaneekit wrote: We have 3 nodes of elastic search all nodes is mater=true and data=true. When we try to find the best practice almost is recommend to setup 1

Re: rest api or java client?

2014-07-25 Thread joergpra...@gmail.com
1. No. ES is already managing connections, see TransportClient 2. REST API sits on top of native Java client. So, because of HTTP, you have overhead with REST. Async call API with HTTP is a mess. 3. All actions are routed automatically to the relevant shards only, no matter what client. 4.

Default script language

2014-07-25 Thread Jettro Coenradie
Hi, I was playing around with the new scripting support in 1.3. In the documentation is stated that groovy is now the default language. This is not true for all situations. The following query for instance is returning a message about the use of mvel. I have to explicitly set the lang to

4 unassigned noeds how to remove

2014-07-25 Thread Warat Wongmaneekit
My cluster are have 4 unassigned nodes and is alway save nodes how to resolve it I try to optimize, flush, but unassigned shard still 4. do you have any solution for fix it. *Cluster health* { - cluster_name: zocialinc, - status: red, - timed_out: false, - number_of_nodes: 3,

Re: Elasticsearch 1.3 Transform Scripts

2014-07-25 Thread joergpra...@gmail.com
Yeah, I use Groovy for input data transformations for years, so I'm very enthusiastic about ES embracing Groovy as default. I use Groovy scripts before the source is passed to the bulk client, in a key/value stream processing. It's good to see transformation machinery in the ES mapping, but I do

Re: Default script language

2014-07-25 Thread David Pilato
Thanks. Tested. Issue opened here:  https://github.com/elasticsearch/elasticsearch/issues/7029 --  David Pilato | Technical Advocate | Elasticsearch.com @dadoonet | @elasticsearchfr Le 25 juillet 2014 à 10:05:43, Jettro Coenradie (jettro.coenra...@gmail.com) a écrit: Hi, I was playing around

Re: Default script language

2014-07-25 Thread David Pilato
Actually mvel is still the default for 1.3.0. groovy will be the default in 1.4.0. We will fix the doc. --  David Pilato | Technical Advocate | Elasticsearch.com @dadoonet | @elasticsearchfr Le 25 juillet 2014 à 10:27:44, David Pilato (da...@pilato.fr) a écrit: Thanks. Tested. Issue opened

Re: Multimaster architecture

2014-07-25 Thread Vadim Lungu
Thank you Mark for your reply, If I understood write, in case when all my cluster nodes are alive i will have: master1 shard + master2 shards + master3 shard = 7 + 6 + 5 = 18 shard. If master 1 die I will have master1 shard + master2 shards + master3 shard = replica of 7 shards + 6 + 5 = 18

Re: rest api or java client?

2014-07-25 Thread David Pilato
3. All actions are routed automatically to the relevant shards only, no matter what client. Just a comment about this. If you are using a TransportClient, the transport client won't try to reach directly the right shard. It will simply direct the request to a node which is one of the nodes it

Inconsistency between GET and POST searches

2014-07-25 Thread Seb Bacon
Hi, I've got a search query which fails with CircuitBreakingException: Data too large when POSTed, but succeeds when the identical query is sent as a GET (with the json in the query string). The search query itself may be buggy, as far as I can tell (the size parameter is in the wrong place).

How to solve failed to merge org.apache.lucene.store.AlreadyClosedException: this Directory is closed

2014-07-25 Thread Warat Wongmaneekit
I saw a lot of error log and we need to fix this problem but we don't know solution please help us. *Example log* [2014-07-25 06:00:23,366][WARN ][cluster.action.shard ] [Host88] [instagram][26] sending failed shard for [instagram][26], node[sQEMBQGMT1eyDrPwF-xGeA], [P], s[STARTED],

Re: rest api or java client?

2014-07-25 Thread CB
thanks for the answers, here are my thoughts: 1. If using pure REST client - Using a Load Balancer will make sure that the endpoint address goes to any of the live nodes (round robin) so that if one of those nodes dies or if I scale out the cluster (add more nodes) it is transparent to the

Search across types in query body

2014-07-25 Thread Vacelet, Manuel
Hi there, I'm playing with search across types with queries of form: curl -XPOST 'localhost:9200/trk/110,112/_search ... And I'm wondering if there is a way to specify the types I want to search on in the query body. Like: curl -XPOST 'localhost:9200/trk/_search -d '{ ... }' It seems that the

Re: Inconsistency between GET and POST searches

2014-07-25 Thread Seb Bacon
OK, so it turns out the GET version just wasn't getting parsed at all. curl -XPOST -G http://localhost:9200/bork/user/_search -d ' something-nonsense' Always returns everything; the parameters have to be in the form key=val when in the URL. The docs do already say that; I was being

Configuration advice

2014-07-25 Thread Wouter van Atteveldt
Dear elastic users, We use elastic for a document store containing around 100M documents in 200G. We used to run a single elastic node on the same server that functioned as the web server, but this caused a lot of performance issues (and the need of re-indexing all data in case of failure).

Aggregation on parent/child documents

2014-07-25 Thread Thomas
Hi, I wanted to ask whether is possible to perform aggregations combining parent/child documents, something similar with the nested aggregation and the reverse nested aggregation. It would be very helpful to have the ability to create for instance buckets based on parent document fields and

Re: Inconsistency between GET and POST searches

2014-07-25 Thread Seb Bacon
It turns out that computing this facet only takes about 12MB, but the fielddata cache was completely full. Restarting the nodes emptied the cache, and everything started working again. I note that there's a setting: indices.fielddata.cache.expire Which is off by default. I guess I need to

Re: Configuration advice

2014-07-25 Thread Wouter van Atteveldt
Dear Mark, others, Thanks for the advice! I do have some more questions I'm afraid... On Friday, July 25, 2014 1:30:07 PM UTC+2, Mark Walkom wrote: I'd go with option 3, but make all 3 nodes master eligable. That way you prevent complete loss of the cluster and protect against split brain.

Re: Geo bounds aggregation as a sub aggregator?

2014-07-25 Thread Colin Goodheart-Smithe
Could you post the full stack trace from the elasticsearch log file? This will help in working out where the query falls over Thanks On Friday, 25 July 2014 01:18:49 UTC+1, svartalf wrote: It is really odd, but it failed for me on the precision levels 6 and 7 with a

Where is all my memory? Or how to estimate better

2014-07-25 Thread Daniel Schonfeld
Hello, So I've replaced some machines we had with more memory and it seems to have helped but I'm starting to see some high memory usage on one of the machines. What I don't understand and can't figure out is where that memory is being used? We have 26GB RAM, allocating 13.5G for the

Re: Where is all my memory? Or how to estimate better

2014-07-25 Thread joergpra...@gmail.com
What OS / version is this? Jörg On Fri, Jul 25, 2014 at 4:10 PM, Daniel Schonfeld downwindab...@gmail.com wrote: Hello, So I've replaced some machines we had with more memory and it seems to have helped but I'm starting to see some high memory usage on one of the machines. What I don't

[ANN] Elasticsearch Mapper Attachment plugin 2.2.0 released

2014-07-25 Thread Elasticsearch Team
Heya, We are pleased to announce the release of the Elasticsearch Mapper Attachment plugin, version 2.2.0. The mapper attachments plugin adds the attachment type to Elasticsearch using Apache Tika.. https://github.com/elasticsearch/elasticsearch-mapper-attachments/ Release Notes -

Kibana and histograms not based on dates?

2014-07-25 Thread Chris Neal
Hi all, I was trying to build a histogram that shows each day's change based on a numeric X axis rather than a date. From what I can see, the histogram in Kibana is based only on date, so X axis can only be date or time. There are two facets of histogram in ES

Re: Auto completion using completion suggester

2014-07-25 Thread Anand kumar
Hi Chris, Thanks a lot for your reply, but the test scenario which you've mentioned in the post cant be accessed. Could you please help me out? On Thu, Jul 24, 2014 at 2:47 PM, Christian Rütgers christian.ruetg...@grintsch.com wrote: Dear Anand, I really appreciate your question because I

Re: Aggregation on parent/child documents

2014-07-25 Thread Adrien Grand
Hi Thomas, None of the aggregations that we have today can leverage parent/child relations. However, there is a `children` aggregation in the pipeline: https://github.com/elasticsearch/elasticsearch/pull/6936 On Fri, Jul 25, 2014 at 1:54 PM, Thomas thomas.bo...@gmail.com wrote: Hi, I wanted

Re: I have a doubt at how to avoiding type gotchas

2014-07-25 Thread Adrien Grand
Hi, The issue is that the `region` field in `group` is indexed with positional data, while the `region` field in `user` is indexed without. This is something that Elasticsearch cannot deal with. You should try to give the `region` field different names in `group` and `user`. On Fri, Jul 25,

Re: Where is all my memory? Or how to estimate better

2014-07-25 Thread Daniel Schonfeld
CoreOS, 367.1.0. Docker 1.0.1 and ES 1.2.2 I don't know if it matters but we are using this plugin which is not written by elasticsearch: https://github.com/Asquera/elasticsearch-http-basic On Friday, July 25, 2014 11:02:40 AM UTC-4, Jörg Prante wrote: What OS / version is this? Jörg

Re: Where is all my memory? Or how to estimate better

2014-07-25 Thread Daniel Schonfeld
I should also mention that machine which is now low on memory and how gotten worse, is suffering form high CPU usage too which comes right on time with the memory getting low. It's a 4 core machine. Google compute engine - n1-highmem-4. On Friday, July 25, 2014 1:19:24 PM UTC-4, Daniel

Re: Aggregation on parent/child documents

2014-07-25 Thread Thomas
Hi Adrien and thank you for the reply, This is exactly what i had in mind alongside with the reversed search equivalent with the reverse_nested, this is planed for version 1.4.0 onwards as i see, will keep track of any updates on this, thanks Thomas On Friday, 25 July 2014 14:54:50 UTC+3,

cluster.routing.allocation.awareness.attributes not working as documented

2014-07-25 Thread Yongtao You
Hi, I was experimenting with the cluster.routing.allocation.awareness.attributes setting. I started with one data node in the cluster, with allocation awareness attribute set to value1. I expect that all the shards will be allocated to this node. Instead, only primary shards are allocated to

Re: no clustering on ec2 due to ClassNotFoundException[org.elasticsearch.gateway.blobstore.BlobStoreGatewayModule]

2014-07-25 Thread Anthony Oleary
hi David May I ask, if i want to use opsworks, copy my ec2 image onto opswork , and auto scale, how can i specify a master node or is it automatic? anthony On Thursday, July 24, 2014 5:36:51 PM UTC+1, David Pilato wrote: It's a matter of EC2 configuration. Elasticsearch does not really

Kibana Not Display GELF Fields

2014-07-25 Thread Didjit
Folks, I'm stumped. Here is what I have: NXLOG (gelp out)-Logstash (gelf in)-Elasticsearch-Kibana This is pushing to a new index ([mynewindex-]-.mm.dd). Logstash shows it has all the fields parsed in the logs and Elasticsearch has docs listed. Even in In the default Kibana UI for Logstash,

Re: rest api or java client?

2014-07-25 Thread Ivan Brusic
Answers inline. On Fri, Jul 25, 2014 at 3:06 AM, CB chen.be...@gmail.com wrote: thanks for the answers, here are my thoughts: 1. If using pure REST client - Using a Load Balancer will make sure that the endpoint address goes to any of the live nodes (round robin) so that if one of those

Re: no clustering on ec2 due to ClassNotFoundException[org.elasticsearch.gateway.blobstore.BlobStoreGatewayModule]

2014-07-25 Thread David Pilato
Automatic master node election. HTH -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 25 juil. 2014 à 20:07, Anthony Oleary anthony.ole...@kweekweek.com a écrit : hi David May I ask, if i want to use opsworks, copy my ec2 image onto opswork , and auto scale, how can i

Re: elasticsearch-hadoop: bulk indexing JSON

2014-07-25 Thread M_20
Hi Guys, Could you please give me a java sample code of mapper and reducer in Elasticsearch-hadoop? I'd appreciate it. Thanks -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this group and stop receiving emails from it,

Re: elasticsearch-hadoop: bulk indexing JSON

2014-07-25 Thread Costin Leau
Have you looked at the docs? http://www.elasticsearch.org/guide/en/elasticsearch/hadoop/current/mapreduce.html On Fri, Jul 25, 2014 at 11:04 PM, M_20 rastegar...@gmail.com wrote: Hi Guys, Could you please give me a java sample code of mapper and reducer in Elasticsearch-hadoop? I'd

Re: need help for Hadoop and ES integration

2014-07-25 Thread M_20
Hi Chetana, Could you please share with me a java sample code of Map/reduce on Elastcisearch-Hadoop? Regards On Thursday, April 3, 2014 4:36:24 AM UTC-5, Chetana wrote: I have downloaded elasticsearch-hadoop-1.2.0.jar from github and trying to search. The code to search looks like

Re: need help for Hadoop and ES integration

2014-07-25 Thread Costin Leau
M_20, I've already replied to your initial query on where you can find some example - the official docs: http://www.elasticsearch.org/guide/en/elasticsearch/hadoop/current/mapreduce.html There's also google which points to other resources outside Elasticsearch. On Fri, Jul 25, 2014 at 11:10 PM,

Re: need help for Hadoop and ES integration

2014-07-25 Thread M_20
Costin, Thank you for your reply. I've read the official docs before. But it seems I am missing something. So, I wanted to see a complete example to make sure I am understanding ES-hadoop correctly. For example, in the official docs, it talks about writing data to ES, and this is the mapper

Schema Update, replication

2014-07-25 Thread Timothy Dilbert
Hello, I am very new to ElasticSearch so have a quick question re: schemas updating and the effect it has on replication. Let's say you have an index filled with a number of json documents and one of the fields in the json document is a blob. Now you perform an update on the json document and

Search by cardinality of a field

2014-07-25 Thread Eric Brunson
I have a doc type which includes a field that is a list of strings. I'd like to query/filter based on the number of items in the list, either exactly equal to n or greater than/less than. Is that possible? I haven't found anything in the Query DSL that seems to lend itself to that. Thanks!

[ANN] Elasticsearch Mapper Attachment plugin 2.3.0 released

2014-07-25 Thread Elasticsearch Team
Heya, We are pleased to announce the release of the Elasticsearch Mapper Attachment plugin, version 2.3.0. The mapper attachments plugin adds the attachment type to Elasticsearch using Apache Tika.. https://github.com/elasticsearch/elasticsearch-mapper-attachments/ Release Notes -

Re: Numerical range in Kibana 3 not working

2014-07-25 Thread deepak deore
By default logstash takes all input as a string, added :int to the fields which i wanted as number. eg. %{NUMBER:apache_bytes:int} %{NUMBER:apache_response_time:int} Ref: https://groups.google.com/forum/#!topic/logstash-users/2ewrcovttSY On Tuesday, 22 July 2014 09:24:20 UTC-7, deepak deore

Re: cluster.routing.allocation.awareness.attributes not working as documented

2014-07-25 Thread Mark Walkom
Take a look at the example here http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/modules-cluster.html#allocation-awareness Basically a shard and it's replica will never be allocated to the same instance, which is what you are seeing. Regards, Mark Walkom Infrastructure

Sorting Problem, ClassCastException.

2014-07-25 Thread Curt Hu
In my document I have field called created like 2014-07-25T06:02:00, the mappings is: created: { format: dateOptionalTime type: date } Here with the range query on that is fine, but why sort on that field I got: { error: ReduceSearchPhaseException[Failed to execute phase [query], [reduce] ];

Use java Api to set a document's field as _id

2014-07-25 Thread Chia-Eng Chang
I want to ask if the unique field _id be assigned by certain field within document. I see with Rest, it can achieve by path: { tweet : { _id : { path : post_id } } } But if I want to do it with java API, is there any way to achieve

Re: cluster.routing.allocation.awareness.attributes not working as documented

2014-07-25 Thread Yongtao You
Hmm... I'm reading the same document and came to the opposite conclusion. :( My understanding is that if all the nodes have the same attribute, then _all_ shards will be deployed among them. As soon as a node with different attribute arrives, that's when the replica will not be deployed to

Re: prevent 'match_phrase' from evaluating score

2014-07-25 Thread Ivan Brusic
Since scoring is not important, can't you simply sort against one of the fields? -- Ivan On Thu, Jul 24, 2014 at 10:59 PM, 陳智清 walker0...@gmail.com wrote: The only solution I could come out is also to modify the source code. I just think it would be nice if there is a existing flag or

Register snapshot repositories via config file?

2014-07-25 Thread Eric Jain
Would it make sense to allow snapshot repositories to be registered via the config file? The docs have an example, but it's for running the tests only. repositories: s3: bucket: bucket_name region: us-west-2 -- You received this message because you are subscribed to the

Re: Is this link still applicable on EC2? http://www.elasticsearch.org/tutorials/elasticsearch-on-ec2/

2014-07-25 Thread Eric Jain
The S3 Gateway has been dropped, so you'll either need to use EBS, or set up some mechanism to do snapshots to S3. Other than that, no major changes. On Thursday, July 24, 2014 10:12:25 PM UTC-7, vjbangis wrote: Hi, Is this link

Re: cluster.routing.allocation.awareness.attributes not working as documented

2014-07-25 Thread Mark Walkom
If you only have one node the replicas will never be assigned. Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 26 July 2014 09:23, Yongtao You yongtao@gmail.com wrote: Hmm... I'm reading the same document and

Re: Deploying ElasticSearch 1.3 to Wildfly 8.1

2014-07-25 Thread John D. Ament
I can try upgrading ASM locally to see if it works. I'm assuming it's backwards compatible? On Friday, July 25, 2014 3:06:13 AM UTC-4, Jörg Prante wrote: As a side note, I run Wildfly 8.1 on Java 8. ASM 3 does not work with Java 8, so I wonder how things can work out at all, even without

Re: n:m lookup filter

2014-07-25 Thread Don Clore
Does anyone know the status of that pull request? Is it likely to be approved? thanks, Don On Saturday, July 19, 2014 12:14:01 AM UTC-7, Jörg Prante wrote: Yes, I think this is somehow related to Matt's Join Filter https://github.com/elasticsearch/elasticsearch/pull/3278 Jörg On

Parent child relationships

2014-07-25 Thread Vijay kumar
Hi all, I have the following type mappings. How do i get all the children of a parent document in elasticsearch? How to get all users of a particular tenant? Please help me tenant: { - properties: { - organization: { - index: not_analyzed - type: string }

Re: n:m lookup filter

2014-07-25 Thread Matt Weber
It's currently blocked until we can figure out a way to prevent a bad query from triggering an OOM error. The goal (as far as I've been told) is to get this in, but no ETA. I need to update the PR to the latest master as there have been significant changes as well. Thanks, Matt Weber On Jul