Re: MoreLikeThis can't identify that 2 documents with exactly same attachments are duplicates

2014-05-08 Thread Zoran Jeremic
Hi Alex, Thank you for this explanation. This really helped me to understand how it works, and now I managed to get results I was expecting just after setting max_query_terms value to be 0 or some very high value. With these results in my tests I was able to identify duplicates. I noticed

Re: Changing node and cluster name Version: 1.1.1

2014-05-08 Thread David Pilato
Can you see anything in elasticsearch logs or in system logs? --  David Pilato | Technical Advocate | Elasticsearch.com @dadoonet | @elasticsearchfr Le 7 mai 2014 à 23:13:01, Francesco Audisio (cesco...@gmail.com) a écrit: yes i have uncomment the line but now the Elasticsearch server not

Re: Changing node and cluster name Version: 1.1.1

2014-05-08 Thread Francesco Audisio
Today evening i check, thank you very much. Il giorno giovedì 8 maggio 2014 08:19:35 UTC+2, David Pilato ha scritto: Can you see anything in elasticsearch logs or in system logs? -- *David Pilato* | *Technical Advocate* | *Elasticsearch.com* @dadoonet https://twitter.com/dadoonet |

Re: Locking a shard to one data path

2014-05-08 Thread Mark Walkom
If you are using single disk machines, then all your segments will be created in the one data path (ie system directory). On linux with a package install, that's usually /var/lib/elasticsearch/ Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web:

Re: Changing node and cluster name Version: 1.1.1

2014-05-08 Thread David Pilato
Please use GIST instead of attaching files. Did you set name to be Blind Faith? Could you gist your elasticsearch.yml file? --  David Pilato | Technical Advocate | Elasticsearch.com @dadoonet | @elasticsearchfr Le 8 mai 2014 à 08:44:49, Francesco Audisio (cesco...@gmail.com) a écrit: i

Re: Changing node and cluster name Version: 1.1.1

2014-05-08 Thread Francesco Audisio
No i haven't set Blind Faith,what is it? this is my elasticsearch-yml: https://gist.github.com/Fraaud/89114d2ad6b70daa3437#file-elasticsearch-yml Il giorno giovedì 8 maggio 2014 09:10:49 UTC+2, David Pilato ha scritto: Please use GIST instead of attaching files. Did you set name to be Blind

Re: ANN: new elasticsearch discovery plugin - eskka

2014-05-08 Thread shikhar
All Elasticsearch nodes will end up being part of the Akka cluster :) I think you're really asking how many seed nodes you should specify. The seed node list is probably going to be similar to what you might use for zen.unicast.hosts. Worth noting that besides being initial contact points for

Re: ANN: new elasticsearch discovery plugin - eskka

2014-05-08 Thread shikhar
On Thu, May 8, 2014 at 1:02 PM, shikhar shik...@schmizz.net wrote: Worth noting that besides being initial contact points for when the cluster is starting up, with eskka they are also used for resolving partitions. Given this requirement, you would ideally have 3 or more specified. It is

Re: more like this on numbers

2014-05-08 Thread Alex Ksikes
Hi Valentin, For these types of searches, have you looked into range queries, perhaps combined in a boolean query? Alex On May 7, 2014 4:14 PM, Valentin plet...@gmail.com wrote: Hi Alex, thanks. Good idea to convert the numbers into strings. But converting the number fields to string won't

Re: MoreLikeThis can't identify that 2 documents with exactly same attachments are duplicates

2014-05-08 Thread Alex Ksikes
On May 8, 2014 8:09 AM, Zoran Jeremic zoran.jere...@gmail.com wrote: Hi Alex, Thank you for this explanation. This really helped me to understand how it works, and now I managed to get results I was expecting just after setting max_query_terms value to be 0 or some very high value. With these

Re: Changing node and cluster name Version: 1.1.1

2014-05-08 Thread David Pilato
You should write something like: node.name: 192.168.0.12.raspi1 Does it work? --  David Pilato | Technical Advocate | Elasticsearch.com @dadoonet | @elasticsearchfr Le 8 mai 2014 à 09:22:06, Francesco Audisio (cesco...@gmail.com) a écrit: No i haven't set Blind Faith,what is it? this is my

Strange appearance of dynamic field

2014-05-08 Thread Michał
Last month in my elasticsearch index appeared weird dynamic mapping http://pastebin.com/dykbXEJy. Is seems as if this mapping would create itself... What might be the reason for this? Can I remove those dynamic mappings? I use version: 0.20.4 -- You received this message because you are

embedded es test server hangs on startup

2014-05-08 Thread Jilles van Gurp
I'm trying run elasticsearch as part of my jruby tests. Here's some of the code I use to do that: Settings settings = ImmutableSettings.settingsBuilder() .put(name, nodeName) .put(cluster.name, linko-dev-cluster)

Re: red status after unexpected stop

2014-05-08 Thread Arnau Bria
Hello, just an update as I was able to solve the issue. The health was: cluster_name: BigLog status: red timed_out: false number_of_nodes: 1 number_of_data_nodes: 1 active_primary_shards: 1175 active_shards: 1175 relocating_shards: 0 initializing_shards: 0 unassigned_shards: 1195 And the

Re: AmazonClientException[Unable to unmarshall error response...] when creating snapshot in S3

2014-05-08 Thread Pete Michel
Paulo, Did you ever figure out your error? I just encountered the exact same problem and was hoping you had found a solution Thanks, Pete On Friday, April 25, 2014 6:14:38 PM UTC-4, Paulo Correa wrote: I`ve set up ES v.1.1.1 + AWS-cloud-plugin 2.1.1 on an EC2 instance, using a Role to

Re: AmazonClientException[Unable to unmarshall error response...] when creating snapshot in S3

2014-05-08 Thread Paulo Correa
Hi Pete, we ran some more tests to see if we could narrow down the problem. We found out that whenever we had an instance that was not in AWS' default-vpc launch an snapshot, the problem occurred. That was the only factor that made the problem happen, so it is not an issue with permissions,

NEST: How can I get the raw JSON that comprises a doc just before it's indexed?

2014-05-08 Thread rianjs
Basically, I'm trying to get the raw JSON that would be sent to elasticsearch for indexing, *without actually indexing it*. Is there any way to do this? I didn't see anything in the docs or unit tests that would be useful. Thanks, -Rian -- You received this message because you are subscribed

Re: Snapshot Duration increasing over time

2014-05-08 Thread Igor Motov
Hi Dipesh, I have a few questions. Are you still on S3? Which version of elasticsearch are you using? How many snapshots do you currently keep in S3? How fast is your index growing over time? Igor On Wednesday, May 7, 2014 6:58:05 AM UTC-4, Dipesh Patel wrote: Hi We've noticed recently

Re: Snapshot Duration increasing over time

2014-05-08 Thread Dipesh Patel
Hi Igor We are using elasticsearch 1.1.1. Currently we are keeping all snapshots that we make in s3, we haven't yet decided on an archive strategy/solution. So at the moment we have 131 snapshots in the s3 bucket. So we have about 112 new indices a day. I'll explain our set up a bit it may

[ANN] Elasticsearch for Apache Hadoop 2.0 RC1 has been released

2014-05-08 Thread Costin Leau
Hi everyone, I'm happy to announce that Elasticsearch for Apache Hadoop (aka es-hadoop) 2.0 RC1 has been released. You can read more about it on our blog at [1]. Cheers! [1] http://www.elasticsearch.org/blog/es-hadoop-20-rc1 -- Costin -- You received this message because you are

Scripts reload on demand

2014-05-08 Thread Thomas
Hi, I was wondering whether there is a way to reload the scripts on demand provided under config/scripts. I'm facing a weird situation were although the documentation describes that the scripts are loaded every xx amount of time (configuration) I do not see that happening and there is no way

Re: AmazonClientException[Unable to unmarshall error response...] when creating snapshot in S3

2014-05-08 Thread Pete Michel
I was able to resolve my issue. We have elasticsearch in a private subnet. Access to S3 goes through squid proxy. I was able to alter the squid proxy configuration to fix the issue. I added the following to /etc/squid/squid.conf... ignore_error_100 on For reference, here is the squid proxy

Re: Node restart?

2014-05-08 Thread Ivan Brusic
To answer my own question, it is in fact disabled: https://github.com/elasticsearch/elasticsearch/issues/265 -- Ivan On Wed, May 7, 2014 at 5:18 PM, Ivan Brusic i...@brusic.com wrote: Does nodes restart action work? It is not documented, and whenever I try to use it I get: { error :

default _ttl causes MapperParsingException due to already expired document

2014-05-08 Thread Ravi Gairola
I have a river importing data from Big Query and I import it into an index via bulk that has a default _ttl of 30 days configured. I don't set the ttl anywhere on the document when importing, so every document should just get the ttl set from the default value. Unfortunately though I keep

Using wildcards in query_string fields

2014-05-08 Thread ltuhuru
I'm a new user, so hopefully this is something really simple. I have two message objects in my index, typeA and typeB, each of which has a net field. I'd like to be able to search for all objects whose net fields match a pattern. If I specify the fields explicitly, the query returns the

performance slowdown after upgrading from 0.90 to 1.1.1

2014-05-08 Thread slushi
We are testing out release 1.1.1 and during our indexing performance testing, we seemed to get significantly slower throughput, the document/second rate is about 30% slower. We used the exact same yml file and startup settings. The code is also identical except for the breaking changes in the

Re: Using wildcards in query_string fields

2014-05-08 Thread ltuhuru
Found my problem. Get rid of the leading type message in the field path and wildcards in the fields work. curl -XGET localhost:9200/nettest/_search?pretty -d '{ query : { query_string : { fields : [*.ne*], query : NET* } } }' -- View this

Re: counting filtered queries

2014-05-08 Thread spicylobe
This worked! I think the docs should be updated though, they are still wrong at: http://www.elasticsearch.org/guide/en/elasticsearch/client/javascript-api/current/api-reference.html#api-count -- You received this message because you are subscribed to the Google Groups elasticsearch group. To

Re: maxDocs different between primary and replica shards

2014-05-08 Thread Csaba Dezsényi
I exactly have the same issue! Does someone have solution for this? Thanks, Csaba 2013. november 28., csütörtök 14:26:51 UTC+1 időpontban Klaus Brunner a következőt írta: We're running Elasticsearch (currently 0.90.6) in what I'd call a replicated architecture: our indexes are quite small

Wildcards in field names

2014-05-08 Thread ltuhuru
Is there a way to aggregate across fields with wildcards in the name? I have documents with a variety of structures, and I want to be able to aggregate across all fields with the name special_label. That field may occur in various structural places within the document. Something like this would

Kibana Password Protected

2014-05-08 Thread Joshua Bitto
Hello All, Im trying to find documentation on how to setup Kibana to be password protected. I'm using Centos 6.5(apache) and right now with the basic install you can just go to the configured url and see logs without having to input credentials. Is there a way to add this? -- You received

Unknown error in TransportShardSingleOperationAction.java

2014-05-08 Thread ashish jain
Hello, I have a program where I am sending bursts of bulk index requests within a short time to elasticsearch (1.1) using the java API. I send in 1000 documents (in a bulk request) every 2-5 seconds - initially I was running into NoNodeAvailableException/NoShardAvailableException (on the

Re: Kibana Password Protected

2014-05-08 Thread Mark Walkom
Yep, look up apache basic authentication or try something like https://github.com/fangli/kibana-authentication-proxy Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 9 May 2014 07:05, Joshua Bitto

Time taken from issue closure on github to ES release?

2014-05-08 Thread T Vinod Gupta
Does someone have visibility into ES release process? I am desperately waiting for a release to come out that will fix the below issue. it says that the bug is fixed/closed 5 days ago.. https://github.com/elasticsearch/elasticsearch/issues/4887 MultiSearch hangs forever +

Re: How to define mappings using Java API

2014-05-08 Thread Ethan Pailes
This was quite helpful. I do have one question though. Is elasticSearchMappings of type Settings? Ethan On Thursday, January 12, 2012 8:09:01 PM UTC-5, Lorrin Nelson wrote: Hi David, That helped, thanks. After getting the builder stuff working following the example, I switched over to

Re: Time taken from issue closure on github to ES release?

2014-05-08 Thread Ivan Brusic
Hard to say. There are currently two open issues for version 1.1.2, so it probably will not be released until those are addressed. https://github.com/elasticsearch/elasticsearch/issues?labels=v1.1.2page=1state=open They have been releasing minor versions at the rate of about once per month, so

Re: performance slowdown after upgrading from 0.90 to 1.1.1

2014-05-08 Thread Mark Walkom
Can you yry increasing your bulk count to 1000, or more? Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 9 May 2014 05:01, slushi kireetre...@gmail.com wrote: We are testing out release 1.1.1 and during our

Re: performance slowdown after upgrading from 0.90 to 1.1.1

2014-05-08 Thread Kireet
I tried various counts, after a certain point they didn't make much difference. Also I am not necessarily concerned with improving performance as much as figuring out why I got a slowdown with the exact same settings in 1.1.1. I want to be sure we didn't miss some configuration somewhere or

Re: performance slowdown after upgrading from 0.90 to 1.1.1

2014-05-08 Thread Kireet
1.7.0_17 On May 8, 2014, at 6:52 PM, Mark Walkom ma...@campaignmonitor.com wrote: What java version? Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 9 May 2014 08:39, Kireet kireetre...@gmail.com

Re: performance slowdown after upgrading from 0.90 to 1.1.1

2014-05-08 Thread Kireet
oracle. Yes I saw that, but I didn't see anything in the release notes mentioning a performance difference in the 2 jdks. I think there was something about some bugfixes. Unless there was something specific, I wouldn't think a minor jdk version would cause a huge performance drop, though I

Query string operators seem to not be working correctly

2014-05-08 Thread Erich Lin
My query is in this format: { query: { query_string: { default_field: _all, query: QUERY, default_operator: AND } } } Here are the different outputs for QUERY and their counts: sofa 2,818 rugs 75,309 red 33,839 red AND rugs 9,441 red AND sofa 149 rugs AND sofa

Re: Retrieve 6 products for top 3 users and each one has 2 with highest matching score

2014-05-08 Thread Yao Li
What about nested or parent/child query? How to achieve? On Thursday, May 8, 2014 4:45:36 PM UTC-7, Yao Li wrote: I have a collection of products which belong to few users, like [ { id: 1, user_id: 1, description: blabla..., ... }, { id: 2, user_id: 2, description: blabla..., ...

Re: Official .NET client

2014-05-08 Thread Paige Cook
Elasticsearch.Net and NEST are the official .NET Clients for Elasticsearch. You can read about them in the recent blog post - introducing elasticsearch.net and nest 1.0.0-beta1http://www.elasticsearch.org/blog/introducing-elasticsearch-net-nest-1-0-0-beta1/ Both aggregations and integrated

Re: Help with ES 1.x percolator query plz

2014-05-08 Thread JGL
Can anybody help plz? On Wednesday, May 7, 2014 6:29:35 PM UTC+12, JGL wrote: Can anybody help plz? On Tuesday, May 6, 2014 11:53:32 AM UTC+12, JGL wrote: Can anybody help plz? On Monday, May 5, 2014 10:24:09 AM UTC+12, JGL wrote: Hi Martjin, The percolator query in the 1st post

custom stemmer with elasticsearch / tire / rails

2014-05-08 Thread Oto Iashvili
Hi, Im' searchinkg to ass new stemmer to elastisearch to use with tire / rails I've found java file (https://github.com/emilis/PolicyFeed/blob/master/src/search/java/org/tartarus/snowball/ext/LithuanianStemmer.java) I've created a jar from this file I've put it in elasticsearch's lib folder

Kibana - Changing text color based on content

2014-05-08 Thread Chris Laplante
We have some data that uses the common Red/Yellow/Green verbiage. When presenting this data in a Kibana table panel, I would like to modify the color of the text. I notice Marvel does this for the global status. Anyone doing anything like this, or have ideas where to inject it? CSS? Thanks,