Re: Match every token position in the field when using synonyms

2014-01-22 Thread Sebastian Briesemeister
I am also very keen on answer!! If you find a solution, let me know! Sebastian On Thursday, 16 January 2014 15:12:23 UTC+1, Dany Gielow wrote: In my Elasticsearch index I have documents that have multiple tokens at the same position. I want to get a document back when I match at least one

Re: A Question on Plugin redundancy

2014-01-22 Thread joergpra...@gmail.com
Roy, from what I understand, you want plugins that are somehow coordinated and do not require to get installed on every node. A similar situation is possible in the HTTP area. Some ES nodes may provide HTTP, some not, by disabling HTTP. The deploy of a set of coordinated plugins is possible by

Re: Encountering Index shard gateway recovery exception while manually moving the shards across nodes

2014-01-22 Thread hrishikesh prabhune
Thanks for a precise and prompt reply! Will updating to Elasticsearch 0.90.10 help me in recovering these shards? This bug https://github.com/elasticsearch/elasticsearch/issues/4674 potentially talks about a similar case that I am facing. And this bug is fixed in 0.90.10. Also , the above issue

Re: update api consistency

2014-01-22 Thread Luca Cavanna
Write consistency in elasticsearch is quite different to what has been described here. You can think of it as a check done before indexing, to make sure that enough copies of the data are available. It doesn't have anything to do with when the response is returned and only affects whether the

Re: Classification with percolator

2014-01-22 Thread Arthur Denning
Hey Binh, Thanks a lot and it is really nice to hear from someone with practical experience on this. Is it correct to say if I had a thousand tags, I would need to make thousands of curl -XPUT 'localhost:9200/my-index1/.percolator/tagname1' to register each tags? In your implementation is

Using updates with external versioning

2014-01-22 Thread David Klotz
Hi, I am using the external version-type for all my documents (they come from another primary database and I want to keep the versions between them consistent). Am I right in assuming that this basically means that I can't use the _update API (with a script or partial documents), because it

Set default_operator to AND, except in field queries

2014-01-22 Thread Nicolas F
Hi, I wonder if I can set the default operator to AND, except in field queries. The goal is for the user to write something like: roger moore tags:(marketing twitter), and it would be translated to roger AND moore AND tags:(marketing OR twitter). I know I can do it with term filters. roger

Score depending on position in the term on the field

2014-01-22 Thread Nikolay Chankov
I am playing with elasticsearch so far, and i noticed something: If I search for a word in a string, the _score is equal no matter where is placed the word. Here I have prepared a test case: curl -XDELETE 'http://localhost:9200/test_search' curl -XPUT 'http://localhost:9200/test_search/' -d ' {

How does ES scale with many fields?

2014-01-22 Thread Bernhard Berger
I want to save a filter over a category-field (with about 5 distinct values) of my Elasticsearch documents and ask me if I could use different field names for every category instead. F.ex. instead of the fields: category, date I would use dateRed, dateBlue, dateGreen So to get all

Encountered problem while starting elasticsearch on AWS

2014-01-22 Thread Jong Min Kim
Hi, I'm from Korea, country of kimchi :) I'm trying to run elasticsearch on AWS EC2 system. I was following instruction on http://www.elasticsearch.org/tutorials/elasticsearch-on-ec2/ page. I installed aws plugin and configure config/elasticsearch.yml as follow. cluster.name: candi cloud:

Re: kibana - show average of a numerical field

2014-01-22 Thread Andrew Vine
Can anyone help me out here? On Tuesday, 21 January 2014 22:58:32 UTC+2, Andrew Vine wrote: Hi All! I hope this is the place to ask questions about Kibana. I am new to Kibana and so far I have enjoyed playing around with it. Could someone help me out with the following.. If I have a

[ANN] es-nozzle 0.4.0 - index documents from filesystem

2014-01-22 Thread Ralf Schmitt
es-nozzle can be used to index documents from the local filesystem or from network shares. It's similar in purpose to dadoonet's filesystem river, but it's not an elasticsearch plugin. Instead es-nozzle takes advantage of RabbitMQ in order to provide a fault tolerant and scalable system for

Re: elasticsearch 0.90.9, exclude or downgrade asm

2014-01-22 Thread Fabian Lange
Hi, we would be also interested to limit jar usage by an elastic search client node. Does the elasticsearch client need asm? or lucene spatial or any other? Would it be possible to provide a jar / maven artifact that is client only? Fabian On Wednesday, January 8, 2014 11:20:46 AM UTC+1, Vadim

Re: WARNING failed to prepare/warm after upgrading from 0.90.3 to 0.90.10

2014-01-22 Thread Benoît
Thank you for your feedback, i'm happy to know that i'm not alone. I will report it as an issue. Benoît On Tuesday, January 21, 2014 9:16:06 PM UTC+1, Emilie Lavigne wrote: I'm having the same problem and would really love to know if there's a fix for it. On Friday, January 17, 2014

Re: Using updates with external versioning

2014-01-22 Thread InquiringMind
David, Using the Java API: try { /* Create using the 3 strings: index, type, and id */ IndexRequestBuilder irb = client.prepareIndex(index, Name, id); */* Set the version number (a long integer value) */* * irb.setVersion(versionNumber);* */* Set external versioning */* *

Re: A Question on Plugin redundancy

2014-01-22 Thread joergpra...@gmail.com
Yes, you could ramp up one (or more) Elasticsearch node(s) without data, without http, not master eligible - but with gatherer plugin ;) Jörg -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this group and stop receiving

Re: [ANN] es-nozzle 0.4.0 - index documents from filesystem

2014-01-22 Thread David Pilato
Awesome improvement! I really like this: One of the highlights is support for generating thumbnail previews for images and  PDF documents. Great job!  --  David Pilato | Technical Advocate | Elasticsearch.com @dadoonet | @elasticsearchfr Le 22 janvier 2014 at 15:38:48, Ralf Schmitt

Re: Using updates with external versioning

2014-01-22 Thread joergpra...@gmail.com
Here is version and version type support for UpdateRequest https://github.com/elasticsearch/elasticsearch/blob/master/src/main/java/org/elasticsearch/action/update/UpdateRequest.java#L299 Jörg On Wed, Jan 22, 2014 at 12:07 PM, David Klotz david.kl...@fileee.comwrote: And I don't see a way of

Stemming acronyms ending in s; keyword marker token filter; minimal english stemmer

2014-01-22 Thread Loren
Using the minimal_english stemmerhttp://www.elasticsearch.org/guide/en/elasticsearch/reference/current/analysis-stemmer-tokenfilter.html, acronym tokens like irs and nps get stemmed to ir and np. I can use the keyword marker token

Re: Encountered problem while starting elasticsearch on AWS

2014-01-22 Thread David Pilato
You are trying to use AWS plugin 2.0.0.RC1 with elasticsearch 0.90.x. This version only works with elasticsearch 1.0.0.RC1 and For 0.90.x, latest version is 1.16.0 See README:     --  David Pilato | Technical Advocate | Elasticsearch.com @dadoonet | @elasticsearchfr Le 22 janvier 2014 at

Re: Query score based on aggregated values

2014-01-22 Thread Kevin Pearson
Thank you Alexander. The first possibility is what we ended up doing for now. We plan on looking into the aggregations framework as a replacement once we upgrade to elasticsearch 1.0 so we don't need to return thousands of documents to the client. Thank you. On Monday, January 20, 2014

Re: A Question on Plugin redundancy

2014-01-22 Thread Lukáš Vlček
Roy, Sounds a bit like similar approach to Sematext SPM to me (except they use CollectD - which is based on RRD is I am not mistaken) - but that shouldn't stop you. As for RHQ it is upstream for JON (JBossON). See

TransportClient not connecting

2014-01-22 Thread ZenMaster80
I can't seem to figure out this problem, Node from NodeBuilder works, but If I use transportclient like below, I get an exception. //I am using all default settings //elasticsearch-0.90.9 Settings settings = ImmutableSettings.settingsBuilder().put(cluster.name, elasticsearch).build();

Re: Too Many Open Files

2014-01-22 Thread Ivan Brusic
The first thing to do is check if your limits are actually being persisted and used. The elasticsearch site has a good writeup: http://www.elasticsearch.org/tutorials/too-many-open-files/ Second, it might be possible that you are reaching the 128k limit. How many shards per node do you have? Do

Re: Classification with percolator

2014-01-22 Thread Binh Ly
Arthur, I am assuming that you will define a query/rule for each tag, so in your case yes, that would be the way to define the percolator queries. Couple of things that you might want to be aware: 1) Percolation is CPU intensive 2) The lesser the queries you can percolate against, the better.

Re: don't improve performances with BitSet filters

2014-01-22 Thread Binh Ly
Camilo, I didn't look in detail at your query, but you should be able to see a performance improvement on a larger index (i.e. millions of docs). If you only have a couple hundred/thousand docs, you may or may not see a difference (depending on your data and queries of course). Also the

Re: keyword tokenizer

2014-01-22 Thread Binh Ly
Paul, Is it possible that your syns_filter is affecting your ngram filter? What happens when you remove the syns_filter? On Wednesday, January 22, 2014 6:17:12 AM UTC-5, paul wrote: My mapping looks as below autocomplete_index:{ type:custom,

Re: A Question on Plugin redundancy

2014-01-22 Thread Roy Russo
Lukas, Yes, very similar approach in tech, but different in distribution model. The use of RRD makes sense for them, as I believe their offering monitors more than just ES clusters. It's a New Relic type of model. Alternatively, New Relic has ES monitoring plugins available as well. On

ES hardware setup advice

2014-01-22 Thread briche arnaud
Hi, I know that the easy response for the following would be to benchmark be myself to see what best fit my workload; however, what I ask for is to help me make a kind of pros/cons list for two different setup, and for which kind of workload one or the other would be more adapted. The first

Re: TransportClient not connecting

2014-01-22 Thread ZenMaster80
Anyone using transportclient from java? On Wednesday, January 22, 2014 12:04:30 PM UTC-5, ZenMaster80 wrote: I can't seem to figure out this problem, Node from NodeBuilder works, but If I use transportclient like below, I get an exception. //I am using all default settings

Re: TransportClient not connecting

2014-01-22 Thread InquiringMind
ImmutableSettings.Builder settingsBuilder = ImmutableSettings.settingsBuilder(); settingsBuilder.put(cluster.name, clusterName); TransportClient client = new TransportClient(settingsBuilder.build()); for (String host : hostNames) { InetSocketTransportAddress server_address = new

Re: How to configure elasticsearch to sort the scored documents on a field after score for documents is calculated ?

2014-01-22 Thread Ivan Brusic
Some pseudo-ish code using the Java API: SearchRequestBuilder builder = ... SortBuilder sortBuidler = org.elasticsearch.search.sort.fieldSort(field); builder.setTrackScores(true); builder.addSort(sortBuidler); builder.addSort(scoreSort().order(SortOrder.DESC)); Hope this helps, Ivan On Tue,

Re: TransportClient not connecting

2014-01-22 Thread ZenMaster80
Brian, This is no different from what I have. I googled the problem, and I guess this may come from the fact that ES js using a different java version. I have added the es 0.90.0.jar to java from the es installation folder. I have no clue what I am missing. On Wednesday, January 22, 2014

Re: TransportClient not connecting

2014-01-22 Thread David Pilato
You don't use Maven for your project? If not, don't forget to add all needed dependencies. -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 22 janv. 2014 à 20:23, ZenMaster80 sabdall...@gmail.com a écrit : Brian, This is no different from what I have. I googled the

Authorization filtering?

2014-01-22 Thread David Haimson
Our data is stored in MongoDB 2.4.8, and indexed to ElasticSearch 0.90.7 using the ElasticSearch MongoDB River. Our data indexes correctly, and I can successfully search the fields we want to search. But I also need to filter on permission - of course we only want to return results the calling

Re: TransportClient not connecting

2014-01-22 Thread joergpra...@gmail.com
You wrote that you have a 0.90.9 cluster but you added 0.90.0 jars to the client. Is that correct? Please check: - if your cluster nodes and client node is using exactly the same JVM - if your cluster and client use exactly the same ES version - if your cluster and client use the same cluster

Re: TransportClient not connecting

2014-01-22 Thread InquiringMind
IMHO, you don't need maven. You just need the Java trick to include all of the ES dependent jar files by setting the classpath to a colon-separated list of directory names, but append /* to the end of each directory name. If you do this, you don't need to list all of the jars in the directory:

Re: retrieve localhost:9200/_aliases using the java api

2014-01-22 Thread Emilie Lavigne
Logically, I understand that the curl statement cannot be faster. Which is how I reckon that I'm doing something wrong. This is how I am timing my methods (from my own dev machine pointing to a vm) using the Google stopwatch and ticker. *Client connection to myvm:9300 using java:*

Re: Score depending on position in the term on the field

2014-01-22 Thread Johan Rask
Lucene will calculate you score based on a scoring formula. I am pretty sure that the location of the word is not part of this formula but rather how common the word is in your sentence. I.e multiple occurences of 'venue' should increase scoring and adding other words to your sentence should

Re: retrieve localhost:9200/_aliases using the java api

2014-01-22 Thread joergpra...@gmail.com
I have written a test case, in the hope it is useful ... (it's for 1.0.0.RC1) https://gist.github.com/jprante/8568886 Jörg -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this group and stop receiving emails from it, send

get count by certain field based on a sub query

2014-01-22 Thread Chen Wang
Guys, I just successfully imported my data to ES, e,g. It has looks like this: activity: 'viewed', sessionId: 00143198107b3fe510b041138cd33fdd9252aab9808c, campaign_id: , activity: 'campaign_viewed', sessionId: 00143198107b3fe510b041138cd33fdd9252aab9808c, campaign_id:my_campaign

How to Parse Response from Elastic Search to Java Objects (Map) ?

2014-01-22 Thread Rahul Kalra
Hi, Please tell me how to convert the Response from Elastic Search to Java Object (Map) I want to get the values of user attributes under _source like lastName, firstName, Manager, email Here is the sample response from Elastic Search.

Re: TransportClient not connecting

2014-01-22 Thread Ross Simpson
java -version will tell you the exact version, patch level, vendor, and architecture of that JVM. The tricky bit can be finding out which JVM you're actually using (usually the value in $JAVA_HOME or `which java` will lead you in the right direction). If you're running your example under an

Prioritize exact match using nGram

2014-01-22 Thread Nikhil Podduturi
Hello All, I am wondering how to rank exact match higher than the nGram matches. For example: search string: *abcd* results - abcd, abcde, abcdegl not in a particular order. But I want to rank 'abcd' higher than the rest. Is it possible?? I am adding my mappings, analyzers and query below.

How to Configure LDAP River to fetch only updated records ?

2014-01-22 Thread Rahul Kalra
Hi, Please help me to identify the correct configuration for LDAP River, so that it fetches only *updated records * in polling after initial full fetch. I am using below mentioned configuration. This fetches all the records from LDAP after 86400s (1 day).

Re: TransportClient not connecting

2014-01-22 Thread ZenMaster80
java version 1.7.0_11 Java(TM) SE Runtime Environment (build 1.7.0_11-b21) Java HotSpot(TM) 64-Bit Server VM (build 23.6-b04, mixed mode) I spent too much time on this, I gave up. I'll ask the question differently, I wanted to use the transport client at 9300 so I can index a file, and the

Difference about the nested type and object type?

2014-01-22 Thread Ivan Ji
Hi, all I am wondering what the difference between the two types in the mappings. I know the nested object can be query using nested query. but can the object type field be query using other queries, such as term query? ex, what's the difference when I change the type of variation to object?

Re: Difference about the nested type and object type?

2014-01-22 Thread Ivan Brusic
The elasticsearch site has a short writeup about the benefits of the nested type: http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/mapping-nested-type.html Basically, with object types, multiple instances of a field are flattened as an unordered array. In your example, if you

Re: TransportClient not connecting

2014-01-22 Thread joergpra...@gmail.com
If you do not post error message or log files, I'm afraid nobody can help you. NodeClient is not local as long as you do not set local(true). But I am afraid you understand something different under local. If you want HTTP, neither NodeClient nor TransportClient is the right thing. 9300 is the