More Like This Query Returning 0 Similar Documents

2014-11-30 Thread sekharreddy mandapati
I have more than 500 documents. While executing more_like_this query elasticsearch returning zero results. Below is my query. POST /espoc/_search { "from": 0, "size": 2, "query": { "filtered": { "query": { "more_like_this": { "fields": [

What does the "aggs" field in my index settings represent.

2014-11-30 Thread Anil Karaka
{ - test_index: { - settings: { - index: { - creation_date: "1416836998944", - warmer: { - enabled: "false" }, - uuid: "R2eL4uvvQbGtLKAaDPrmaQ", - number_of_replicas

Re: How do I get all the distinct field values that satisfy a particular query..

2014-11-30 Thread Anil Karaka
This is a silly question. All I need to do is treat "aggs" like "size", "query", "_source" fields in the json "size" limits the number of queries that I will get in the end. "aggs" aggregates the given query results accordingly. So my resultant aggregation query would be { "query": {

Get Kibana to render text as HTML?

2014-11-30 Thread Kevin Burton
I have some fields that are just raw HTML. Is there a way to get Kibana to render them as HTML? Right now it's just showing the raw HTML source. I want it to render the HTML -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe fr

Parent child relation to 2nd level and more - has_child query

2014-11-30 Thread Madhav Ayyagari
I had partial success implementing parent child relation. I was able to create the mappings correctly (I presume), and post documents without any issue. In the example below, "family" index has types "dad", "child", "grandchild". "child" has parent type as "dad", and "grandchild" has parent typ

Re: Java client - setTimeout vs actionGet(timeout)

2014-11-30 Thread Itamar Syn-Hershko
IIRC the Java API doesn't have any default client-side timeout for search requests, its an opt-in feature -- Itamar Syn-Hershko http://code972.com | @synhershko Freelance Developer & Consultant Author of RavenDB in Action On Sun,

Re: Marvel / ES query document count major discrepancy

2014-11-30 Thread Boaz Leskes
Hi Mike, That number includes all the docs currently in the index, including deleted docs which are not yet physically merged away (the same as the ES Stats API does, check the delete stats in the same part). Did you do some documents deletion before checking? Cheers, Boaz On Thursday, Novemb

Re: Marvel template upload

2014-11-30 Thread Boaz Leskes
Hi Jaguar, Marvel requires an index template to make sure some fields get the correct mappings. Similar in nature to the logstash template (see https://github.com/logstash-plugins/logstash-output-elasticsearch/blob/master/lib/logstash/outputs/elasticsearch/elasticsearch-template.json ) . Mar

Re: 'Shard Allocation' dashboard in Marvel 1.2 showing SearchParseExceptions

2014-11-30 Thread Boaz Leskes
Hi Teo, Max, >From the query, I can see that it that cluster_state is not shipped. The marvel index template adds the cluster_type with some non-default fields but all fields (like @timestamp) are added when the first document is shipped. Can you validate there are no documents of that type?

Re: ES java api: how to handle connectivity problems?

2014-11-30 Thread joergpra...@gmail.com
The client may hang because I assume the cluster state got unavailable, and minimum master condition is no longer met. If you re-add the failed node, the cluster state will be available again, and I think the client will continue. Jörg On Sun, Nov 30, 2014 at 11:07 PM, wrote: > > 2 nodes are no

Re: ES java api: how to handle connectivity problems?

2014-11-30 Thread msbreuer
> 2 nodes are not enough to form a distributed system, such a cluster is > prone to split brains, because there is no algorithm that can decide what > node shall continue as master in case of a single node failure. > Agreed! > ES has come precautions built in to suspend execution in this c

Re: Why is Kibana 4.0 no longer a status webapp?

2014-11-30 Thread Mark Walkom
There's info from some of the devs in this thread - https://groups.google.com/forum/?hl=en-GB#!searchin/logstash-users/kibana/logstash-users/X_ClBEKH6tk/L9z9i9BOFtUJ On 1 December 2014 at 04:29, Kevin Burton wrote: > Curious why Kibana 4.0 is no longer a static webap and why it requires a > Java

Re: Java client - setTimeout vs actionGet(timeout)

2014-11-30 Thread Nikolas Everett
Default for server side timeout is none and I don't know client side timeout. I imagine it is a long time. On Nov 30, 2014 1:46 PM, "Ron Sher" wrote: > Thanks for the info. > > Do you know what are the defaults? > > On Sunday, November 30, 2014 5:53:49 PM UTC+2, Nikolas Everett wrote: >> >> Timeo

Re: ES java api: how to handle connectivity problems?

2014-11-30 Thread joergpra...@gmail.com
2 nodes are not enough to form a distributed system, such a cluster is prone to split brains, because there is no algorithm that can decide what node shall continue as master in case of a single node failure. ES has come precautions built in to suspend execution in this case. Please use at least

Re: poor performance of "full text" searches

2014-11-30 Thread joergpra...@gmail.com
One note, Elasticsearch is not designed to run on one big host, but on many small servers - it scales out, not up. If you run 3 nodes on 1 server, this will slow down the overall system performance. Avoid left truncation in the form *word. Always use right truncation word*. Jörg On Fri, Nov 28,

Re: char_filter for German

2014-11-30 Thread joergpra...@gmail.com
Hi, by using my langdetect plugin, and the analyzer-by-path selection of Elasticsearch, it is possible to analyze input by detected language. See https://github.com/jprante/elasticsearch-langdetect http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/mapping-analyzer-field.html

Re: Java client - setTimeout vs actionGet(timeout)

2014-11-30 Thread Ron Sher
Thanks for the info. Do you know what are the defaults? On Sunday, November 30, 2014 5:53:49 PM UTC+2, Nikolas Everett wrote: > > Timeouts are server side and best effort. I believe action get(timeout) is > client side. > > I use the http client but use both and set the server side timeout to

Why is Kibana 4.0 no longer a status webapp?

2014-11-30 Thread Kevin Burton
Curious why Kibana 4.0 is no longer a static webap and why it requires a Java daemon. Is this strictly required? I guess I'm fine if it is but why? What does the Java daemon provide? I guess I liked that it was a static webapp as it allows for easier integration on our end. -- You receive

Re: poor performance of "full text" searches

2014-11-30 Thread David Pilato
Indeed. If you want to run terms agg on a analyzed field, it might have no sense. In that case you should use multi field to analyze you field for search and don’t analyze it for aggs. See http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/mapping-core-types.html#_multi_field

Re: Releasing elasticsearch connection

2014-11-30 Thread Jason Wee
Have you try using client.close() ? Jason On Sun, Nov 30, 2014 at 8:23 PM, wrote: > Hi, > > I am deploying a webapp, which internally queries elasticsearch. I am > creating only one connection, and I want to close it when the webapp shuts > down. I am using singleton pattern for creating the ES

Re: char_filter for German

2014-11-30 Thread Beatrix Willius
Hi, I'm in the preliminary stages for implementing Elasticsearch so I'm interested in this, too. What about mixed languages or where I even don't know the language? My data are emails. Therefore the data could be any language. > On 30.11.2014, at 17:20, joergpra...@gmail.com

Re: char_filter for German

2014-11-30 Thread joergpra...@gmail.com
Do not use regex, this will give wrong results. Elasticsearch comes with full support for german umlaut handling. If you install ICU plugin, you can use something like this analysis setting { "index" : { "analysis" : { "filter" : { "german_normalize_stem"

Re: Java client - setTimeout vs actionGet(timeout)

2014-11-30 Thread Nikolas Everett
Timeouts are server side and best effort. I believe action get(timeout) is client side. I use the http client but use both and set the server side timeout to lower than the client side timeout. The server side timeout should return partial results if possible. On Nov 30, 2014 10:41 AM, "Ron Sher"

Java client - setTimeout vs actionGet(timeout)

2014-11-30 Thread Ron Sher
Hi all, I want to make sure the search query doesn't exceed some limit. I've seen the option to use a setTimeout vs actionGet(timeout). Can someone please explain the difference? Also, I've read somewhere that there's a default connection timeout. Can that be used instead. If so, how? Thanks

GeoJson cannot make it work

2014-11-30 Thread CB
I need some help - i'm trying to configure dynamic templates to pick up GeoJSON format in order to use the better map in Kibana 3.x BUT the parser throws exceptions: java.lang.RuntimeException: indexing error MapperParsingException[failed to parse]; nested: ElasticsearchParseException[geo_poi

Releasing elasticsearch connection

2014-11-30 Thread prachicsa
Hi, I am deploying a webapp, which internally queries elasticsearch. I am creating only one connection, and I want to close it when the webapp shuts down. I am using singleton pattern for creating the ES client and how do I destroy/close ES client? When the application shuts down, will the clie

Re: Failing Replica Shards

2014-11-30 Thread Jakub Podeszwik
Small mistake. 1. should be: 1. If shard had more than one segment then optimizing it to one segment usually worked. On Sunday, 30 November 2014 12:00:37 UTC+1, Jakub Podeszwik wrote: > > I've had similar problems. Two things that helped: > 1. If index had more than one shard then optimizing it t

Re: Failing Replica Shards

2014-11-30 Thread Jakub Podeszwik
I've had similar problems. Two things that helped: 1. If index had more than one shard then optimizing it to one shard usually worked. 2. In other case manually copying shard files from node with master shard to one of nodes that kept failing. On Sunday, 30 November 2014 00:57:02 UTC+1, David Kl

Re: char_filter for German

2014-11-30 Thread Krešimir Slugan
Hi Jürgen, Currently we don't have big volumes of data to index so we would like to yield more results in hope that proper ones would still be shown in the top. In future, when we have more data, we'll have to sacrifice some use cases in order to provide more precise results for the rest of use