Re: GROUP BY query in ElasticSearch

2014-03-25 Thread David Pilato
Sorry. I misread the question. By now we don't support it. -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 25 mars 2014 à 04:02, Otis Gospodnetic otis.gospodne...@gmail.com a écrit : David was suggesting that this IS doable with ES aggregations, so I'd love to see if

Re: Benchmarks (again)

2014-03-25 Thread Robin Clarke
Each node had 8 cores (2.4GHz Xeon), 32GB RAM, SSD disks (I never saw IOWait, but was also focusing on ingestion rate). I always had 2 master nodes, and in addition tried the configurations 20, 10 and 5 data nodes. Running Elasticsearch 1.0.1 (but with Logstash 1.3.3) -Robin- On 25 March 2014

Searhch Request

2014-03-25 Thread Praveenkumar Arepalli
Hi Team, I have little knowledge in elasticsearch but i cannot satisfied with what have know.i want to enhance my knowledge. Here is the question 1.How to know which field is matched in indexed document while fetching from Index? Ex: structure of document in index Document #1 { \name\ :

Re: Elasticsearch configuration for uninterrupted indexing

2014-03-25 Thread Rujuta Deshpande
Well it was for the entire machine. Now, I have changed it to a 4 GB machine. Even 4 GB is not enough right now and I do face the same problem. I am trying to benchmark the max/min Heap size I will have to allocate to an elasticsearch instance to be able to achieve uninterrupted indexing

Re: Benchmarks (again)

2014-03-25 Thread Mark Walkom
Java version? Also what OS? Just as a general note, it's always good to have an uneven number of masters to ensure you get a majority quorum. Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 25 March 2014 17:35,

word hit maximum time by users

2014-03-25 Thread robin thakur
I want to know that whether we get information of order of words or phrases that are search maximum time, in the descending order. e.g:- users search america maximum time, then Tree, then Sea, then mountains, so we get America in top of list and mountains in last of list. And tree and Sea on

Re: Searhch Request

2014-03-25 Thread David Pilato
May be using explain option. But question is more why a user would like knowing that? Could you describe a little more your use case and what you are trying to achieve? -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 25 mars 2014 à 07:37, Praveenkumar Arepalli

Re: Searhch Request

2014-03-25 Thread David Pilato
I understand that you as a developer want to know it. My question is what are you going to do with that information? If it's for debugging purpose then explain is fine. -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 25 mars 2014 à 08:07, Praveenkumar Arepalli

Re: Searhch Request

2014-03-25 Thread Praveenkumar Arepalli
How to use explain David? On Tue, Mar 25, 2014 at 12:44 PM, David Pilato da...@pilato.fr wrote: I understand that you as a developer want to know it. My question is what are you going to do with that information? If it's for debugging purpose then explain is fine. -- David ;-) Twitter

Re: Benchmarks (again)

2014-03-25 Thread Robin Clarke
Thanks for the tip with the number of masters! java version 1.6.0_45 on Debian 3.2.54-2 On 25 March 2014 07:55, Mark Walkom ma...@campaignmonitor.com wrote: Java version? Also what OS? Just as a general note, it's always good to have an uneven number of masters to ensure you get a majority

Re: Searhch Request

2014-03-25 Thread David Pilato
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-request-explain.html#search-request-explain -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 25 mars 2014 à 08:18, Praveenkumar Arepalli arepalli.praveen6...@gmail.com a écrit : How to use

Re: Searhch Request

2014-03-25 Thread Praveenkumar Arepalli
companyName : { type : multi_field, fields : { companyName : { type : string, analyzer : apptivo_whitespace_wdf_lcf_analyzer }, sortable : { type : string, analyzer : apptivo_sort_analyzer,

Re: Benchmarks (again)

2014-03-25 Thread Mark Walkom
Ouch, why such an old version of java? (And is is Open or Oracle?) v7 is the minimum for ES and LS. Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 25 March 2014 18:20, Robin Clarke ro...@robinclarke.net wrote:

Re: Searhch Request

2014-03-25 Thread Michael Kleen
Hi Praveenkumar, what is your index layout, your input data and what is your query request ? Can you post your setup as a working example using curls similiar to https://gist.github.com/mkleen/4739479 ? In this way its easy to help you here. Regards, Michael On 25 March 2014 08:29,

query search maximum time

2014-03-25 Thread robin thakur
hi, Is there any API in elastic search to get information regarding the query which is search maximum time, menas this query has maximum hit counts. thanks -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this group and

Re: query search maximum time

2014-03-25 Thread Michael Kleen
Hi robin, have a look at http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/index-modules-slowlog.html Michael On 25 March 2014 09:07, robin thakur thakurrob...@gmail.com wrote: hi, Is there any API in elastic search to get information regarding the query which is

Re: Kibana has problems connecting to Elasticsearch

2014-03-25 Thread Dušan Đorđević
Maybe it is a cross domain, but I do not have idea what might be wrong... On Tue, Mar 25, 2014 at 1:06 AM, Harry Waye ha...@arachnys.com wrote: Is it likely that you have a cross domain issue here? On Monday, March 24, 2014 3:59:35 PM UTC, Dušan Đorđević wrote: On Mon, Mar 24, 2014 at 3:49

Re: query search maximum time

2014-03-25 Thread robin thakur
Hi mkleen, I am not getting answer, I want to know that whether elastic search store the query history in itself. So that I get the history of all queries with there count -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from

Re: Searhch Request

2014-03-25 Thread Praveenkumar Arepalli
{ index.analysis.analyzer.test_whitespace_wdf_lcf_analyzer.filter.1 : test_lowercase_filter, index.analysis.analyzer.test_whitespace_wdf_lcf_analyzer.filter.0 : test_word_delimiter_filter, index.analysis.analyzer.test_whitespace_wdf_lcf_analyzer.tokenizer : test_whitespace_tokenizer,

popular keyword search

2014-03-25 Thread robin thakur
HI, Is there any functionality given by elastic search for popular keyword search.( the keyword which is search maximum time). Thanks -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this group and stop receiving emails

Re: Thread Pools and Queues

2014-03-25 Thread Paweł Młynarczyk
Thank you for your answer. I did some tests and it appears that the lowest queue size between the data nodes is the effective one (as far as I am concerned the search requests are spread across all nodes with data) and queue sizes in non data nodes are ignored (I guess that it is not ignored

Reduce the number of results returned in elasticsearch

2014-03-25 Thread soumya sengupta
I am using an ngram analyzer for doing auto complete search on a particular field using elasticsearch. By default I have set the number of results to return to 10 using size:10 inside the query. But what I want is that as my search nears the exact match, the results returned by elastic search

Re: Elasticsearch configuration for uninterrupted indexing

2014-03-25 Thread joergpra...@gmail.com
While it is possible to create an ES cluster with dedicated reader/writer nodes, this is not the default and in many cases, dedication of nodes is not required at all. ES has some better heuristics built in to relief the admin from tedious jobs like setting up dedicated nodes. So I wonder how you

Re: Install Elasticsearch 1.0.1 on Ubuntu 12.04 fails with sudo dpkg -i elasticsearch-1.0.1.deb

2014-03-25 Thread Patrick Lock
If I run curl -XGET 'localhost:9200/_cluster/health?pretty' I get { cluster_name : elasticsearch, status : yellow, timed_out : false, number_of_nodes : 1, number_of_data_nodes : 1, active_primary_shards : 5, active_shards : 5, relocating_shards : 0,

Re: Does snapshot/restore do any analysis on restore?

2014-03-25 Thread Harry Waye
Excellent great thanks David On Tuesday, March 25, 2014 5:56:55 AM UTC, David Pilato wrote: Snapshot/restore just save Lucene files (and some metadata). Restoring does not imply any analysis process. -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 25 mars 2014 à

not getting results from java search api

2014-03-25 Thread Subhadip Bagui
Hi, I'm using the below method to get results from ES. But search hit result is 0. Please let me know the correct way to get results, public static SearchResponse searchIndex(Client client, Node node) { SearchRequestBuilder srequest = client.prepareSearch(testindex).setTypes(testtype)

Re: Custom analysis, phonetic filter and highlighting

2014-03-25 Thread Erdal Gunyar
Well, since I've posted this message I didn't find any reliable solution, it looks like a bug in the phonetic plugin. There is at least a workaround : as I'm using my custom analysis in _all, the workaround consists to put a more standard analyser on the wanted field (for ex. content) and to

Re: not getting results from java search api

2014-03-25 Thread joergpra...@gmail.com
Have to tried matchQuery instead of termQuery? Jörg On Tue, Mar 25, 2014 at 1:19 PM, Subhadip Bagui i.ba...@gmail.com wrote: Hi, I'm using the below method to get results from ES. But search hit result is 0. Please let me know the correct way to get results, public static

Re: Reduce the number of results returned in elasticsearch

2014-03-25 Thread Binh Ly
You probably want to override the search_analyzer to make sure that the ngram is not applied to your query string. For example, if you use the match query, just set analyzer in your query to something like standard. -- You received this message because you are subscribed to the Google Groups

Stree Address Queries

2014-03-25 Thread Henri van den Bulk
I have a field that contains the full address, e.g. 1200 S FUN ST. I need to accomplish two type of searches one a suggest search with synonyms so that users can type ahead 1200 Fun ave which can be 1200 Fun Avenue or 1200 E Fun Ave. How have other accomplished this type of search and what

Re: Install Elasticsearch 1.0.1 on Ubuntu 12.04 fails with sudo dpkg -i elasticsearch-1.0.1.deb

2014-03-25 Thread Patrick Lock
I have managed to install elasticsearch- 1.0.1 by using an old method cd ~ wget https://download.elasticsearch.org/elasticsearch/elasticsearch/elasticsearch-1.0.1.tar.gz tar -xf elasticsearch-1.0.1.tar.gz cd /usr/local/ sudo mkdir elasticsearch cd elasticsearch cd ~ sudo mv elasticsearch-1.0.1

Re: query search maximum time

2014-03-25 Thread Binh Ly
It is possible to turn on query slowlog log to get what you want. For example, you can go into the ES.yml file, and set this line (restart ES): index.search.slowlog.threshold.query.warn: 1ms And that will likely log all queries that you run into the slowlog.log file. -- You received this

Elasticsearch not matching a 1-token analysed string value

2014-03-25 Thread Emanuil Tolev
Hello everybody, I'm using a dynamic template in order to peruse 2 versions of each field - an analysed one, and a non-analysed one. Elasticsearch 0.90.7 with no plugins, HTTP transport. This is the mapping for the type in question: { journal: { dynamic_templates: [ {

Re: how to modify term frequency formula?

2014-03-25 Thread geantbrun
Thanks again for the answer Ivan. Would it be simpler to modify directly in the source code the way tf is calculated? I mean replacing somewhere something like tf = sqrt(n) by tf = min(10,sqrt(n)). Cheers, Patrick Le vendredi 21 mars 2014 18:01:51 UTC-4, Ivan Brusic a écrit : Term frequencies

Disable merge

2014-03-25 Thread vineeth mohan
Hello , I am doing lots of bulk insert in hundreds of millions of feeds. But then , index merge is coming in the way at times. Is there anyway to disable the merge until i complete the bulk loading process ? Thanks Vineeth -- You received this message because you are subscribed to

Getting connections idle , closing log on running facet-script

2014-03-25 Thread vineeth mohan
Hi , While running a script using - https://github.com/imotov/elasticsearch-facet-script I am seeing following error in debug log. Is there any settings i can tune to increase this timeout or some other workaround to deal with it. [2014-03-25

Re: Date calculation does not work in DELETE

2014-03-25 Thread Boaz Leskes
Hi Norbert, This looks like a bug. Will dig deeper and get back to you. Cheers, Boaz On Sunday, March 23, 2014 7:16:24 PM UTC+1, Norbert Hartl wrote: btw. I forgot to add I’m using elasticsearch 1.0.1 Norbert Am 22.03.2014 um 17:28 schrieb Norbert Hartl

Re: Getting connections idle , closing log on running facet-script

2014-03-25 Thread Igor Motov
Vineeth, The script plugin is not using Apache http client. Are you using it in your code? Igor On Tuesday, March 25, 2014 11:02:48 AM UTC-4, vineeth mohan wrote: Hi , While running a script using - https://github.com/imotov/elasticsearch-facet-script I am seeing following error in

Embedded elasticsearch on Websphere Z/OS?

2014-03-25 Thread binoyaf
hi, I am trying to get elasticsearch to work in an embedded mode inside a websphere application server (V8.5.0.1) on Z/OS. After startup, it keeps logging the below error trace and the CPU spikes to 100%. - [ZOSB] version[1.0.0], pid[50397324], build[a46900e/2014-02-12T16:18:34Z] - [ZOSB]

jetty support in Elastic Search NEST API

2014-03-25 Thread dinuswt22
I have authenticated a machine which hosts Elastic Search, using Jetty plugin. Everything works fine with respect to security. But my problem is I need to add documents / update documents in the same index which is secured using Jetty. In NEST I tried to find anything related to a method

Re: Kibana Histogram unexpected line curve with cumulative value

2014-03-25 Thread Xwilly Azel
On Thursday, March 20, 2014 2:30:55 PM UTC+1, Xwilly Azel wrote: I have the same issue with your version and I don't see where in kibana i can say : disable zero-fill checkbox. On Thursday, March 20, 2014 2:29:24 PM UTC+1, Xwilly Azel wrote:On Thursday, March 20, 2014 1:15:44 PM UTC+1,

Re: word hit maximum time by users

2014-03-25 Thread Otis Gospodnetic
Hi Robin, Are you looking for something like http://www.sematext.com/search-analytics/ ? Check out the Top Queries report there. You can see it in this live demo, actually: https://apps.sematext.com/demo#sa Otis -- Performance Monitoring * Log Analytics * Search Analytics Solr

Re: Disable merge

2014-03-25 Thread Otis Gospodnetic
Hey Vineeth, Don't think so. But you can play with flush and merge throttle params. Otis -- Performance Monitoring * Log Analytics * Search Analytics Solr Elasticsearch Support * http://sematext.com/ On Tuesday, March 25, 2014 11:04:05 AM UTC-4, vineeth mohan wrote: Hello , I am doing

Re: Disable merge

2014-03-25 Thread Ivan Brusic
Especially lowering the max_merge_at_once value or the number of segments. Querying will be affected if you change the settings. -- Ivan On Tue, Mar 25, 2014 at 8:42 AM, Otis Gospodnetic otis.gospodne...@gmail.com wrote: Hey Vineeth, Don't think so. But you can play with flush and merge

Re: how to modify term frequency formula?

2014-03-25 Thread Ivan Brusic
Did you see Britta's slides? She has a slide called Cosine similarity as script which mimics the Lucene scoring as a script. You can replace the call to _index[field][word].tf() with your own implementation. You can deploy the script as a native Java script (note: not Javascript) for performance.

Re: Elasticsearch not matching a 1-token analysed string value

2014-03-25 Thread Emanuil Tolev
Thanks David, that worked beautifully. TermQuery does say your term won't be analysed! Thanks, Emanuil On Tuesday, March 25, 2014 2:17:18 PM UTC, David Pilato wrote: Because it has been analyzed and converted to lowercase. Try with MatchQuery. -- David ;-) Twitter : @dadoonet /

Real Time Python Update on a River

2014-03-25 Thread Thibaut Lapierre
Hi, I use the twitter river who use bulk indexing. I have a Python script who analyse tweets and return some data. So i want to analyse each tweet and add two fields to the river with the returned data. Maybe i can build a second sheme with id and treatment status in order to run the script

ElasticSearch Init Script w/ Java Change ...

2014-03-25 Thread Matt Wise
Last night we ran into an interesting issue. We pushed out a change to our hosts via Puppet that installed Oracles Java7 as the default JRE/JDK on all of our hosts -- previously it had been the default only on a small subset of our systems. When this happened, our ElasticSearch hosts broke in a

Re: Are certain fields excluded from being part of _all grouping ?

2014-03-25 Thread pulkitsinghal
After reading up on '_all' a bit more, I now realize that its not implemented to collect the resulting tokens from fields but their _source values instead! So ofcourse it won't work .. boo hoo :P On Tuesday, March 25, 2014 10:53:10 AM UTC-5, pulkitsinghal wrote: I ran the following two

Bulk indexing with EC2 cluster?

2014-03-25 Thread IronMan2014
I am having some issues and I would like some feedback: #1 - I run a test with 250 MB worth of documents against my local machine which is an i7, it takes total of 130 secs to index. I run it against a cluster of 2 i2x4 large EC2 instances, much more powerful than my local machine, yet it

type filtering

2014-03-25 Thread Björn Ebers
Hi, is there a query performance difference between type filter and filtering with type-url-segment if an index contains a huge amount of documents? POST /index/type/_search { query: {match_all: {}} } vs. POST /index/_search { query: { filtered: { query:

[ANN] Elasticsearch Mapper Attachment plugin 2.0.0 released

2014-03-25 Thread Elasticsearch Team
Heya, We are pleased to announce the release of the Elasticsearch Mapper Attachment plugin, version 2.0.0. The mapper attachments plugin adds the attachment type to Elasticsearch using Apache Tika.. https://github.com/elasticsearch/elasticsearch-mapper-attachments/ Release Notes -

Re: [ANN] Elasticsearch Mapper Attachment plugin 2.0.0 released

2014-03-25 Thread IronMan2014
Great, but, I thought I downloaded this 2 days ago, it might have been 2.0.0 RC1 though? On Tuesday, March 25, 2014 2:06:59 PM UTC-4, David Pilato wrote: Heya, We are pleased to announce the release of the *Elasticsearch Mapper Attachment plugin*, *version 2.0.0* The mapper

Re: [ANN] Elasticsearch Mapper Attachment plugin 2.0.0 released

2014-03-25 Thread David Pilato
It was 2.0.0.RC1 --  David Pilato | Technical Advocate | Elasticsearch.com @dadoonet | @elasticsearchfr Le 25 mars 2014 à 19:24:54, IronMan2014 (sabdall...@hotmail.com) a écrit: Great, but, I thought I downloaded this 2 days ago, it might have been 2.0.0 RC1 though? On Tuesday, March 25,

Re: [ANN] Elasticsearch Mapper Attachment plugin 2.0.0 released

2014-03-25 Thread IronMan2014
Ok. Do you know if there is any update on the AWS plugin too? On Tuesday, March 25, 2014 2:25:25 PM UTC-4, David Pilato wrote: It was 2.0.0.RC1 -- *David Pilato* | *Technical Advocate* | *Elasticsearch.com* @dadoonet https://twitter.com/dadoonet |

Re: [ANN] Elasticsearch Mapper Attachment plugin 2.0.0 released

2014-03-25 Thread David Pilato
2.0.0.RC1 should work fine although we will release a 2.0.0 version very soon I hope. --  David Pilato | Technical Advocate | Elasticsearch.com @dadoonet | @elasticsearchfr Le 25 mars 2014 à 19:28:49, IronMan2014 (sabdall...@hotmail.com) a écrit: Ok. Do you know if there is any update on the

Re: create panel to display latency time using two datetime fields

2014-03-25 Thread computer engineer
Thanks for your response but I am completely confused here: I am actually stuck on what the script would be. Technically I could put in a table format as well but the actual query is what I am unsure about; the script? On Tuesday, March 18, 2014 7:40:17 PM UTC-4, Binh Ly wrote: If you are

Re: Are certain fields excluded from being part of _all grouping ?

2014-03-25 Thread InquiringMind
What I did with the high-performance query engine I built in 2001-2010 was to OR the queries for individual fields, creating a query-time version of the _all field. It was blindingly fast. What I now do with Elasticsearch is to disable the _all field because of the issues you've found (and

Re: how to modify term frequency formula?

2014-03-25 Thread geantbrun
Yes I saw Britta's slides but I find it difficult to implement my own scoring for complex queries (ex: with AND and OR). Do you have a concrete example or a link to share to explain with more details the override alternative? Thanks again Ivan, Patrick Le mardi 25 mars 2014 12:04:26 UTC-4, Ivan

ES query returning unusual results

2014-03-25 Thread al.smith951
I have an ES cluster running ES 1.0.1 on 10 nodes, where an index is has 5 shards each with 1 replica. Doing the following query kept alternately returning two distinct result counts, say 103799 and 104430. I've since carefully removed 5 nodes from my cluster and now for the same query I'm

Re: ElasticSearch Init Script w/ Java Change ...

2014-03-25 Thread Mark Walkom
I'd say it was the java swap that caused it, as ES will not start another process if it can see one running; markw@es00-fv:~$ ps -ef|grep java 106 20801 1 5 Feb25 ?1-14:27:46 /usr/bin/java -Xms4g -Xmx4g -Xss256k -Djava.awt.headless=true -XX:+UseParNewGC

Re: Tutorial, how to, guide, standard procedure to upgrade ES from 0.9 to 1.0 ?

2014-03-25 Thread Mark Walkom
It'll depend a lot on the version you are running in 0.90.N. due to underlying lucene changes We rebuilt our cluster and reindexed, as we took the chance to make a few changes to data and cluster structures and we had the capacity. Regards, Mark Walkom Infrastructure Engineer Campaign Monitor

Warning to the Christian and Non Muslim world!

2014-03-25 Thread war house
Warning to the Christian and Non Muslim world! We (Muslims) are inviting you to stop oppression on the Muslims. Don't you see the condition of your country? You are under the grate dissatisfaction of Allah for tormenting the Muslims of the whole world. As a result, different kinds of chaos and

JVM uptime less than JAVA process uptime?

2014-03-25 Thread Nelson Jeppesen
I'll keep this short and I hope its just a misunderstanding of java on my part. All my Elasticsearch nodes are reporting a JVM uptime between 1 and 25 hours. The strange part is all the JAVA process have a start date of 9+ days. How can the JVM uptime be less than the JAVA process time? I see

Re: JVM uptime less than JAVA process uptime?

2014-03-25 Thread Mark Walkom
At a guess, it might mean the actual time it spent processing, as opposed to service runtime. Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 26 March 2014 12:03, Nelson Jeppesen nelson.jeppe...@gmail.com wrote:

How to apply score/boost factor in ElasticSearch1.0.0 on filters

2014-03-25 Thread Kina Shah
Hi, I am new to ElasticSearch. I want to apply some boost factor on the filter that I have. Is it possible to apply score/boost factor on filters. if yes, can some one show me how to do that in java? Thanks! -- You received this message because you are subscribed to the Google Groups

Re: JVM uptime less than JAVA process uptime?

2014-03-25 Thread Nelson Jeppesen
I was thinking the same thing, except every ~25 hours the jvm uptime resets back to zero, but the java pid and start time remain unchanged. On Tuesday, March 25, 2014 6:03:15 PM UTC-7, Nelson Jeppesen wrote: I'll keep this short and I hope its just a misunderstanding of java on my part.