How to Upgrade from 1.1.1 to 1.2.2 in a windows enviroment (as windows service)

2014-07-17 Thread Wesley Creteur
Hi, I'm a recent user of elasticsearch and was wondering what steps i should take on upgrade to a newer version of elasticsearch on my windows server 2012? I've installed elasticsearch as running service on windows with the following commands: (3 nodes)

[ERROR][bootstrap] {1.2.2}: Initialization Failed ... - NullPointerException[null]

2014-07-17 Thread vjbangis
Does anyone experience below elasticsearch]$ tail -f /var/log/elasticsearch/lastikman.log [2014-07-17 05:37:01,470][INFO ][node ] [Karma] version[ 1.2.2], pid[12325], build[9902f08/2014-07-09T12:02:32Z] [2014-07-17 05:37:01,471][INFO ][node ] [Karma]

Re: How to Upgrade from 1.1.1 to 1.2.2 in a windows enviroment (as windows service)

2014-07-17 Thread Costin Leau
Hi, Remove the old service (service remove) then install it again using the new path. Going forward you might want to look into using file-system links (which Windows Vista + supports) so that you can make an alias to the folder, install the service for it and reuse that across installs. That

Re: No efect refresh_interval

2014-07-17 Thread Marek Dabrowski
Hello Mike My ES version is 1.2.1 I checked utilization nodes my cluster. Common valus ofr all nodes are: java proces cpu utilization: 6% os load: 1 io stat: 15kB/s write I checked indexing process 2 methods: a) indexing by native json data (13GB splited to 100MB chunks) time for i in

Re: [ERROR][bootstrap] {1.2.2}: Initialization Failed ... - NullPointerException[null]

2014-07-17 Thread vineeth mohan
Hi, Can you enabled the debug mode in the lo config and paste the debug log here. Thanks Vineeth On Thu, Jul 17, 2014 at 1:15 PM, vjbangis jessviray0...@gmail.com wrote: Does anyone experience below elasticsearch]$ tail -f /var/log/elasticsearch/lastikman.log [2014-07-17

Elasticsearch query SQL Server LAG function analog

2014-07-17 Thread Dmitriy Shilonosov
Hi, I am looking for a SQL Server LAG http://msdn.microsoft.com/en-us/library/hh231256(v=sql.110).aspx/LEAD http://msdn.microsoft.com/en-us/library/hh213125(v=sql.110).aspxfunctions analog in Elasticsearch. Assume I have a list of documents in result set found by particular criteria. The

Re: [Hadoop] : Parsing error in MR integration

2014-07-17 Thread Costin Leau
Hi, Looked again at your code sample and your configuration is incorrect. For some reason you are using FileInput/OuputFormat to set the input and output; since you are using es-hadoop you need to specify only the input and not the output. Moreover in your case, you are not using the input so

Re: [ERROR][bootstrap] {1.2.2}: Initialization Failed ... - NullPointerException[null]

2014-07-17 Thread joergpra...@gmail.com
Do you try to execute Elasticsearch on a non-executable file system? Jörg On Thu, Jul 17, 2014 at 9:45 AM, vjbangis jessviray0...@gmail.com wrote: Does anyone experience below elasticsearch]$ tail -f /var/log/elasticsearch/lastikman.log [2014-07-17 05:37:01,470][INFO ][node

Re: Unique/Distinct values from elasticsearch query

2014-07-17 Thread jigish thakar
Hey Soumya, I needed exactly same in my implementation. and hardly 2 days old with ElasticSearch. Can you please post code snippet you used in mapping and then to fetch unique values? Thanks in advance. On Thursday, February 20, 2014 12:39:08 PM UTC+5:30, soumya sengupta wrote: Thanks,

Re: How many tcp connections should ES/logstash generate ?

2014-07-17 Thread Bastien Chong
My issue is fixed by creating and dropping daily index. The resouce temporarily unavailable was due to the 1024 maximum process for elasticsearch user. By not deleting per range, it decreased by 10x the number of process, and I also increase the ulimit for nproc. Thanks all for your help. On

Re: Cross Fields w/ Fuzziness

2014-07-17 Thread Elliott Bradshaw
I realize that this post is getting a little old, but does the community have any feedback on the feasibility of this? On Friday, May 16, 2014 10:21:53 AM UTC-4, Tom wrote: +1 fuzziness would be great when using cross_fields Am Mittwoch, 7. Mai 2014 22:00:25 UTC+2 schrieb Ryan Tanner: Any

Re: No efect refresh_interval

2014-07-17 Thread Marek Dabrowski
Hello I found reason my problems. Refresh index during usage perl depend on parameters max_count and max_size for $e-bulk_helper Values for this parameters determine when refresh will be done on index. Tnx for help. Regards W dniu czwartek, 17 lipca 2014 09:59:55 UTC+2 użytkownik Marek

Re: Does insert order matter for date range queries

2014-07-17 Thread John Smith
Thanks On Tuesday, 15 July 2014 11:49:56 UTC-4, Nikolas Everett wrote: I don't believe it matters, no. On Tue, Jul 15, 2014 at 11:47 AM, John Smith java.d...@gmail.com javascript: wrote: Say I insert a few documents that have my own date field (NOT the ES insert stamp) but not inserted

IP geolocation without Logstash

2014-07-17 Thread Justin Koehler
I'm working on a system to record usage data for an application that submits its data to an ES cluster. I would like to record the location of each data point based on IP geolocation. I found the Logstash plugin that uses the GeoIP databases, but I was unable to find any solutions built for

Bloom filter codec?

2014-07-17 Thread Nikolas Everett
Has anyone had success adding a bloom filter to the codec for any of their fields? http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/index-modules-codec.html#bloom-postings I imagine it'd help reduce IO from (non multi-term) queries that frequently don't match. Like if you

Re: High memory usage on dedicated master nodes

2014-07-17 Thread David K Smith
They are dedicated masters and no queries are going through them. smonasco, that's it I believe. It's ParNew for young gen. I made a mistake in our puppet configs and gave the same amount of memory to both data nodes and master nodes for young generation (Xmn) even though master nodes only

Re: Random sort with a seed changes after updating a doc

2014-07-17 Thread Arny
I'm facing the same issue. Is there no way to choose what the seed actually should pick for the random score calculation? Or just let it pick the uid which never changes. On Tuesday, February 11, 2014 1:00:57 AM UTC+1, Brandon Williams wrote: I'm using random_score to perform a search with

How can I break down and diagnose this query error resulting in a NumberFormatException?

2014-07-17 Thread Ryan V
I just converted our ES cluster from 0.90.12 to 1.1.1 and our app from NEST 0.12 to 1.0.0-rc1 and have had a really difficult time fixing all the breaking changes. I'm stuck on the following error. It occurs when I execute what is a rather complex search query: { [my_index][0]:

counting unique root objects on nested aggregations

2014-07-17 Thread Kallin Nagelberg
I'm trying to build a query to aggregate on some fields in a nested document, but instead of returning the count of the nested documents for each aggregation, I'd like to know the number of root objects. IE., I have a mapping like (from the docs): product : { properties : {

Trading index performance for search performance

2014-07-17 Thread jnortey
At the moment, we're able to bulk index data at a rate faster than we actually need. Indexing is not as important to use as being able to quickly search for data. Once we start reaching ~30 million documents indexed, we start to see performance decreasing in ours search queries. What are the

Re: Trading index performance for search performance

2014-07-17 Thread Nikolas Everett
It might be useful to fiddle with the merge configuration http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/index-modules-merge.html to try to end up with fewer segments. That'll reduce the IO cost of the underlying Lucene operations that filter your query before the

Re: Trading index performance for search performance

2014-07-17 Thread joergpra...@gmail.com
The 30m docs may have characteristics (volume, term freqs, mappings) so ES limits are reached within your specific configuration. This is hard to guess without knowing more facts. Beside improving merge configuration, you might be able to sacrifice indexing time by assigning limited daily

Need to import lots of lucene libraries to run example code

2014-07-17 Thread Morris Chang
Hi, I am using java to learn elasticsearch API. I tried to start from the index api with the example code on website Resource page: Node node = nodeBuilder().node(); Client client = node.client(); IndexResponse response = client.prepareIndex(twitter, tweet, 1 )

Re: Using script fields in Kibana

2014-07-17 Thread Darby Sager
Gal, I too would appreciate seeing your solution. Thank you! On Tuesday, April 1, 2014 9:27:42 AM UTC-7, Gal Zolkover wrote: Ok thank you , I'm up for the chalange  -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this

ElasticSearch Hadoop

2014-07-17 Thread James Cook
I've read through much of the documentation for es-hadoop, but I might be coming away with some misunderstandings. The setup docs for elasticsearch for apache hadoop (es-hadoop) uses the word *interact* which is a bit vague. Elasticsearch for Apache Hadoop is an open-source, stand-alone,

Re: counting unique root objects on nested aggregations

2014-07-17 Thread Kallin Nagelberg
I realized this could be simplified by simply leaving out the 'value_count' aggregation within the reverse_nested, as that information is already provided by the included 'doc_count'. I guess it can't be simplified much beyond this. Would it be worth including this information by default when

Re: Trading index performance for search performance

2014-07-17 Thread jnortey
Thanks to both of you for the advise. Unfortunately setting daily indexing times isn't an option for us, however I think I have a good idea of what we should try next. On Thursday, July 17, 2014 10:56:31 AM UTC-5, jnortey wrote: At the moment, we're able to bulk index data at a rate faster

Re: ElasticSearch Hadoop

2014-07-17 Thread Costin Leau
On 7/17/14 8:38 PM, James Cook wrote: I've read through much of the documentation for es-hadoop, but I might be coming away with some misunderstandings. The setup docs for elasticsearch for apache hadoop (es-hadoop) uses the word /interact/ which is a bit vague. Elasticsearch for Apache

Re: Need to import lots of lucene libraries to run example code

2014-07-17 Thread joergpra...@gmail.com
Use the Maven dependencies of the ES jar in the Maven repo to let the IDE build and run your code. If you want to run your code, you have to include all jars under the lib folder in ES_HOME into your classpath. Maven knows about these dependencies automatically. Jörg On Thu, Jul 17, 2014 at

Kibana with (non-basic) User Authentication

2014-07-17 Thread Stephan Buys
Hi all, I'm exploring options to enable user management and authentication for Kibana. The idea is to have a nice looking (not just basic auth) authentication screen for users of a web-based monitoring solution that we are developing (powered by Elasticsearch). We're not a big group, so I'm

Re: Bloom filter codec?

2014-07-17 Thread Adrien Grand
Hi Nik, The trade-off is not easy indeed. First, the default terms dictionary can already save some disk seeks. By storing the prefixes of the terms that are in the terms dictionary in a FST in memory, it can avoid going to disk when the term that you are looking up cannot match this FST. A bloom

Re: Bloom filter codec?

2014-07-17 Thread Nikolas Everett
Thanks for replying. I've been looking to reduce my IO. Pushing everything into an all field is really going to be the biggest thing, I think, but I was wondering about the bloom filters. It doesn't sound worth it. It feels like everything but the default codec is pretty unlikely to be useful?

Question regarding Shard Distribution while adding a replica to cluster

2014-07-17 Thread Rahul Sharma
Hi, My ES Version -- 0.90.2 1) I used to have a 3 node ES cluster where I had 5 shard indices. I used to have around 1000 shards (all primary as it was running with 0 replica). 2) Then I change the replica to 1 and add 3 additional nodes hoping the shards will get evenly balanced with a replica.

How to figure out field type?

2014-07-17 Thread Adrian
I've added some data to my ES. JSON format: { doc:{ site : marriage.com, name : amount-active-users, daily : { dataX: [1,2,3], dataY: [1388538061, 1388624461, 1388710861], startDate:1388538061, endDate:1388710861 } } } If you look at dataX field, it's an array. ES

Python package index in elasticsearch

2014-07-17 Thread Maciej Dziardziel
Hi Being frustrated with speed and inflexibility of pip search, I played with elasticsearch and set up my own index. Maybe someone will find it useful too. Site:http://pypisearch.linuxcoder.co.uk Code: https://github.com/Fiedzia/pypisearch Full lucene syntax is allowed. Note: indexing

Re: Bloom filter codec?

2014-07-17 Thread Adrien Grand
On Thu, Jul 17, 2014 at 10:37 PM, Nikolas Everett nik9...@gmail.com wrote: Thanks for replying. I've been looking to reduce my IO. Pushing everything into an all field is really going to be the biggest thing, I think, but I was wondering about the bloom filters. It doesn't sound worth it.

Re: Kibana with (non-basic) User Authentication

2014-07-17 Thread Mark Walkom
There are a few such wrappers around that community members have written. Have a search through the archives here and you may get some ideas and even code to leverage. Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On

Re: Cluster interface

2014-07-17 Thread Mark Walkom
ES needs direct access to the interface for the instance, so NAT won't work. Regards, Mark Walkom Infrastructure Engineer Campaign Monitor email: ma...@campaignmonitor.com web: www.campaignmonitor.com On 18 July 2014 03:39, avery.ro...@insecure-it.com wrote: I've setup three kvm guests with

Re: Garbage collection pauses causing cluster to get unresponsive

2014-07-17 Thread Srinath C
Hi Michael, Did you get a chance to look at the hot_threads and iostat output? I also tried with EBS Provisioned SSB with 4000 IOPS and with that I was able to ingest only at around 30K per second after which there are EsRejectedExecutionException. There were 4 elasticsearch instances of

too many open files

2014-07-17 Thread Seungjin Lee
hello, I'm using elasticsearch with storm, Java TransportClient. I have total 128 threads across machines which communicate with elasticsearch cluster. From time to time, error below occurs org.elasticsearch.common.netty.channel.ChannelException: Failed to create a selector. at

Re: too many open files

2014-07-17 Thread Andrew Selden
This is a fairly common problem and not necessarily specific to Elasticsearch. It is simple to solve. In Linux you can increase the operating system's max file descriptor limit. Other Unix-like operating systems have the same concept. You can find how to do this for your specific Linux

Re: Python package index in elasticsearch

2014-07-17 Thread Honza Král
Nice! Have you looked at Warehouse (0)? It's a similar effort by the pypa initiative, also using elasticsearch. Honza 0 - https://github.com/pypa/warehouse On Fri, Jul 18, 2014 at 6:58 AM, Maciej Dziardziel fied...@gmail.com wrote: Hi Being frustrated with speed and inflexibility of pip

Treat Dot as a normal character in query_string query

2014-07-17 Thread Curt Hu
How can I treat the Dot '.' as the normal character in the query_string, as I want to search www.google.com as the whole string in the query_string, the current results for me are so strange.. -- View this message in context:

Re: percolator throughput decreases as time passes

2014-07-17 Thread Seungjin Lee
not really, amount of queries were same throughout process lifecycle 2014-07-16 19:04 GMT+09:00 Martijn v Groningen martijn.v.gronin...@gmail.com: Do the amount of registered percolate queries also increase? On 15 July 2014 12:02, Seungjin Lee sweetest0...@gmail.com wrote: ​ hi all,

Re: Python version for curator

2014-07-17 Thread Honza Král
Hi Brian, you seem to have hit an issue we have had with curator, there are some solutions and workarounds on the github issue: https://github.com/elasticsearch/curator/issues/77 hope this helps, Honza On Thu, Jul 17, 2014 at 6:22 AM, Brian brian.from...@gmail.com wrote: No joy: $ pip