Re: Elasticsearch and MongoDB without River

2015-05-24 Thread Michael Sick
I'd use Apache Storm, especially if it was used elsewhere in your organization. --Mike On Sat, May 23, 2015 at 4:28 AM, sriharshakiran < kiran.srihar...@imomentous.com> wrote: > Hi All > > Now that rivers are deprecated, I need to index data into ES from MongoDB. > Can anyone suggest an approach?

Re: Elasticsearch 1.1.0 - Optimize broken?

2014-04-04 Thread Michael Sick
Have you tried max_num_segments=1 on your optimize? On Fri, Apr 4, 2014 at 11:27 AM, Elliott Bradshaw wrote: > Any thoughts on this? I've run optimize several more times, and the > number of segments falls each time, but I'm still over 1000 segments per > shard. Has anyone else run into somethi

Re: Advice for implementing a secure graph index with ElasticSearch

2014-03-05 Thread Michael Sick
ex on ElasticSearch if all the data was > open to everyone. I'd be able to consolidate all of a vertex's or edge's > properties in a single document. Unfortunately, that's not the case. The > project I'm working on is at http://lumify.io if that's helpful in

Re: Advice for implementing a secure graph index with ElasticSearch

2014-03-05 Thread Michael Sick
https://github.com/thinkaurelius/titan/wiki "Titan is a distributed graph database optimized for storing and querying graphs represented over a cluster of machines. The cluster can elastically scale to s

Re: Insert later feature

2014-02-23 Thread Michael Sick
Also, if there are no other clients wanting a faster refresh, you can set index.refresh_interval to a higher value than the 1s default either in general for your index or just during the times when you're doing your bulk updates. http://www.elasticsearch.org/guide/en/elasticsearch/reference/current

Re: Problem with keeping in sync Elasticsearch across two data centers

2014-02-22 Thread Michael Sick
he primary center fails, > the fail over data center still has most of the data (may be except for the > last few seconds/minutes/hours). > > Overall I am looking for a right way to implement cross data center > deployment of elastic-search! > > -Amit. > > >

Re: Problem with keeping in sync Elasticsearch across two data centers

2014-02-21 Thread Michael Sick
Dario, I believe that you're looking for TribeNodes http://www.elasticsearch.org/guide/en/elasticsearch/reference/master/modules-tribe.html ES is not built to consistently cluster across DC's / larger network lags. On Fri, Feb 21, 2014 at 11:24 AM, Dario Rossi wrote: > Hi, > I've the following

Re: performing range summary on the result of a metric within a bucket

2014-01-31 Thread Michael Sick
Forgot to post the mapping (here mikeasick<https://gist.github.com/mikeasick> / gist:8738689 <https://gist.github.com/mikeasick/8738689>) and that it's using 1.0.0.Beta2 and Java 1.7.0_25 on Windows 7. Thanks! On Fri, Jan 31, 2014 at 10:42 AM, Michael Sick < michael.s..

performing range summary on the result of a metric within a bucket

2014-01-31 Thread Michael Sick
I have what will be a long running time series table that I would like to aggregate by time buckets and analyze: 1. Query & Filter (in this case by dates & a term query on user_guid) (Working) 2. Bucket by time for analysis (working using the aggs "date_aggregate" and "heartRate_stats"