Re: Comparing arrays

2014-09-24 Thread Tim Uckun
On Tuesday, September 23, 2014 2:18:35 PM UTC+12, Tim Uckun wrote: > > I have documents with an array field. The array contains unique elements > only. In this case the data is strings and but it could be numbers. > > I want to search for documents using this array field.

Comparing arrays

2014-09-22 Thread Tim Uckun
I have documents with an array field. The array contains unique elements only. In this case the data is strings and but it could be numbers. I want to search for documents using this array field. Ideally I would like to pass in an array and find the "nearest match" meaning documents which contain

Re: Cassandra + Elasticsearch or Just Elasticsearch for Primary data store.

2014-07-14 Thread Tim Uckun
> > > I'm just confused if Cassandra can really make a difference here, since > looks to me ES can suffice here. > > > If you are not going to be using Cassandra for indexing then there is no reason to have it. If you want durability in case something goes wrong with ES you can just store your

Reverse River?

2014-05-22 Thread Tim Uckun
I would like to have a river in reverse. Every time a document is inserted or modified I would like to push that into another destination like a database. Ideally this would be async or maybe even in batches. Has anybody done anything like this before? -- You received this message because you

Re: Capacity Planning with ElasticSearch

2014-04-23 Thread Tim Uckun
To follow up on this... As a general rule is it better to have one horse size index or a hundred duck sized indices. I am thinking about those types of searches where you might frequently search a subset of the data. For example keeping a separate index for every customer because normally the a

cardinality not giving expected results.

2014-04-14 Thread Tim Uckun
I have roughly 50 million records in ES. The data was generated artificially and is mostly duplicates. I am executing the following query { size: 0, aggregations: { by_month: { date_histogram: { field:"time_stamp", interva

Re: sbuqueries and distinct counts.

2014-04-10 Thread Tim Uckun
I hate to bump myself but does anybody have any input on this at all? On Thursday, April 10, 2014 11:59:43 AM UTC+12, Tim Uckun wrote: > > I want to do something like this. > > select date_trunc('month', time_stamp), sum(distinct_count) from ( > select date_tru

sbuqueries and distinct counts.

2014-04-09 Thread Tim Uckun
I want to do something like this. select date_trunc('month', time_stamp), sum(distinct_count) from ( select date_trunc('week', time_stamp) as time_stamp, count(distinct field_name) as distinct_count from blah group by date_trun('week', time_stamp) ) group by date_trunc('mon

Re: Min, max dates not formatting?

2014-04-09 Thread Tim Uckun
On Wednesday, April 9, 2014 7:49:13 PM UTC+12, Alexander Reelsen wrote: > > Hey, > > can you try to use date formats instead of the named identificators like > date_time and see if it works? Also, can you check the exception of the > elasticsearch logs? > > I tried the "-MM-dd .." it gives

Min, max dates not formatting?

2014-04-08 Thread Tim Uckun
I have a search like this { size: 0, query:{ match_all: {} }, aggregations: { min_date: {min: {field: 'time_stamp', format: 'date_time'}}, max_date: {max: {field: 'time_stamp', format: "-MM-dd HH:mm"}} } } This fails with a parse e

Re: java 8, elasticsearch, and MVEL

2014-04-07 Thread Tim Uckun
You should take a look at mirah. On Monday, April 7, 2014 11:12:39 AM UTC+12, kimchy wrote: > > We are planning to address this on Elasticsearch itself. The tricky bit is > the fact that we want to have a highly optimized concurrent scripting > engine. You can install the Rhino one which shou

ES instead of Cassandra.

2014-03-23 Thread Tim Uckun
Is ES a suitable replacement for Cassandra for an analytics platform? I need high speed data ingestion, time series analysis, rollups and aggregations etc. Cassandra is used for this kind of task often but it seems to me ES might be a suitable if not better replacement. Cheers. -- You receiv