Undo delete feature

2014-07-19 Thread Pulkit Agrawal
Hi All,

I like to if ElasticSearch has any undo delete operation.

So any document deleted using delete api can be recovered until it is
deleted from disk using optimize api flag only_expunge_deletes.

If it is not there I like to know what are the challenges to develop that
as we already have document in index and just marked it as delete.

Thanks,
Pulkit Agrawal
Product Analyst
MckinseyCompany

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAMVa6up86Rphmh7C7fNVBANgXMvi8ruV7GD78iVHhzw1tuWQCw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: Heap / GC Issues

2014-07-19 Thread joergpra...@gmail.com
You are on the right track and you found already the answer to your
question, examine your queries. They seem to be cached and eat your heap.

http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/filter-caching.html

Jörg


On Fri, Jul 18, 2014 at 5:09 PM, Ned Campion nedcamp...@gmail.com wrote:



 We are doing a lot of bool conditions in our queries, so that may be a
 factor in the hefty filter cache.



-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAKdsXoE06%2BdSouhX1-3xnmrenTDJbCzDOqD4Ma_NxCkFh5QQJw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: n:m lookup filter

2014-07-19 Thread joergpra...@gmail.com
Yes, I think this is somehow related to Matt's Join Filter

https://github.com/elasticsearch/elasticsearch/pull/3278

Jörg


On Sat, Jul 19, 2014 at 4:24 AM, Don Clore cloredo...@gmail.com wrote:

 I am pretty sure this is not supported, but it'd be great to explicit
 confirmation/denial.

 Sodocument types A and B, where there's an N:M relationship between A
 and B, and document type B has a list of the document A instances that
 relate to it.

 More concretely  A == a sports Player data type, and B is a set of new
 stories.   The Story type has a list of the ids of Players that the story
 is about/related to.

 SoI know the terms lookup filter allows one to use a single document
 as the source of the terms for the lookup.   What we'd like to be able to
 do is expose a faceted/aggregations-based UI to the user that allows her to
 perform a variety of filtering operations on Players over a fairly
 extensive set of criteria, and then have the resulting set of Player
 document ids serve as the lookup into the Story stories, i.e., get all the
 stories that relate to the Player result set.

 Obviously, we'd ideally like to do this in a single query, or failing
 that, have some reasonably efficient way to issue the two query/filters
 (passing a large result set of ids over the wire seems like a bad idea; I'm
 new to ES, but...this kind of thing was never great with Solr).

 One idea I had (perhaps half-baked) was to create a PlayerResultSet type,
 with an id deterministically fashioned from the query/filter predicates
 such that the same user filtering action would result in the same
 PlayerResultSet id each time; we'd issue a terms lookup filter request
 using the PlayerResultSet id, if it fails because the PlayerResultSet
 document doesn't exist, then we'd have to issue the filter for the Players,
 construct a PlayerResultSet doc and index it, and query for the Stories
 that have those Player Ids; not sure if it would be worse to issue all the
 ids in a query, or index the PlayerResultSet doc with Refresh==true (or
 issue the query and queue up the PlayerResultSet doc for later indexing, or
 whatever).

 The Player data should be fairly static; we could delete the documents and
 recreate them each time we refresh Player data.

 Ok, that sounds pretty awful, I'm hoping someone has a less Rube-Goldberg
 approach; obviously, I'm sort of building in my filter query caching
 mechanism, hopefully something like this can be more easily achieved with
 the built-in filter caching.

 thanks for any insights,
 Don

 --
 You received this message because you are subscribed to the Google Groups
 elasticsearch group.
 To unsubscribe from this group and stop receiving emails from it, send an
 email to elasticsearch+unsubscr...@googlegroups.com.
 To view this discussion on the web visit
 https://groups.google.com/d/msgid/elasticsearch/91919a48-0892-4878-890b-e14c67fd40b5%40googlegroups.com
 https://groups.google.com/d/msgid/elasticsearch/91919a48-0892-4878-890b-e14c67fd40b5%40googlegroups.com?utm_medium=emailutm_source=footer
 .
 For more options, visit https://groups.google.com/d/optout.


-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAKdsXoEMzKNuuBvuTt5XTLN6gMuePrVDP-%3DyjyQ0pWnPJ5NK9w%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: Certified platforms for ElasticSearch

2014-07-19 Thread joergpra...@gmail.com
Not sure how you could find that Linux and Windows are certified or if
there are certifications at all.

The platform on which Elasticsearch runs is server-side Java so this is
probably your question.

I for myself run Elasticsearch on Java 7 and Java 8, on Red Hat Linux
Enterprise 6, Mac OS X 10.9, Solaris 10.

I know that people were successful in running Elasticsearch on Windows 7
and FreeBSD, but I do not use these systems.

AIX is more painful, but this is not necessarily related to Elasticsearch
https://github.com/elasticsearch/elasticsearch/issues/1805

Flawless operations are most predictable with the Oracle distribution of
JDK 7 and 8. Use always the most recent releases, in older versions the
probability of bugs is higher, especially related to Lucene. Lucene and
OpenJDK teams are working together to find bugs before releases.

OpenJDK 7, the reference implementation, which is included in recent Linux
distributions, should work equivalently well. If you run Elasticsearch 
1.2.0, OpenJDK 6 should be avoided, some versions have annoying bugs.

It is also possible to use Server JRE for Elasticsearch. This is a
tailored Java distribution with tools for JVM monitoring, but without
browser plugin integration or graphics display.

For Java 8, you need Elasticsearch = 1.0.4, 1.1.2, 1.2.0 if MVEL scripting
is used https://github.com/elasticsearch/elasticsearch/issues/5877

Jörg



On Wed, Jul 16, 2014 at 8:27 AM, bsa shashi.b.an...@gmail.com wrote:

 Hello,

 Can somebody provide the list of platforms on which ElasticSearch is
 certified, besides Linux and Windows?

 -bsa

 --
 You received this message because you are subscribed to the Google Groups
 elasticsearch group.
 To unsubscribe from this group and stop receiving emails from it, send an
 email to elasticsearch+unsubscr...@googlegroups.com.
 To view this discussion on the web visit
 https://groups.google.com/d/msgid/elasticsearch/98b9d852-a7ef-48c6-bb01-bd5bfdd5a7fb%40googlegroups.com
 https://groups.google.com/d/msgid/elasticsearch/98b9d852-a7ef-48c6-bb01-bd5bfdd5a7fb%40googlegroups.com?utm_medium=emailutm_source=footer
 .
 For more options, visit https://groups.google.com/d/optout.


-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAKdsXoFzHJ_%3DUPCbybiZSd4jjTDCkK1rLzNTGE-Uoa65nAapdA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: Undo delete feature

2014-07-19 Thread vineeth mohan
Hello Pulkit ,

The best option i can suggest would be to take snapshot of the index before
the delete opertaion.
And later retrieve the document that you need.

Else you might need to look at lucene level.
I believe lucene has features to tag a version and restoring to it later.

Thanks
   Vineeth


On Sat, Jul 19, 2014 at 11:51 AM, Pulkit Agrawal pulkitdot...@gmail.com
wrote:

 Hi All,

 I like to if ElasticSearch has any undo delete operation.

 So any document deleted using delete api can be recovered until it is
 deleted from disk using optimize api flag only_expunge_deletes.

 If it is not there I like to know what are the challenges to develop that
 as we already have document in index and just marked it as delete.

 Thanks,
 Pulkit Agrawal
 Product Analyst
 MckinseyCompany

 --
 You received this message because you are subscribed to the Google Groups
 elasticsearch group.
 To unsubscribe from this group and stop receiving emails from it, send an
 email to elasticsearch+unsubscr...@googlegroups.com.
 To view this discussion on the web visit
 https://groups.google.com/d/msgid/elasticsearch/CAMVa6up86Rphmh7C7fNVBANgXMvi8ruV7GD78iVHhzw1tuWQCw%40mail.gmail.com
 https://groups.google.com/d/msgid/elasticsearch/CAMVa6up86Rphmh7C7fNVBANgXMvi8ruV7GD78iVHhzw1tuWQCw%40mail.gmail.com?utm_medium=emailutm_source=footer
 .
 For more options, visit https://groups.google.com/d/optout.


-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAGdPd5nPyRYrPOZjO4kvWT1AAT%2BOvjDs%2BuPoeZhSh8C%3D9LV9pw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Possibly to use current date as synonim?

2014-07-19 Thread Bdfy

http://stackoverflow.com/questions/24841835/possibly-to-use-current-date-as-synonim#
 
  
Possibly to use current date as synonim ? For example for query 
latest/breaking news I want to get last news in search. How to do it ?

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/5da5e274-c203-4542-8b10-1a06973764bc%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [ANN] Log4j2 Elasticsearch appender

2014-07-19 Thread Alfredo Serafini
I'll try it as soon as I can! 
thanks, 
Alfredo
:-)

Il giorno venerdì 18 luglio 2014 10:08:14 UTC+2, Jörg Prante ha scritto:

 Hi,

 I released a Log4j2 Elasticsearch appender

 https://github.com/jprante/log4j2-elasticsearch

 in the hope it is useful.

 Best,

 Jörg


-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/dce481d9-ac3e-4fd0-aaba-3a4c69d07d34%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: Heap / GC Issues

2014-07-19 Thread Clinton Gormley
Your filter cache is only taking up 3GB of the heap, which fits with the
default limit of 10% of heap space. So the filter cache is not at fault
here.

I would look at the two usual suspects:

* field data - how much space is this consuming? Try:
curl 'localhost:9200/_nodes/stats/indices/fielddata?fields=*pretty'
* swap - is it completely disabled?

The swap is a common gotcha.  If you have any data in swap it will slow GCs
down terribly, and give you the scenario that you are seeing.  See the docs
for how to disable swap:
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/setup-configuration.html#setup-configuration-memory

(i prefer disabling swap completely, rather than relying on mlockall or
swappiness)




On 19 July 2014 09:06, joergpra...@gmail.com joergpra...@gmail.com wrote:

 You are on the right track and you found already the answer to your
 question, examine your queries. They seem to be cached and eat your heap.


 http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/filter-caching.html

 Jörg


 On Fri, Jul 18, 2014 at 5:09 PM, Ned Campion nedcamp...@gmail.com wrote:



 We are doing a lot of bool conditions in our queries, so that may be a
 factor in the hefty filter cache.

  --
 You received this message because you are subscribed to the Google Groups
 elasticsearch group.
 To unsubscribe from this group and stop receiving emails from it, send an
 email to elasticsearch+unsubscr...@googlegroups.com.
 To view this discussion on the web visit
 https://groups.google.com/d/msgid/elasticsearch/CAKdsXoE06%2BdSouhX1-3xnmrenTDJbCzDOqD4Ma_NxCkFh5QQJw%40mail.gmail.com
 https://groups.google.com/d/msgid/elasticsearch/CAKdsXoE06%2BdSouhX1-3xnmrenTDJbCzDOqD4Ma_NxCkFh5QQJw%40mail.gmail.com?utm_medium=emailutm_source=footer
 .

 For more options, visit https://groups.google.com/d/optout.


-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAPt3XKRnLmMMVxFppy%3D3BG%3DD1afTfXpeeOni%3DRkJNunKkTfh0A%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


[ANN] JDBC river/feeder plugin 1.2.2.0

2014-07-19 Thread joergpra...@gmail.com
Hi,

here is a new release of JDBC river/feeder plugin for Elasticsearch

https://github.com/jprante/elasticsearch-river-jdbc/releases/tag/1.2.2.0

Highlights:

- update to Elasticsearch 1.2.2
- more reliable bulk indexing
- properly handle SQL insert/update/select statement types
- dropping empty maps after ignoring null values
- overlap time for column strategy
- option for creating time-window based index names

See also README at

https://github.com/jprante/elasticsearch-river-jdbc

As always, feedback/contributions are most welcome!

Best,

Jörg

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAKdsXoF_8xRHYGGqoggDkWx34pzatjM8xRsAL2pPg9u2J221nQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Parsing exceptions

2014-07-19 Thread Simon O
Is there any easy way to get just the relevant error messages out of an 
exception?

I know there are multiple shards and each one can potentially return a 
different error message but 99% of the time, the error is because I've made 
a mistake.

This means I get the same message 50+ times in a complete soup of brackets 
and braces.

The best solution I have at the moment is to scroll to the end of the error 
and read backwards.

Is there any way to parse the error messages into something useful? As you 
can see, it's not valid json...

SearchPhaseExecutionException[
   Failedtoexecutephase[
 query
   ],
   allshardsfailed;shardFailures{
 [
   kieUEJ4uS_eoaBV3cBz3Rw
 ][
   elmail
 ][
   7
 ]: SearchParseException[
   [
 elmail
   ][
 7
   ]: from[
 -1
   ],
   size[
 0
   ]: ParseFailure[
 Failedtoparsesource[
   {
 size: 0,
 aggs: {
   senders: {
 terms: {
   field: from.emailAddress,
   size: 50
 },
 aggs: {
   senderDate: {
 date_histogram: {
   field: date,
   interval: quarter
 }
   }
 }
   }
 },
 filter: {
   range: {
 date: {
   gte: -3year
 }
   }
 }
   }
 ]
   ]
 ];nested: ElasticsearchParseException[
   failedtoparsedatefield[
 -3year

 

...Snip... 

 

  }{
 [
   kieUEJ4uS_eoaBV3cBz3Rw
 ][
   elmail
 ][
   0
 ]: SearchParseException[
   [
 elmail
   ][
 0
   ]: from[
 -1
   ],
   size[
 0
   ]: ParseFailure[
 Failedtoparsesource[
   {
 size: 0,
 aggs: {
   senders: {
 terms: {
   field: from.emailAddress,
   size: 50
 },
 aggs: {
   senderDate: {
 date_histogram: {
   field: date,
   interval: quarter
 }
   }
 }
   }
 },
 filter: {
   range: {
 date: {
   gte: -3year
 }
   }
 }
   }
 ]
   ]
 ];nested: ElasticsearchParseException[
   failedtoparsedatefield[
 -3year
   ],
   triedbothdateformat[
 dateOptionalTime
   ],
   andtimestampnumber
 ];nested: IllegalArgumentException[
   Invalidformat: -3yearismalformedatyear
 ];
   }
 ]

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/669d35b5-c489-4099-b21f-e96a60ab4beb%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.