Updating single field in large documents

2014-10-17 Thread Dragan Bošnjak
What strategy could one do when you need to frequently update single field in large document? What can you do to improve update performance in case like that? Thanks -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this

update mapping

2014-10-17 Thread eunever32
Is the way to update mapping of large index as follows Create empty index with new mapping Copy old data into new index Alias new index to previous If so, what are recommended tools? Ideally there would be a user interface for IT people to use? Thanks -- You received this message because

Re: Cluster discovery on Amazon EC2 problem - need urgent help

2014-10-17 Thread Norberto Meijome
I am pretty sure you can open the ports for the sec group the elb belongs to , regardless of the az. (Az, not region). Unless you r using network acls. Anyway, not really ES... pm me if u want to continue the AWS discussion :-) On 16/10/2014 3:37 pm, Zoran Jeremic zoran.jere...@gmail.com wrote:

Re: ElasticSearch- IndexReaders cannot exceed 2147483647

2014-10-17 Thread Prasanth R
Dear All, Thanks for your replies. Conclusion is, we can not store more than 2147483647 records per shard as of now. The only option is we need to increase the shard count. Thanks Prasath Rajan On Tuesday, October 14, 2014 9:34:33 PM UTC+5:30, Jörg Prante wrote: You can not store more than

Re: What MongoDB can do and ES cannot?

2014-10-17 Thread samant
hi Clinton Considering the enormous amount of value addition in ES since this original question was posted . Wondering, if the answer has tilted in favor of ElasticSearch ? Can we safely say - ElasticSearch can be considered as a primary data store ? -- View this message in context:

Re: Using a nested object property within custom_filters_score script

2014-10-17 Thread meganeinu7
Hi Veda, I run into a similar issue like yours. Have you found a solution to your problem? Thanks, Vincent -- View this message in context: http://elasticsearch-users.115913.n3.nabble.com/Using-a-nested-object-property-within-custom-filters-score-script-tp4046901p4064981.html Sent from the

Re: Many indices.fielddata.breaker errors in logs and cluster slow...

2014-10-17 Thread Kimbro Staken
This is caused by elasticsearch trying to load fielddata. Fielddata is used for sorting and faceting/aggregations. When a query has a sort parameter the node will try to load the fielddata for that field for all documents in the shard, not just those included in the query result. The breaker is

Re: Announcing elasticsearch plugin for Liferay - elasticray

2014-10-17 Thread Alexandre Rafalovitch
Elastic (without Search) should be ok, I believe. At least according to the official source: http://www.elasticsearch.org/trademarks/ Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-start.com/ and @solrstart Solr popularizers

River MongoDB-Elasticsearch (parent/child)

2014-10-17 Thread Ludovic MEYER
Hello, I'm looking for a solution to creat parent/child relation with the script of the river mongodb-ES plugine. I don't know if the relation parent/child must be present already in MongoDB to do that. For now, I just have the field parent_id in the all document with an ID which is the same

Understanding HEAP usage

2014-10-17 Thread karthik jayanthi
Hi, We are using Elasticsearch for one of our applications. As a part of which we indexed about 3M documents and have built two indices around them. We have used a cluster of 2 Nodes each with 7.5 GB RAM and have dedicated 4 GM to the ES. What we are seeing is that on one of the nodes, the

Re: Filter by specific value without mapping

2014-10-17 Thread Vladimir Krylov
Tried to remove papping and make not_analizable curl -XPUT http://$HOST:9200/reports; -d' { mappings: { _default_: { dynamic_templates: [ { store_generic : { match : *, match_mapping_type : string,

Set _score field value in Elasticsearch

2014-10-17 Thread Kruti Shukla
Hello All, I'm trying achieve one functionality in Elasticsearch but I'm not able to do it. In SQL we can do it like -- select SET score_1 = _score from sometable I trying to assign value of score in one field. That means Elastic search will return 2 columns having same values _score and

APT repository sync

2014-10-17 Thread Yapeng Wu
Hi, Can someone point me in the right direction for running a local mirror of the elasticsearch APT repositories? Specifically, is there an rsync connection available? Thanks! Yapeng -- You received this message because you are subscribed to the Google Groups elasticsearch group. To

Re: Understanding HEAP usage

2014-10-17 Thread Nikolas Everett
Measuring heap usage in Java applications is very different than measuring memory usage for other stuff. 1. Usually java allocates all the heap its going to need up front at startup. At least, we do that in server applications. 2. Java's garbage collection is very lazy so heap usage will go up

Re: copy index

2014-10-17 Thread joergpra...@gmail.com
You can use the knapsack plugin for export/import data and change mappings (and much more!) For a 1:1 online copy, just one curl command is necessary, yes. https://github.com/jprante/elasticsearch-knapsack Jörg On Thu, Oct 16, 2014 at 7:55 PM, euneve...@gmail.com wrote: Hi I can see there

Re: ElasticSearch spark esRDD not returing the aggregate values in aggregated query

2014-10-17 Thread Jeff Steinmetz
Siva, Try the latest build of elasticsearch-hadoop, ver 2.1.0 Beta 2 http://www.elasticsearch.org/overview/hadoop/download/ The esRDD has been changed to sparks PairRDD https://spark.apache.org/docs/1.1.0/api/scala/index.html#org.apache.spark.rdd.PairRDDFunctions The RDD will now be key/value

Re: Sorting by nested fields

2014-10-17 Thread Elke
Has somebody another idea? Or it is not possible at all? -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this

Minimum double score in a native script

2014-10-17 Thread Alejandro Sierra
Hi!, I am writing a Java plugin with a customized score script (native) returning a double. Basically I wrote a class extending AbstractDoubleSearchScript. For some documents which don't pass a specific test, the score should be the lowest possible, meaning they should be at the bottom of the

How to count tuples of 3 variables, sorted

2014-10-17 Thread Artur Martins
Greetings community, I'm new to elasticsearch, so first of all sorry for my questions being so basic. I developed a flow collector which dumps flows to my elasticsearch server. Right now i use Kibana to perform the Top 10 destination and Top 10 source IPs filters, and such. But the query I'm

Kibana group by terms

2014-10-17 Thread Michael Irwin
I'm using Kibana w/ logstash to view web server logs. I'd like to add a graph that displays uniques of the *entire* User-Agent string. I've tried adding a terms graph, but that breaks the UA string into separate words, which is less than desirable in this situation. Is there a way to do this?

managing snapshots

2014-10-17 Thread Matthias Johnson
I'm investigating snapshots and came across some things that aren't clear in the docs. My understanding is that the snapshots are incremental and only transfer things that were changed since the last snapshot. (Is that shards, lucene stuff, something else ???) One thing that isn't clear is

Re: Get only ids with no source Java API

2014-10-17 Thread Ivan Brusic
Have you tried setting no fields to be returned or the explicit setNoFields() method? http://jenkins.elasticsearch.org/job/Elasticsearch%20Master%20Branch%20Javadoc/Elasticsearch_API_Documentation/org/elasticsearch/action/search/SearchRequestBuilder.html#setNoFields() -- Ivan On Thu, Oct 16,

Re: Scaling strategies without shard splitting

2014-10-17 Thread Ian Rose
Hey Nik - Thanks for the response. - Ian On Mon, Oct 13, 2014 at 4:28 PM, Nikolas Everett nik9...@gmail.com wrote: On Mon, Oct 13, 2014 at 11:12 AM, Ian Rose ianr...@fullstory.com wrote: Hi - My team has used Solr in it's single-node configuration (without SolrCloud) for a few years

Re: Filters: odd behavior

2014-10-17 Thread Ivan Brusic
They are indeed executed in the defined order. Filters that are more specific should be placed early on and those that cannot be cached (geo/timebased) should be placed last. Cheers, Ivan On Thu, Oct 16, 2014 at 5:16 AM, @mromagnoli marce.romagn...@gmail.com wrote: Hi everyone, I have a

elasticsearch fields and elasticsearch-hadoop

2014-10-17 Thread Akil Harris
Is there an easy way to rename the fields on an index? I have a field named searchTerm that I use for some event tracking. But the elasticsearch-hadoop library assumes all elasticsearch fields are lowercase and is converting all field names to lower case. When hadoop tries to retrieve the data

Re: Filters: odd behavior

2014-10-17 Thread Alexandre Rafalovitch
And there is post-filter as well: http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-request-post-filter.html Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-start.com/ and @solrstart Solr

Future of cardinality aggregation feature.

2014-10-17 Thread Govind M
Guys, I see that the cardinality aggregation feature is marked as experimental feature. We are using this feature and feel it is very useful. But would like to how if this feature will be supported going forward or any chance of getting removed? Thanks in advance. Regards, -G -- You

Re: Scaling strategies without shard splitting

2014-10-17 Thread joergpra...@gmail.com
In my use case I have indexed a union catalog for some hundred libraries, where each library can have a search service, plus adding their own catalog data they do not want to share. Elasticsearch offers far more flexibility and performance than Solr with the ability of automatic extending the

Reduce Disk Space Requirements

2014-10-17 Thread PARTH GANDHI
Details: Elastic Search version used: 1.3.4 Docs to index: ~ 2.2 Million Growth in docs: few 100 docs every week. Number of fields per doc: ~10-15 tokenizers used: ngram (min:2, max:15), path_hierarchy filters used: word_delimiter, pattern_capture, lowercase, unique Size on disk: ~ 150 GB (No

Re: Reduce Disk Space Requirements

2014-10-17 Thread joergpra...@gmail.com
ngram min=2 kills your index space. Use min=3 or higher. Also maybe edge ngram tokenizer might be an alternative. Jörg On Sat, Oct 18, 2014 at 12:06 AM, PARTH GANDHI parth.gandh...@gmail.com wrote: Details: Elastic Search version used: 1.3.4 Docs to index: ~ 2.2 Million Growth in docs: few

Re: Java 8 recommended version?

2014-10-17 Thread Adrien Grand
Hi Jilles, 1.7u55 has indeed be the recommended version for a long time, but JDK 8u25 is fine too. The page that you linked is from elasticsearch-hadoop and might be a bit outdated, we are trying to keep up to date information about recommended JVMs at the following URL:

Re: Multi Field Aggregation

2014-10-17 Thread Artur Martins
Hello, I'm having the exact same problem. Have you managed to find a solution? My thread is here: LINK https://groups.google.com/forum/?fromgroups#!topic/elasticsearch/Oum03VSBzHQ Thanks On Thursday, October 16, 2014 1:57:35 PM UTC+1, Alastair James wrote: Hi there. I am trying to create

Re: best practice for thread pool queue size

2014-10-17 Thread Zaki Agha
Yes the particular error is from July. How can I determine the optimal setting for queue size? On Monday, October 13, 2014 3:21:32 PM UTC-7, Mark Walkom wrote: Increasing queues isn't going to help if there are underlying problems stopping the processing. Based on those errors it looks like

Re: [ANN] Elasticsearch CSV plugin for formatting search responses as CSV

2014-10-17 Thread Artur Martins
This is priceless. Thank you. On Wednesday, July 16, 2014 12:23:11 AM UTC+1, Jörg Prante wrote: Hi, I wrote a little plugin for formatting search responses as CSV (comma separated values) This format is useful for extracting some (or all) fields from ES JSON and wrap it into a tabular

word delimiter

2014-10-17 Thread Nick Tackes
Hello, I am experimenting with word_delimiter and have an example with a special character that is indexed. The character is in the type table for the word delimiter. analysis of the tokenization looks good, but when i attempt to do a match query it doesnt seem to respect tokenization as