What strategy could one do when you need to frequently update single field
in large document?
What can you do to improve update performance in case like that?
Thanks
--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe from this
Is the way to update mapping of large index as follows
Create empty index with new mapping
Copy old data into new index
Alias new index to previous
If so, what are recommended tools?
Ideally there would be a user interface for IT people to use?
Thanks
--
You received this message because
I am pretty sure you can open the ports for the sec group the elb belongs
to , regardless of the az. (Az, not region). Unless you r using network
acls.
Anyway, not really ES... pm me if u want to continue the AWS discussion :-)
On 16/10/2014 3:37 pm, Zoran Jeremic zoran.jere...@gmail.com wrote:
Dear All,
Thanks for your replies.
Conclusion is, we can not store more than 2147483647 records per shard as
of now. The only option is we need to increase the shard count.
Thanks
Prasath Rajan
On Tuesday, October 14, 2014 9:34:33 PM UTC+5:30, Jörg Prante wrote:
You can not store more than
hi Clinton
Considering the enormous amount of value addition in ES since this original
question was posted . Wondering, if the answer has tilted in favor of
ElasticSearch ?
Can we safely say - ElasticSearch can be considered as a primary data store
?
--
View this message in context:
Hi Veda,
I run into a similar issue like yours.
Have you found a solution to your problem?
Thanks,
Vincent
--
View this message in context:
http://elasticsearch-users.115913.n3.nabble.com/Using-a-nested-object-property-within-custom-filters-score-script-tp4046901p4064981.html
Sent from the
This is caused by elasticsearch trying to load fielddata. Fielddata is used
for sorting and faceting/aggregations. When a query has a sort parameter
the node will try to load the fielddata for that field for all documents in
the shard, not just those included in the query result. The breaker is
Elastic (without Search) should be ok, I believe. At least according
to the official source: http://www.elasticsearch.org/trademarks/
Regards,
Alex.
Personal: http://www.outerthoughts.com/ and @arafalov
Solr resources and newsletter: http://www.solr-start.com/ and @solrstart
Solr popularizers
Hello,
I'm looking for a solution to creat parent/child relation with the script
of the river mongodb-ES plugine.
I don't know if the relation parent/child must be present already in
MongoDB to do that. For now, I just have the field parent_id in the all
document with an ID which is the same
Hi,
We are using Elasticsearch for one of our applications. As a part of which
we indexed about 3M documents and have built two indices around them. We
have used a cluster of 2 Nodes each with 7.5 GB RAM and have dedicated 4 GM
to the ES.
What we are seeing is that on one of the nodes, the
Tried to remove papping and make not_analizable
curl -XPUT http://$HOST:9200/reports; -d'
{
mappings: {
_default_: {
dynamic_templates: [
{
store_generic : {
match : *,
match_mapping_type : string,
Hello All,
I'm trying achieve one functionality in Elasticsearch but I'm not able to
do it.
In SQL we can do it like -- select SET score_1 = _score from sometable
I trying to assign value of score in one field. That means Elastic search
will return 2 columns having same values _score and
Hi,
Can someone point me in the right direction for running a local mirror of
the elasticsearch APT repositories? Specifically, is there an rsync
connection available?
Thanks!
Yapeng
--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To
Measuring heap usage in Java applications is very different than measuring
memory usage for other stuff.
1. Usually java allocates all the heap its going to need up front at
startup. At least, we do that in server applications.
2. Java's garbage collection is very lazy so heap usage will go up
You can use the knapsack plugin for export/import data and change mappings
(and much more!)
For a 1:1 online copy, just one curl command is necessary, yes.
https://github.com/jprante/elasticsearch-knapsack
Jörg
On Thu, Oct 16, 2014 at 7:55 PM, euneve...@gmail.com wrote:
Hi
I can see there
Siva,
Try the latest build of elasticsearch-hadoop, ver 2.1.0 Beta 2
http://www.elasticsearch.org/overview/hadoop/download/
The esRDD has been changed to sparks PairRDD
https://spark.apache.org/docs/1.1.0/api/scala/index.html#org.apache.spark.rdd.PairRDDFunctions
The RDD will now be key/value
Has somebody another idea? Or it is not possible at all?
--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email
to elasticsearch+unsubscr...@googlegroups.com.
To view this
Hi!,
I am writing a Java plugin with a customized score script (native)
returning a double. Basically I wrote a class extending
AbstractDoubleSearchScript.
For some documents which don't pass a specific test, the score should be
the lowest possible, meaning they should be at the bottom of the
Greetings community,
I'm new to elasticsearch, so first of all sorry for my questions being so
basic.
I developed a flow collector which dumps flows to my elasticsearch server.
Right now i use Kibana to perform the Top 10 destination and Top 10 source
IPs filters, and such.
But the query I'm
I'm using Kibana w/ logstash to view web server logs. I'd like to add a
graph that displays uniques of the *entire* User-Agent string. I've tried
adding a terms graph, but that breaks the UA string into separate words,
which is less than desirable in this situation. Is there a way to do this?
I'm investigating snapshots and came across some things that aren't clear
in the docs.
My understanding is that the snapshots are incremental and only transfer
things that were changed since the last snapshot. (Is that shards, lucene
stuff, something else ???)
One thing that isn't clear is
Have you tried setting no fields to be returned or the explicit
setNoFields() method?
http://jenkins.elasticsearch.org/job/Elasticsearch%20Master%20Branch%20Javadoc/Elasticsearch_API_Documentation/org/elasticsearch/action/search/SearchRequestBuilder.html#setNoFields()
--
Ivan
On Thu, Oct 16,
Hey Nik -
Thanks for the response.
- Ian
On Mon, Oct 13, 2014 at 4:28 PM, Nikolas Everett nik9...@gmail.com wrote:
On Mon, Oct 13, 2014 at 11:12 AM, Ian Rose ianr...@fullstory.com wrote:
Hi -
My team has used Solr in it's single-node configuration (without
SolrCloud) for a few years
They are indeed executed in the defined order. Filters that are more
specific should be placed early on and those that cannot be cached
(geo/timebased) should be placed last.
Cheers,
Ivan
On Thu, Oct 16, 2014 at 5:16 AM, @mromagnoli marce.romagn...@gmail.com
wrote:
Hi everyone,
I have a
Is there an easy way to rename the fields on an index?
I have a field named searchTerm that I use for some event tracking. But
the elasticsearch-hadoop library assumes all elasticsearch fields are
lowercase and is converting all field names to lower case. When hadoop
tries to retrieve the data
And there is post-filter as well:
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-request-post-filter.html
Regards,
Alex.
Personal: http://www.outerthoughts.com/ and @arafalov
Solr resources and newsletter: http://www.solr-start.com/ and @solrstart
Solr
Guys,
I see that the cardinality aggregation feature is marked as experimental
feature. We are using this feature and feel it is very useful.
But would like to how if this feature will be supported going forward or
any chance of getting removed?
Thanks in advance.
Regards,
-G
--
You
In my use case I have indexed a union catalog for some hundred libraries,
where each library can have a search service, plus adding their own catalog
data they do not want to share.
Elasticsearch offers far more flexibility and performance than Solr with
the ability of automatic extending the
Details:
Elastic Search version used: 1.3.4
Docs to index: ~ 2.2 Million
Growth in docs: few 100 docs every week.
Number of fields per doc: ~10-15
tokenizers used: ngram (min:2, max:15), path_hierarchy
filters used: word_delimiter, pattern_capture, lowercase, unique
Size on disk: ~ 150 GB (No
ngram min=2 kills your index space. Use min=3 or higher. Also maybe edge
ngram tokenizer might be an alternative.
Jörg
On Sat, Oct 18, 2014 at 12:06 AM, PARTH GANDHI parth.gandh...@gmail.com
wrote:
Details:
Elastic Search version used: 1.3.4
Docs to index: ~ 2.2 Million
Growth in docs: few
Hi Jilles,
1.7u55 has indeed be the recommended version for a long time, but JDK 8u25
is fine too. The page that you linked is from elasticsearch-hadoop and
might be a bit outdated, we are trying to keep up to date information about
recommended JVMs at the following URL:
Hello,
I'm having the exact same problem.
Have you managed to find a solution?
My thread is here: LINK
https://groups.google.com/forum/?fromgroups#!topic/elasticsearch/Oum03VSBzHQ
Thanks
On Thursday, October 16, 2014 1:57:35 PM UTC+1, Alastair James wrote:
Hi there.
I am trying to create
Yes the particular error is from July.
How can I determine the optimal setting for queue size?
On Monday, October 13, 2014 3:21:32 PM UTC-7, Mark Walkom wrote:
Increasing queues isn't going to help if there are underlying problems
stopping the processing.
Based on those errors it looks like
This is priceless. Thank you.
On Wednesday, July 16, 2014 12:23:11 AM UTC+1, Jörg Prante wrote:
Hi,
I wrote a little plugin for formatting search responses as CSV (comma
separated values)
This format is useful for extracting some (or all) fields from ES JSON and
wrap it into a tabular
Hello, I am experimenting with word_delimiter and have an example with a
special character that is indexed. The character is in the type table for
the word delimiter. analysis of the tokenization looks good, but when i
attempt to do a match query it doesnt seem to respect tokenization as
35 matches
Mail list logo