Adrien,
Just an FYI, after resetting the cluster, things seem to have improved.
Optimize calls now lead to CPU/IO activity over their duration.
Max_num_segments=1 does not seem to be working for me on any given call, as
each call would only reduce the segment count by about 600-700. I ran
Any thoughts on this?
The search type is really fantastic and we're already using it in
production, but a fuzziness capability would add so much.
On Monday, April 7, 2014 9:59:54 AM UTC-4, Elliott Bradshaw wrote:
Hi Elasticsearch,
I've been playing with the new cross_fields multi match
hi,
I am using s3 gateway in a application , elastic search version 1.x , I
had a strange exception while starting my nodes , please take a look
Error injecting constructor, java.lang.UnsupportedOperationException
at org.elasticsearch.gateway.s3.S3Gateway.init(Unknown Source)
while
Hi all,
How to do benchmark of the ElasticSearch? Thanks!
Best,
Jianjun
--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email
to
Hi all,
We have having some issues with differences between source() and doc() in a
search script. We're running 0.90.2 with the following mapping
mappings: {
- features: {
- properties: {
-id: {
-
- type: integer
}
- histogram: {
Hi all
I have two sum when aggregation over, for example `click total num` and
`search total num`. Is there some operator that can give me a num which is
`click total num`/`search total num`?
Thanks
Zhao Chun
--
You received this message because you are subscribed to the Google Groups
Hi Jianjun,
On Thu, Apr 10, 2014 at 3:02 PM, Jianjun Hu 4nexu...@gmail.com wrote:
Hi all,
How to do benchmark of the ElasticSearch? Thanks!
The answer to this question is it depends. It would be very helpful to
describe a bit more about what you are working on so people can give a
better
As well, the uncommented sections of our elasticsearch.yml:
bootstrap.mlockall: true
gateway.type: local
gateway.recover_after_nodes: 4
gateway.recover_after_time: 5m
gateway.expected_nodes: 4
discovery.zen.minimum_master_nodes: 2
discovery.zen.ping.timeout: 20s
So I'm working on the JSON mapping for several of our documents and one of
the fields happens to be a string array.
I looked at the documentation for how it's supposed to be done and it left
me confused. Could someone clarify it for me?
Here is essentially what I want to happen (part of my
Dear All,
I am new to elasticsearch, but have been asked to update our (Java) code
from ES version 0.90 to version 1. Can anyone tell me how to do the
equivalent of builder.setPercolate(String) in version 1.0. A short example
would also be greatly appreciated. Can this be done without the
Hi Leslie,
Thanks for your reply.
We have some data in MySQL database and some PDF documents. We want to
index them and let our user to use. So we decide to use the ElasticSearch.
Before using the ElsaticSearch, we want to know the performance of the
ElasticSearch, so we want to do benchmark
I've also closed all indices, then started one only and waited... Still
ended up with a red status and some unallocated shards.
--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it,
Hi Jianjun,
Thank you for the additional details. I think we will need some more
information but I will let someone who is better skilled at benchmarking
Elasticsearch ask you about it.
Welcome to the mailing list! We are happy to have you here and to help.
Cheers,
LH
On Thu, Apr 10, 2014 at
Hello Dan, thanks again for all the help but I can't seem to get anything
to return. Here is my full code, there has to be something I'm missing.
Maybe in the indexing, the mapping?
Mapping:
putMappingResponse = new PutMappingRequestBuilder(newClient
.admin().indices())
.setIndices(INDEX_NAME)
Hello,
im trying to create ElasticSearch mapping in Symfony 2 with help of
FOSElasticaBundle. Well it probably is not relevant that I use Symfony 2,
but relevant is the fact that Im trying to create mapping as similar to
MySQL schema as I can. I stumbled upon situation, where I have foreign
I'd like to give this a try. Can you please post a recreation (maybe gist)
of a bunch of sample parent and child documents and how you want the
results returned exactly? Thanks.
--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe
Thanks very much for confirming that..
On Thursday, 10 April 2014 14:37:07 UTC+1, Itamar Syn-Hershko wrote:
It has been deprecated, you cannot percolate while indexing in version 1.
The new percolator API is documented here:
Also, the uncommented portions of our elasticsearch.yml:
bootstrap.mlockall: true
gateway.type: local
gateway.recover_after_nodes: 4
gateway.recover_after_time: 5m
gateway.expected_nodes: 4
indices.recovery.max_size_per_sec: 500mb
discovery.zen.minimum_master_nodes: 2
discovery.zen.ping.timeout:
Here's the stack trace of ES on the busy
server:https://gist.github.com/huyphan/e543119a65861ab89e54
And here's on another low CPU load server:
https://gist.github.com/huyphan/ee0c1a3bcd48f5b8ba28
On Thursday, 10 April 2014 17:37:15 UTC+8, Huy Phan wrote:
Hi guys,
I think I'm having a
The best way to know is to test it yourself.
It's very dependant on your hardware, your settings and the data that you
are indexing.
Regards,
Mark Walkom
Infrastructure Engineer
Campaign Monitor
email: ma...@campaignmonitor.com
web: www.campaignmonitor.com
On 10 April 2014 23:49, Leslie
Which exact versions are you using?
Plugin
ES
Note that gateways will be removed in 1.2 IIRC.
--
David ;-)
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs
Le 10 avr. 2014 à 14:41, Ankur Goel ankrug...@gmail.com a écrit :
hi,
I am using s3 gateway in a application , elastic
Am I in the mailling list now?
--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web
Could you please give me some more detailed suggestions on how to test the
elasticsearch? thanks.
On Thursday, April 10, 2014 10:13:49 AM UTC-4, Mark Walkom wrote:
The best way to know is to test it yourself.
It's very dependant on your hardware, your settings and the data that you
are
Title: Re: ElasticSearch Benchmark
Hello!
Depends on what you want to test. If you want to test indexing - start from scratch, see how many documents you can index with the clean Elasticsearch cluster, when you start to see slowdowns. Try seeing what the bottleneck is if any. In general, such
Hi,
I'm currently trying to build an elasticsearch index
and experience some trouble.
Indexing is based on database data and basically has three
steps:
a) index all database content
b) index incremental changes that happened during step a
(until almost all are done)
c) permanently index
Hi there,
is there any way to figure out a maximum theoretical score for a non text
search query - something like:
query: {
query_string: {
query: location:(1)^3 accommodation_comfort:(137 OR
193)^2 accommodation_facilities:(459 OR 403 OR 319),
Update: I've tried to clear the cache, to refresh the index, closing and
reopening the index, no joy. Then everything started to work automagically
after a while (tens of minutes later). The index itself has a total of 620
docs.
On Wednesday, April 9, 2014 1:45:32 PM UTC-5, Jozsef Szalay
Hi,
Currently I have used:
*public* PercolateRequestBuilder *setPercolateQuery**(*QueryBuilder
queryBuilder*)* *{*
sourceBuilder*().*setQueryBuilder*(*queryBuilder*);*
*return* *this**;*
*}*
With a 'match all' query builder as the paramater. Does ES do this by default
Hi guys,
I am using elasticsearch embedded in my webapplication. I really need the
methods isStarted and isStopped on Node api. Currently I wait in my
current thread for several seconds for starting/stopping, but I think that
it will bring more code clarity to have this methods in the java
We use Flume 1.4 to pass logs into HDFS as well as ElasticSearch for
storage. The pipeline looks roughly like this:
Client to Server Flow...
(local_app - local_host_flume_agent) AVRO/SSL
(remote_flume_agent)...
Agent Server Flow ...
(inbound avro - FC1 - ElasticSearch)
(inbound avro
One additional thing.. we have two ES sinks actually pointing to the same
cluster. The config looks more like this actually:
(inbound avro - FC1 - ElasticSearch)
(inbound avro - FC2 - S3/HDFS)
(inbound avro_2 - FC3 - ElasticSearch)
(inbound avro_2 - FC4 - S3/HDFS)
On Thursday, April 10, 2014
Hi,
I have a different requirement this time. I am trying to use Kibana to show
certain values rather than their count. Ideally, I am looking for something
like Pie Chart . Lets say my elasticsearch data has following tags
{x:5, y:10, z:15,a:2}, so I want to create a pie chart showing
I discovered the root cause. The current master was on a VM that was in a
bad state. Sadly I could not get on to the host to debug the issue but it
was still listening on 9200 and not accessible via ssh. I forced a master
change by shutting down the node using the cluster admin api. Once the
Matt,
I don't know if this helps, but we are seeing similar issues with Flume
using log4j2 (not log4j v1 as used by ES). For tomcat-hosted servlets,
flume failover works fine. But for non-tomcat applications (such as looping
batch-mode applications and Netty-based servers with static main
Is it right that _all_ my shards are showing up in /_cat/recovery?
It looks like every one of them is marked done. It is cool to be able to
see the last replication, as well as the relatively huge cost of relocation
while writing but it isn't what I expected reading the docs.
Example:
and sent this to the wrong group. Sorry!
On Thursday, April 10, 2014 12:12:07 PM UTC-7, Ryan Bellows wrote:
Hi All,
Since upgrading to 1.4 (and ES 1.1) I've had no luck at all consuming
logs, with events averaging about 10k a second, peak around 15k. What
happens is Logstash processing
Is there a way to search and retrieve only specific fields using simple
query string?
--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email
to
Nik,
Sorry for the confusion. The recovery API is meant to show the most recent
recovery for each shard, as well as active recoveries. I'll see if we can
improve the documentation around this.
To see only active, on-going recoveries, use the active_only flag. For example:
curl -XGET
I've been working on a new highlighter on and off for a few weeks and I'd
love for other folks to try it out:
https://github.com/wikimedia/search-highlighter
You should try it because:
1. Its pretty quick.
2. It supports many of the features of the other highlighters and lets you
combine them
I see now. Cool. Thanks.
Nik
On Thu, Apr 10, 2014 at 3:57 PM, Andrew Selden
andrew.sel...@elasticsearch.com wrote:
Nik,
Sorry for the confusion. The recovery API is meant to show the most recent
recovery for each shard, as well as active recoveries. I'll see if we can
improve the
I'm receving this error at the top of the page:
Oops!
ClassCastException[org.elasticsearch.index.fielddata.plain.PagedBytesIndexFieldData
cannot be cast to org.elasticsearch.index.fielddata.IndexNumericFieldData]
I have a mapping that I'm applying, it's a catch-all for iis, eventlog, and
On Thu, Apr 10, 2014 at 4:52 PM, Mohit Anchlia mohitanch...@gmail.comwrote:
I am trying to understand the Pros and Cons of using scroll vs from/size
for pagination? Scroll document advises against using it for real time
users but it doesn't say why.
I believe scrolls pin some state in memory
Hi Mohit,
It is possible, as shown here:
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-request-fields.html#search-request-fields
On Thu, Apr 10, 2014 at 4:27 PM, Mohit Anchlia mohitanch...@gmail.comwrote:
Is there a way to search and retrieve only specific
Hey guys. I have two instances of what should be the same index (one of
those datacenter redundancy things). I send the same updates to both DCs.
However the two indices appear to be different (for example the same query
returns a different set of results and has a different number of hits).
I should be more explicit, is it possible when using q= type of query
string? For eg: http://host/_search?q=field:hello;
Is it possible to add a query string parameter to achieve that?
On Thu, Apr 10, 2014 at 3:04 PM, Luiz Guilherme Pais dos Santos
luizgpsan...@gmail.com wrote:
Hi Mohit,
It
I hate to bump myself but does anybody have any input on this at all?
On Thursday, April 10, 2014 11:59:43 AM UTC+12, Tim Uckun wrote:
I want to do something like this.
select date_trunc('month', time_stamp), sum(distinct_count) from (
select date_trunc('week', time_stamp) as
Thanks Adrien and Nikolas it's very helpful.
On Thu, Apr 10, 2014 at 3:19 PM, Adrien Grand
adrien.gr...@elasticsearch.com wrote:
On Thu, Apr 10, 2014 at 11:13 PM, Nikolas Everett nik9...@gmail.comwrote:
This one is easy. Elasticsearch/lucene has to keep a min heap of all the
documents you
Elastisch [1] is a small, feature complete Clojure client for ElasticSearch.
Release notes:
http://blog.clojurewerkz.org/blog/2014/04/11/elastisch-2-dot-0-0-beta4-is-released/
1. http://clojureelasticsearch.info
--
MK
http://github.com/michaelklishin
http://twitter.com/michaelklishin
--
You
How many nodes do you have? How many replicas?
After a quick glance, it looks like both stack overflow answers have the
correct way to disable allocation.
--
Ivan
On Wed, Apr 9, 2014 at 8:57 AM, Madeleine mxmadal...@gmail.com wrote:
I've also closed all indices, then started one only and
Hi,
Unfortunately, this is not possible. But maybe you could do it on client
side?
On Thu, Apr 10, 2014 at 3:04 PM, Zhao Chun buaa.zh...@gmail.com wrote:
Hi all
I have two sum when aggregation over, for example `click total num` and
`search total num`. Is there some operator that can give
Hi,
There is no need for a particular type for arrays[1]. Elasticsearch
automatically detects arrays, so instead of declaring a field as an array,
you need to declare what it contains. For example, assuming that
variablesUsed is an array of strings, you could define it as
variablesUsed : {type :
Thanks for reporting this, the behavior is definitely unexpected. I'll test
_optimize on very large numbers of shards to see if I can reproduce the
issue.
On Thu, Apr 10, 2014 at 2:10 PM, Elliott Bradshaw ebradsh...@gmail.comwrote:
Adrien,
Just an FYI, after resetting the cluster, things
Hi,
You would get better results with the phrase suggester by using an analyzer
that has a shingle filter[1]. This filter will help Elasticsearch compute
frequencies of pairs of terms. Without it, it falls back to single-term
frequencies, so in your case it would only consider the frequencies of
Hi,
The only way to implement it today would be on client side. However, I
think this would be an interesting feature given that while Elasticsearch
is fast at retrieving the top hits, going deeper in the result set can use
lots of resources (both CPU-wise and memory-wise).
Can you open an issue
Hi,
Although it is generally not advised to return scores as percentages[1], in
this particular case it would work given that you are only querying
structured attributes and don't care about term frequencies and so on.
For this problem, I would recommend using the function_score query[2],
which
Cool. Thanks!
On Thu, Apr 10, 2014 at 7:05 PM, Adrien Grand
adrien.gr...@elasticsearch.com wrote:
Hey Nikolas,
You can run the tests in parallel by specifying -Dtests.jvms=${number of
tests to run concurrently} The default has been 1 for a very long time but
Simon recently changed it so
Hi,
Could anybody let me know the behavior of scroll when I am scanning
through an index and the actions I take during this take more time than the
scheduled scroll time?
The documentation states that the scroll parameter is a time value
parameter (for example: scroll=5m), indicating for how
Thanks for your reply, I will do it on client side.
在 2014年4月11日,上午7:29,Adrien Grand adrien.gr...@elasticsearch.com 写道:
Hi,
Unfortunately, this is not possible. But maybe you could do it on client side?
On Thu, Apr 10, 2014 at 3:04 PM, Zhao Chun buaa.zh...@gmail.com wrote:
Hi all
I
Set the min to 2 and max to either 2 or 3 and emit unigrams. That'll get it
working. Also you probably want to set max errors to a number 1 because .5
means 50% which can get quite large if someone types 20 terms.
That's been my experience with it.
Nik
Sent from my iPhone
On Apr 10,
curl -XGET http://localhost:9200/_search?fields=field1;
curl -XGET http://localhost:9200/_search?fields=field1,field2;
On Thu, Apr 10, 2014 at 7:46 PM, Mohit Anchlia mohitanch...@gmail.comwrote:
I should be more explicit, is it possible when using q= type of query
string? For eg:
I have one more follow up question, how can one know if there are more
documents or not? This is to avoid one exta last call if possible.
On Thu, Apr 10, 2014 at 3:47 PM, Mohit Anchlia mohitanch...@gmail.comwrote:
Thanks Adrien and Nikolas it's very helpful.
On Thu, Apr 10, 2014 at 3:19 PM,
I am curious to hear some thoughts on this question.
On Wednesday, April 9, 2014 4:05:52 PM UTC-7, Srinivasan Ramaswamy wrote:
Hi
I am porting over an existing search system over to elastic search. We
have a few custom requirements for search. I played around with ES RESTful
APIs and
Hi,
I would to know is there anyway to enable scoring when using wildcard in
query_sting query. Below is my query :
{
from: 0,
size: 100,
explain: true,
track_scores: true,
query: {
query_string: {
fields: [
DISPLAY_NAME^8,
Hi,
With the new cardinality feature introduced in 1.1.0. How are missing
values treated? For example, in a data set with a field city, are all
rows with no city value counted as 1?
-thanks
--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
Hi,
Following facets query code is working fine in 0.90.3 but it's throwing
exception in 1.0.1.
curl -XGET
'http://elasticsearch-0903.x.net:9200/index123-*/_search?pretty' -d '{
{
facets: {
terms: {
terms: {
field: events.actingUserName,
I did a quick test, this works:
*Create mapping*
curl -XPUT 'http://localhost:9200/example/test/_mapping' -d
'{
test: {
properties: {
my_suggest: {
type: completion
}
}
}
}'
*Index value*
curl -XPUT 'http://localhost:9200/example/test/1/' -d
'{
my_suggest:
You should read in details the error.
Although you did not paste all the error, I think your concern is not with
facets but with queries.
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/_deprecations.html
Field Query has been removed.
HTH
--
David ;-)
Twitter : @dadoonet
The scoring is computed using the Lucene scoring:
https://lucene.apache.org/core/3_6_2/api/all/org/apache/lucene/search/Similarity.html
the *idf* is the inverse document frequency, which gives a higher score to
the rarer terms in the index. The term '*Hap*penings' appears four times in
your
There should be no difference. Ultimately, both types are part of a the
same Lucene index. The cache, posting format and idf are all tied to a
field, so if the different types share the same field, it is the same data
structure underneath.
--
Ivan
On Thu, Apr 10, 2014 at 9:58 AM, Mohit
Hi David ,
thanks for replying ,
I am using version
number : 1.0.0,
we have AWS plugin, we have removed S3 gateway for now ,
will switch to local but just wanted to make sure why we are getting this
error,
It will be really helpful to avoid any surprises in future.
On Thursday, 10 April
Thanks Dan.
How about queryWeight ? It only computed when using wildcard or prefix.
Also when you say The term '*Hap*penings' appears four times in your index
is regardless on which field right ?
Thanks
On Fri, Apr 11, 2014 at 1:25 PM, Dan Tuffery dan.tuff...@gmail.com wrote:
The scoring is
71 matches
Mail list logo