Hi,
i want write a rest filter to accomplish the following:
I know that the rest request is a search request and i want to parse the
rest requests content and modify the search query and then execute the
modified request.
I tried a lot of things like
SearchSourceBuilder sb =
Hi,
I am using transport client for ES. I have a search request for 1 million
records.
I am getting No node available exception after getting 8k records itself.
What can be done to overcome this issue?
Its very urgent please help.
Thanks.
--
View this message in context:
I forgot to mention that; it's commented out because the service I'm
attempting to access refuses POST requests. I've been informed that this
is a sort of low-hanging fruit security measure to prevent mutating methods.
On Wednesday, January 21, 2015 at 1:17:07 PM UTC-5, Clinton Gormley wrote:
You could everything through the API.
- list all the snapshots in the repository
- parse the JSON returned
- for each snapshot, if it's older than 7 days, delete the snapshot
Le mardi 20 janvier 2015 19:19:38 UTC-2, abutalib zaidi a écrit :
I am fairly new to ElasticSearch and was trying to do
Hi,
I have the following issue when I search on a not_analyzed String.
Mapping:
filename: {
type: string,
index: not_analyzed
}
field: filename
value: Stream_Flash_Video_129518.log
query_string: filename:*lash* -- get results
query_string: filename:*Flash* -- get no results
query_string:
Hi Andrew
The code looks correct. You have send_get_body_as POST commented out - I'm
guessing that is the problem. Probably the service you're using does not
allow GET requests with bodies.
I'd uncomment that and try again.
Ping me on https://github.com/elasticsearch/elasticsearch-perl/issues
Hi,
I have the following issue when I search on a not_analyzed String.
Mapping:
filename: {
type: string,
index: not_analyzed
}
field: filename
value: Stream_Flash_Video_129518.log
query_string: filename:*lash* -- get results
query_string: filename:*Flash* -- get no results
query_string:
I have submitted an issue. Thanks for your attention!
On Wednesday, January 21, 2015 at 1:41:39 PM UTC-5, Andrew Walker wrote:
I forgot to mention that; it's commented out because the service I'm
attempting to access refuses POST requests. I've been informed that this
is a sort of
Hello,
I have been writing a script that for properly restarting all the nodes of
an ES cluster and I have noticed a few particular things in the process.
The process is quite simple, it is the same described in the official ES
documentation
Is there another mailing list I should be using, perhaps? I have still
found no solution to this problem.
On Thursday, January 15, 2015 at 3:44:11 PM UTC-5, Andrew Walker wrote:
I have a remote node that I am attempting to connect to that requires an
api key as a URL parameter in addition to
YouTube videos of
U.S. Congress money laundering hearing
of
Saudi Billionaire Maan Al sanea
with *bank of America*
and The owner of Saad Hospital and Schools
in the Eastern Province in *Saudi Arabia*
and the Chairman of the Board of Directors of Awal Bank in *Bahrain*
With
Hi all,
I'm quite new to ES and wonder how/if jdbc river is ready for indexing
RDBMS in production enviromnent as in GitHub
https://github.com/jprante/elasticsearch-river-jdbc#river-or-feeder it
says about River:
*about to be deprecated by Elasticsearch core team*
So is it a good idea to use
n00b question, apologies:
I need to understand how my colleagues are using ES, which we're using to
provide search services on our intranet (alongside a host of other nobler
purposes). Is there a reasonably idiot-proof means to interrogate the
server logs using ES, to answer the usual usage
Which cluster would give better read performance on two separate servers?
ES cluster Foo
[] serverA - 1 primary shard
[] serverB - 2 replica Shards
ES cluster Bar
[] serverA - 1 primary shard and 1 replica shard
[] serverB - 1 primary shard and 1 replica shard
--
You received this message
Hi all,
I'm trying to figure out how to configure jdbc river in production
environment.
We want to index a DB with appr. 10% write queries and 90% read queries and
need to keep ES index in sync with DB.
So I wonder how two configure river to keep Index in near-realtime Sync
with DB.
I'm
Yes, you can set up the JDBC plugin with a 'schedule' parameter so that it
is started every minute.
See the doc at
https://github.com/jprante/elasticsearch-river-jdbc
Note, if you delete rows in DB, this is not per se detected by JDBC plugin.
There is no synchronization between DB rows and JSON
Hello,
I'm using the html_strip char filter in my mapping, and this gives me a
NoClassDefFoundError when indexing documents. See
https://gist.github.com/stof/fb8c0daa6dd7d6f2fae6 for a reproducing script
(and the output I got)
The same config used to work on an older 1.3 version of
I'm curious what the release cycle for Elasticsearch and the es-cloud-aws
plugin
are. https://github.com/elasticsearch/elasticsearch-cloud-aws/issues/165
lays out why, but I'll include it here for completeness:
Right now, it's very difficult to spin up a cluster in Frankfurt
(eu-central-1)
Just another note. A sequence of GET and POST can cause concurrency issues
if more than one of our application nodes are communicating with the ES
instance, if they try update at the same/similar time, so this is really
not ideal.
--
You received this message because you are subscribed to the
I'm working on a solution that will act as a dictionary validator by
performing the following:
- input: phrase
- processing: shingles phrase match with fuzziness
- output: rewritten phrase
- data: dictionary like, with entries that are short phrases up to 5 words
(e.g know it all, merry go
As suggested here in the documentation
(http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/index-modules-store.html),
we have performed the following command
curl -XPUT hostName:9200/indexName -d '{ settings: { index.store.type:
memory } }';
when creating the index.
This
Oh, I see what you mean...sorry about that. Thanks!
On Wednesday, January 21, 2015 at 10:17:38 AM UTC-6, Darin Hensley wrote:
Which cluster would give better read performance on two separate servers?
ES cluster Foo
[] serverA - 1 primary shard
[] serverB - 2 replica Shards
ES cluster Bar
The current default scripting language, groovy, is sandboxed. If you still
don't want to use it your only option is the get update put sequence.
On Jan 19, 2015 1:29 PM, Jason Lee pump.min...@gmail.com wrote:
I'm trying to add new values to an existing array field in a document.
I've noticed
I think the best thing about open source is, you can clone the github repo
branch es-1.4 of the plugin and build a snapshot version 2.4.2-SNAPSHOT for
yourself.
Last commit on es-1.4 was adding eu-central-1 so it should work then:
On
http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/replica-shards.html
in Figure 52. Adjust the number of replicas to balance the load between
nodes they show Node 3 which contains only 2 replica shards. They state in
it The fact that node 3 holds two replicas and no
You won't ever be able to store two of the same replicas shards on the same
server, so your first example won't work.
On 22 January 2015 at 05:17, Darin Hensley darin.hens...@gmail.com wrote:
Which cluster would give better read performance on two separate servers?
ES cluster Foo
[] serverA
You need to test this yourself.
Though 100GB of data should fit in 8GB of heap based on what I have seen.
But again, test this as your dataset and usage may be unique.
On 22 January 2015 at 00:57, Svetlozar Penev svet...@celtis.net wrote:
Hello,
Can you tell what server specification I need
Can you elaborate on how you are storing this index in memory?
On 22 January 2015 at 08:17, bplsi...@chaordicsystems.com wrote:
Hello,
I have been writing a script that for properly restarting all the nodes of
an ES cluster and I have noticed a few particular things in the process.
The
Ah ok, so I don't have to worry about any security risks then if I just use
the defaults?
On Thursday, January 22, 2015 at 10:50:32 AM UTC+13, Nikolas Everett wrote:
The current default scripting language, groovy, is sandboxed. If you still
don't want to use it your only option is the get
I have a situation where, using the Java API, I initiate a bunch of
indexing operations, but throw away the Future object (I don't need the
return status). This is so I can do a lot of indexing reasonably
asynchronously, so I don't have to hold up the GUI that triggers these
calls.
However,
Hi,
I am using the more like this function to score a series of documents by
their ids. I now need to determine the rank of these documents within the
more like this corpus.
I am working on an algorithm to perform to build a binary search tree split
by rank which also includes the top score
I have a situation where, using the Java API, I initiate a bunch of
indexing operations, but throw away the Future object (I don't need the
return status). This is so I can do a lot of indexing reasonably
asynchronously, so I don't have to hold up the GUI that triggers these
calls.
However,
Final bit of info, and this seems important; one TransportClient is being
run in one thread, the other in another... so it goes like this:
Thread 1: Fork Thread 1
Thread 2: Create new TransportClient; Index list of items, not waiting for
Future objects (i.e. not calling actionGet())
Thread 1:
I made a mistake in my statement above - I am using different
TransportClients for the indexing and the searching... so, one is doing the
indexing, and while that is going on, I'm doing the searching on a
different one.
- Tim
--
You received this message because you are subscribed to the
I wish I could just go back and edit prior postings. :-)
... I should mention, if I do the above, but don't create 2 threads - that
is, synchronously index each item, close the TransportClient, create
another TransportClient and do the search, it works. So this is ONLY
happening when the two
When i set the ping timeout for transport client to a higher value, the issue
is fixed automatically.
--
View this message in context:
http://elasticsearch-users.115913.n3.nabble.com/No-node-available-exception-tp4069387p4069408.html
Sent from the ElasticSearch Users mailing list archive at
I'm trying to make a new heatmapjs using data from twitter-river
elasticsearch. I can extract geolocation from Twitter data, but how do I
pass the data the heatmapjs. Here is sample dataset from heatmapjs:
var testData = {
max: 8,
data: [{lat: 24.6408, lng:46.7728, count:
Hi Jorg,
I’ve personally been using Java 8 with ES for a while. Although I can’t say
when we’ll stop support for Java 7, as you can see from our support matrix
recent versions of ES are supported on Java 8.
http://www.elasticsearch.com/support/matrix
Hi Messias,
You should set “lowercase_expanded_terms” param to false.
Default it true.
See :
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/query-dsl-query-string-query.html#query-dsl-query-string-query
“query_string” query is changed automatically query-terms to
Hi,
You’d need to set lowercase_expanded_terms to false in query string query.
It’s true by default. That’s why *Flash*” doesn't get any result.
Masaru
On January 22, 2015 at 03:17:51, Messias (schubert.torste...@gmail.com) wrote:
Hi,
I have the following issue when I search on a
Some ideas:
You can/should share the same client within all threads. So only one client for
the full JVM.
You should create first the index and wait for the index to be created, using
actionGet(). It's a quick operation. Then run your code as you wrote.
My 2 cents.
David
Le 22 janv. 2015 à
Yes, it is round robin.
On Wed, Jan 21, 2015 at 4:14 PM, David Pilato da...@pilato.fr wrote:
But on different nodes?
--
David ;-)
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs
Le 21 janv. 2015 à 07:28, Hajime placeofnomemor...@gmail.com a écrit :
I run on parallel using REST API
Il 02/12/2014 09:21, Alessandro
Bonfanti ha scritto:
Il 12/11/2014 17:43, Alessandro
Bonfanti ha scritto:
Il 12/11/2014 17:20, Nikolas
Everett ha scritto:
Hello,
Can you tell what server specification I need for 100GB of data.
I use it only for text search(not more then 3-4 words per search).
Currently I`m using Amazon. I`ll need some advise.
Which server is better for elasticsearch ?
How much CPU, RAM and Disk space(HDD or SDD) for 100GB?
I am trying to implement fuzziness and wild card in a single query. I tried
the below code but no success
match : { Keyword : { query : B*a*c*k* O*b*a*, operator : and,
fuzziness : 9, analyze_wildcard: true } }
But when I try the same query from postman its working
I am trying to implement fuzziness and wild card in a single query. I tried
the below code but no success
match : {
Keyword : {
query : B*a*c*k* O*b*a*,
operator : and,
fuzziness : 9,
analyze_wildcard: true
}
}
But when I try the same
Hi,
Wondering if anyone can offer any advise on this as I have looked around
and cant really seem to find an answer on this.
Since updating to Java 7 64 bit from Java7 32 bit and increasing the
MAX_HEAP_SIZE I am seeing slower query performance when trying to query
data via the Plugin GUI's
any update?
On Thursday, December 11, 2014 at 3:07:59 PM UTC+2, PMOPMO wrote:
Hello all,
I want to run a simple *sql group by query* in kibana 4 Discover page.
Each record in my elastic search index represent a log and has 3 columns:
process_id (not unique value), log_time, log_message.
Forgot to mention were currently running elasticsearch
version elasticsearch-1.3.2-1
--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email
to
Hi Gordon,
This `from` is kind of ignored for scroll search. I don't remember why that
was the case, but it seems to me that scroll search can/should take into
account the `from` option during the first scroll search request.
Martijn
On 20 January 2015 at 23:33, Gordon Tillman gor...@gmail.com
Thanks Martijn,
That is what I have observed. But is is a regression from ES version 1.0.1
and before. And I can't find anything that even implies that `from` is not
supported for a scroll search, except of course when `search_type=scan`.
I would love to get support for that back into the
According to the documentation on _id
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/mapping-id-field.htmlit
is possible to store _id but it never gives a reason why that would be
useful.
I have a use case where I am exporting all ids from ES using scan/scroll
with
100 shards per index on a 10 node cluster only feel like being fast,
because the file structures sizes are very small in the beginning and most
of the memory is not allocated. But they are not. You will run into trouble
much later when indexes grow and segment merges get bigger, or indexes must
be
Using TransportClient has just solved the problem.I bulk 4 docs per
seconds but loadaverage,searching,everything is perfect.
On Wed, Jan 21, 2015 at 8:25 PM, Hajime placeofnomemor...@gmail.com wrote:
Yes, it is round robin.
On Wed, Jan 21, 2015 at 4:14 PM, David Pilato da...@pilato.fr
I agree, this should be fixed:
https://github.com/elasticsearch/elasticsearch/issues/9373
On 21 January 2015 at 14:27, Gordon Tillman gor...@gmail.com wrote:
Thanks Martijn,
That is what I have observed. But is is a regression from ES version
1.0.1 and before. And I can't find anything that
Thank you Martijn!
On Wednesday, January 21, 2015 at 8:04:23 AM UTC-6, Martijn v Groningen
wrote:
I agree, this should be fixed:
https://github.com/elasticsearch/elasticsearch/issues/9373
On 21 January 2015 at 14:27, Gordon Tillman gor...@gmail.com
javascript: wrote:
Thanks Martijn,
56 matches
Mail list logo