minimum_master_nodes still doesn't protect you from all possible failure
scenarios, see http://aphyr.com/posts/317-call-me-maybe-elasticsearch
What version are you running?
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer &
It may worth looking at 2 things:
1. Using the latest Elasticsearch version (1.4). Many work went on
optimizing those type of scenarios on the server side.
2. Disabling refresh / flush - I assume this is an ETL process and as such
this could greatly help.
--
Itamar Syn-Hershko
http://code972
I personally recommend https://github.com/elasticsearch/stream2es
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Tue, Oct 21, 2014 at 3
Yes
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Tue, Oct 21, 2014 at 10:47 AM, Costya Regev wrote:
> Hi,
>
> It's not cle
Incremental. See
http://www.elasticsearch.org/blog/introducing-snapshot-restore/
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Wed, Oct 15,
See
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/query-dsl-not-filter.html
You should probably switch to a bool and a should clause before instead of
an and filter
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Fre
No - you should definitely use the snapshot and restore as its the most
stable and efficient way for backups there is.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http:
Yes, you don't want to use anything other than local storage for
Elasticsearch. Not EBS and definitely not S3. You can use the
snapshot/restore API to continously backup to S3 and get all the data
protection you need.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twi
HTTP status codes are used to communicate errors, for example a runtime
error would return HTTP status 500
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhers
Yes, they are. Pretty much like any search request with Elasticsearch.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Tue, Sep 30, 2014 at 1
This is probably a bug of the .NET client API, and you should log it on
github where they monitor issues for it
You might find this alternative library useful:
https://github.com/synhershko/NElasticsearch available from nuget as well
https://www.nuget.org/packages/NElasticsearch/
--
Itamar Syn
cess, it seems to me that would be the solution for you anyway.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Mon, Sep 29, 2014 at 8:13 AM, CB
Mainly the ability to parse log texts easily and restructure it as json
("grok")
There's no "better" way really, if your app knows what it's doing and you
use batching and have proper error recovery its probably better to have it
that way
--
Itamar Syn-Hershko
ht
No, it is just that sort is overriding the random_score. I would say just
do the sorting on the client side - while it is possible to do still on ES
(using scripted fields for example) it will just get too complicated.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.
) can only be
changed via manually editing elasticsearch.yml.
My advice to you would be to use Puppet, Chef or any other configuration
and deployment management tools to avoid unmoderated changes to
elasticsearch.yml
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/syn
No, but since you should never expose your clusters to end users directly
you could always impose this limit on the software facade that connects
between your application and the cluster
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Dev
This will still mean less overhead than having those distinct field in
discreet indexes. I wouldn't worry about that.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <
?
Maybe look at your data model and try to re-arrange it.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
--
You received this message because yo
Seems like it will in the next version (looking at the couchbase elastic
transport plugin commits)
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
Sure thing
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Sun, Sep 14, 2014 at 7:19 PM, Niv Penso wrote:
> Amazing answer helped me so
ent
manner using the cluster meta-data it has stored locally. This is by design
and has many optimizations in place as well, and also allows you as a
client to use round-robin or make requests to a load balancer.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.c
servers instead of having them on one server ("virtual shards"). This will
help fan out the indexing load.
5. If you don't specify the document IDs yourself, make sure you use the
latest ES, there's a significant improvement there in the ID generation
mechanism which could help
yes, that'd be wise to do
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Mon, Sep 8, 2014 at 4:22 PM, James wrote:
> Ah cool ok.
1000 entries is a very small set. If you can have this local to your code
that would be best. Otherwise yes a 2-phase query is probably your best bet
- note you could also use the suggesters to improve speed
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhers
This is just a key/value lookup. If it's not too big I'd just hold it all
in memory. Otherwise of course you can use Elasticsearch for that.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of Rav
You will need to reindex to another index (as this change of mappings isn't
backwards compatible) or to a field with a different name.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB
rom you as I'll be available for
short time contracts and consultancy gigs.
More details:
http://code972.com/elasticsearch-consulting-and-development-services
Thanks,
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & C
Yes
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Thu, Sep 4, 2014 at 12:49 AM, Employ wrote:
> Thank you. And no plugin is re
Well, this response is also public :)
I'll ping you sometime next week with more details, juggling with too many
things currently. Would definitely love to have an extra set of eyes.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelanc
summarizes all this very well:
http://stackoverflow.com/a/15041966/135701
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Wed, Sep 3, 2014 at 12:33
You don't need any external geo-data, see
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/query-dsl-geo-distance-filter.html
You only need external data sources if you want to give some coordinates /
polygons names (like countries, neighborhoods etc)
--
Itamar Syn-He
7;t thing Jorg
actually meant that..). I was involved with
https://lucene.apache.org/openrelevance/ but its now discontinued and in
some spare time I have I'm trying to take that initiative forward.
Ping me privately if that sounds interesting and we can continue discussing.
--
Itamar Syn-Hershko
http:/
- Yumbo <-- synonyms to Mulaló (multiple
tokens at the same position) etc
And then you would use a tokenizer normally (and tokenize on commas, for
example)
Then you still lose the full-text search capabilities but in exchange for
more precision (and more setup work on your part)
--
Itamar Syn-Hers
Take a look at suggesters - they are meant for that plus they are more
performant! http://www.elasticsearch.org/blog/you-complete-me/
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Ac
What would be the usecase for such a process (removing stop words without
tokenization)?
This may be a good read btw:
http://www.elasticsearch.org/blog/stop-stopping-stop-words-a-look-at-common-terms-query/
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhers
I'm going to bet on Head. Disable it and see what happens.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Thu, Aug 21, 2014 at 7:22
to an ES index?
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Wed, Aug 20, 2014 at 11:53 PM, Ivan Brusic wrote:
> Very strange query ind
un on a decent
sized installation.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Wed, Aug 20, 2014 at 10:14 PM, Chris Neal
wrote:
> Hi guys,
&
You can do this using the timestamp (with a range, and grow it if
necessary) or if you have a serial ID of some sort on the log message you
can do a range query on that
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consult
You should bring the price over to Elasticsearch and not the other way
around. Scoring against an external service is an added friction with huge
performance costs.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
It would also depends on the caching abilities and that the currently
paralelized threads are not sharing a resource. In the end, like with all
Computer Science stuff, the answer is "it depends" :)
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhersh
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/docs-delete-by-query.html
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
Aggregations operate on the results of a search query, so you can
definitely use that total also when you have sub-aggregations. As for
filter aggregations, you can have a subtract which acts as a sink for all
unused docs and subtract it's count from the total count
--
Itamar Syn-Hershko
You get it from the search request that wraps the terms aggregation, under
total hits in the root of the response
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.c
your server side facade
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Mon, Jul 14, 2014 at 6:21 PM, Danny Lieberman wrote:
> Itamar
>
I would strongly suggest against that. Never expose ES to the public,
always put it behind a server facade. To get a glimpse of what you are
exposing yourself, see this recent blog post
http://www.elasticsearch.org/blog/scripting-security/
--
Itamar Syn-Hershko
http://code972.com | @synhershko
You don't
You either pull a small page of data to display (match_all query or any
filtering query), or ask ES to aggregate the data for you and get back the
metrics or buckets. You can also do both at the same time.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twi
ly
so, but definitely not to 230GB on a 16GB server even when there's no
aggregation involved.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Mon,
uery types.
One advice I could tell you is to try and avoid introducing too much
friction, like duplicating the model too many times (DTO, DAO etc). If you
can use the same structure for display as you use for indexing in ES, use
that.
HTH
--
Itamar Syn-Hershko
http://code972.com | @synhers
What you are seeing is the Lucene numeric field terms. Try having the
applicationid field as a not_analyzed string field for the purpose of
faceting, instead of having it as a numeric field (which is usable for
range queries or sorts)
--
Itamar Syn-Hershko
http://code972.com | @synhershko <ht
ot of aggregation operations (aka faceting)
You probably can find ways to fine tune and squeeze more performance out of
what you currently have (again - using filters, codecs and other advanced
configs) but it's probably just wiser to scale out
--
Itamar Syn-Hershko
http://code972.com | @synhers
erver
isn't cool, especially if you use aggregations), look into codecs and much
more. There's no need for you to look into segments, especially since if
this is a live index which is being written to there's a large cost (CPU,
IO and GC) associated with merging segments
--
Itamar Syn-He
How did you arrive at this number of 5?
To being with, what sizes are your shards? what are the specs of your
servers?
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http:
onse=401. But this will not distinguish between A->B->C and
>> B->A->C. Perhaps I could use the script filter for the "last mile" and from
>> the term filtered results throw out B-A-C and it will run more quickly
>> because of the reduced docu
t, see
http://code972.com/blog/2014/07/74-the-definitive-guide-for-elasticsearch-on-windows-azure
HTH,
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhersh
If it was corrupted you would have seen other errors, not 503. Check your
network settings.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On S
This error means indexing has stopped at one point, up to that point
everything is preserved.
See
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/index-modules-allocation.html#disk
for how to avoid this from now on
--
Itamar Syn-Hershko
http://code972.com | @synhershko
Is this 1 Elasticsearch instance running locally or do multiple servers /
nodes participate?
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
ree/feature/improve_zen and
there may be other related tickets as well
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Tue, Jun 24, 2014 at 7:04 PM,
ures, but what Aphyr
showed is that when failure conditions happen the chances you will are
pretty high. Thanks to the Fallacies of Distributed Computing, that
basically means those are bound to happen every now and then. If and how
much data you lose will vary based on volumes, setups etc.
HTH
--
bucketing is
one example. So if you need exact values, I'd go for a model that does it.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
ok ID. Unless you use data from the book
level along with full-text searches on the texts, which even then in some
scenarios I would consider denormalization.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Autho
This is usually something that's being solved using parent-child, but the
question here really is what do you mean by needing to retrieve both books
& pages.
Can you describe the actual scenario and what you are trying to achieve?
--
Itamar Syn-Hershko
http://code972.com | @synhersh
FWIW, this is usually why you use replicas, so even if a shard goes down
there's a back-up shard (ideally more than one) that you can fallback to.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of Rav
Take a look here: http://www.elasticsearch.org/blog/you-complete-me/
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Tue, Jun 17, 2014
This is just to debug this, to make sure results are indeed not sorted by
_timestamp, as you claim. Probably easier to just set _timestamp to stored.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of
Possibly, because it's not provided in the _source, or just use this:
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/mapping-timestamp-field.html#_path_2
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelanc
This is weird. Are you sure what you are seeing is not overridden documents
(can happen if you specify the ID yourself)? Can you add the _timestamp
field to the results and verify the documents are indeed not sorted by
_timestamp?
--
Itamar Syn-Hershko
http://code972.com | @synhershko <ht
If you mean Kibana dashboards, take a look here this might be of help:
https://github.com/synhershko/RavenDB.ElasticsearchReplication/tree/master/Kibana.Host
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author
I'm pretty sure you're right
On Jun 6, 2014 8:03 PM, "Nikolas Everett" wrote:
> I'm in the position where I need to make _sure_ a document is deleted from
> the index when something occurs in my source system. I want to just hit it
> with a DELETE every time. Is that a good idea?
>
> It looks t
number of lines where? you can always show a Count facet that will count
the number of results of a query
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhersh
;path" and buckets on the user. To
check the condition of the previous path you should be able to bucket again
using a script, or maybe even with a query on a nested type.
This is just from the top of my head but should definitely work if you can
get to that model
--
Itamar Syn-Hershko
http:
You should have released this before my talk last week, I could have
mentioned it :\
https://www.youtube.com/watch?v=FbAO2k57bdg
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Ac
emory requirements etc as a data node.
Finally, there has been (and still is) a lot of work put into this so I
strongly recommend upgrading to the latest (currently it is 1.2.1).
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Co
you can use ES properly using
queries or the aggregations framework?
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Thu, Jun 5, 2014 at
Depending on your corpus, this should happen automatically. That's what
TF/IDF is about.
What you can do further is use NLP methods to tag those items in search and
indexing. Look up POS tagging and entity extraction.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twi
Just enable multicast using the plugin for your cloud provider...
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Mon, Jun 2, 2014 at 2:08 P
th as one
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Mon, Jun 2, 2014 at 1:49 PM, Martin Harris <
martin.har...@cloudsoftcorp.com>
When you search, what does the _id field of the result indicate?
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Mon, May 26, 2014 at 11:
What is the exact URLs you're approaching? Are you specifying the index
name and type name as well in your GET ?
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://mann
Aye, and then you can use edit distance on single words (fuzzy query) to
cope with fast typers
On May 22, 2014 8:22 PM, "Robert Muir"
wrote:
> On Wed, May 21, 2014 at 6:01 PM, Erik Rose wrote:
> > I'm trying to move Mozilla's source code search engine (dxr.mozilla.org)
> > from a custom-written
most out of your document store. More details
here:
http://code972.com/blog/2014/05/71-using-elasticsearch-as-a-storage-for-git-repositories
I'll have some concrete use-cases / demos to share soon
Cheers,
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhersh
What are the details of that exception? can it be that ES has issues
parsing the docs?
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Mon,
That doesn't seem right, try making larger bulk sizes. Also, what size is
your docs?
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On M
That's a very low rate. Are you importing locally or via remote connection?
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Mon, Ma
use percentages for the
watermark values.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Fri, May 16, 2014 at 3:26 PM, Michel Con
This is effective only when you run with the debugger attached, but yes.
This is effective for all response types.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.c
What version are you using? the latest one (v 1.0.0 beta1, a pre-release on
nuget) should have this feature
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synh
ten times multi-lingual search demands a lot of attention
Not long ago I gave a talk about this topic, you might find it helpful:
https://skillsmatter.com/skillscasts/4968-approaches-to-multi-lingual-text-search-with-elasticsearch-and-lucene
--
Itamar Syn-Hershko
http://code972.com | @sy
And then what analyzer you will use for that?
It is doable, but I'd strongly suggest against it unless you know what you
are doing:
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/query-dsl-query-string-query.html#_multi_field_2
--
Itamar Syn-Hershko
http://code97
while your code is a client written in
.NET, so no
Otherwise, you can define the analyzer via index settings from client code
as well
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Ac
This should be possible to do using script fields:
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-facets-terms-facet.html#_script_field
However, you will need to figure out how to do date manipulation there
--
Itamar Syn-Hershko
http://code972.com | @synhershko
It would have to be defined on ES, the analyzer name has to be the one that
ES recognizes (as the plugin defines it)
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.c
This should work:
[ElasticProperty(Analyzer = "my_analyzer")]
public string Content { get; set;}
You can also specify Index/SearchAnalyzer this way (for fine-grained
control)
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Fre
u may want to upgrade gradually (0.90 and then 1.x) just to be safe, but
you don't have to reindex.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/&
There's no need to reindex, it is enough to do full cluster restart after
upgrading the binaries and ES/Lucene will take care of the rest
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB
. There's no thumb rule for that one I'm afraid.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On Thu, Apr 24, 2014 at 4:37 PM, wr
or delete an entire index and not use
TTLs or delete-by-query processes.
Deciding on the optimal size of an index in that scenario highly depends on
your data, usage patterns and a lot of experimenting.
That's to answer 1 & 2
3. Definitely, 0.20 is a very old version
--
Itamar Syn-Her
even
4-5 grams perform much better than the above two, you should try those.
They do not require a dictionary.
I can't seem to find that paper now, the link I had to it seems to be
broken.
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Dev
search.org/guide/en/elasticsearch/reference/current/search-facets-terms-facet.html#_script_field
--
Itamar Syn-Hershko
http://code972.com | @synhershko <https://twitter.com/synhershko>
Freelance Developer & Consultant
Author of RavenDB in Action <http://manning.com/synhershko/>
On
101 - 200 of 245 matches
Mail list logo