Elasticsearch stemmer issue

2014-06-08 Thread Александр Шаманов
Hello everyone, I have follow index mapping: curl -XPUT 'http://localhost:9200/some_content/' -d ' { "settings":{ "query_string":{ "default_con":"content", "default_operator":"AND" }, "index":{ "analysis":{ "analyzer":{ "

restore snapshot not working

2014-06-08 Thread Hermanto Phang
Hi All, I try to restore my backup snapshot using command curl -XPOST "localhost:9200/_snapshot/my_backup/snapshot_1/_restore" But it was come up error {"error":"SnapshotRestoreException[[my_backup:snapshot_1] cannot restore index [cobbler_api] because it's open]","status":500} Any input to

Re-creating ES Index

2014-06-08 Thread karthik jayanthi
Hi, I have a few questions with respect to the situation of needing to re-create an index. 1) Is there is any process to re-create an index apart from deleting the current one and creating it again with the documents ? 2) During the situations of re-create, can we use the data already store

Re: Queries, filters and match_all

2014-06-08 Thread Arkadiy Zabazhanov
Guys, I still need help, A've tried to change filtered query strategies. It returns all the filtered results anyway for versions 1.0.0 - 1.2.1. When this behavior was changed and how? Why don't I need match_all for filtered query with empty query? пятница, 6 июня 2014 г., 7:14:28 UTC+7 пользова

Losing data after Elasticsearch restart

2014-06-08 Thread Rohit Jaiswal
Hello Everyone, We lost data after restarting Elasticsearch cluster. Restarting is a part of deploying our software stack. We have a 20-node cluster running 0.90.2 and we have Splunk configured to index ES logs. Looking

Sort nested documents in search result

2014-06-08 Thread Zdenek Pizl
Hallo, let's say I have following structure of document with nested (multiple) documents : { "host": "server-001", "plugins": [ { "plugin_name": "function-c", "plugin": {"function-c": "some C content" } }, { "plugin_name": "function-a", "plugin": {"f

Re: compresstion in ES 1.2.1

2014-06-08 Thread sri
Hello Jorg, Thanks a lot for the info., i tried applying the template provided by you but the size is not reducing.On the other hand, I was noticing decrease in size when i was disabling the fields via Mapping API. Thanks and Regards Sri On Sunday, June 8, 2014 4:37:58 PM UTC-4, Jörg Prante wr

Re: compresstion in ES 1.2.1

2014-06-08 Thread joergpra...@gmail.com
Try this index template for new index creations curl -XPUT 'localhost:9200/_template/template1' -d ' { "template" : "*", "mappings" : { "_default_" : { "_source" : { "enabled" : false }, "_all" : { "enabled" : false} } } } ' See also http://www

Re: compresstion in ES 1.2.1

2014-06-08 Thread sri
Thanks a lot for the insight Patrick. I have a few more queries: - it is possible to disable the '_source' and '_all' fields by default for all the indices that would be created later (possibility define in the elasticsearch.yml file) - what happens if my index is created and then

Re: compresstion in ES 1.2.1

2014-06-08 Thread joergpra...@gmail.com
Lucene uses LZ4 compression http://blog.jpountz.net/post/35667727458/stored-fields-compression-in-lucene-4-1 so you should not run ES on a ZFS file system with compression enabled. Jörg On Sun, Jun 8, 2014 at 8:47 PM, Patrick Proniewski wrote: > Hello, > > I don't know how it's compressed b

Find all the geoshapes that insersects with a given latitude/longitude

2014-06-08 Thread Vidal Chriqui
Hi My goal is to find for a given latitude/longitude all the indexed documents (circular geoshapes with specific radius for each) that contains this lat/lon. If necessary i'm ok to transform the circular zones with envelope, but this does not seem to be the issue. I need help to find the way t

Re: compresstion in ES 1.2.1

2014-06-08 Thread Patrick Proniewski
Hello, I don't know how it's compressed but it appears that data is compressed up to an amount of 4k. ie. it's useless to store data on a compressed (lz4) filesystem if fs block size is 4k: Filesystem SizeUsed Avail Capacity Mounted on zdata/ES-lz4 1.1T1.9G1

Re: Nested object type and join:false and geo_shape

2014-06-08 Thread bants
Hey - no, sorry I didn't ever get a response. Picking up this work over the next couple of times so I'll see if I come up with anything. On Thursday, May 29, 2014 2:16:07 PM UTC+1, horse.bad...@gmail.com wrote: > > Hello, > > I am trying to achieve something very similar, where a nested filter i

options for accessing ES repository from traditional BI tools that do not aupport REST API

2014-06-08 Thread elitem way
Since ES support aggregation query now, I am thinking using ES as a data warehouse staging area. The challenge is pulling the summary data from ES using traditional BI tool like Tableau. I know Hive ODBC driver - ES-Hadoop Hive is an option, but it is very slow compared with native REST API. Are

Re: issue of elasticsearch-hadoop-2.0.0 with Hive (cloudera and hortonworks), helps are needed

2014-06-08 Thread elitem way
Here is the Hive log when running the "select count(*) from cars2;": application_1402243729361_0009 14/06/08 10:27:19 INFO log.PerfLogger: 14/06/08 10:27:19 INFO log.PerfLogger: 14/06/08 10:27:19 INFO parse.ParseDriver: Parsing command: select count(*) from cars2 14/06/08 10:27:19 INFO par

Re: compresstion in ES 1.2.1

2014-06-08 Thread joergpra...@gmail.com
The Elasticsearch file size does not only contain compressed fields, but much more. For example, term vectors, norms, etc. You would have to disable field attributes you do not want. Also note, Elasticsearch has replica enabled by default, and segment count is not optimized automatically. Jörg O

Re: compresstion in ES 1.2.1

2014-06-08 Thread sri
Okay i will make the changes and upload the new stats. I am just curious, could you explain how the results were making sense, i just want to get a proper idea of what ES is actually doing to the data. Thanks and Regards Sri On Sunday, June 8, 2014 12:56:55 PM UTC-4, David Pilato wrote: > > Wel

Re: compresstion in ES 1.2.1

2014-06-08 Thread joergpra...@gmail.com
Compression is always enabled by default. Jörg On Sun, Jun 8, 2014 at 6:01 PM, sri <1.fr@gmail.com> wrote: > Hello everyone, > > I have read posts and blogs on how elasticsearch compression can be > enabled in the previous versions(0.17 - 0.19). > > I am currently using ES 1.2.1, i wasn't a

issue of elasticsearch-hadoop-2.0.0 with Hive (cloudera and hortonworks), helps are needed

2014-06-08 Thread elitem way
I am learning the elasticsearch-hadoop. I have a few issues that I do not understand. I am using ES 1.12 on Windows, elasticsearch-hadoop-2.0.0 and cloudera-quickstart-vm-5.0.0-0-vmware sandbox with Hive. 1. I loaded only 6 rows to ES index car/transactions. Why did Hive return 14 rows instead?

Re: compresstion in ES 1.2.1

2014-06-08 Thread David Pilato
Well. Think that you index all field individualy, that you are storing source (compressed) and that you are indexing _all field as well. So with defaults, this results make sense to me. Try disable _all field and see what gain you can get. -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @

Re: compresstion in ES 1.2.1

2014-06-08 Thread sri
Hi David, Thank you very much for the prompt reply. Below are the stats that i got when i was testing the ES cluster: Number of Nodes :2 Input format : rsyslog input file size(Mb) ES file size per node(Mb) 1 1.8 2 3.6 3 5.3 4 6.8 5 8.5 6 10.1 7 11.7 8 13 9 14.1 10 16 I am sorry

Re: compresstion in ES 1.2.1

2014-06-08 Thread David Pilato
It's compressed by default now. -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 8 juin 2014 à 18:01, sri <1.fr@gmail.com> a écrit : Hello everyone, I have read posts and blogs on how elasticsearch compression can be enabled in the previous versions(0.17 - 0.19). I

compresstion in ES 1.2.1

2014-06-08 Thread sri
Hello everyone, I have read posts and blogs on how elasticsearch compression can be enabled in the previous versions(0.17 - 0.19). I am currently using ES 1.2.1, i wasn't able to find out how to enable compression in this version or if at all there is any such option for it. I know that i ca

Re: XGET to _mapping does not match the mapping I used to create the index, is this normal?

2014-06-08 Thread Jun Ohtani
Hi, You’re welcome. Btw Twitterとかで聞いてもらっても答えますので、お気軽に。もちろん、MLのほうが皆さんに見えるのでいいのですが。 Jun Ohtani joht...@gmail.com blog : http://blog.johtani.info twitter : http://twitter.com/johtani 2014/06/07 0:17、Enno Shioji のメール: > Hi Jun, > > Ah, I must be doing something wrong then. I'll corr

Tribe problem in creating native thread

2014-06-08 Thread Srećko Morović
Hello, We are trying to start up a tribe node that would connect to ~60 clusters all consisting of several nodes. We use unicast from the tribe server to discover all of the cluster master nodes. Master nodes are fixed (if a master fails, then other machines in the cluster will be out of functi

Re: Understanding merge statistics from Marvel

2014-06-08 Thread John Smith
I know benchmarking is a tough subject! But what do those number mean? On Friday, 6 June 2014 12:17:22 UTC-4, John Smith wrote: > > Running Elasticsearch 1.2.1 with Java 1.7_55 on CentOs 6.5 > > The machine is a 32 core 96GB with standard spinning disk, but I also > installed 1 Samsung Evo 840 fo

Re: scala elastic4s usage question

2014-06-08 Thread Stephen Samuel
"as" is only used to start a mapping definition, ie in the outer block. You are mapping nested fields, in which case you want to use "nested", or "inner" depending on your use case. Here is an example taken from the unit tests: create.index("users").shards(2).mappings( "tweets" as (

Adding relevance to query_string query, help required

2014-06-08 Thread see613
I am using a "query_string" with morphology and wildcard. 'searchAnalyzer' => array( 'type' => 'custom', 'tokenizer' => 'standard', 'filter' => array('lowercase', 'word