Re: how to see individual node stats on marvel?

2015-02-15 Thread Jinyuan Zhou
, so use with care. On Mon, Feb 16, 2015 at 7:07 AM, Jinyuan Zhou zhou.jiny...@gmail.com wrote: Another one is the query box. just type full or partial name of node(s). one search box corresponds to display of the combined metrics of nodes whose name is/matches the typed in text. Jinyuan

Re: how to see individual node stats on marvel?

2015-02-15 Thread Jinyuan Zhou
button to view both nodes in the Node Stats dashboard and be able to compare them. On Wednesday, January 28, 2015 at 10:43:46 PM UTC+1, Jinyuan Zhou wrote: Thanks, Jinyuan (Jack) Zhou On Wed, Jan 28, 2015 at 1:05 PM, Mark Walkom markwal...@gmail.com wrote: You can click on a node name

Re: how to see individual node stats on marvel?

2015-01-28 Thread Jinyuan Zhou
Thanks, Jinyuan (Jack) Zhou On Wed, Jan 28, 2015 at 1:05 PM, Mark Walkom markwal...@gmail.com wrote: You can click on a node name and it will take you to node level stats. You can also filter by name using the search box in the nodes panel. On 29 January 2015 at 05:23, Jinyuan Zhou

how to see individual node stats on marvel?

2015-01-28 Thread Jinyuan Zhou
Hi, I am playing with marvel. there is a node stats page. But it seems it only shows overall stats for cluster. What is fitler key I can use to see indvidual node? Thanks, -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from

Re: In jmv gc log message, what are two numbers after [gc][old] and before duration?

2015-01-20 Thread Jinyuan Zhou
I figured out that second number 169 is number of old gc events by run jstat -gcold pid. But still don't know the first number:15221. Jinyuan (Jack) Zhou On Mon, Jan 19, 2015 at 11:10 AM, Jinyuan Zhou zhou.jiny...@gmail.com wrote: Hi, I copied this log message from some postings. there two

Why no exception when circuit break is triggered?

2015-01-19 Thread Jinyuan Zhou
The following line indicates, the fieddata would exceeds configured breaker indices.breaker.fielddata.limit. which by default is 60% of heap. accroding to doc, it should throw exception. But it seems my query returned OK. I noticed large amount such quereis eventually leads to OOM error. I

In jmv gc log message, what are two numbers after [gc][old] and before duration?

2015-01-19 Thread Jinyuan Zhou
Hi, I copied this log message from some postings. there two numbers between [gc][old] and duration: *[155221][169]*. What are they? 2014-04-16 16:32:57,505][WARN ][monitor.jvm ] [elasticsearch2. trend1] [gc][old]*[**15221**]**[**169]* duration [1.3m], collections [1]/[ 1.3m],

Re: Is it possible to install plugin into a directory other than ${ES_HOME}/plugins?

2015-01-17 Thread Jinyuan Zhou
. Or symlinks. Always your friend. On Jan 16, 2015 7:11 PM, Jinyuan Zhou zhou.jiny...@gmail.com wrote: Thanks, -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this group and stop receiving emails from it, send an email

Is it possible to install plugin into a directory other than ${ES_HOME}/plugins?

2015-01-16 Thread Jinyuan Zhou
Thanks, -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this discussion on the web visit

what will happen when shard size is too large

2015-01-04 Thread Jinyuan Zhou
I read some where that max recommended shard size is about 50G. What are biggest bad consequences when shard size is too large. I have, for some indice, shard size more than 100G on a jvm with 29G fixed heap size. I suspect this is one factor that caused OutofMemory problem. However I

Re: this query brought down my cluster :{query: {function_score: {query: {match_all:{}}, random_score: {seed: 123456}}}}

2014-12-23 Thread Jinyuan Zhou
, Dec 22, 2014 at 7:28 PM, Jinyuan Zhou zhou.jiny...@gmail.com wrote: Hi, I like to share my experience and in the same time hope I can get some tips. The query was run against an index with about 700 million documents. Two things happens, 1. The node run this query crashed. It is the node

this query brought down my cluster :{query: {function_score: {query: {match_all:{}}, random_score: {seed: 123456}}}}

2014-12-22 Thread Jinyuan Zhou
Hi, I like to share my experience and in the same time hope I can get some tips. The query was run against an index with about 700 million documents. Two things happens, 1. The node run this query crashed. It is the node configured not to proccess data. 2. The data nodes start crazy on

what data are cached exactly in filter cache?

2014-11-24 Thread Jinyuan Zhou
is it the set of keys, e.g, the unique identifiers, which corresponds to exactly the set of documents that match the filter? Thanks, -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this group and stop receiving emails from

Re: what data are cached exactly in filter cache?

2014-11-24 Thread Jinyuan Zhou
at 8:39 PM, Jinyuan Zhou zhou.jiny...@gmail.com wrote: is it the set of keys, e.g, the unique identifiers, which corresponds to exactly the set of documents that match the filter? Thanks, -- You received this message because you are subscribed to the Google Groups elasticsearch group

Re: issues with using repository-hdfs plug in for snapshot/restore operation

2014-11-21 Thread Jinyuan Zhou
`hadoop classpath` to ES_CLASSPATH? I tried to exec `hadoop class' path and added the result to export ES_CLASSPATH=pasted output but getting same error, is that what you ment? Thank you, On Monday, July 7, 2014 2:42:04 PM UTC-7, Jinyuan Zhou wrote: I am using elasticsearch 1.2.1 and CDH 4.6

cost of automatics refresh

2014-11-19 Thread Jinyuan Zhou
I am curious about how much cost for both cpu and memory when automatic refresh is on while I am not adding new index. For example, when it is time to refresh, ES have to look at the buffer to see if there is anything to refresh. how much would this cost even there nothing to flush. Thanks,

why my refresh takes more than 200 milli seconds on average?

2014-11-19 Thread Jinyuan Zhou
I have average 220 milliseconds each refresh. It seems very slow. I thought refresh should be very fast. It flush the buffer to lucent segment and everything happens in memory. Does IO speed affect refresh speed? I do have spinning disks. Thanks, -- You received this message because you

index.refresh_interval: -1

2014-11-17 Thread Jinyuan Zhou
If I set this value, do I have to make api call to refresh manually? By manually, I mean ES will not refresh without client requests. Thanks, -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this group and stop receiving

Re: Elasticsearch-Hadoop repository plugin Cloudera Hadoop 2.0.0-cdh4.6.0

2014-10-14 Thread Jinyuan Zhou
My ES cluster nodes and Hadoop nodes are not collocated. Light version does not works for me without putting enough correct versions of hadoop related jars. Right now I don't want to create my jar as Brent did and I don't want to install hadoop or copy jars on the es nodes either . Right now I

Re: Elasticsearch-Hadoop repository plugin Cloudera Hadoop 2.0.0-cdh4.6.0

2014-10-14 Thread Jinyuan Zhou
one. Cheers, On Tue, Oct 14, 2014 at 10:00 PM, Jinyuan Zhou zhou.jiny...@gmail.com wrote: My ES cluster nodes and Hadoop nodes are not collocated. Light version does not works for me without putting enough correct versions of hadoop related jars. Right now I don't want to create my jar

Re: elasticsearch 1.3.2 process logging.yml as a json foramt when logging.yml is deployed from subversion repo

2014-10-09 Thread Jinyuan Zhou
What i did is renamimg logging.yml to something else and then at startup time I copy the contents to logging.yml Thanks, Jack Jinyuan (Jack) Zhou On Thu, Oct 9, 2014 at 2:28 AM, Daniel P. thetuxkee...@gmail.com wrote: We have the same problem is reading .svn/text-base/logging.yml.text-base

Re: elasticsearch 1.3.2 process logging.yml as a json foramt when logging.yml is deployed from subversion repo

2014-10-09 Thread Jinyuan Zhou
/elasticsearch/issues/8040 Let's see what happens. :) On Thursday, October 9, 2014 5:08:19 PM UTC+2, Jinyuan Zhou wrote: What i did is renamimg logging.yml to something else and then at startup time I copy the contents to logging.yml Thanks, Jack Jinyuan (Jack) Zhou On Thu, Oct 9, 2014 at 2

Re: Can't open file to read checksums

2014-09-25 Thread Jinyuan Zhou
Could it be a bug? I saw the exactly same warning from log file while rolling restart all nodes. I am running 1.3.2 on all nodes. I saw about 4 times when restart total 12 nodes. The cluster is OK. all indices eventually in green state. One more detail, the warning happens on the node

how big the directory for es.path.logs and es.path.work has to be?

2014-09-22 Thread Jinyuan Zhou
I understand es.path.data is place elasticsearch store data. I am just wondering if the log directory and work directory needs be somewhat big too when we store a more and more data in ES. Is there any known cases that the storage size of these directory, especially work directory, caused

Re: how big the directory for es.path.logs and es.path.work has to be?

2014-09-22 Thread Jinyuan Zhou
: www.campaignmonitor.com On 23 September 2014 09:47, Jinyuan Zhou zhou.jiny...@gmail.com wrote: I understand es.path.data is place elasticsearch store data. I am just wondering if the log directory and work directory needs be somewhat big too when we store a more and more data in ES. Is there any known

Re: Any zen discovery difference between master eligible and non eligible nodes?

2014-09-18 Thread Jinyuan Zhou
client node. -- *David Pilato* | *Technical Advocate* | *Elasticsearch.com* @dadoonet https://twitter.com/dadoonet | @elasticsearchfr https://twitter.com/elasticsearchfr Le 18 septembre 2014 à 02:54:31, Jinyuan Zhou (zhou.jiny...@gmail.com) a écrit: yes. Jinyuan (Jack) Zhou On Wed

Re: Any zen discovery difference between master eligible and non eligible nodes?

2014-09-17 Thread Jinyuan Zhou
...@pilato.fr wrote: A cluster needs a master node. You need at least one node which could be elected as master. -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 17 sept. 2014 à 20:08, Jinyuan Zhou zhou.jiny...@gmail.com a écrit : I have two nodes

Re: Any zen discovery difference between master eligible and non eligible nodes?

2014-09-17 Thread Jinyuan Zhou
yes. cluster name is set to be the same on all 12 nodes Jinyuan (Jack) Zhou On Wed, Sep 17, 2014 at 12:24 PM, David Pilato da...@pilato.fr wrote: Did you set the same cluster name? -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 17 sept. 2014 à 21:09, Jinyuan Zhou

Re: Any zen discovery difference between master eligible and non eligible nodes?

2014-09-17 Thread Jinyuan Zhou
or a plugin for discovery? -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 17 sept. 2014 à 21:49, Jinyuan Zhou zhou.jiny...@gmail.com a écrit : yes. cluster name is set to be the same on all 12 nodes Jinyuan (Jack) Zhou On Wed, Sep 17, 2014 at 12:24 PM, David Pilato da

Re: Any zen discovery difference between master eligible and non eligible nodes?

2014-09-17 Thread Jinyuan Zhou
wrote: When you start the client node, did you set list of master nodes as unicast hosts? -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 17 sept. 2014 à 22:13, Jinyuan Zhou zhou.jiny...@gmail.com a écrit : I am using unicast with out of box zen discovery

Re: better places to store es.nodes and es.port in ES Hive integration?

2014-09-16 Thread Jinyuan Zhou
On 6/17/14 2:42 AM, Jinyuan Zhou wrote: sure, I was able to run follwoing command against my remote es cluster. hive -i init.hive -f search.hql. Below is the contents of init.hive, search.hql and data file in hdfs /user/cloudera/hivework/foobar/foobar.data I replaced value for es.nodes

Re: elasticsearch hadoop, dynamically decide index name too (not just type name), is it possible?

2014-09-11 Thread Jinyuan Zhou
This is great. Thanks, Jinyuan (Jack) Zhou On Thu, Sep 11, 2014 at 1:56 PM, Costin Leau costin.l...@gmail.com wrote: Yes, simply use a different pattern - {my-index-pattern}/{my-type}-foobar On 9/11/14 9:56 PM, Jinyuan Zhou wrote: I saw hadoop documentation regarding setting up index

Re: should ES_HEAP_SIZE be less than 31G?

2014-09-04 Thread Jinyuan Zhou
Say if I have two boxes for data node in the cluster: A and B. However, A is doubles capacity of B. Now I can start two data nodes on A and 1 on B. With the help of rack id attribute, ES will make sure replicate and its primary are not located on the same box. In this case, It seems to me that

should ES_HEAP_SIZE be less than 31G?

2014-09-03 Thread Jinyuan Zhou
I read somewhere that the ES_HEAP_SIZE is best to be less than 31G, in this case JVM can use an 32 bit number to address memory locations. If my server have about 64G ram. This seems a perfect. But what If I have a server with 128G RAM and sufficient CPU's. Which configuration is better

Spring Elasticsearch vs Spring Data Elasticsearch:

2014-09-02 Thread Jinyuan Zhou
Hi, I'm planning to use java native client to mainly to bulk index a lot of data. I see there are two project listed on the elasticsearch guide's integration page. one is a sub project of spring data at https://github.com/spring-projects/spring-data-elasticsearch. One is a third party project

Re: Spring Elasticsearch vs Spring Data Elasticsearch:

2014-09-02 Thread Jinyuan Zhou
-- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 2 sept. 2014 à 21:44, Jinyuan Zhou zhou.jiny...@gmail.com a écrit : Hi, I'm planning to use java native client to mainly to bulk index a lot of data. I see there are two project listed on the elasticsearch guide's

Re: Is it possible to register a RestFilter without creating a plugin?

2014-08-26 Thread Jinyuan Zhou
this is possible. In such a provision , how will you define what the REST API will do ? Thanks Vineeth On Tue, Aug 26, 2014 at 2:41 AM, Jinyuan Zhou zhou.jiny...@gmail.com wrote: Thanks, -- You received this message because you are subscribed to the Google Groups elasticsearch

Is it possible to register a RestFilter without creating a plugin?

2014-08-25 Thread Jinyuan Zhou
Thanks, -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this discussion on the web visit

elasticsearch 1.3.2 process logging.yml as a json foramt when logging.yml is deployed from subversion repo

2014-08-15 Thread Jinyuan Zhou
I deploy these files from svn repository. since logging.yml is under subversion there is a .svn/text-base/logging.yml.text-base file. ES load this file ends with text-base and assumes its a json format. a org.elasticsearch.common.jackson.core.JsonParseException is raised. Is there a way to

explanation of each fields of _stats api data

2014-07-21 Thread Jinyuan Zhou
Hi there, How can find documentations for explanation of json data returned by _stats for index. I mean meaning of each fields not just high level description. Specifically, For the following data primaries: {docs: {count: 1789457,deleted: 0},store: { size_in_bytes:

issues with using repository-hdfs plug in for snapshot/restore operation

2014-07-07 Thread Jinyuan Zhou
I am using elasticsearch 1.2.1 and CDH 4.6. quick start vm. My ES server is installed on the same vm. I have one successful senario: I used light version and add the result and command `hadoop classpath` to ES_CLASSPATH But I encoutered errros with the default version and hadoop2 version.

Re: How to see all document types for a give index?

2014-07-01 Thread Jinyuan Zhou
Thanks, I guess this may be the perfect solution. Jinyuan (Jack) Zhou On Tue, Jul 1, 2014 at 6:23 AM, smonasco smona...@gmail.com wrote: I believe http://localhost:9200/index/_mapping will give you types. It is an indirect method for sure, but that kind of metadata is going to be in

How to see all document types for a give index?

2014-06-29 Thread Jinyuan Zhou
Thanks, -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this discussion on the web visit

Re: better places to store es.nodes and es.port in ES Hive integration?

2014-06-17 Thread Jinyuan Zhou
AM, Jinyuan Zhou wrote: sure, I was able to run follwoing command against my remote es cluster. hive -i init.hive -f search.hql. Below is the contents of init.hive, search.hql and data file in hdfs /user/cloudera/hivework/foobar/foobar.data I replaced value for es.nodes with fake name. Other

Re: better places to store es.nodes and es.port in ES Hive integration?

2014-06-16 Thread Jinyuan Zhou
creation hql inside myscript.hql don't have to set es.* properties as long as it appears in init.hive file This solves my problem. Thanks, Jinyuan (Jack) Zhou On Sun, Jun 15, 2014 at 10:24 AM, Jinyuan Zhou zhou.jiny...@gmail.com wrote: Thanks Costin, I am aiming at modifying the existing

Re: better places to store es.nodes and es.port in ES Hive integration?

2014-06-15 Thread Jinyuan Zhou
and thus its changes discarded. Cheers, On 6/14/14 1:57 AM, Jinyuan Zhou wrote: Hi, I am playing with elasticsearch and hive integration. The documentation says to set configuration like es.nodes, es.port in TBLPROPERTIES. It works. But it can cause many reduntant codes. If I have ten data

better places to store es.nodes and es.port in ES Hive integration?

2014-06-13 Thread Jinyuan Zhou
Hi, I am playing with elasticsearch and hive integration. The documentation says to set configuration like es.nodes, es.port in TBLPROPERTIES. It works. But it can cause many reduntant codes. If I have ten data set to index to the same es cluster, I would have to repeat this information ten

How to set replication to async for a index

2014-06-09 Thread Jinyuan Zhou
Hi there, I am new to ES. I understand the concept but cannot find documentation about where to put the setting. Thanks, Jack -- You received this message because you are subscribed to the Google Groups elasticsearch group. To unsubscribe from this group and stop receiving emails from it,

Re: How to set replication to async for a index

2014-06-09 Thread Jinyuan Zhou
June 2014 11:01, Jinyuan Zhou zhou.jiny...@gmail.com wrote: Hi there, I am new to ES. I understand the concept but cannot find documentation about where to put the setting. Thanks, Jack -- You received this message because you are subscribed to the Google Groups elasticsearch group

Re: beginner question

2014-04-26 Thread Jinyuan Zhou
Solr Elasticsearch Support * *http://sematext.com/ Hi , Can you try the insert using POST and paste the result. Thanks Vineeth On Sat, Apr 26, 2014 at 9:23 PM, Jinyuan Zhou zhou.j...@gmail.comjavascript: wrote: I am following the definitive guide. I added this docume

Re: beginner question

2014-04-26 Thread Jinyuan Zhou
Le 26 avr. 2014 à 17:53, Jinyuan Zhou zhou.j...@gmail.com javascript: a écrit : I am following the definitive guide. I added this docume by issue this in sense PUT /megacorp/employee/1 { first_name : John, last_name : Smith, age :25, about : I love to go