Re: Need Clarification on Shards Replication

2014-01-03 Thread David Pilato
Until existing(not newly created) segments are fully copied to the new machine 
, no indexing operation will happen on replica shard right ? rather it notes 
down those new indexing documents in transaction log alone ?((Correct me if i 
am wrong))
Once all segments are copied it replays the transaction logs . if so no new 
documents visible for search , till segments copying process over. is it right 
? 

Correct. Replica shard won't be in STARTED state so it won't be searchable.

indices.store.throttle.type: merge  indices.store.throttle.max_bytes_per_sec: 
50mb these properties related to only lucene segment merges alone am i right ?
See whole definition: https://github.com/elasticsearch/elasticsearch/issues/2041



HTH

David

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/etPan.52c69681.643c9869.11bb1%40MacBook-Air-de-David.local.
For more options, visit https://groups.google.com/groups/opt_out.


Re: Need Clarification on Shards Replication

2014-01-02 Thread Alexander Reelsen
Hey,

replication is done per document (as opposed to relocation). So the
document is indexed on the primary first, and if it was successful there,
the document is indexed on all replicas of a shard in parallel. If that
index operation on the replica(s) has returned, the index requests is
returned to the client.

The throttling of merges (which is a heavy I/O and CPU intensive background
process) ensures, you have enough I/O performance available for index and
search operations.

Hope this helps...


--Alex


On Thu, Jan 2, 2014 at 6:43 AM, Anantha Govindarajan 
ananthagovindara...@gmail.com wrote:

 I have one es master and data-node and indexing documents to that (1 shard
 + 1 Replica), after indexing few documents (say 1 million and still
 indexing docs), adding one more data node to the cluster , now the shards
 started replicating to new node. How this replication happens ?  In the
 mean i am still indexing new documents to that index.

1. Whether datanode1 will send index segments to datanode2 ?
2. Whether datanode1 will send documents one by one (as IndexRequests)
to datanode2 instead of copying segments ?
3. Whether datanode1 will send whole index to datanode2 ?


 How will *indices.store.throttle.type: merge
  indices.store.throttle.max_bytes_per_sec: 50mb* these settings react
 with respect to the above test scenario ?



 Anantha Govindarajan.

 --
 You received this message because you are subscribed to the Google Groups
 elasticsearch group.
 To unsubscribe from this group and stop receiving emails from it, send an
 email to elasticsearch+unsubscr...@googlegroups.com.
 To view this discussion on the web visit
 https://groups.google.com/d/msgid/elasticsearch/326cfecc-b59c-4e4c-b5e9-e369e841a02e%40googlegroups.com
 .
 For more options, visit https://groups.google.com/groups/opt_out.


-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAGCwEM_EZ_bb1hsVpLyW7Pt0UWM47GRU2iuQt_mJPV0xoO5iSQ%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.


Re: Need Clarification on Shards Replication

2014-01-02 Thread Anantha Govindarajan
Hi Alex 

Thanks for replying. If i understand correctly normal indexing flow is,


   - Document is indexed in primary shard machine , then replica shard 
   machine then return the index response  to client - in case of 
   ReplicationType.SYNC.
   - Document is indexed in primary shard machine , then sent it to replica 
   machine(s) if available, and wont wait for response - in case of 
   ReplicationType.ASYNC.

But my question is not normal indexing flow. I have already indexed 1 
million documents in primary shard alone , at that moment no node is 
available for replica.

after some time adding a machine to cluster , at this point new indexing 
documents follows normal indexing flow (am i right ? Not sure !). But my 
question is how existing 1 million documents in primary shard is replicated 
to new machine ? 

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/dd473bdb-8603-44b2-a59c-0a8f3033ad0d%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


Re: Need Clarification on Shards Replication

2014-01-02 Thread David Pilato
It's relocation. Segments are copied over the wire. New updates/insert/delete 
operations which happen in the meantime are replayed from the transaction log 
on the new shard. 

HTH

--
David ;-)
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 2 janv. 2014 à 10:39, Anantha Govindarajan ananthagovindara...@gmail.com a 
écrit :

 Hi Alex 
 
 Thanks for replying. If i understand correctly normal indexing flow is,
 
 Document is indexed in primary shard machine , then replica shard machine 
 then return the index response  to client - in case of ReplicationType.SYNC.
 Document is indexed in primary shard machine , then sent it to replica 
 machine(s) if available, and wont wait for response - in case of 
 ReplicationType.ASYNC.
 But my question is not normal indexing flow. I have already indexed 1 million 
 documents in primary shard alone , at that moment no node is available for 
 replica.
 
 after some time adding a machine to cluster , at this point new indexing 
 documents follows normal indexing flow (am i right ? Not sure !). But my 
 question is how existing 1 million documents in primary shard is replicated 
 to new machine ? 
 -- 
 You received this message because you are subscribed to the Google Groups 
 elasticsearch group.
 To unsubscribe from this group and stop receiving emails from it, send an 
 email to elasticsearch+unsubscr...@googlegroups.com.
 To view this discussion on the web visit 
 https://groups.google.com/d/msgid/elasticsearch/dd473bdb-8603-44b2-a59c-0a8f3033ad0d%40googlegroups.com.
 For more options, visit https://groups.google.com/groups/opt_out.

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/2DA64523-60B8-4A18-86C7-4A737FADD6B1%40pilato.fr.
For more options, visit https://groups.google.com/groups/opt_out.


Re: Need Clarification on Shards Replication

2014-01-02 Thread Anantha Govindarajan
Hi David , 

Thanks for your reply . 

Until existing(not newly created) segments are fully copied to the new 
machine , no indexing operation will happen on replica shard right ? rather 
it notes down those new indexing documents in transaction log alone ?((Correct 
me if i am wrong))

Once all segments are copied it replays the transaction logs . if so no new 
documents visible for search , till segments copying process over. is it 
right ? 

*indices.store.throttle.type: merge 
 indices.store.throttle.max_bytes_per_sec: 50mb *these properties related 
to only lucene segment merges alone am i right ?


Ananth

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/051a21a8-ba9f-4401-84ce-fce31a28b5fc%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.