[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-13 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15419878#comment-15419878
 ] 

ASF subversion and git services commented on SOLR-9320:
---

Commit 5fc35a65c39b7fb4ca49675d3ef9cd7f9d6c0fa8 in lucene-solr's branch 
refs/heads/branch_6x from [~varunthacker]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=5fc35a6 ]

SOLR-9320: Improve logging


> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
>Assignee: Noble Paul
> Fix For: 6.2, master (7.0)
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch, SOLR-9320_followup.patch, 
> SOLR-9320_followup.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-13 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15419877#comment-15419877
 ] 

ASF subversion and git services commented on SOLR-9320:
---

Commit 70d27aec83f9257da459f157acd9fc70764f7195 in lucene-solr's branch 
refs/heads/master from [~varunthacker]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=70d27ae ]

SOLR-9320: Improve logging


> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
>Assignee: Noble Paul
> Fix For: 6.2, master (7.0)
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch, SOLR-9320_followup.patch, 
> SOLR-9320_followup.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-11 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15416675#comment-15416675
 ] 

ASF subversion and git services commented on SOLR-9320:
---

Commit 4bd7d7fadbc72883f8ec9f2266daf0cd1f50b514 in lucene-solr's branch 
refs/heads/branch_6x from [~noble.paul]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=4bd7d7f ]

SOLR-9320: SOLR-9318 The last commit screwed up


> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
>Assignee: Noble Paul
> Fix For: 6.2, master (7.0)
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-11 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15416658#comment-15416658
 ] 

ASF subversion and git services commented on SOLR-9320:
---

Commit 519be6acf0866ce2f27e825b5b0035d39147af0b in lucene-solr's branch 
refs/heads/branch_6x from [~noble.paul]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=519be6a ]

SOLR-9320: A REPLACENODE command to decommission an existing node with another 
new node and SOLR-9318 the DELETENODE command that deletes all replicas in a 
node


> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
>Assignee: Noble Paul
> Fix For: 6.2, master (7.0)
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-11 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15416650#comment-15416650
 ] 

ASF subversion and git services commented on SOLR-9320:
---

Commit ae60c74f8c6ea2f62e1870802accbcd73bbfdc48 in lucene-solr's branch 
refs/heads/master from [~noble.paul]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=ae60c74 ]

SOLR-9320: A REPLACENODE command to decommission an existing node with another 
new node and SOLR-9318 the DELETENODE command that deletes all replicas in a 
node


> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-09 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15413372#comment-15413372
 ] 

Noble Paul commented on SOLR-9320:
--

We would need a node level locking, ideally. But, we don't have such a thing now

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-09 Thread Varun Thacker (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15413326#comment-15413326
 ] 

Varun Thacker commented on SOLR-9320:
-

bq. REPLACENNODE just adds/removes replicas, so. it is no big deal. The 
operation happens across collections, so we have no idea what to lock here


Won't we need to block updates to all collection that belongs to the source 
node? What if we are doing a split shard - Replace node might copy the wrong 
data?

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-09 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15413307#comment-15413307
 ] 

Noble Paul commented on SOLR-9320:
--

bq.In CollectionParams, should REPLACENODE have lock level as none? What if we 
are replicaing a node and someone fires a split shard request? To be correct we 
need to lock right?

REPLACENNODE just adds/removes replicas, so. it is no big deal. The operation 
happens across collections, so we have no idea what to lock here

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-09 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15413069#comment-15413069
 ] 

Noble Paul commented on SOLR-9320:
--

Thanks [~varunthacker]
bq.CollectionAdminRequest#DeleteNode and ReplaceNode - Can we add a little bit 
of Javadocs explaining what the parameters are?

sure.

bq.Any reason why we needed to change modify 
AsyncCollectionAdminRequest#setAsyncId ?

Yeah. Keeping that abstract was bad. That is deprecated

bq.In OverseerCollectionMessageHandler#addReplica can't we make node and 
coreName final to start with?

No, they are modified in between

bq.The logging in this patch needs to be improved. It's too verbose and 
redundant.


Logging is screwed up. I added them for my testing. I shall clean them up.


> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-09 Thread Varun Thacker (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15412998#comment-15412998
 ] 

Varun Thacker commented on SOLR-9320:
-

Patch looks good!  Here are a few comments on that patch

Any reason why we needed to change modify 
AsyncCollectionAdminRequest#setAsyncId ?

CollectionAdminRequest#DeleteNode and ReplaceNode - Can we add a little bit of 
Javadocs explaining what the parameters are?

In CollectionParams, should REPLACENODE have lock level as none? What if we are 
replicaing a node and someone fires a split shard request? To be correct we 
need to lock right?

In OverseerCollectionMessageHandler#addReplica can't we make node and coreName 
final to start with?
{code}
final String fnode = node;
final String fcoreName = coreName;
{code}

Can the tests please extend SolrCloudTestCase ? On my machine it takes one min 
for RepliceNodeTest to run. I'd guess it will be a lot faster when its using 
SolrCloudTestCase

The logging in this patch needs to be improved. It's too verbose and redundant.

For example these log line is so verbose and adds little value -

{code}
42134 INFO  (TEST-ReplaceNodeTest.test-seed#[F960E2C81499AC90]) [] 
o.a.s.c.ReplaceNodeTest excluded_node : 127.0.0.1:51079_x%2Ftc  coll_state : {
  "replicationFactor":"2",
  "shards":{
"shard1":{
  "range":"8000-b332",
  "state":"active",
  "replicas":{
"core_node1":{
  "core":"replacenodetest_coll_shard1_replica2",
  "base_url":"http://127.0.0.1:51075/x/tc;,
  "node_name":"127.0.0.1:51075_x%2Ftc",
  "state":"active",
  "leader":"true"},
"core_node5":{
  "core":"replacenodetest_coll_shard1_replica1",
  "base_url":"http://127.0.0.1:51089/x/tc;,
  "node_name":"127.0.0.1:51089_x%2Ftc",
  "state":"active"}}},
"shard2":{
  "range":"b333-e665",
  "state":"active",
  "replicas":{
"core_node2":{
  "core":"replacenodetest_coll_shard2_replica1",
  "base_url":"http://127.0.0.1:51084/x/tc;,
  "node_name":"127.0.0.1:51084_x%2Ftc",
  "state":"active"},
"core_node9":{
  "core":"replacenodetest_coll_shard2_replica2",
  "base_url":"http://127.0.0.1:51094/x/tc;,
  "node_name":"127.0.0.1:51094_x%2Ftc",
  "state":"active",
  "leader":"true"}}},
"shard3":{
  "range":"e666-1998",
  "state":"active",
  "replicas":{
"core_node6":{
  "core":"replacenodetest_coll_shard3_replica2",
  "base_url":"http://127.0.0.1:51071/x/tc;,
  "node_name":"127.0.0.1:51071_x%2Ftc",
  "state":"active"},
"core_node8":{
  "core":"replacenodetest_coll_shard3_replica1",
  "base_url":"http://127.0.0.1:51065/x/tc;,
  "node_name":"127.0.0.1:51065_x%2Ftc",
  "state":"active",
  "leader":"true"}}},
"shard4":{
  "range":"1999-4ccb",
  "state":"active",
  "replicas":{
"core_node3":{
  "core":"replacenodetest_coll_shard4_replica2",
  "base_url":"http://127.0.0.1:51075/x/tc;,
  "node_name":"127.0.0.1:51075_x%2Ftc",
  "state":"active",
  "leader":"true"},
"core_node7":{
  "core":"replacenodetest_coll_shard4_replica1",
  "base_url":"http://127.0.0.1:51089/x/tc;,
  "node_name":"127.0.0.1:51089_x%2Ftc",
  "state":"active"}}},
"shard5":{
  "range":"4ccc-7fff",
  "state":"active",
  "replicas":{
"core_node4":{
  "core":"replacenodetest_coll_shard5_replica1",
  "base_url":"http://127.0.0.1:51084/x/tc;,
  "node_name":"127.0.0.1:51084_x%2Ftc",
  "state":"active",
  "leader":"true"},
"core_node10":{
  "core":"replacenodetest_coll_shard5_replica2",
  "base_url":"http://127.0.0.1:51094/x/tc;,
  "node_name":"127.0.0.1:51094_x%2Ftc",
  "state":"active",
  "router":{"name":"compositeId"},
  "maxShardsPerNode":"3",
  "autoAddReplicas":"false"}
{code}

On the redundant part:

In this patch I guess the purpose of adding logging to 
OverseerCollectionMessageHandler#deleteReplica and 
OverseerCollectionMessageHandler#addReplica was to be able to see when 
replicanode/deletenode calles addReplica and deleteReplica.

That log line in ReplaceNodeCmd should be like - "calling addReplica for 
collection=[] shards=[] on node=[]" . We shouldn't add a generic line to 
OverseerCollectionMessageHandler#addReplica . That will make anyone use 
AddReplica directly to see the same entry being logged twice ( the overseer 
logs it automatically and second one is the new line that is added in this 
patch) . Also in this patch its logged as {{log.info("going to create replica 
{}", Utils.toJSONString(sourceReplica));}} which is tougher to read then the 
approach 

[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-08 Thread Nitin Sharma (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15412691#comment-15412691
 ] 

Nitin Sharma commented on SOLR-9320:


Most of the patch looks good. In ReplaceNodeCmd, instead of calling 
DeleteNodeCmd.deletereplicas, can you just called DELETENODE cmd on the source 
node itself? Looks much cleaner that way and one command doesnt need to know 
the details of the other cmd. 

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-08 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15411693#comment-15411693
 ] 

Noble Paul commented on SOLR-9320:
--

this contains SOLR-9318 as well

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-06 Thread Nitin Sharma (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15410678#comment-15410678
 ] 

Nitin Sharma commented on SOLR-9320:


Functionality seems to be fine to me. The spec you mentioned in the description 
was that REPLACE does a create and then invokes a DELETENODE. The recent patch 
seems to do iterate and do a delete replica instead of having a DELETENODE 
action and calling that. Hence the original question. 

I tried the first one and it works fine. I will try your new patch. 

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-05 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15410477#comment-15410477
 ] 

Noble Paul commented on SOLR-9320:
--

Is the current patch missing anything?

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-05 Thread Nitin Sharma (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15410472#comment-15410472
 ] 

Nitin Sharma commented on SOLR-9320:


The patch i attached follows the original spec you proposed. Replace node after 
creating new replicas will call "DELETENODE" on the destination host which will 
remove all cores on the node. Can you merge this with that as well? Thanks. 

I will patch this on top of 6.1 and give it a shot



> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-02 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15404513#comment-15404513
 ] 

Noble Paul commented on SOLR-9320:
--

The best we can do is pass an extra parameter to speed up the execution (say 
parallel=true). But the real risk of screwing up the node/network exists

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-02 Thread Nitin Sharma (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15404456#comment-15404456
 ] 

Nitin Sharma commented on SOLR-9320:


Would be good to establish context on when this command should/can be run. In 
our current version of rebalance, we use this to switch serving nodes in 
production and hence the time taken to migrate a node is limited (time bound). 
If this is considered an offline process, then may be serial works just fine. 

I agree with Erick on throttling but we can do that at a system level (unix 
traffic shaping tc etc). That will leave the bandwidth management outside the 
responsibility of solr. The same node serving solr can be used to run other 
services in some production setups. Keeping that outside solr will put that 
onus on the maintainer.  Just my $0.02

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-02 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15404435#comment-15404435
 ] 

Noble Paul commented on SOLR-9320:
--

bq.And doing one after the other could still be throttled by maxWriteMBPerSec

The problem is, we can't do that from the {{REPLACENODE}} command. The 
downloads happen in a different thread from the target node 

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-02 Thread Erick Erickson (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15404430#comment-15404430
 ] 

Erick Erickson commented on SOLR-9320:
--

[~noble.paul] That certainly works. My goal was just to be sure bandwidth 
figured in to the design if people wanted to multi-thread.

And doing one after the other could still be throttled by maxWriteMBPerSec if a 
single node overwhelmed the network or disks (I've actually seen this in the 
wild FWIW).



> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-02 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15404382#comment-15404382
 ] 

Noble Paul commented on SOLR-9320:
--

[~erickerickson] I'm inclined to create cores one after another. Even if takes 
some time, it is better to ensure that we don't overload that one node. This is 
an admin command which is issued asynchronously. So, there is no harm even if 
it takes a while.

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-02 Thread Erick Erickson (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15404292#comment-15404292
 ] 

Erick Erickson commented on SOLR-9320:
--

If we're going to take advantage of multithreading we need to take some care to 
throttle replication. If we try to copy 500 cores' indexes to some other node 
all at once with 500 separate threads I'd worry about network bandwidth issues. 
I've seen saturated I/O cause nodes to go into recovery and the like. Not to 
mention beating up the disks on both machines.

The number of replace operations to carry out in parallel is probably the 
easiest. I can think of two tuning parameters, max # of simultaneous threads 
and max bandwidth consumption. 

Thinking about it for a bit, though, the bandwidth parameter seems like it's 
difficult to do well. There'd have to be some kind of cross-copy communications 
I should think. And other ugliness. I suppose one could get this behavior on a 
case-by-case basis by 
1> specifying the max # of replace ops
2> specifying ${maxWriteMbPerSec:10} in 
the replication handler and overriding with a sys var when doing a 
REPLACENODE



> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-01 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15403350#comment-15403350
 ] 

Noble Paul commented on SOLR-9320:
--

Yeah, It's not in the patch. What I meant to say was the Multithreading 
framework is there and the code just needs to take advantage of that.

OTOH, I wonder how performant does it need to be ? Replacing a node may take a 
while and as long as it completes without timing out, is it no good enough? The 
health of the system is not degraded or no action is blocked  while the 
REPLACENODE is happening. I think, instead of creating 500 cores on the same 
node (vying for the same disk/CPU) it is better to do it serially

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-01 Thread Shalin Shekhar Mangar (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15403340#comment-15403340
 ] 

Shalin Shekhar Mangar commented on SOLR-9320:
-

bq. I don't think so. The create replica call can be async as well.

It can be but in your patch it is not. The addReplica method call will wait for 
the async core create call to complete and also for the new replica to be 
visible in cluster state. Similarly, the deleteReplica method waits for the 
async core unload to complete and wait for coreNode name to be deleted from the 
cluster state.

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-01 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15403331#comment-15403331
 ] 

Noble Paul commented on SOLR-9320:
--

I don't think so. The create replica call can be async as well.

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-01 Thread Nitin Sharma (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15403326#comment-15403326
 ] 

Nitin Sharma commented on SOLR-9320:


Thanks for clarifying on async. 

Reg 500 collections: When you run replace node on a node x and the node happens 
to have 500 cores, then all these cores need to be moved to the destination 
(and then removed from source). If you have too many cores, adding a new 
replica for every core on the destination host will take time. Wondering if you 
ran any tests with such high capacity

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-01 Thread Shalin Shekhar Mangar (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15403323#comment-15403323
 ] 

Shalin Shekhar Mangar commented on SOLR-9320:
-

bq. Multithreading is already of part of OverseerCollectionMessageHandler

That is only for executing collection APIs concurrently. It does not help when 
you are trying to add replica to target and remove replica from source serially 
for each replicas on a particular node. Nitin is right that it should be 
multi-threaded otherwise it will take a long time for a node hosting a lot of 
replicas.

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-01 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15403320#comment-15403320
 ] 

Noble Paul commented on SOLR-9320:
--

Multithreading is already of part of {{OverseerCollectionMessageHandler}} . You 
just have to invoke the command with the {{async}} flag and It will do 
everything in a thread of its own

 BTW Why would you run {{REPLACENODE}} with 500 collections? It runs one node 
at a time , right?

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-08-01 Thread Nitin Sharma (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15402587#comment-15402587
 ] 

Nitin Sharma commented on SOLR-9320:


I have a concern when this comes to scaling to 100s of collections. If you have 
100+ collections, this command will eventually timeout (> 3 mins) and this does 
this in serial. The patch i sent (has multi threading capability), I ran with 
500 collections and it finished around 4+ mins. Can you share the numbers with 
collections of that scale?

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch, 
> SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-07-27 Thread Anshum Gupta (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15396244#comment-15396244
 ] 

Anshum Gupta commented on SOLR-9320:


I am having issues applying this patch against master. 
I tried to take a look at this without applying the patch and the patch does 
add complexity, as Noble pointed out.

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-07-27 Thread Nitin Sharma (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15396174#comment-15396174
 ] 

Nitin Sharma commented on SOLR-9320:


This has the patch for both REPLACENODE and DELETENODE inside it. I can reduce 
the num files again and re-upload if you like. 

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-07-27 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15396181#comment-15396181
 ] 

Noble Paul commented on SOLR-9320:
--

If I'm not wrong you just need to add one method. This is a trivial 
functionality

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-07-27 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15396160#comment-15396160
 ] 

Noble Paul commented on SOLR-9320:
--

I really didn't mean that the code was complex to understand. I meant adding so 
many files for such a simple functionality is not worth it. We generally follow 
a pattern for multiple commands. 

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-07-27 Thread Nitin Sharma (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15396152#comment-15396152
 ] 

Nitin Sharma commented on SOLR-9320:


[~noble.paul] Can you explain about the complex part? The code does what was 
mentioned in the spec. It identifies all the cores on the node to be replaced 
and recreates them on the destination node. After that it calls "DELETENODE" on 
the source node. Which part of this complex? The multithreaded part?

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-07-27 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15396107#comment-15396107
 ] 

Noble Paul commented on SOLR-9320:
--

REPLACENODE is just like any other command that we already have. This patch 
seems to be too complex for a very simple functionality like that. Please refer 
implementations of other commands.

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
> Attachments: DELETENODE.jpeg, REPLACENODE_After.jpeg, 
> REPLACENODE_Before.jpeg, REPLACENODE_call_response.jpeg, SOLR-9320.patch
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-07-20 Thread Nitin Sharma (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15386191#comment-15386191
 ] 

Nitin Sharma commented on SOLR-9320:


Thanks for clarifying. I am aligning towards option a) as well (for simplicity 
and performance) but wanted to confirm the semantics. 

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-07-20 Thread Erick Erickson (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15386025#comment-15386025
 ] 

Erick Erickson commented on SOLR-9320:
--

(b) would copy the index twice, correct? I've seen (and I wouldn't lie) TB size 
indexes on a single replica so I think that would be prohibitively expensive.

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-07-19 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15385408#comment-15385408
 ] 

Noble Paul commented on SOLR-9320:
--

bq.That will leave uneven naming conventions in the cluster.

replica names do not matter. You can create a new replica in any name as you 
want

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9320) A REPLACENODE command to decommission an existing node with another new node

2016-07-19 Thread Nitin Sharma (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15385323#comment-15385323
 ] 

Nitin Sharma commented on SOLR-9320:


Clarification on this. When you mean recreate, does the naming matter? Lets say 
x_shard1_replica1 is on source node, if you want to move it to destination 
node, we can
a)  either create a new replica (x_shard1_replica2) and delete the source. That 
will leave uneven naming conventions in the cluster. (there will not be a 
replica1 but a replica2). 
b) Preserve the exact same name as the replica in source node. We can achieve 
this by creating a temp replica on destination first, deleting the  replica on 
source,  recreating the replica (with same name) on destination and then 
cleaning up the temp. 


Option (b) can be thought of as a migrate core. 

Let me know which sounds more usable. 

> A REPLACENODE command to decommission an existing node with another new node
> 
>
> Key: SOLR-9320
> URL: https://issues.apache.org/jira/browse/SOLR-9320
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Reporter: Noble Paul
> Fix For: 6.1
>
>
> The command should accept a source node and target node. recreate the 
> replicas in source node in the target and do a DLETENODE of source node



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org