Re: [Gluster-devel] Handling locks in NSR

Avra Sengupta Wed, 02 Mar 2016 02:44:08 -0800

On 03/02/2016 04:03 PM, Atin Mukherjee wrote:

-Atin
Sent from one plus one
On 02-Mar-2016 3:41 pm, "Avra Sengupta" <aseng...@redhat.com<mailto:aseng...@redhat.com>> wrote:
>
> On 03/02/2016 02:55 PM, Venky Shankar wrote:
>>
>> On Wed, Mar 02, 2016 at 02:29:26PM +0530, Avra Sengupta wrote:
>>>
>>> On 03/02/2016 02:02 PM, Venky Shankar wrote:
>>>>
>>>> On Wed, Mar 02, 2016 at 01:40:08PM +0530, Avra Sengupta wrote:
>>>>>
>>>>> Hi,
>>>>>
>>>>> All fops in NSR, follow a specific workflow as described in thisUML(https://docs.google.com/presentation/d/1lxwox72n6ovfOwzmdlNCZBJ5vQcCaONvZva0aLWKUqk/edit?usp=sharing).>>>>> However all locking fops will follow a slightly differentworkflow as>>>>> described below. This is a first proposed draft for handlinglocks, and we
>>>>> would like to hear your concerns and queries regarding the same.
>>>>>
>>>>> 1. On receiving the lock, the leader will Journal the lockhimself, and then>>>>> try to actually acquire the lock. At this point in time, if itfails to>>>>> acquire the lock, then it will invalidate the journal entry, andreturn a>>>>> -ve ack back to the client. However, if it is successful inacquiring the>>>>> lock, it will mark the journal entry as complete, and forwardthe fop to the
>>>>> followers.
>>>>
>>>> So, does a contending non-blocking lock operation check only onthe leader>>>> since the followers might have not yet ack'd an earlier lockoperation?
>>>
>>> A non-blocking lock follows the same work flow, and thereby checkson the>>> leader first. In this case, it would be blocked on the leader,till the
>>> leader releases the lock. Then it will follow the same workflow.
>>
>> A non-blocking lock should ideally return EAGAIN if the region isalready locked.>> Checking just on the leader (posix/locks on the leader serverstack) and returning>> EAGAIN is kind of incomplete as the earlier lock request might nothave been granted
>> (due to failure on followers).
>>
>> or does it even matter if we return EAGAIN during the transient state?
>>
>> We could block the lock on the leader until an earlier lock requestis satisfied>> (in which case return EAGAIN) or in case of failure try to satisfythe lock request.
>
> That is what I said, it will be blocked on the leader till theleader releases the already held lock.
>
>>
>>>>> 2. The followers on receiving the fop, will journal it, and thentry to>>>>> actually acquire the lock. If it fails to acquire the lock, thenit will>>>>> invalidate the journal entry, and return a -ve ack back to theleader. If it>>>>> is successful in acquiring the lock, it will mark the journalentry as
>>>>> complete,and send a +ve ack to the leader.
>>>>>
>>>>> 3. The leader on receiving all acks, will perform a quorumcheck. If quorum>>>>> meets, it will send a +ve ack to the client. If the quorumfails, it will
>>>>> send a rollback to the followers.
>>>>>
>>>>> 4. The followers on receiving the rollback, will journal itfirst, and then>>>>> release the acquired lock. It will update the rollback entry inthe journal
>>>>> as complete and send an ack to the leader.
>>>>
>>>> What happens if the rollback fails for whatever reason?
>>>
>>> The leader receives a -ve rollback ack, but there's little it cando about>>> it. Depending on the failure, it will be resolved duringreconciliation
>>>>>
>>>>> 5. The leader on receiving the rollback acks, will journal it's own
>>>>> rollback, and then release the acquired lock. It will update therollback
>>>>> entry in the journal, and send a -ve ack to the client.
>>>>>
>>>>> Few things to be noted in the above workflow are:
>>>>> 1. It will be a synchronous operation, across the replica volume.
>
> Atin, I am not sure how AFR handles it.
If AFR/EC handle them asynchronously do you see any performancebottleneck with NSR for this case?

Well it's not synchronous to the point that the follwers would performit one after the other. AFR/EC clients would also have to wait for acksfrom a quorum of servers till they can ack the client. The same is truewith the NSR leader, who will have to wait till it gets ack from aquorum of followers.

>>>>> 2. Reconciliation will take care of nodes who have missed outthe locks.>>>>> 3. On a client disconnect, there will be a lock-timeout on whoseexpiration

>>>>> all locks held by that particular client will be released.
>>>>>
>>>>> Regards,
>>>>> Avra
>>>>> _______________________________________________
>>>>> Gluster-devel mailing list
>>>>> Gluster-devel@gluster.org <mailto:Gluster-devel@gluster.org>
>>>>> http://www.gluster.org/mailman/listinfo/gluster-devel
>
>
> _______________________________________________
> Gluster-devel mailing list
> Gluster-devel@gluster.org <mailto:Gluster-devel@gluster.org>
> http://www.gluster.org/mailman/listinfo/gluster-devel

_______________________________________________
Gluster-devel mailing list
Gluster-devel@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-devel

Re: [Gluster-devel] Handling locks in NSR

Reply via email to