Re: [pkg-discuss] after upgrade to snv_124 pkg ( cli or GUI ) does not work

Shawn Walker Tue, 29 Sep 2009 11:41:49 -0700

[email protected] wrote:

This might also explain how manifests are getting corrupted in bug 6011.
They parse correctly when we download them, but subsequent re-loads
fail.  Two processes writing to the same file without any locking could
certainly cause this problem.  The manifest code doesn't use a method to
keep its tempfiles unique, so two writers could be modifying the same
file, and then rename it into place.


This is an extension of similar comments Shawn made earlier this
morning on bug 11169.

http://defect.opensolaris.org/bz/show_bug.cgi?id=11169#c4

I should note that it's just a theory, but I think it's a sound one.The other problem in being able to consistently reproduce this is thatthe cron entry for the update-refresh script:


30 0,9,12,18,21 * * * /usr/lib/update-manager/update-refresh.sh

...means that it will only be triggered (if I remember how to readcrontab correctly :P) at the half-hour mark for the hours 0,9,12,18,21.In addition to that, there is a "dither" that the update-refresh.shscript adds to attempt to introduce an additional random amount of delaybefore the update refresh is actually performed to prevent all clientsfrom accessing the server at the same time.

Digging further, I attempted to reproduce this on an upgraded image (122-> 124), and while I saw the pkg/catalog directory disappearancebehaviour described, I was unable to get a client to crash. This isn'tsurprising given that only specific race condition cases would expose it.

The good news is that, as far as I can tell, the current image upgradecold should not leave the system in a bad state. That is, it attemptsto do all of the conversion work first before altering the imagestructure, and the very last thing it does is remove the old/var/pkg/catalog directory.

That should mean that if any of the clients is interrupted before it hasa chance to perform the final step of removing the /var/pkg/catalogdirectory, the next time a client runs, it should be able to completethe upgrade without a problem.

In a worst case scenario, should the image upgrade completely andunexpectedly fail, the user always has the old boot environment tofallback to.

For now, based on a conversation with Danek, I think release noting thisis sufficient. The long-term solution is to have proper image-lockingmechanisms in the client, since I imagine that this is not the last timewe'll have an image format change and that would solve many other issuesas well.


Cheers,
--
Shawn Walker
_______________________________________________
pkg-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/pkg-discuss

Re: [pkg-discuss] after upgrade to snv_124 pkg ( cli or GUI ) does not work

Reply via email to