> I guess revert what we did (unpin) and return an error. The interesting 
> question is what can make migration/isolation fail

OK. I will make the necessary changes. Let's handle errors properly.
Whatever the cause for the error, we will know it when it happens, and
when error is returned. I think I will add a 10-time retry instead of
the infinite retry that we currently have. The 10-times retry we
currently have during the hot-remove path.

>
> a) out of memory: smells like a zone setup issue. Failures are acceptable I 
> guess.
>
> b) short term pinnings: process dying - not relevant I guess. Other cases? 
> (Fork?)
>
> c) ?
>
> Once we clarified that, we actually know how likely it will be to return an 
> error (and making vfio pinnings fail etc).

Reply via email to