On 6/25/25 10:43, Stefano Garzarella wrote:
> On Fri, Jun 20, 2025 at 09:52:43PM +0200, Michal Luczaj wrote:
>> vsock_find_cid() and vsock_dev_do_ioctl() may race with module unload.
>> transport_{g2h,h2g} may become NULL after the NULL check.
>>
>> Introduce vsock_transport_local_cid() to protect from a potential
>> null-ptr-deref.
>>
>> KASAN: null-ptr-deref in range [0x0000000000000118-0x000000000000011f]
>> RIP: 0010:vsock_find_cid+0x47/0x90
>> Call Trace:
>> __vsock_bind+0x4b2/0x720
>> vsock_bind+0x90/0xe0
>> __sys_bind+0x14d/0x1e0
>> __x64_sys_bind+0x6e/0xc0
>> do_syscall_64+0x92/0x1c0
>> entry_SYSCALL_64_after_hwframe+0x4b/0x53
>>
>> KASAN: null-ptr-deref in range [0x0000000000000118-0x000000000000011f]
>> RIP: 0010:vsock_dev_do_ioctl.isra.0+0x58/0xf0
>> Call Trace:
>> __x64_sys_ioctl+0x12d/0x190
>> do_syscall_64+0x92/0x1c0
>> entry_SYSCALL_64_after_hwframe+0x4b/0x53
>>
>> Fixes: c0cfa2d8a788 ("vsock: add multi-transports support")
>> Suggested-by: Stefano Garzarella <sgarz...@redhat.com>
>> Signed-off-by: Michal Luczaj <m...@rbox.co>
>> ---
>> net/vmw_vsock/af_vsock.c | 23 +++++++++++++++++------
>> 1 file changed, 17 insertions(+), 6 deletions(-)
>>
>> diff --git a/net/vmw_vsock/af_vsock.c b/net/vmw_vsock/af_vsock.c
>> index 
>> 2e7a3034e965db30b6ee295370d866e6d8b1c341..63a920af5bfe6960306a3e5eeae0cbf30648985e
>>  100644
>> --- a/net/vmw_vsock/af_vsock.c
>> +++ b/net/vmw_vsock/af_vsock.c
>> @@ -531,9 +531,21 @@ int vsock_assign_transport(struct vsock_sock *vsk, 
>> struct vsock_sock *psk)
>> }
>> EXPORT_SYMBOL_GPL(vsock_assign_transport);
>>
>> +static u32 vsock_transport_local_cid(const struct vsock_transport 
>> **transport)
> 
> Why we need double pointer?

Because of a possible race. If @transport is `struct vsock_transport*` and
we pass `transport_g2h`, the passed non-NULL pointer value may immediately
become stale (due to module unload). But if it's `vsock_transport**` and we
pass `&transport_g2h`, then we can take the mutex, check `*transport` for
NULL and safely go ahead.

Or are you saying this could be simplified?

>> +{
>> +    u32 cid = VMADDR_CID_ANY;
>> +
>> +    mutex_lock(&vsock_register_mutex);
>> +    if (*transport)
>> +            cid = (*transport)->get_local_cid();
>> +    mutex_unlock(&vsock_register_mutex);
>> +
>> +    return cid;
>> +}
>> +
>> bool vsock_find_cid(unsigned int cid)
>> {
>> -    if (transport_g2h && cid == transport_g2h->get_local_cid())
>> +    if (cid == vsock_transport_local_cid(&transport_g2h))
>>              return true;
>>
>>      if (transport_h2g && cid == VMADDR_CID_HOST)
>> @@ -2536,18 +2548,17 @@ static long vsock_dev_do_ioctl(struct file *filp,
>>                             unsigned int cmd, void __user *ptr)
>> {
>>      u32 __user *p = ptr;
>> -    u32 cid = VMADDR_CID_ANY;
>>      int retval = 0;
>> +    u32 cid;
>>
>>      switch (cmd) {
>>      case IOCTL_VM_SOCKETS_GET_LOCAL_CID:
>>              /* To be compatible with the VMCI behavior, we prioritize the
>>               * guest CID instead of well-know host CID (VMADDR_CID_HOST).
>>               */
>> -            if (transport_g2h)
>> -                    cid = transport_g2h->get_local_cid();
>> -            else if (transport_h2g)
>> -                    cid = transport_h2g->get_local_cid();
>> +            cid = vsock_transport_local_cid(&transport_g2h);
>> +            if (cid == VMADDR_CID_ANY)
>> +                    cid = vsock_transport_local_cid(&transport_h2g);
> 
> I still prefer the old `if ... else if ...`, what is the reason of this
> change? I may miss the point.

Ah, ok, I've just thought such cascade would be cleaner.

So is this what you prefer?

if (transport_g2h)
        cid = vsock_transport_local_cid(&transport_g2h);
else if (transport_h2g)
        cid = vsock_transport_local_cid(&transport_h2g);

Thanks,
Michal


Reply via email to