Dear All,






This draft explores a timely intersection of networking and AI. It identifies 
significant network challenges in distributed LLM inference clouds—specifically 
highly concurrent model loading and severe cold start latency caused by massive 
model downloads (70GB to 1TB+). We analyze why these scenarios are a perfect 
fit for multicast and discuss the applicability of PIM-SM, SR P2MP, and BIER 
technologies.

We believe this work can serve as a useful discussion starter  on how to evolve 
multicast to better serve emerging AI workloads.



We would be very grateful for your review comments and feedback on the draft. 
Your insights will be invaluable in helping us improve its technical depth and 
clarity in the next revision.



Thank you for your time and consideration.




Best regards,


Yisong Liu





----邮件原文----发件人:internet-drafts <[email protected]>收件人:Junye Zhang 
<[email protected]>,Yisong Liu <[email protected]>,Zheng Zhang 
<[email protected]>抄 送: (无)发送时间:2026-02-12 16:52:29主题:New Version 
Notification for draft-liu-rtgwg-llmsync-multicast-00.txtA new version of 
Internet-Draft draft-liu-rtgwg-llmsync-multicast-00.txt hasbeen successfully 
submitted by Yisong Liu and posted to theIETF repository.Name:     
draft-liu-rtgwg-llmsync-multicastRevision: 00Title:    Multicast Use Cases for 
Large Language Model SynchronizationDate:     2026-02-12Group:    Individual 
SubmissionPages:    6URL:      
https://www.ietf.org/archive/id/draft-liu-rtgwg-llmsync-multicast-00.txtStatus: 
  https://datatracker.ietf.org/doc/draft-liu-rtgwg-llmsync-multicast/HTML:     
https://www.ietf.org/archive/id/draft-liu-rtgwg-llmsync-multicast-00.htmlHTMLized:
 
https://datatracker.ietf.org/doc/html/draft-liu-rtgwg-llmsync-multicastAbstract:
   Large Language Models (LLMs) deployments are becoming increasingly   
widespread, with inference services being the most common   application.  This 
draft will discuss multicast use cases for   inference cloud services.The IETF 
SecretariatSubject:New Version Notification for 
draft-liu-rtgwg-llmsync-multicast-00.txtA new version of Internet-Draft 
draft-liu-rtgwg-llmsync-multicast-00.txt hasbeen successfully submitted by 
Yisong Liu and posted to theIETF repository.Name:     
draft-liu-rtgwg-llmsync-multicastRevision: 00Title:    Multicast Use Cases for 
Large Language Model SynchronizationDate:     2026-02-12Group:    Individual 
SubmissionPages:    6URL:      
https://www.ietf.org/archive/id/draft-liu-rtgwg-llmsync-multicast-00.txtStatus: 
  https://datatracker.ietf.org/doc/draft-liu-rtgwg-llmsync-multicast/HTML:     
https://www.ietf.org/archive/id/draft-liu-rtgwg-llmsync-multicast-00.htmlHTMLized:
 
https://datatracker.ietf.org/doc/html/draft-liu-rtgwg-llmsync-multicastAbstract:
   Large Language Models (LLMs) deployments are becoming increasingly   
widespread, with inference services being the most common   application.  This 
draft will discuss multicast use cases for   inference cloud services.The IETF 
Secretariat

_______________________________________________
rtgwg mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to