GitHub user Alanxtl edited a discussion: [AI] AI gateway milestones

https://jimmysong.io/blog/ai-gateway-in-depth/

Step1: 基础

- [x] 基础 AI 协议支持 (SSE/HTTP) #657
- [x] Streamable HTTP 
([doc](https://spec.modelcontextprotocol.io/specification/2025-03-26/basic/transports/#streamable-http)/[pr](https://github.com/modelcontextprotocol/specification/pull/206))
 #674
- [x] Token 用量获取 #659

Step2: 兼容

- [x] 面向密钥的负载均衡 [apache/dubbo-go-pixiu#696 
(comment)](https://github.com/apache/dubbo-go-pixiu/discussions/696#discussioncomment-14020512)
- [x] 密钥的健康性检测 
[apache/dubbo-go-pixiu#731](https://github.com/apache/dubbo-go-pixiu/pull/731) 
[apache/dubbo-go-pixiu#696 
(comment)](https://github.com/apache/dubbo-go-pixiu/discussions/696#discussioncomment-14013522)
- [x] fallback策略 #685 #692
- [x] retry策略 #685 #692
- [x] LLM 供应商的兼容 #678(需持续适配)
- [x] OpenAI 请求适配层,该功能不由 pixiu 网关层面实现 [apache/dubbo-go-pixiu#696 
(comment)](https://github.com/apache/dubbo-go-pixiu/discussions/696#discussioncomment-13973597)
- [x] LLM服务发现 (Nacos 注册中心) #746

Step2.1: 遗留问题

- [x] 删除 provider 有关的字段和功能(根据供应商名称自动组装endpoint地址)#764

Step3: 完善

- [x] AI 可观测性 
[apache/dubbo-go-pixiu#733](https://github.com/apache/dubbo-go-pixiu/pull/733)
  - [x] Token 用量
  - [x] 响应时间
- [x] 响应合规性和请求合规性
- [x] 日志
- [x] 更智能的路由

GitHub link: https://github.com/apache/dubbo-go-pixiu/discussions/696

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: 
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to