Nvidia

模型名延迟发布日期
z-ai/glm57s,常超时2026-2-11
z-ai/glm4.77s2025-12-22
moonshotai/kimi-k2.5超时
moonshotai/kimi-k2-thinking9.6s2025-12-08
moonshotai/kimi-k2-instruct-09051s2025-09-05
moonshotai/kimi-k2-instruct1s
minimaxai/minimax-m2.51s2026-02-12
qwen/qwen3.5-397b-a17b24s,常超时2026-02-16
qwen/qwen3.5-122b-a10b15s2026-02-24
qwen/qwen3-coder-480b-a35b-instruct1.2s
qwen/qwen3-next-80b-a3b-instruct0.7s
qwen/qwen3-next-80b-a3b-thinking3.8s
qwen/qwq-32b2.7s
deepseek-ai/deepseek-v3.21m10s,常超时2025-12-01
deepseek-ai/deepseek-v3.1-terminus2s2025-08-21
deepseek-ai/deepseek-v3.11.5s
stepfun-ai/step-3.5-flash1.4s2026-02
nvidia/nemontron-3-super-120b-a12b2.4s2026-03-11
openai/gpt-oss-120b1s2025-08-05
openai/gpt-oss-20b0.7s

注:延迟为发送 hello 消息收到完整响应的耗时,并不稳定。质量排名参考 https://artificialanalysis.ai

腾讯

IDNamePrice
glm-5.0GLM-5.0x0.80
glm-5.0-turboGLM-5.0-Turbox0.95
glm-4.7GLM-4.7x0.23
minimax-2.7MiniMax-M2.7x0.26
minimax-m2.5MiniMax-M2.5x0.18
kimi-k2.5Kimi-K2.5x0.45
deepseek-v-2-volcDeepSeek-V3.2x0.29
hunyuan-2.0-thinkingTencent HY 2.0 Thinkx0.04

美团

智谱

超算互联网