| tags: [ AI LLM ] categories: [ Development ]
Awesome LLM
排行榜
- https://artificialanalysis.ai/
- https://llm-stats.com/
- https://www.swebench.com/
- https://openrouter.ai/rankings
- https://pinchbench.com/
- https://kilo.ai/leaderboard
- https://llmdb.com/
- https://crfm.stanford.edu/helm/lite/latest/#/leaderboard
| 序号 | 模型 | 参数规模 | 输入价格($/1M) | 输出价格($/1M) | 窗口 |
|---|---|---|---|---|---|
| 1 | GPT-5.4 | 2.5 | 15 | 1m | |
| 2 | Claude Opus 4.6 | 5 | 25 | 1m | |
| 3 | Gemini 3.1 Pro | 2 | 12 | 1m | |
| 4 | GLM-5 | 744B A40B | 0.72 | 2.3 | 200k |
| 5 | MiMo-V2-Pro | 1000B A42B | 1 | 3 | 1m |
| 6 | Minimax M2.7 | 230B A10B | 0.3 | 1.2 | 200k |
| 7 | Kimi K2.5 | 1000B A32B | 0.45 | 2.2 | 256k |
| 8 | Qwen3.5 | 397B A17B | 0.39 | 2.34 | 256k |
| 27B | 0.195 | 1.56 | |||
| 122B A10B | 0.26 | 2.08 | |||
| 9 | Minimax M2.5 | 230B A10B | 0.2 | 1.17 | 200k |
| 10 | DeepSeek V3.2 | 685B A37B | 0.26 | 0.38 | 128k |
| 11 | Grok 4 | 3 | 15 | ||
| 12 | MiMo-V2-Flash | 309B A15B | 0.09 | 0.29 | 256k |
| 13 | NVIDIA Nemontron 3 Super | 120B A12B BF16 | |||
| 14 | gpt-oss-120B | 0.039 | 0.19 | 128k | |
| 15 | gpt-oss-20B | 0.03 | 0.11 | 128k | |
| 16 | Step-3.5-Flash | 196B A11B | 0.1 | 0.3 | 256k |
| 17 | LongCat-Flash-Thinking | 560B A27B | 0.09 | 0.29 | 256k |
| 18 | Mistral Large 3 | ||||
| 19 | Google Gemma-4-31b | 31B | 0.14 | 0.40 | 256k |
| 20 | Cognition SWE-1.5 | ||||
| 21 | NVIDIA Nemotron 3 Nano | 30B A3B | |||
| 22 | K2 Think V2 | ||||
| 23 | Olmo 32B | 32B |
备注:NVIDIA Nemontron 和 K2 Think 开放程度高,Olmo 是示范用途,完全开放。更早的开源开放模型有 Pythia、LLM360、OpenCoder、BLOOM。
价格
- https://openrouter.ai/models
- https://artificialanalysis.ai/
- https://models.litellm.ai/
- https://github.com/BerriAI/litellm/blob/main/model_prices_and_context_window.json
- https://www.dmxapi.cn/rmb
- https://llm-stats.com/leaderboards/llm-leaderboard
- https://whatllm.org/
- https://pricepertoken.com/
- https://llm.minprices.com/
- https://github.com/syaoranwe/LLM-Price
运行
CLI
- https://github.com/ggerganov/llama.cpp
- https://github.com/foldl/chatllm.cpp
- https://github.com/ggerganov/whisper.cpp
- https://github.com/google/gemma.cpp
- https://github.com/EricLBuehler/mistral.rs
- https://github.com/ml-explore/mlx-lm
Desktop
- https://ollama.com
- https://lmstudio.ai
- https://cherry-ai.com
- https://msty.app
- https://github.com/open-webui/open-webui
- https://github.com/lobehub/lobe-chat
- https://github.com/chatboxai/chatbox
- https://github.com/danny-avila/LibreChat
- https://anythingllm.com
- https://www.nomic.ai/gpt4all
- https://github.com/lostruins/koboldcpp
- https://github.com/Mozilla-Ocho/llamafile
Server
- https://github.com/vllm-project/vllm
- https://github.com/sgl-project/sglang
- https://github.com/triton-inference-server/tensorrtllm_backend https://github.com/NVIDIA/TensorRT-LLM
- https://github.com/skypilot-org/skypilot
- https://github.com/huggingface/text-generation-inference
- https://github.com/lm-sys/FastChat
- https://github.com/bentoml/OpenLLM
- https://haystack.deepset.ai
Proxy
- https://github.com/BerriAI/litellm
- https://github.com/maximhq/bifrost
- https://github.com/tensorzero/tensorzero
- https://github.com/QuantumNous/new-api
- https://github.com/songquanpeng/one-api
- https://github.com/helicone/helicone
- https://portkey.ai/
SDK
- https://github.com/badlogic/pi-mono 21.1k stars
- https://github.com/pydantic/pydantic-ai 15.6k
- https://github.com/langchain-ai/deepagents 13.6k stars
- https://github.com/openai/openai-agents-python
- https://github.com/openai/openai-agents-js
- https://github.com/anthropics/claude-agent-sdk-python
- https://github.com/anthropics/claude-agent-sdk-typescript
- https://google.github.io/adk-docs/ 支持 Java, Go, TypeScript, Python
- https://ai-sdk.dev/ Vercel AI SDK for TypeScript
- https://www.llamaindex.ai/
- https://agentscope.io/
- https://github.com/microsoft/agent-framework
智能体
Github star 数量统计于 2026-03-08。
- https://github.com/openclaw/openclaw 278.6k stars
- https://github.com/ultraworkers/claw-code 158k stars
- https://github.com/HKUDS/nanobot 30.4k stars
- https://github.com/bytedance/deer-flow 25.9k stars
- https://github.com/zeroclaw-labs/zeroclaw 24.5k stars
- https://github.com/sipeed/picoclaw 22.9k stars
- https://github.com/badlogic/pi-mono 21.1k stars
- https://github.com/qwibitai/nanoclaw 20.3k stars
- https://github.com/AstrBotDevs/AstrBot 19.8k stars
- https://github.com/langbot-app/LangBot 15.5k stars
- https://github.com/RightNow-AI/openfang 12.4k stars
- https://github.com/agentscope-ai/CoPaw 9.6k stars
- https://github.com/nearai/ironclaw 7.3k stars
- https://github.com/nullclaw/nullclaw 5.9k stars
- https://github.com/HKUDS/OpenHarness 5.2k stars
- https://github.com/microclaw/microclaw 507 stars
- https://github.com/qhkm/zeptoclaw 433 stars
- https://github.com/loongclaw-ai/loongclaw 28 stars
应用
- 知识问答
- 智谱清言: https://chatglm.cn/ 、 https://z.ai ,支持 agent
- Minimax: https://agent.minimaxi.com/ ,只有 agent 模式,原来的 https://chat.minimaxi.com 域名已经跳转到 agent 域名
- Kimi: https://kimi.com , agent 模式收费,每月三次免费额度
- 千问: https://qianwen.com , https://qwen.ai ,有「深度研究」和「任务助理」,后者每月限制 20 次免费额度
- 腾讯元宝: https://yuanbao.tencent.com/
- DeepSeek: https://chat.deepseek.com/
- 小米 Mimo: https://aistudio.xiaomimimo.com/
- 字节豆包: https://www.doubao.com/ , 有 「深入研究」和「超能模式」
- 美团龙猫: https://longcat.chat/ , 有「深度研究」
- 阶越 AI: https://www.stepfun.com/
- ChatGPT: https://chatgpt.com/
- Nvidia: https://build.nvidia.com/openai/gpt-oss-120b
- 代码阅读:
- 个人桌面助理
- Anthropics Claude CoWork
- 基于 OpenCode 的 OpenWork
- 基于 OpenCode 的 Accomplish,曾用名 OpenWork
- Open Cowork
- Proma
- Craft Agents
- Kuse Cowork
- Eigent
- Open Claude Cowork
- OpenCowork
- Agent Cowork
- OpenClaw 包装或仿品
- AionUi
- 阶跃 AI 桌面伙伴
- 昆仑天工 Skywork Desktop
- Anthropics Claude CoWork
- CLI
技能
训练
- Pre-training & fine-tuning
- Fine-tuning only