2.9 KiB
2.9 KiB
OpenRouter Models Quick Reference for Claude Code
Top 5 Models for Tool/Function Calling
🥇 1. Mistral Small 3.1 24B - BEST VALUE
Model: mistralai/mistral-small-3.1-24b
Cost: $0.02/M input | $0.04/M output
Speed: ⚡⚡⚡⚡ Fast
Tool Support: ⭐⭐⭐⭐⭐ Excellent
Use for: Production deployments - best cost/performance ratio
🥈 2. Cohere Command R7B - CHEAPEST PAID
Model: cohere/command-r7b-12-2024
Cost: $0.038/M input | $0.15/M output
Speed: ⚡⚡⚡⚡⚡ Very Fast
Tool Support: ⭐⭐⭐⭐⭐ Excellent
Use for: Budget-conscious deployments with agent workflows
🥉 3. Qwen Turbo - LARGE CONTEXT
Model: qwen/qwen-turbo
Cost: $0.05/M input | $0.20/M output
Speed: ⚡⚡⚡⚡⚡ Very Fast
Tool Support: ⭐⭐⭐⭐ Good
Context: 1M tokens
Use for: Projects needing massive context windows
🆓 4. DeepSeek V3 0324 - FREE
Model: deepseek/deepseek-chat-v3-0324:free
Cost: $0.00 (FREE!)
Speed: ⚡⚡⚡⚡ Fast
Tool Support: ⭐⭐⭐⭐ Good
Use for: Development, testing, cost-sensitive production
⭐ 5. Gemini 2.0 Flash - FREE (MOST POPULAR)
Model: google/gemini-2.0-flash-exp:free
Cost: $0.00 (FREE!)
Speed: ⚡⚡⚡⚡⚡ Very Fast
Tool Support: ⭐⭐⭐⭐⭐ Excellent
Limits: 20 req/min, 50/day if <$10 credits
Use for: Development, testing, low-volume production
Quick Command Examples
# Best value - Mistral Small 3.1
agentic-flow --agent coder --task "..." --provider openrouter \
--model "mistralai/mistral-small-3.1-24b"
# Free tier - Gemini
agentic-flow --agent researcher --task "..." --provider openrouter \
--model "google/gemini-2.0-flash-exp:free"
# Cheapest provider auto-routing
agentic-flow --agent optimizer --task "..." --provider openrouter \
--model "deepseek/deepseek-chat:floor"
Cost Comparison (per Million Tokens)
| Model | Input | Output | 50/50 Mix |
|---|---|---|---|
| Mistral Small 3.1 | $0.02 | $0.04 | $0.03 |
| Command R7B | $0.038 | $0.15 | $0.094 |
| Qwen Turbo | $0.05 | $0.20 | $0.125 |
| DeepSeek FREE | $0.00 | $0.00 | $0.00 |
| Gemini FREE | $0.00 | $0.00 | $0.00 |
Pro Tips
- Use
:freesuffix for free models - Use
:floorsuffix for cheapest provider - Filter models: https://openrouter.ai/models?supported_parameters=tools
- No extra fees for tool calling - only token usage
- Free tier limits: 20 req/min, 50/day (unlimited with $10+ balance)
When to Use Which Model
- Development/Testing: Gemini 2.0 Flash Free
- Production (Budget): Mistral Small 3.1 24B
- Production (Cheapest): Command R7B
- Large Context: Qwen Turbo
- Complex Reasoning: DeepSeek Chat
- Maximum Savings: DeepSeek V3 0324 Free
Full research report: /workspaces/agentic-flow/agentic-flow/.claude/openrouter-models-research.md