OpenRouter Models Quick Reference for Claude Code

Top 5 Models for Tool/Function Calling

🥇 1. Mistral Small 3.1 24B - BEST VALUE

Model: mistralai/mistral-small-3.1-24b
Cost: $0.02/M input | $0.04/M output
Speed: ⚡⚡⚡⚡ Fast
Tool Support: ⭐⭐⭐⭐⭐ Excellent

Use for: Production deployments - best cost/performance ratio

🥈 2. Cohere Command R7B - CHEAPEST PAID

Model: cohere/command-r7b-12-2024
Cost: $0.038/M input | $0.15/M output
Speed: ⚡⚡⚡⚡⚡ Very Fast
Tool Support: ⭐⭐⭐⭐⭐ Excellent

Use for: Budget-conscious deployments with agent workflows

🥉 3. Qwen Turbo - LARGE CONTEXT

Model: qwen/qwen-turbo
Cost: $0.05/M input | $0.20/M output
Speed: ⚡⚡⚡⚡⚡ Very Fast
Tool Support: ⭐⭐⭐⭐ Good
Context: 1M tokens

Use for: Projects needing massive context windows

🆓 4. DeepSeek V3 0324 - FREE

Model: deepseek/deepseek-chat-v3-0324:free
Cost: $0.00 (FREE!)
Speed: ⚡⚡⚡⚡ Fast
Tool Support: ⭐⭐⭐⭐ Good

Use for: Development, testing, cost-sensitive production

⭐ 5. Gemini 2.0 Flash - FREE (MOST POPULAR)

Model: google/gemini-2.0-flash-exp:free
Cost: $0.00 (FREE!)
Speed: ⚡⚡⚡⚡⚡ Very Fast
Tool Support: ⭐⭐⭐⭐⭐ Excellent
Limits: 20 req/min, 50/day if <$10 credits

Use for: Development, testing, low-volume production

Quick Command Examples

# Best value - Mistral Small 3.1
agentic-flow --agent coder --task "..." --provider openrouter \
  --model "mistralai/mistral-small-3.1-24b"

# Free tier - Gemini
agentic-flow --agent researcher --task "..." --provider openrouter \
  --model "google/gemini-2.0-flash-exp:free"

# Cheapest provider auto-routing
agentic-flow --agent optimizer --task "..." --provider openrouter \
  --model "deepseek/deepseek-chat:floor"

Cost Comparison (per Million Tokens)

Model	Input	Output	50/50 Mix
Mistral Small 3.1	$0.02	$0.04	$0.03
Command R7B	$0.038	$0.15	$0.094
Qwen Turbo	$0.05	$0.20	$0.125
DeepSeek FREE	$0.00	$0.00	$0.00
Gemini FREE	$0.00	$0.00	$0.00

Pro Tips

Use :free suffix for free models
Use :floor suffix for cheapest provider
Filter models: https://openrouter.ai/models?supported_parameters=tools
No extra fees for tool calling - only token usage
Free tier limits: 20 req/min, 50/day (unlimited with $10+ balance)

When to Use Which Model

Development/Testing: Gemini 2.0 Flash Free
Production (Budget): Mistral Small 3.1 24B
Production (Cheapest): Command R7B
Large Context: Qwen Turbo
Complex Reasoning: DeepSeek Chat
Maximum Savings: DeepSeek V3 0324 Free

Full research report: /workspaces/agentic-flow/agentic-flow/.claude/openrouter-models-research.md

2.9 KiB Raw Blame History