Pick a winner in each dimension — change your vote anytime.
Pricing and capabilities synced from the OpenRouter catalogue. Scores are editorial (0–10).
Choose Grok 4.20 if you want agents that genuinely need 2M tokens of context. Choose Qwen3 235B A22B if you want self-hosted or ultra-cheap open-weight agents.
In our editorial scoring, Qwen3 235B A22B leads in 3 of five dimensions (speed, value for $ and reliability), while Grok 4.20 leads in 1. On price, Grok 4.20 runs about $1.63 per 1M tokens (blended) and is proprietary; Qwen3 235B A22B is about $0.09 and open-weight.
The model picks the moves; the agent runs the loop, the tools, and the guardrails. Once you've chosen a model, see which agent gets the most out of it.