An in-depth comparison of Devin and Jules across output quality, autonomy, reliability, speed, value, and ease of use. Vote for your favorite.
Pick a winner in each category — you can change your vote anytime.
Choose Devin if you are teams that want to delegate well-scoped engineering tickets end-to-end. Choose Jules if you are developers who want to queue up fixes and features and review PRs later.
Editorially this matchup is a dead heat: each agent leads in 2 of our six categories. On price, Devin runs $20/mo entry + usage (acus) and is proprietary; Jules runs free tier / google ai plans and is proprietary.
Devin is Cognition's fully autonomous software engineer: give it a task in Slack, Linear, or the web IDE and it plans, writes code, runs tests, and opens a pull request in its own cloud sandbox — including several sessions in parallel. It shines on well-scoped, repetitive engineering work (migrations, test coverage, small features) and improved markedly through its 2.x releases, but it remains weaker on ambiguous, novel tasks, and ACU-based usage pricing means heavy use costs real money. Cognition also acquired Windsurf in 2025, folding its IDE technology into the same product family.
Jules is Google's asynchronous coding agent, powered by Gemini. Unlike interactive agents, you assign it tasks — bug fixes, dependency bumps, small features — and it clones your repo into a cloud VM, writes and tests the change, and comes back with a pull request and an audio changelog summary. The free tier makes it an easy add to any workflow, but the async model means it suits queued, well-defined tasks rather than tight pair-programming loops, and turnaround depends on task queue and complexity.
Whichever AI agent you choose, Standard Compute gives you unlimited LLM compute at one flat monthly price. No rate limits, no per-token billing.