Agents lie.
To you, and to themselves. They'll write hollow tests, declare victory, and never look back. The problem is they rarely have the tools to verify their own work. And even when they do, they don't bother.
Preview access
To you, and to themselves. They'll write hollow tests, declare victory, and never look back. The problem is they rarely have the tools to verify their own work. And even when they do, they don't bother.
You define a program — orchestrator, sub-agents, roles — and Orpheus provisions VMs, allocates agents, manages forks. Bring whatever agents you want. We'll manage the fleet.
Orpheus invests compute in validating code, not just producing it. Agents are required to run code end to end, execute the happy path, and review outputs before anything gets merged. We use defence in depth to keep agents honest.
Work locally with Claude, Codex, Gemini, or anything else you prefer. Point them at the Orpheus CLI and they become commanders of the agent fleet — launching test-time compute on fully provisioned machines and delivering reviewed, verified pull requests. Add it to any workflow.
test_oauth_e2e.py.