Open Agent Leaderboard

Exgentic

Systematic evaluation of AI agents across diverse environments — without domain-specific tuning.

Trade-offs

Cost-Performance Frontier

The Pareto frontier of agent efficiency — accuracy vs. spend.