Software Engineering Arena

Software Engineering Arena is an open-source initiative to transparently evaluate and track AI assistants across real-world software engineering tasks. We provide interactive platforms, tracking systems, and novel metrics to advance the field of AI-assisted software development.

"The easier it is to verify a solution, the faster an AI system can learn to master the task." > — Alperen Keles (@alpaylan), Andrej Karpathy (@karpathy), Jason Wei (@jasonwei20)

Our mission: We believe any evaluable task can eventually be automated with high-quality AI systems. We accelerate this transformation in software engineering by developing benchmarks and leaderboards that rigorously evaluate AI capabilities.

Welcome collaboration from research labs, independent contributors, and the broader SE community!

⚔️ Arena-Based Tracking Suite

Evaluate AI assistants through pairwise comparisons in user-oriented software engineering scenarios:

SWE-Model-Arena

Evaluate foundation models through pairwise comparisons in multi-round conversational workflows with repository-aware context and transparent leaderboards.

📊 GitHub-Based Tracking Suite

Evaluate AI assistants through their actual GitHub activity:

📄 License

All projects under Software Engineering Arena are licensed under the Apache 2.0 License. Data collected and open-sourced follows the same license.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Software Engineering Arena

Software Engineering Arena

⚔️ Arena-Based Tracking Suite

SWE-Model-Arena

📊 GitHub-Based Tracking Suite

SWE-Issue

SWE-PR

SWE-Review

SWE-Release

SWE-Wiki

SWE-Team

📄 License

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!