Back to Dashboard
CategoryWeight: 1.0x

Performance Efficiency

Tests whether generated code uses efficient algorithms and avoids unnecessary computation, memory allocation, and I/O.

Best Score

0.0

Avg Score

0.0

Tests

3

Performance Over Time — All Models

Model Rankings

1
Claude Sonnet 4.6

Category score

View
95.3BEST
Tokens42.6k
Total42.6k
2
Claude Opus 4.8

Category score

View
94.3-1.0 pts
Tokens26.1k
Total26.1k
3
GPT-5.5

Category score

View
90.3-5.0 pts
Tokens40.4k
Total40.4k
4
Grok

Category score

View
85.0-10.3 pts
Tokens85.1k
Total85.1k

Test Breakdown

Algorithm Complexity

Solve a problem in O(n log n) instead of naive O(n^2)

Claude Sonnet 4.6
95.3
Claude Opus 4.8
94.3
GPT-5.5
90.3
Grok
85.0

Memory-efficient Processing

Process a large file using streaming instead of loading into memory

Claude Sonnet 4.6
95.3
Claude Opus 4.8
94.3
GPT-5.5
90.3
Grok
85.0

Query Optimization

Write database queries that use indexes and avoid N+1 patterns

Claude Sonnet 4.6
95.3
Claude Opus 4.8
94.3
GPT-5.5
90.3
Grok
85.0