Back to Dashboard
CategoryWeight: 1.0x
Code Quality
Evaluates readability, idiomatic patterns, naming conventions, and adherence to language best practices.
Best Score
0.0Avg Score
0.0Tests
3Performance Over Time — All Models
Model Rankings
Test Breakdown
Idiomatic Python
Write Pythonic code using generators, comprehensions, and context managers
Claude Sonnet 4.6
97.7Grok
97.7Claude Opus 4.8
97.0GPT-5.5
93.7TypeScript Best Practices
Use strict types, discriminated unions, and proper error narrowing
Claude Sonnet 4.6
97.7Grok
97.7Claude Opus 4.8
97.0GPT-5.5
93.7Clean Architecture Patterns
Implement repository pattern with proper dependency inversion
Claude Sonnet 4.6
97.7Grok
97.7Claude Opus 4.8
97.0GPT-5.5
93.7