Compare, rank, and review AI agents and models — performance metrics and community ratings in one place.
19 models
| # | Model | Provider | Performance | User Rating | Reviews | Tags |
|---|---|---|---|---|---|---|
| 1 | GPT-4oUpdated 2 days ago | OpenAI | 92.5 | ★ 4.7 | 1240 | Text Generation, Small |
| 2 | Claude 3.5 SonnetUpdated 2 days ago | Anthropic | 91.2 | ★ 4.6 | 890 | Code Assistant, Small |
| 3 | Gemini 1.5 ProUpdated 2 days ago | 90.8 | ★ 4.5 | 654 | Multimodal, Small | |
| 4 | Claude 3 OpusUpdated 2 days ago | Anthropic | 90.5 | ★ 4.6 | 720 | Multimodal, Small, Text Generation |
| 5 | Qwen 3.5 397B-A17BUpdated 2 days ago | Alibaba | 89.2 | ★ 4.5 | 156 | Reasoning, Large, Code Assistant |
| 6 | Grok 2Updated 2 days ago | xAI | 89.0 | ★ 4.4 | 380 | Multimodal, Small |
| 7 | Llama 3.1 405BUpdated 2 days ago | Meta | 88.4 | ★ 4.4 | 420 | Reasoning, Large |
| 8 | Grok 3 BetaUpdated 2 days ago | xAI | 88.2 | ★ 4.5 | 145 | Text Generation, Small |
| 9 | Qwen 3.5 122B-A10BUpdated 2 days ago | Alibaba | 87.5 | ★ 4.4 | 98 | Text Generation, Large, Multimodal |
| 10 | Mistral LargeUpdated 2 days ago | Mistral AI | 87.1 | ★ 4.3 | 312 | Text Generation, Small |
| 11 | CodestralUpdated 2 days ago | Mistral AI | 86.2 | ★ 4.6 | 289 | Reasoning, Small |
| 12 | Qwen 3.5 27BUpdated 2 days ago | Alibaba | 85.8 | ★ 4.3 | 87 | Code Assistant, Medium, Reasoning |
| 13 | Grok 2 MiniUpdated 2 days ago | xAI | 85.5 | ★ 4.3 | 210 | Reasoning, Small |
| 14 | GPT-4o miniUpdated 2 days ago | OpenAI | 85.0 | ★ 4.5 | 2100 | Code Assistant, Small |
| 15 | DeepSeek CoderUpdated 2 days ago | DeepSeek | 84.5 | ★ 4.5 | 445 | Text Generation, Small |
| 16 | Qwen 2.5Updated 2 days ago | Alibaba | 83.8 | ★ 4.3 | 198 | Code Assistant, Small |
| 17 | Gemma 2 27BUpdated 2 days ago | 83.5 | ★ 4.3 | 245 | Reasoning, Medium, Code Assistant | |
| 18 | Claude 3 HaikuUpdated 2 days ago | Anthropic | 82.3 | ★ 4.4 | 567 | Multimodal, Small |
| 19 | Phi-3 MediumUpdated 2 days ago | Microsoft | 78.2 | ★ 4.2 | 312 | Multimodal, Medium, Text Generation |