AI Model Leaderboard
Compare cutting-edge language models based on comprehensive benchmark performance. Track progress in reasoning, coding, mathematics, and general knowledge.
Total Models
Top Score
Avg Score
Open Source
Multimodal
Latest Release
StarCoder2 15B
Hugging Face
Advanced code generation model with 4x larger context than StarCoder
Mistral Magistral
Mistral AI
Flagship multimodal model from Mistral AI with 405B parameters, designed for advanced reasoning and instruction following
Qwen3 Coder 480B
Alibaba
Massive-scale coding-specialized model with 480B parameters, designed for complex software development tasks
Llama 3.3 Nemotron Super 70B
Nvidia
Advanced Llama 3.3-based model enhanced by Nvidia with superior reasoning and instruction following capabilities
OpenReasoning Nemotron 27B
Nvidia
Open-source reasoning-focused model built on Nemotron architecture with enhanced logical thinking capabilities
Mistral Voxtral
Mistral AI
Multimodal voice and text model from Mistral AI with advanced audio processing and speech understanding capabilities
DBRX Instruct
Databricks
Instruction-tuned version of DBRX with enhanced capabilities
Qwen3 30B-A3B Thinking
Alibaba
Advanced reasoning-focused MoE model with enhanced thinking capabilities and step-by-step problem solving
Mistral Devstral
Mistral AI
Specialized coding model from Mistral AI with 22B parameters, optimized for software development and programming tasks
Kimi K2 Instruct
Moonshot AI
Advanced Chinese-English bilingual model with exceptional long-context understanding and instruction following
A.X 4.0 VL Light
SK Telecom
SK Telecom's vision-language model optimized for Korean and English
Qwen3 Coder 30B
Alibaba
Efficient coding-specialized model with 30B parameters, optimized for software development and programming tasks
DBRX
Databricks
132B parameter MoE model outperforming GPT-3.5 and competitive with GPT-4
Kanana 1.5 15.7B
Kakao
Kakao's Korean-optimized language model with strong instruction following
Kimi K2 Base
Moonshot AI
Base version of Kimi K2 model optimized for research and fine-tuning applications with strong foundational capabilities
Qwen3 30B-A3B Instruct
Alibaba
Advanced MoE model with 30B total parameters and 3B active, optimized for instruction following with high efficiency
OLMoCR 7B
Allen Institute for AI
Open language model optimized for OCR and document understanding
OpenReasoning Nemotron 7B
Nvidia
Compact reasoning-focused model with 7B parameters, optimized for efficient deployment with strong logical capabilities
Llama 3.3 70B Instruct
Meta AI
Llama 3.3 70B delivers performance nearly matching the 405B model while being significantly more efficient and cost-effective.
Falcon 180B
Technology Innovation Institute
180B parameter model, top open-source model at launch
Step 3
StepFun
Advanced multimodal model with strong reasoning capabilities
Stable LM 2 12B
Stability AI
Stability AI's 12B parameter language model with improved performance
AFM 4.5B
Arcee AI
Arcee AI's efficient 4.5B parameter model focused on domain adaptation
Llama 3.1 70B
Meta AI
Highly capable open model with extended context
Claude 3.5 Haiku
Anthropic
Claude 3.5 Haiku is Anthropic's fastest and most cost-effective model, optimized for speed while maintaining strong performance.
Mixtral 8x22B
Mistral AI
Large mixture-of-experts model with strong performance
StripedHyena 7B
Together AI
Efficient transformer alternative with state-space model architecture
Claude 3 Sonnet
Anthropic
Balanced model striking ideal balance between intelligence and speed for enterprise workloads
Kimi K1.5
Moonshot AI
Enhanced Kimi model with dramatically expanded context window and improved reasoning across all domains
Claude 3 Haiku
Anthropic
Fastest Claude model optimized for speed and efficiency
DeepSeek-R1
DeepSeek
DeepSeek-R1 is an advanced reasoning model that rivals OpenAI o1 performance through reinforcement learning and chain-of-thought reasoning.
Zephyr 7B Beta
Hugging Face
Fine-tuned Mistral-7B with improved helpfulness and harmlessness
Mixtral 8x22B (Historical)
Mistral AI
Early release version of Mixtral 8x22B with mixture-of-experts architecture, showing the initial capabilities before optimization
Gemini 2.0 Pro
Google DeepMind
Gemini 2.0 Pro is Google's most capable multimodal AI model, designed for the agentic era with advanced reasoning and tool use capabilities.
o1-mini
OpenAI
o1-mini is a cost-efficient reasoning model optimized for STEM tasks, particularly mathematics and coding, with performance matching o1 on technical benchmarks.
A.X 3.0
SK Telecom
Third generation A.X model with significantly improved reasoning capabilities and expanded context window for enterprise applications
Llama 3.2 11B Vision
Meta AI
Lightweight multimodal model for on-device deployment
Falcon 40B
Technology Innovation Institute
40B parameter model optimized for performance and efficiency
Kimi K1
Moonshot AI
First generation Kimi model with exceptional long context capabilities and strong Chinese-English bilingual performance
GPT-4 Turbo
OpenAI
Enhanced GPT-4 with improved performance and extended context
Claude 3 Opus
Anthropic
Powerful model for complex tasks requiring deep analysis
EXAONE 4.0.1 32B
LG AI
LG AI's flagship 32B parameter model with multilingual capabilities
Gemini 1.5 Flash
Google DeepMind
Lightweight multimodal model optimized for speed
BLOOM 176B
BigScience
Multilingual 176B parameter model supporting 59 languages
A.X 2.0
SK Telecom
Second generation of SK Telecom's A.X series, focused on Korean language understanding and telecommunications applications
Stable LM 2 1.6B
Stability AI
Compact 1.6B parameter model for efficient deployment
Jamba
AI21 Labs
Hybrid Mamba-Transformer architecture with 256K context window, combining efficiency and performance
Jurassic-2
AI21 Labs
Improved model with faster response times, better instruction following, and multilingual support
Mistral 7B v0.3
Mistral AI
Enhanced version of Mistral 7B with improved instruction following and extended context capabilities
OLMo 1.7 7B
Allen Institute for AI
Improved version of OLMo with enhanced training methodology and better performance across multiple benchmarks
OLMo 1.0 7B
Allen Institute for AI
Open Language Model from Allen AI, designed for transparency and research with fully open training data and process
GPT-NeoX 20B
EleutherAI
Open-source 20B parameter autoregressive language model
Midjourney V6
Midjourney
Most photorealistic model with significantly improved text rendering
Whisper Large v3
OpenAI
State-of-the-art speech recognition model supporting 99 languages with improved accuracy
Gen-3 Alpha
Runway ML
State-of-the-art video generation model with advanced motion and physics