Understand the AI landscape to choose the best model and provider for your use case
State of Generative Media 2025 Survey
Supported by fal
State of AI (Highlights) - Q2 2025
Analysis of the AI landscape and the key trends shaping AI
Highlights
Intelligence
Artificial Analysis Intelligence Index; Higher is better
Speed
Output Tokens per Second; Higher is better
Gemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5Flashgpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)o3Logo of o3 which relates to the data aboveo3GPT-5 (medium)Logo of GPT-5 (medium) which relates to the data aboveGPT-5(medium)Gemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProLlama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4 MaverickGPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1GPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5 (high)Qwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetClaude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusGrok 4Logo of Grok 4 which relates to the data aboveGrok 4DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1192612251811781561421176551464644
Price
USD per 1M Tokens; Lower is better
gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4 MaverickGemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashDeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1Qwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507GPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5 (high)GPT-5 (medium)Logo of GPT-5 (medium) which relates to the data aboveGPT-5(medium)Gemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProGPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1o3Logo of o3 which relates to the data aboveo3Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetGrok 4Logo of Grok 4 which relates to the data aboveGrok 4Claude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1Opus0.30.40.812.63.43.43.43.53.56630
Artificial AnalysisGPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)Grok 4Logo of Grok 4 which relates to the data aboveGrok 4o3Logo of o3 which relates to the data aboveo3Gemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProClaude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1Opusgpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)Qwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetDeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528Gemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashKimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905GLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1Llama Nemotron Super 49B v1.5Logo of Llama Nemotron Super 49B v1.5 which relates to the data aboveLlamaNemotronSuper 49B v1.5gpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1GPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)GPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1EXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032BSolar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro2Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4MaverickMistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.16765656059585757545251504949454545434343383635
+ Add model from specific provider
Artificial Analysis Intelligence Index: Combination metric covering multiple dimensions of intelligence - the simplest way to compare how smart models are. Version 3.0 was released in September 2025 and includes: MMLU-Pro, GPQA Diamond, Humanity's Last Exam, LiveCodeBench, SciCode, AIME 2025, IFBench, AA-LCR, Terminal-Bench Hard, 𝜏²-Bench Telecom. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.
Artificial Analysis Intelligence Index: Combination metric covering multiple dimensions of intelligence - the simplest way to compare how smart models are. Version 3.0 was released in September 2025 and includes: MMLU-Pro, GPQA Diamond, Humanity's Last Exam, LiveCodeBench, SciCode, AIME 2025, IFBench, AA-LCR, Terminal-Bench Hard, 𝜏²-Bench Telecom. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.
Intelligence Evaluations
Intelligence evaluations measured independently by Artificial Analysis; Higher is better
Results claimed by AI Lab (not yet independently verified)
MMLU-Pro (Reasoning & Knowledge)
Claude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusGPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)Grok 4Logo of Grok 4 which relates to the data aboveGrok 4Gemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5Proo3Logo of o3 which relates to the data aboveo3DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528Qwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetGLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1Gemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashKimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905EXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032BLlama Nemotron Super 49B v1.5Logo of Llama Nemotron Super 49B v1.5 which relates to the data aboveLlamaNemotronSuper 49B v1.5Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4Maverickgpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)GPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1GPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)Solar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro2Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1gpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)Mistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.188%87%87%86%85%85%85%84%84%84%83%83%82%82%81%81%81%81%81%81%79%74%68%
GPQA Diamond (Scientific Reasoning)
Grok 4Logo of Grok 4 which relates to the data aboveGrok 4GPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)Gemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5Proo3Logo of o3 which relates to the data aboveo3DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528Claude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusGemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashQwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)GLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetKimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905Llama Nemotron Super 49B v1.5Logo of Llama Nemotron Super 49B v1.5 which relates to the data aboveLlamaNemotronSuper 49B v1.5EXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032BDeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1Solar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro2GPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4MaverickGPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1gpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)Mistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.188%85%84%83%81%81%79%79%78%78%78%78%77%75%74%74%73%69%67%67%67%62%59%
Humanity's Last Exam (Reasoning & Knowledge)
GPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)Grok 4Logo of Grok 4 which relates to the data aboveGrok 4Gemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5Proo3Logo of o3 which relates to the data aboveo3gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)Qwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1GLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5Claude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusGemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashEXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032BClaude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4Sonnetgpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1Solar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro2Llama Nemotron Super 49B v1.5Logo of Llama Nemotron Super 49B v1.5 which relates to the data aboveLlamaNemotronSuper 49B v1.5DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1Kimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905GPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4MaverickGPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1Mistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.126.5%23.9%21.1%20.0%18.5%15.0%14.9%13.0%12.2%11.9%11.1%10.5%9.6%8.5%7.5%7.0%6.8%6.3%6.3%5.4%4.8%4.6%4.4%
LiveCodeBench (Coding)
Grok 4Logo of Grok 4 which relates to the data aboveGrok 4Gemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProQwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507o3Logo of o3 which relates to the data aboveo3DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528EXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032BGLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5Llama Nemotron Super 49B v1.5Logo of Llama Nemotron Super 49B v1.5 which relates to the data aboveLlamaNemotronSuper 49B v1.5gpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)Gemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashGPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetClaude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1Opusgpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)Solar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro2Kimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1GPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)GPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1Mistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.1Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4Maverick82%80%79%78%78%77%75%74%74%72%70%67%66%66%65%64%62%61%58%56%46%41%40%
SciCode (Coding)
Grok 4Logo of Grok 4 which relates to the data aboveGrok 4GPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)Gemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProQwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507o3Logo of o3 which relates to the data aboveo3Claude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusDeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetGemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashDeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1GPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)GPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1gpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)Llama Nemotron Super 49B v1.5Logo of Llama Nemotron Super 49B v1.5 which relates to the data aboveLlamaNemotronSuper 49B v1.5GLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5EXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032BMistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.1Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4MaverickKimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905Solar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro246%43%43%42%41%41%40%40%39%39%39%38%37%36%36%35%35%35%34%34%33%31%30%
IFBench (Instruction Following)
GPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)o3Logo of o3 which relates to the data aboveo3gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)gpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)Claude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusClaude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetGrok 4Logo of Grok 4 which relates to the data aboveGrok 4Qwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507Gemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashGemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProGPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)GLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5GPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4MaverickKimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1Mistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.1DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1Solar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro2Llama Nemotron Super 49B v1.5Logo of Llama Nemotron Super 49B v1.5 which relates to the data aboveLlamaNemotronSuper 49B v1.5EXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032B73%71%69%61%55%55%54%51%50%49%46%44%43%43%42%42%41%40%40%38%37%37%36%
AIME 2025 (Competition Math)
GPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)Grok 4Logo of Grok 4 which relates to the data aboveGrok 4Qwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1o3Logo of o3 which relates to the data aboveo3Gemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProClaude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusEXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032BLlama Nemotron Super 49B v1.5Logo of Llama Nemotron Super 49B v1.5 which relates to the data aboveLlamaNemotronSuper 49B v1.5DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetGLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5Gemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5Flashgpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)Solar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro2Kimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1Mistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.1GPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1GPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4Maverick94%93%93%91%90%88%88%80%80%77%76%74%74%73%62%61%57%50%43%38%35%32%19%
AA-LCR (Long Context Reasoning)
GPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)o3Logo of o3 which relates to the data aboveo3Grok 4Logo of Grok 4 which relates to the data aboveGrok 4Qwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507Claude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusGemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProClaude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetGemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashGPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1Kimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1GLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4MaverickDeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1Llama Nemotron Super 49B v1.5Logo of Llama Nemotron Super 49B v1.5 which relates to the data aboveLlamaNemotronSuper 49B v1.5GPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)Mistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.1gpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)EXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032B76%69%68%67%66%66%65%62%61%55%53%52%51%48%48%46%45%34%25%20%19%14%
Terminal-Bench Hard
Grok 4Logo of Grok 4 which relates to the data aboveGrok 4o3Logo of o3 which relates to the data aboveo3Claude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusGPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetGemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProDeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1Kimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)GLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5GPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528GPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1Gemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashQwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507Mistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.1Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4Maverickgpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)Llama Nemotron Super 49B v1.5Logo of Llama Nemotron Super 49B v1.5 which relates to the data aboveLlamaNemotronSuper 49B v1.5EXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032BSolar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro238%35%32%31%30%25%24%23%23%22%21%18%16%15%13%13%13%10%6%6%5%4%3%
𝜏²-Bench Telecom
GPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)o3Logo of o3 which relates to the data aboveo3Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1Grok 4Logo of Grok 4 which relates to the data aboveGrok 4Kimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905Claude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusGPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetGemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProQwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507gpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)GPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1Mistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.1DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1Gemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashSolar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro2Llama Nemotron Super 49B v1.5Logo of Llama Nemotron Super 49B v1.5 which relates to the data aboveLlamaNemotronSuper 49B v1.5GLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4MaverickEXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032B85%81%76%75%73%71%67%66%65%54%53%50%47%41%37%37%35%32%28%28%25%18%17%
+ Add model from specific provider
While model intelligence generally translates across use cases, specific evaluations may be more relevant for certain use cases.
Artificial Analysis Intelligence Index: Combination metric covering multiple dimensions of intelligence - the simplest way to compare how smart models are. Version 3.0 was released in September 2025 and includes: MMLU-Pro, GPQA Diamond, Humanity's Last Exam, LiveCodeBench, SciCode, AIME 2025, IFBench, AA-LCR, Terminal-Bench Hard, 𝜏²-Bench Telecom. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.
Output Tokens Used to Run Artificial Analysis Intelligence Index
Tokens used to run all evaluations in the Artificial Analysis Intelligence Index
Answer Tokens
Reasoning Tokens
Artificial AnalysisGrok 4Logo of Grok 4 which relates to the data aboveGrok 4gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)Llama Nemotron Super 49B v1.5Logo of Llama Nemotron Super 49B v1.5 which relates to the data aboveLlamaNemotronSuper 49B v1.5Qwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507GLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5EXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032BGemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProDeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528Gemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashGrok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1GPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1Solar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro2gpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)o3Logo of o3 which relates to the data aboveo3Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetClaude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusKimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4MaverickDeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1Mistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.1GPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1GPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)120M110M110M110M100M100M100M99M93M87M87M63M56M56M50M43M30M16M15M14M12M7.4M4.4M120M110M100M100M96M96M89M91M78M84M84M57M50M53M46M37M23M12M16M
+ Add model from specific provider
Artificial Analysis Intelligence Index Tokens Use: The number of tokens required to run all evaluations in the Artificial Analysis Intelligence Index (excluding repeats).
Cost to Run Artificial Analysis Intelligence Index
Cost (USD) to run all evaluations in the Artificial Analysis Intelligence Index
Input Cost
Output Cost
Reasoning Cost
Artificial AnalysisClaude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusGrok 4Logo of Grok 4 which relates to the data aboveGrok 4Gemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProQwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507GPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4Sonneto3Logo of o3 which relates to the data aboveo3Gemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashGLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1EXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032BGPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)GPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)Kimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905Solar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro2Mistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.1DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4Maverickgpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)$2335$1764$1012$910$873$665$407$235$232$219$141$132$106$69$69$50$48$31$26$17$14$11$1754$1725$888$839$837$549$371$511
+ Add model from specific provider
Cost to Run Artificial Analysis Intelligence Index: The cost to run the evaluations in the Artificial Analysis Intelligence Index, calculated using the model's input and output token pricing and the number of tokens used across evaluations (excluding repeats).
Intelligence vs. Cost to Run Artificial Analysis Intelligence Index
Artificial Analysis Intelligence Index; Cost to Run Intelligence Index
Most attractive quadrant
Alibaba
Anthropic
DeepSeek
Google
LG AI Research
Meta
Mistral
Moonshot AI
OpenAI
Upstage
xAI
Z AI
Artificial Analysis8163264128256512102420484096Cost to Run Intelligence Index (USD, Log Scale)303540455055606570Artificial Analysis Intelligence IndexLlama 4 MaverickLlama 4 MaverickMistral Medium 3.1Mistral Medium 3.1gpt-oss-20B (high)gpt-oss-20B (high)DeepSeek V3.1DeepSeek V3.1Solar Pro 2Solar Pro 2GPT-5 (minimal)GPT-5 (minimal)Kimi K2 0905Kimi K2 0905GPT-4.1GPT-4.1gpt-oss-120B (high)gpt-oss-120B (high)EXAONE 4.0 32BEXAONE 4.0 32BGrok Code Fast 1Grok Code Fast 1DeepSeek V3.1DeepSeek V3.1DeepSeek R1 0528DeepSeek R1 0528GLM-4.5GLM-4.5Gemini 2.5 FlashGemini 2.5 Flasho3o3Claude 4 SonnetClaude 4 SonnetGPT-5 (high)GPT-5 (high)Qwen3 235B 2507Qwen3 235B 2507Gemini 2.5 ProGemini 2.5 ProGrok 4Grok 4Claude 4.1 OpusClaude 4.1 Opus
+ Add model from specific provider
Cost to Run Artificial Analysis Intelligence Index: The cost to run the evaluations in the Artificial Analysis Intelligence Index, calculated using the model's input and output token pricing and the number of tokens used across evaluations (excluding repeats).
Artificial Analysis Intelligence Index: Combination metric covering multiple dimensions of intelligence - the simplest way to compare how smart models are. Version 3.0 was released in September 2025 and includes: MMLU-Pro, GPQA Diamond, Humanity's Last Exam, LiveCodeBench, SciCode, AIME 2025, IFBench, AA-LCR, Terminal-Bench Hard, 𝜏²-Bench Telecom. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.
Speed & Latency
Comparison of first-party API performance
Output Speed
Output Tokens per Second; Higher is better
Artificial AnalysisGemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5Flashgpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)o3Logo of o3 which relates to the data aboveo3Gemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProLlama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4MaverickGPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)GPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1Solar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro2Mistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.1EXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032BKimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905GPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)GLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5Qwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetClaude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusGrok 4Logo of Grok 4 which relates to the data aboveGrok 4DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1201919261255247225181156142119117106877468655551464644
+ Add model from specific provider
Output Speed: Tokens per second received while the model is generating tokens (ie. after first chunk has been received from the API for models which support streaming).
Figures represent performance of the model's first-party API (e.g. OpenAI for o1) or the median across providers where a first-party API is not available (e.g. Meta's Llama models).
Latency: Time To First Answer Token
Seconds to First Answer Token Received; Accounts for Reasoning Model 'Thinking' time
Input processing
Thinking (reasoning models, when applicable)
Artificial AnalysisLlama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4MaverickMistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.1GPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1Kimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905GPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1Grok 4Logo of Grok 4 which relates to the data aboveGrok 4gpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)o3Logo of o3 which relates to the data aboveo3Gemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashSolar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro2EXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032BGemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProGLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5Qwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetClaude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusGPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.10.30.40.50.512.85.77.68.49.310.81220.327.228.137.140.144.54575.5105.3105.618.926.936.43943.143.4102.4102.7
+ Add model from specific provider
Time To First Answer Token: Time to first answer token received, in seconds, after API request sent. For reasoning models, this includes the 'thinking' time of the model before providing an answer. For models which do not support streaming, this represents time to receive the completion.
End-to-End Response Time
Seconds to Output 500 Tokens, including reasoning model 'thinking' time; Lower is better
'Thinking' time (reasoning models)
Input processing time
Outputting time
Artificial AnalysisLlama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4MaverickGPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1GPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)Mistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.1Grok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1Kimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905gpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)o3Logo of o3 which relates to the data aboveo3Gemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashGrok 4Logo of Grok 4 which relates to the data aboveGrok 4Solar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro2DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1Gemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProEXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032BGLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5Qwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507Claude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetClaude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1OpusGPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.13.94.85.26.27.77.810.411.513.513.918.925.129.331.33446.249.955.255.883.3130.9131.328.175.518.926.936.43943.143.4102.4102.726.525.625.7
+ Add model from specific provider
End-to-End Response Time: Seconds to receive a 500 token response. Key components:
Input time: Time to receive the first response token
Thinking time (only for reasoning models): Time reasoning models spend outputting tokens to reason prior to providing an answer. Amount of tokens based on the average reasoning tokens across a diverse set of 60 prompts (methodology details).
Answer time: Time to generate 500 output tokens, based on output speed
Figures represent performance of the model's first-party API (e.g. OpenAI for o1) or the median across providers where a first-party API is not available (e.g. Meta's Llama models).
Intelligence vs. Output Speed
Artificial Analysis Intelligence Index; Output Speed: Output Tokens per Second
Artificial Analysis Intelligence Index: Combination metric covering multiple dimensions of intelligence - the simplest way to compare how smart models are. Version 3.0 was released in September 2025 and includes: MMLU-Pro, GPQA Diamond, Humanity's Last Exam, LiveCodeBench, SciCode, AIME 2025, IFBench, AA-LCR, Terminal-Bench Hard, 𝜏²-Bench Telecom. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.
Output Speed: Tokens per second received while the model is generating tokens (ie. after first chunk has been received from the API for models which support streaming).
Figures represent performance of the model's first-party API (e.g. OpenAI for o1) or the median across providers where a first-party API is not available (e.g. Meta's Llama models).
Price
Pricing: Input and Output Prices
Price: USD per 1M Tokens
Input price
Output price
Artificial Analysisgpt-oss-20B (high)Logo of gpt-oss-20B (high) which relates to the data abovegpt-oss-20B(high)gpt-oss-120B (high)Logo of gpt-oss-120B (high) which relates to the data abovegpt-oss-120B(high)Solar Pro 2Logo of Solar Pro 2 which relates to the data aboveSolar Pro2Llama 4 MaverickLogo of Llama 4 Maverick which relates to the data aboveLlama 4MaverickDeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeek V3.1EXAONE 4.0 32BLogo of EXAONE 4.0 32B which relates to the data aboveEXAONE 4.032BGrok Code Fast 1Logo of Grok Code Fast 1 which relates to the data aboveGrok CodeFast 1Mistral Medium 3.1Logo of Mistral Medium 3.1 which relates to the data aboveMistral Medium3.1DeepSeek R1 0528Logo of DeepSeek R1 0528 which relates to the data aboveDeepSeekR1 0528DeepSeek V3.1Logo of DeepSeek V3.1 which relates to the data aboveDeepSeekV3.1GLM-4.5Logo of GLM-4.5 which relates to the data aboveGLM-4.5Gemini 2.5 FlashLogo of Gemini 2.5 Flash which relates to the data aboveGemini 2.5FlashKimi K2 0905Logo of Kimi K2 0905 which relates to the data aboveKimi K2 0905Qwen3 235B 2507Logo of Qwen3 235B 2507 which relates to the data aboveQwen3 235B2507GPT-4.1Logo of GPT-4.1 which relates to the data aboveGPT-4.1o3Logo of o3 which relates to the data aboveo3GPT-5 (high)Logo of GPT-5 (high) which relates to the data aboveGPT-5(high)GPT-5 (minimal)Logo of GPT-5 (minimal) which relates to the data aboveGPT-5 (minimal)Gemini 2.5 ProLogo of Gemini 2.5 Pro which relates to the data aboveGemini 2.5ProClaude 4 SonnetLogo of Claude 4 Sonnet which relates to the data aboveClaude 4SonnetGrok 4Logo of Grok 4 which relates to the data aboveGrok 4Claude 4.1 OpusLogo of Claude 4.1 Opus which relates to the data aboveClaude 4.1Opus0.050.150.50.240.270.60.20.40.550.550.590.310.7221.251.251.2533150.20.60.50.851.111.522.192.192.192.52.758.488101010151575
+ Add model from specific provider
Input Price: Price per token included in the request/message sent to the API, represented as USD per million Tokens.
Figures represent performance of the model's first-party API (e.g. OpenAI for o1) or the median across providers where a first-party API is not available (e.g. Meta's Llama models).
Intelligence vs. Price (Log Scale)
Artificial Analysis Intelligence Index; Price: USD per 1M Tokens; Inspired by prior analysis by Swyx
Most attractive quadrant
Alibaba
Anthropic
DeepSeek
Google
LG AI Research
Meta
Mistral
Moonshot AI
OpenAI
Upstage
xAI
Z AI
Artificial Analysis$64.00$32.00$16.00$8.00$4.00$2.00$1.00$0.50$0.25$0.13$0.06Price (USD per M Tokens, Log Scale, More Expensive to Cheaper)303540455055606570Artificial Analysis Intelligence IndexMistral Medium 3.1Mistral Medium 3.1Llama 4 MaverickLlama 4 MaverickSolar Pro 2Solar Pro 2EXAONE 4.0 32BEXAONE 4.0 32BGPT-4.1GPT-4.1GPT-5 (minimal)GPT-5 (minimal)gpt-oss-20B (high)gpt-oss-20B (high)DeepSeek V3.1DeepSeek V3.1Grok Code Fast 1Grok Code Fast 1GLM-4.5GLM-4.5Kimi K2 0905Kimi K2 0905Gemini 2.5 FlashGemini 2.5 FlashDeepSeek R1 0528DeepSeek R1 0528DeepSeek V3.1DeepSeek V3.1Claude 4 SonnetClaude 4 SonnetQwen3 235B 2507Qwen3 235B 2507gpt-oss-120B (high)gpt-oss-120B (high)Gemini 2.5 ProGemini 2.5 Proo3o3Grok 4Grok 4Claude 4.1 OpusClaude 4.1 OpusGPT-5 (high)GPT-5 (high)
+ Add model from specific provider
While higher intelligence models are typically more expensive, they do not all follow the same price-quality curve.
Artificial Analysis Intelligence Index: Combination metric covering multiple dimensions of intelligence - the simplest way to compare how smart models are. Version 3.0 was released in September 2025 and includes: MMLU-Pro, GPQA Diamond, Humanity's Last Exam, LiveCodeBench, SciCode, AIME 2025, IFBench, AA-LCR, Terminal-Bench Hard, 𝜏²-Bench Telecom. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.
Price: Price per token, represented as USD per million Tokens. Price is a blend of Input & Output token prices (3:1 ratio).
Figures represent performance of the model's first-party API (e.g. OpenAI for o1) or the median across providers where a first-party API is not available (e.g. Meta's Llama models).
Smaller, emerging providers are offering high output speed and at competitive prices.
Price: Price per token, represented as USD per million Tokens. Price is a blend of Input & Output token prices (3:1 ratio).
Output Speed: Tokens per second received while the model is generating tokens (ie. after first chunk has been received from the API for models which support streaming).
Median: Figures represent median (P50) measurement over the past 72 hours to reflect sustained changes in performance.
Pricing (Input and Output Prices): gpt-oss-120B
Price: USD per 1M Tokens; Lower is better; 1,000 Input Tokens
Input price
Output price
Artificial AnalysisCompactifAILogo of CompactifAI which relates to the data aboveCompactifAIDeepinfraLogo of Deepinfra which relates to the data aboveDeepinfraNovitaLogo of Novita which relates to the data aboveNovitaParasailLogo of Parasail which relates to the data aboveParasailAmazonLogo of Amazon which relates to the data aboveAmazonNebius BaseLogo of Nebius Base which relates to the data aboveNebius BaseGoogle VertexLogo of Google Vertex which relates to the data aboveGoogle VertexAzureLogo of Azure which relates to the data aboveAzureFireworksLogo of Fireworks which relates to the data aboveFireworksGMILogo of GMI which relates to the data aboveGMITogether.aiLogo of Together.ai which relates to the data aboveTogether.aiGroqLogo of Groq which relates to the data aboveGroqCerebrasLogo of Cerebras which relates to the data aboveCerebrasCloudflareLogo of Cloudflare which relates to the data aboveCloudflare0.050.090.10.150.150.150.150.150.150.150.150.150.250.350.230.450.50.60.60.60.60.60.60.60.60.750.690.75
The relative importance of input vs. output token prices varies by use case. E.g. Generation tasks are typically more input token weighted while document-focused tasks (e.g. RAG) are more output token weighted.
Input Price: Price per token included in the request/message sent to the API, represented as USD per million Tokens.
Output Price: Price per token generated by the model (received from the API), represented as USD per million Tokens.
Output Speed: gpt-oss-120B
Output Speed: Output Tokens per Second; 1,000 Input Tokens
Artificial AnalysisCerebrasLogo of Cerebras which relates to the data aboveCerebrasGroqLogo of Groq which relates to the data aboveGroqFireworksLogo of Fireworks which relates to the data aboveFireworksAzureLogo of Azure which relates to the data aboveAzureTogether.aiLogo of Together.ai which relates to the data aboveTogether.aiNovitaLogo of Novita which relates to the data aboveNovitaGoogle VertexLogo of Google Vertex which relates to the data aboveGoogle VertexGMILogo of GMI which relates to the data aboveGMIAmazonLogo of Amazon which relates to the data aboveAmazonCompactifAILogo of CompactifAI which relates to the data aboveCompactifAINebius BaseLogo of Nebius Base which relates to the data aboveNebius BaseDeepinfraLogo of Deepinfra which relates to the data aboveDeepinfraParasailLogo of Parasail which relates to the data aboveParasailCloudflareLogo of Cloudflare which relates to the data aboveCloudflare2252242131851801771381313256492397289282271
Output Speed: Tokens per second received while the model is generating tokens (ie. after first chunk has been received from the API for models which support streaming).
Figures represent performance of the model's first-party API (e.g. OpenAI for o1) or the median across providers where a first-party API is not available (e.g. Meta's Llama models).
Output Speed, Over Time: gpt-oss-120B
Output Tokens per Second; Higher is better; 1,000 Input Tokens
Amazon
Azure
Cerebras
Cloudflare
CompactifAI
Deepinfra
Fireworks
GMI
Google Vertex
Groq
Nebius Base
Novita
Parasail
Together.ai
Aug 10Aug 17Aug 24Aug 31Sep 070500100015002000250030003500Artificial Analysis
Smaller, emerging providers offer high output speed, though precise speeds delivered vary day-to-day.
Output Speed: Tokens per second received while the model is generating tokens (ie. after first chunk has been received from the API for models which support streaming).
Over time measurement: Median measurement per day, based on 8 measurements each day at different times. Labels represent start of week's measurements.
See more information on any of our supported models