Pricing: Compare Groq API Pricing With Other API Providers

Check the latest prices of open-source LLM API providers. Evaluate and compare Groq API prices against other providers based on key metrics such as quality, context window, knowledge cutoff, and more.

API Providers Compared: Groq, Microsoft Azure, Amazon Bedrock, Together.ai, Fireworks AI, Baseten, Lepton AI, Deepinfra, Replicate, Databricks, Novita AI, and OctoAI.

groq api pricing

Groq API Pricing

CreatorModelContext WindowInput Price $/1MOuput Price $/1MQuality Index
googleGemma 2 9B8k$0.20$0.2071
googleGemma 7B Instruct8k$0.07$0.0745
metaLlama 3.1 Instruct 70B8k$0.59$0.7995
metaLlama 3 Instruct 70B8k$0.59$0.7983
metaLlama 3.1 Instruct 8B8k$0.05$0.0866
metaLlama 3 Instruct 8B8k$0.05$0.0864
mistralMixtral 8x7B Instruct33k$0.24$0.2461
microsoft azure api pricing

Microsoft Azure

CreatorModelContext WindowInput Price $/1MOuput Price $/1MQuality Index
metaLlama 3 Instruct 70B8k$0.59$0.7983
metaLlama 3 Instruct 8B8k$0.37$1.1064
amazon bedrock api pricing

Amazon Bedrock

CreatorModelContext WindowInput Price $/1MOuput Price $/1MQuality Index
metaLlama 3 Instruct 70B8k$2.65$3.5083
metaLlama 3.1 Instruct 8B128k$0.30$0.6066
metaLlama 3 Instruct 8B8k$0.30$0.6064
mistralMixtral 8x7B Instruct33k$0.45$0.7061
mistralMixtral 7B Instruct33k$0.15$0.2040
togetherai api pricing

Together.ai

CreatorModelContext WindowInput Price $/1MOuput Price $/1MQuality Index
googleGemma 2 9B8k$0.30$0.3071
googleGemma 2 27B8k$0.80$0.8078
googleGemma 7B Instruct8k$0.20$0.2045
metaLlama 3.1 Instruct 70B Turbo4k$0.88$0.8895
metaLlama 3.1 Instruct 405B Turbo4k$5.00$5.00100
metaLlama 3 Instruct 70B Turbo8k$0.88$0.8883
metaLlama 3.1 Instruct 8B Turbo33k$0.18$0.1866
metaLlama 3 Instruct 8B8k$0.20$0.2064
mistralMixtral 8x22B Instruct65k$1.20$1.2071
mistralMixtral 8x7B Instruct33k$0.60$0.6061
mistralMixtral 7B Instruct8k$0.20$0.2040
openchatOpenChat 3.5 (1210)8k$0.20$0.2050
alibabaQwen2 Instruct 72B33k$0.90$0.9083
nousresearchNous: Capybara 7B8k$0.18$0.18
fireworks ai api pricing

Fireworks AI

CreatorModelContext WindowInput Price $/1MOuput Price $/1MQuality Index
googleGemma 2 9B8k$0.20$0.2071
googleGemma 7B Instruct8k$0.20$0.2045
metaLlama 3.1 Instruct 405B128k$3.00$3.00100
metaLlama 3.1 Instruct 70B128k$0.90$0.9095
metaLlama 3 Instruct 70B8k$0.90$0.9083
metaLlama 3.1 Instruct 8B128k$0.20$0.2066
metaLlama 3 Instruct 8B8k$0.20$0.2064
mistralMixtral 8x22B Instruct65k$1.20$1.2071
mistralMixtral 8x7B Instruct33k$0.50$0.5061
mistralMixtral 7B Instruct33k$0.20$0.2040
alibabaQwen2 Instruct 72B33k$0.90$0.9083
baseten api pricing

Baseten

CreatorModelContext WindowInput Price $/1MOuput Price $/1MQuality Index
mistralMistral 7B Instruct4k$0.20$0.2040
lepton ai api pricing

Lepton AI

CreatorModelContext WindowInput Price $/1MOuput Price $/1MQuality Index
metaLlama 3.1 Instruct 405B128k$2.80$2.80100
metaLlama 3.1 Instruct 70B128k$0.80$0.8095
metaLlama 3 Instruct 70B8k$0.80$0.8083
metaLlama 3.1 Instruct 8B128k$0.70$0.7066
metaLlama 3 Instruct 8B8k$0.07$0.0764
mistralMixtral 8x7B Instruct33k$0.50$0.5061
mistralMixtral 7B Instruct33k$0.07$0.0740
deepinfra api pricing

Deepinfra

CreatorModelContext WindowInput Price $/1MOuput Price $/1MQuality Index
googleGemma 2 9B8k$0.09$0.0971
googleGemma 7B Instruct8k$0.07$0.0745
metaLlama 3.1 Instruct 405B33k$7.00$7.00100
metaLlama 3.1 Instruct 70B128k$0.52$0.7595
metaLlama 3 Instruct 70B8k$0.52$0.7583
metaLlama 3.1 Instruct 8B128k$0.09$0.0966
metaLlama 3 Instruct 8B8k$0.06$0.0664
mistralMixtral 8x22B Instruct65k$0.65$0.6571
mistralMixtral 8x7B Instruct33k$0.24$0.2461
mistralMixtral 7B Instruct8k$0.06$0.7740
microsoft azure api pricingPhi-3 Medium Instruct 14B4k$0.14$0.14
alibabaQwen2 Instruct 72B33k$0.56$0.9083
alibabaQwen2 Instruct 7B33k$0.07$0.07
openchatOpenChat 3.5 (1210)8k$0.07$0.0750
replicate api pricing

Replicate

CreatorModelContext WindowInput Price $/1MOuput Price $/1MQuality Index
metaLlama 3.1 Instruct 405B128k$9.50$9.50100
metaLlama 3 Instruct 70B8k$0.65$2.7583
metaLlama 3 Instruct 8B8k$0.05$0.2564
mistralMixtral 8x7B Instruct33k$0.30$1.0061
mistralMixtral 7B Instruct33k$0.05$0.2540
novita ai api pricing

Novita AI

CreatorModelContext WindowInput Price $/1MOuput Price $/1MQuality Index
googleGemma 2 9B8k$0.08$0.0871
metaLlama 3.1 Instruct 405B33k$2.75$2.75100
metaLlama 3.1 Instruct 70B8k$0.55$0.7695
metaLlama 3 Instruct 70B8k$0.51$0.7483
metaLlama 3.1 Instruct 8B8k$0.10$0.1066
metaLlama 3 Instruct 8B8k$0.06$0.0664
mistralMixtral 7B Instruct33k$0.06$0.0640
openchatOpenChat 3.5 7B33k$0.06$0.0650
octo ai api pricing

OctoAI

CreatorModelContext WindowInput Price $/1MOuput Price $/1MQuality Index
metaLlama 3.1 Instruct 70B128k$0.90$0.9095
metaLlama 3.1 Instruct 405B128k$3.00$3.00100
metaLlama 3 Instruct 70B8k$0.90$0.9083
metaLlama 3.1 Instruct 8B128k$0.15$0.1566
metaLlama 3 Instruct 8B8k$0.15$0.1564
mistralMixtral 8x22B Instruct65k$1.20$1.2071
mistralMixtral 8x7B Instruct33k$0.45$0.4561
mistralMixtral 7B Instruct33k$0.15$0.1540
alibabaQwen 2 7B Instruct33k$0.15$0.15
databricks api pricing

Databricks

CreatorModelContext WindowInput Price $/1MOuput Price $/1MQuality Index
metaLlama 3 Instruct 70B128k$1.00$3.0083
metaLlama 3.1 Instruct 70B128k$1.00$3.0095
metaLlama 3.1 Instruct 405B128k$10.00$30.00100
mistralMixtral 8x7B Instruct33k$0.75$2.2561

Key Definitions

  • Quality Index: A standardized score reflecting average performance across Chatbot Arena, MMLU, and MT-Bench benchmarks.
  • Context Window: The maximum combined number of input and output tokens. (Note: Output token limits are often lower than input limits.)
  • Input Price: Cost per token sent to the API in the request, in USD per million tokens.
  • Output Price: Cost per token generated by the model (received from the API), in USD per million tokens.
  • Knowledge Cutoff: The date the model’s training data was last updated. Information or events after this date may not be reflected in the model’s responses.