Foundation Models - IBM watsonx.ai

Foundation models with the power of choice

IBM watsonx™ models are designed for the enterprise and optimized for targeted business domains and use cases. Through the AI studio IBM® watsonx.ai™ we offer a selection of cost-effective, enterprise-grade foundation models developed by IBM, open-source models and models sourced from third-party providers to help clients and partners scale and operationalize artificial intelligence (AI) faster with minimal risk. You can deploy the AI models wherever your workload is, both on-premises and on hybrid cloud.

IBM takes a differentiated approach to delivering enterprise-grade foundation models:

Open: Bring best-in-class IBM and proven open-source models to watsonx foundation model library or your library.
Trusted: Train models on trusted and governed data for applications that require enterprise-level transparency, governance and performance.
Targeted: Designed for the enterprise and optimized for targeted business domains and use cases.
Empowering: Empower clients with competitively priced model choices to build AI that best suits their unique business needs and risk profiles.

IBM model Point-of-view : A differentiated approach to AI foundation models

Granite 3.1 is now available in watsonx foundation model library.

What's new

New model feature

New to Granite - Updated Granite 3.1 models, all-new embedding models and more

New model feature

Meta's Llama 3.3 70b Instruct model now available on watsonx.ai

New model feature

On-premise foundation models from Mistral AI now available on Watsonx

Ebook: Explore how to choose the right foundation model

IBM models

IBM watsonx foundation models library gives you the choice and flexibility to choose the model that best fits your business needs, regional interests and risk profiles from a library of proprietary, open-source and third-party models.

Granite, developed by IBM Research

IBM® Granite™ is our family of open, performant, and trusted AI models, tailored for business and optimized to scale your AI applications. With Granite 3.1, you’ll find open-sourced, enterprise-ready models that deliver exceptional performance across a wide range of enterprise tasks such as cybersecurity and RAG and against safety benchmarks.

Granite 3.1 8b and 2b: Instruct models trained on high-quality data optimized for natural language and enterprise use cases
Granite Guardian: LLM-based guardrails designed to detect harmful content like hate, profanity, social bias, etc.
Granite 13b chat: Chat model optimized for dialogue use cases and works well with virtual agent and chat applications
Granite 13b instruct: Instruct model trained on high-quality finance data to perform well in finance domain tasks
Granite Code: Family of models ranging from 3B to 34B parameter size and trained on 116 programming languages
Granite multilingual: Trained to understand and generate text in English, German, Spanish, French and Portuguese
Granite Japanese: Designed to perform language tasks on Japanese text

IBM Embedding Models

Use IBM developed and open-sourced embedding models, deployed in IBM watsonx.ai, for retrieval augmented generation, semantic search and document comparison tasks.

Granite-embedding-30M-english
Granite-embedding-125M-english
Granite-embedding-107M-multilingual
Granite-embedding-278M-multilingual

Try watsonx.ai for free

IBM Research report

See how Granite models were trained and data sources used

Why IBM Granite?

Learn more about Granite

Open

Choose the right model, from sub-billion to 34B parameters, open-sourced under Apache 2.0.

Performant

Don’t sacrifice performance for cost. Granite outperforms comparable models across a variety of enterprise tasks.

Trusted

Build responsible AI with a comprehensive set of risk and harm detection capabilities, transparency, and IP protection.

Foundation model library

Select a generative foundation model that best fits your needs. After you have a short list of models for your use case, systematically test the models by using prompt engineering techniques to see which ones consistently return the desired results.

See more watsonx pricing information

Model name

Provider

Use cases

Context length

Price

USD/1 million tokens*

granite-3-1-2b-instruct

New

Featured model

IBM

Supports questions and answers (Q&A), summarization, classification, generation, extraction, RAG, and coding tasks.

128k

0.20

granite-3-1-8b-instruct

New

Featured model

IBM

Supports questions and answers (Q&A), summarization, classification, generation, extraction, RAG, and coding tasks.

128k

0.10

granite-guardian-3-8b

New

Featured model

IBM

Supports detection of HAP/PII, jailbreaking, bias, violence, and other harmful content.

128k

0.20

granite-guardian-3-2b

New

Featured model

IBM

Supports detection of HAP/PII, jailbreaking, bias, violence, and other harmful content.

128k

0.10

granite-20b-multilingual

IBM

Supports Q&A, summarization, classification, generation, extraction, translation and RAG tasks in French, German, Portuguese, Spanish and English.

8192

0.60

granite-13b-chat

Deprecated

IBM

Supports questions and answers (Q&A), summarization, classification, generation, extraction and RAG tasks.

8192

0.60

granite-13b-instruct

IBM

Supports Q&A, summarization, classification, generation, extraction and RAG tasks.

8192

0.60

granite-34b-code-instruct

IBM

Task-specific model for code by generating, explaining and translating code from a natural language prompt.

8192

0.60

granite-20b-code-instruct

IBM

Task-specific model for code by generating, explaining and translating code from a natural language prompt.

8192

0.60

granite-8b-code-instruct

IBM

Task-specific model for code by generating, explaining and translating code from a natural language prompt.

128k

0.60

granite-3b-code-instruct

IBM

Task-specific model for code by generating, explaining and translating code from a natural language prompt.

128k

0.60

granite-8b-japanese

IBM

Supports Q&A, summarization, classification, generation, extraction, translation and RAG tasks in Japanese.

4096

0.60

granite-7b-lab

Deprecated

IBM

Supports questions and answers (Q&A), summarization, classification, generation, extraction and RAG tasks.

8192

0.60

llama-3-3-70b-instruct

New

Learn more

Meta

Supports Q&A, summarization, generation, coding, classification, extraction, translation and RAG tasks in English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai

128k

1.80

llama-3-2-90b-vision-instruct

New

Learn more

Meta

Supports image captioning, image-to-text transcription (OCR) including handwriting, data extraction and processing, context Q&A, object identification

128k

2.00

llama-3-2-11b-vision-instruct

New

Learn more

Meta

Supports image captioning, image-to-text transcription (OCR) including handwriting, data extraction and processing, context Q&A, object identification

128k

0.35

llama-guard-3-11b-vision

New

Learn more

Meta

Supports image filtering, HAP/PII detection, harmful content filtering

128k

0.35

llama-3-2-1b-instruct

New

Learn more

Meta

Supports Q&A, summarization, generation, coding, classification, extraction, translation and RAG tasks in English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai

128k

0.10

llama-3-2-3b-instruct

New

Learn more

Meta

Supports Q&A, summarization, generation, coding, classification, extraction, translation and RAG tasks in English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai

128k

0.15

llama-3-405b-instruct

Meta

Supports Q&A, summarization, generation, coding, classification, extraction, translation and RAG tasks in English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.

128k

Input: 5.00 / Output: 16.00

llama-3-1-70b-instruct

Meta

Supports Q&A, summarization, generation, coding, classification, extraction, translation and RAG tasks in English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.

128k

1.80

llama-3-1-8b-instruct

Meta

Supports Q&A, summarization, generation, coding, classification, extraction, translation and RAG tasks in English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.

128k

0.60

llama-3-8b-instruct

Deprecated

Meta

Supports summarization, classification, generation, extraction and translation tasks.

8192

0.60

llama-3-70b-instruct

Deprecated

Meta

Supports RAG, generation, summarization, classification, Q&A, extraction, translation and code generation tasks.

8192

1.80

allam-1-13b-instruct

SDAIA

Supports Q&A, summarization, classification, generation, extraction, RAG, and translation in Arabic.

4096

1.80

codellama-34b-instruct

Meta

Task-specific model for code by generating and translating code from a natural language prompt.

16384

1.80

pixtral-12b

New

Mistral AI

Supports image captioning, image-to-text transcription (OCR) including handwriting, data extraction and processing, context Q&A, object identification

128k

0.35

mistral-large-2

New

Mistral AI

Supports Q&A, summarization, generation, coding, classification, extraction, translation and RAG tasks in French, German, Italian, Spanish and English.

128k*

Input: 3.00 / Output: 10.00

mixtral-8x7b-instruct

Mistral AI

Supports Q&A, summarization, classification, generation, extraction, RAG and code generation tasks.

32768

0.60

jais-13b-chat (Arabic)

core42

Supports Q&A, summarization, classification, generation, extraction and translation in Arabic.

2048

1.80

flan-t5-xl-3b

Google

Supports Q&A, summarization, classification, generation, extraction and RAG tasks. Available for prompt-tuning.

4096

0.60

flan-t5-xxl-11b

Google

Supports Q&A, summarization, classification, generation, extraction and RAG tasks.

4096

1.80

flan-ul2-20b

Google

Supports Q&A, summarization, classification, generation, extraction and RAG tasks.

4096

5.00

elyza-japanese-llama-2-7b-instruct

ELYZA

Supports Q&A, summarization, RAG, classification, generation, extraction and translation tasks.

4096

1.80

*Prices shown are indicative, may vary by country, exclude any applicable taxes and duties, and are subject to product offering availability in a locale.

Embedding model library

Embedding models convert input text into embeddings, which are dense vector representations of the input text. Embeddings capture nuanced semantic and syntactic relationships between words and passages in vector space.

Model name

Provider

Use cases

Context length

Price

USD/1 million tokens*

slate-125m-english-rtrvr-v2

New

IBM