Models

Ollama supports a list of models available on ollama.com/library

Here are some example models that can be downloaded:

ModelParametersSizeDownload
Llama 3.23B2GBollama run llama3.2
Llama 3.18B4.7GBollama run llama3.1
Llama 3.170B40GBollama run llama3.1:70b
Llama 3.1405B231GBollama run llama3.1:405b
Phi 3 Mini3.8B2.3GBollama run phi3
Phi 3 Medium14B7.9GBollama run phi3:medium
Gemma 22B1.6GBollama run gemma2:2b
Gemma 29B5.5GBollama run gemma2
Gemma 227B16GBollama run gemma2:27b
Mistral7B4.1GBollama run mistral
mistral-small22b13GBollama run mistral-small
Moondream 21.4B829MBollama run moondream
Neural Chat7B4.1GBollama run neural-chat
Starling7B4.1GBollama run starling-lm
Code Llama7B3.8GBollama run codellama
Llama 2 Uncensored7B3.8GBollama run llama2-uncensored
LLaVA7B4.5GBollama run llava
Solar10.7B6.1GBollama run solar
Ollama Model Library

Note:

You should have at least 8 GB of RAM available to run the 7B models, 16 GB to run the 13B models, and 32 GB to run the 33B models.

Supported Models

Ollama supports a wide range of Large Language Models (LLMs) tailored for various tasks, such as coding, general-purpose language understanding, multilingual applications, and scientific discussions. These models vary in size and capabilities, with parameter counts ranging from a few billion to over 400 billion. Below is a table listing the supported models along with a brief description and associated tags.

Model NameDescriptionTags
Lama 3.2Meta’s Llama 3.2 goes small with 1B and 3B models. Tools
Llama 3.1Meta’s latest state-of-the-art LLM available in 8B, 70B, and 405B parameter sizes.Tools
Gemma 2High-performing model by Google, available in 2B, 9B, and 27B parameter sizes.
Mistral-NemoA state-of-the-art 12B model with 128k context length, developed by Mistral AI and NVIDIA.Tools
Mistral Large 2Mistral’s flagship model excels in code generation, mathematics, and reasoning, supporting a 128k context window.Tools
Qwen 2A series of large language models by Alibaba, ranging from 0.5B to 72B parameters.Tools
Qwen2.5Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, a range of base language models and instruction-tuned models are released, with sizes ranging from 0.5 to 72 billion parameters.Tools
DeepSeek Coder V2Open-source Mixture-of-Experts code language model comparable to GPT-4 Turbo in coding tasks.Code
Phi-3Lightweight LLMs by Microsoft in 3B (Mini) and 14B (Medium) parameter sizes.
MistralThe updated 7B model by Mistral AI.Tools
MixtralMixture of Experts model with open weights available in 8x7B and 8x22B parameter sizes.Tools
mistral-smallMistral Small is a lightweight model designed for cost-effective use in tasks like translation and summarization.Tools
CodeGemmaPowerful models for coding tasks like code generation and natural language understanding.Code
Command RLLM optimized for conversational interaction and long context tasks.Tools
Command R+Scalable LLM designed for enterprise use cases.Tools
LLaVAMultimodal model combining vision encoder and Vicuna for visual and language understanding, updated to version 1.6.Vision
Llama 3Meta’s most capable openly available LLM.
GemmaLightweight state-of-the-art models by Google DeepMind, updated to version 1.1.
QwenLarge language models by Alibaba Cloud, available in sizes up to 110B parameters.
Llama 2A collection of foundation language models ranging from 7B to 70B parameters.
CodeLlamaLLM that generates and discusses code using text prompts, with sizes up to 70B parameters.Code
Nomic Embed TextHigh-performing open embedding model with a large token context window.Embedding
Dolphin MixtralUncensored models based on Mixtral, optimized for coding tasks, available in 8x7B and 8x22B sizes.
PhiA 2.7B language model by Microsoft Research, excelling in reasoning and language understanding.
Llama 2 UncensoredUncensored version of Llama 2 by George Sung and Jarrad Hope.
DeepSeek CoderCapable coding model trained on two trillion code and natural language tokens.Code
MXBAI Embed LargeState-of-the-art large embedding model by Mixedbread.ai.Embedding
Dolphin MistralUncensored model based on Mistral, excelling at coding tasks, updated to version 2.8.
ZephyrFine-tuned versions of Mistral and Mixtral models designed to act as helpful assistants.
StarCoder 2Next-generation open code LLMs available in 3B, 7B, and 15B parameter sizes.Code
Dolphin Llama 3A new model by Eric Hartford based on Llama 3, available in 8B and 70B sizes.
Orca MiniGeneral-purpose model ranging from 3B to 70B parameters, suitable for entry-level hardware.
Yi 1.5High-performing bilingual language model.
Mistral OpenOrcaFine-tuned on the OpenOrca dataset, based on the Mistral 7B model.
LLaVA Llama 3LLaVA model fine-tuned from Llama 3 Instruct, with better benchmark scores.Vision
StarCoderCode generation model trained on over 80 programming languages.Code
VicunaGeneral-use chat model based on Llama and Llama 2, with context sizes from 2K to 16K.
TinyLlamaAn open project to train a compact 1.1B Llama model on 3 trillion tokens.
Llama 2 ChineseLlama 2 model fine-tuned for improved Chinese dialogue ability.
CodestralMistral AI’s first-ever code model designed for code generation tasks.Code
Wizard VicunaUncensored model based on Llama 2, available in 7B, 13B, and 30B parameter sizes.
Nous Hermes 2Powerful models by Nous Research, excelling in scientific discussion and coding tasks.
CodeGeeX 4Versatile model for AI software development scenarios, including code completion.Code
OpenChatOpen-source models trained on diverse data, surpassing ChatGPT on various benchmarks.
Aya 23A state-of-the-art multilingual model supporting 23 languages, released by Cohere.
Granite CodeIBM’s open foundation models for Code Intelligence.Code
WizardLM 2Advanced LLM by Microsoft AI, optimized for complex chat, multilingual, reasoning, and agent use cases.
TinyDolphinAn experimental 1.1B parameter model based on TinyLlama and trained on the Dolphin 2.8 dataset by Eric Hartford.
CodeQwenA large language model pre-trained on extensive code data.Code
WizardCoderA state-of-the-art model for code generation.Code
Stable CodeA coding model with instruct and code completion variants, competitive with larger models like CodeLlama 7B.Code
All MiniLMEmbedding models trained on very large sentence-level datasets.Embedding
OpenHermesA 7B model fine-tuned by Teknium on Mistral using fully open datasets.
Stable LM 2A multilingual model trained on English, Spanish, German, and other languages, available in 1.6B and 12B sizes.
Wizard MathModel focused on solving math and logic problems.