IBM’s updated Granite 3.1 models are here! 4 sizes using two different architectures.

Mixture of expert models are designed for low latency usage.
ollama run granite3-moe:1b
ollama run granite3-moe:3b
Dense models are designed for tool-based use cases.
ollama run granite3.1-dense:2b
ollama run granite3.1-dense:8b
IBM is also releasing their embedding models today!
English only:
ollama pull granite-embedding:30m
Multilingual:
ollama pull granite-embedding:278m