0.3 GB minimum VRAM / Q4_K_M
Model Intelligence
Choose local AI models with real hardware context, not catalog chaos.
VRAM Check turns a raw model registry into a decision surface: what is practical, what is frontier-scale, and what really fits local hardware without guesswork. We do not host or distribute models. We surface trusted technical data and official sources.
These anchors help you decide where to start before you dive into the full catalog.
2026-02-20 / Deepseek / General chat / Math / reasoning
Registry last verified on 2026-03-20. Official sources and hardware-fit envelopes only.
Last registry update: 2026-03-20
121 models match this current pass
Deepseek
NEWDeepSeek V3.2
Latest DeepSeek V3.2 profile with improved quality and efficiency.
GPT OSS
NEWGPT OSS
Open GPT-style profile in Ollama catalog for modern local experimentation.
Kimi
NEWKimi K2.5
Kimi K2.5 refreshed profile with improved instruction quality.
Gemma
NEWGemma 3N
Gemma 3N profile for edge-friendly, modern local assistant workflows.
GLM
NEWGLM-5
GLM-5 profile for modern multilingual and reasoning workloads.
Phi
Phi-4 Reasoning
Phi-4 reasoning profile tuned for analytical and math-heavy prompts.
Mistral
Devstral 2
Coding-oriented Devstral profile for software engineering workflows.
Granite
Granite 4
IBM Granite 4 profile for enterprise assistant and coding tasks.
Llama
Llama 4 Maverick
Llama 4 Maverick profile for frontier-scale deployments.
Llama
Llama 4 Scout
Llama 4 Scout profile focused on balanced frontier quality.
Kimi
Kimi K2
Kimi K2 profile for high-end local reasoning and assistant tasks.
Qwen
Qwen3.5 32B Instruct
Refined Qwen3.5 generation with stronger instruction reliability.