LocalAI Navigator
← All models

Gemma 4 4B

Phone / tiny

Google · 4B parameters · released 2026-03 · Gemma · runs on iOS & Android

✓ Free to run locallyFree on OpenRouter

Text + image input, 128K context, 140+ languages. Clean writing.

Best used for

WritingVision / image inputMultilingual
TextVision in

Memory needed (GB)

More compression (Q4) and a smaller context window both lower the RAM this model needs. A bigger context window is not free — watch the numbers climb to the right.

Quant4K ctx8K ctx32K ctx128K ctx
Q42.32.42.94
Q84.64.85.88
FP169.29.611.616

Ways to run it

On your own machine — free & private

LM Studio (search)
gemma-4-4b
Ollama (local)
ollama run gemma4:4b

Or as a hosted API — optional, for when you're away

Reachable on OpenRouter with a free tier — no per-token bill on the free model id. Same one key also works with Ollama's cloud option and most chat apps.

OpenRouter model id
google/gemma-4-4b:free

New to this? Local vs. cloud, and what's actually free →

Source: https://huggingface.co/google · verified 2026-06-15