Open models
The open-source models worth running on your own machine. Switch between Mobile and Desktop, lightest to heaviest. List refreshes automatically every week.
Free to run locally
Open-weight models are free to download and run on your own hardware forever — no account, no API key, no per-use bill. This is the whole point of the app.
Also on the cloud (optional)
Many of these are also reachable as a hosted API via OpenRouter — often with a free tier, for when you're away from your machine. You never have to use it.
Models small enough to run on a phone (8–12 GB RAM) at Q4, lightest first. Each shows whether it runs on iOS, Android, or both. See install on phone for the apps to use.
Gemma 4 1B
GoogleRuns on a phone. Great for on-device drafting and simple chat.
Llama 4 3B
MetaBroad ecosystem support. Safe default for small hardware.
Qwen3.5 4B
AlibabaBest small multilingual model. Vision-capable. The mobile sweet spot.
Gemma 4 4B
GoogleText + image input, 128K context, 140+ languages. Clean writing.