Use multiple models

Assign different models to different tasks. Haiku searches your codebase, Sonnet writes the code, Flash compacts context.

Open the router

/router

Or edit ~/.soulforge/config.json:

{
  "taskRouter": {
    "spark":      "anthropic/claude-haiku-4-5",
    "ember":      "anthropic/claude-sonnet-4-5",
    "webSearch":  "anthropic/claude-haiku-4-5",
    "desloppify": "anthropic/claude-haiku-4-5",
    "verify":     "anthropic/claude-haiku-4-5",
    "compact":    "google/gemini-2.5-flash",
    "semantic":   "anthropic/claude-haiku-4-5",
    "default":    null
  }
}

What each slot does

Slot	Runs when	Good choice
`spark`	Read-only agents explore your code	Fast/cheap (Haiku, Flash)
`ember`	Agents that edit files	Strong coding model (Sonnet, Opus)
`webSearch`	Web search agents	Fast/cheap
`desloppify`	Cleanup pass after code edits	Fast/cheap
`verify`	Adversarial review after code edits	Medium strength
`compact`	Context compaction	Fast/cheap (Flash is ideal)
`semantic`	Repo map one-line symbol summaries	Fast/cheap
`default`	Fallback when no slot matches	—

Recommended setups

Balanced
Budget
Max quality
One gateway

Strong code, cheap everything else.

"spark": "anthropic/claude-haiku-4-5",
"ember": "anthropic/claude-sonnet-4-5",
"compact": "google/gemini-2.5-flash"

Haiku everywhere. Still very capable for most tasks.

"spark": "anthropic/claude-haiku-4-5",
"ember": "anthropic/claude-haiku-4-5",
"compact": "google/gemini-2.5-flash"

Sonnet for code, Sonnet for exploration too.

"spark": "anthropic/claude-sonnet-4-5",
"ember": "anthropic/claude-sonnet-4-5",
"compact": "google/gemini-2.5-flash"

Same models, one key via LLM Gateway.

"spark": "llmgateway/claude-haiku-4-5",
"ember": "llmgateway/claude-sonnet-4-5"

Why this saves money

Agents spend ~70% of their tokens on exploration (reading files, running greps, navigating the Soul Map). That work doesn’t need an expensive model. Reserve your strong model for the ~30% that’s actually writing code.

Get started

Sponsors

Recipes

Providers

Tools

How it works

Reference

Use multiple models

Open the router

What each slot does

Recommended setups

Why this saves money

​Open the router

​What each slot does

​Recommended setups

​Why this saves money

Open the router

What each slot does

Recommended setups

Why this saves money