Skip to main content

Documentation Index

Fetch the complete documentation index at: https://soulforge.proxysoul.com/llms.txt

Use this file to discover all available pages before exploring further.

Open /provider-settings to tune thinking mode and budget per model. Or edit config directly:
{
  "thinking": {
    "mode": "adaptive"
  }
}

Thinking modes

ModeWhat it does
offNo thinking blocks
adaptive (or auto)Model decides when to think (Claude 4+)
enabledFixed budget thinking — specify tokens
Budgets when mode is enabled: 1024, 2048, 5000, 10000, 20000.

Context management (Anthropic)

Advanced Anthropic features. Apply automatically on supported models:
  • Clear Thinking — auto-clears old thinking blocks, keeps last 5 turns.
  • Clear Tool Uses — drops old tool content when input exceeds 100k.
  • Compact — API-side compaction at 75% of context.

Graceful degradation

If a provider rejects an option (unsupported thinking mode, unknown parameter), SoulForge retries with reduced options automatically. You don’t see the error.

Retries on rate limits

Getting 429s? Bump retry config:
{
  "retry": {
    "maxAttempts": 5,
    "baseDelayMs": 3000
  }
}
Delays double each attempt: 3s, 6s, 12s, 24s, 48s.
FieldDefaultRange
maxAttempts31–10
baseDelayMs1000 (chat) / 2000 (agents)250–60000