Skip to main content

Overview

Whim gives you access to 20+ models across three providers. You can set a default model for your workspace and override it on individual tasks. Different models offer different tradeoffs between speed, quality, context window, and CU cost.

Available Models

Claude Models (Anthropic)

Available via CCR + OpenRouter and Claude Subscription.
ModelModel IDStrengthsBest for
Claude Opus 4.6claude-opus-4.6Highest capability, deep reasoningComplex architecture, difficult bugs, nuanced refactors
Claude Sonnet 4.5claude-sonnet-4.5Strong balance of speed and qualityGeneral-purpose coding, most tasks
Claude Sonnet 4.5 (1M)claude-sonnet-4.5-1mExtended 1M token context windowLarge codebases, cross-file analysis
Claude Haiku 4.5claude-haiku-4.5Fastest Claude modelQuick edits, simple tasks, rapid iteration

GPT Models (OpenAI)

ModelModel IDProviderBest for
GPT 5.4gpt-5.4Codex onlyFlagship GPT tasks via native Codex CLI
GPT 5.3gpt-5.3OpenRouterGeneral-purpose GPT coding
GPT 5.3 Codexgpt-5.3-codexOpenRouter, CodexCode-optimized GPT 5.3
GPT 5.3 Codex Spark (Preview)gpt-5.3-codex-sparkOpenRouter, CodexFast, lightweight code tasks
GPT 5.2gpt-5.2OpenRouterBudget-friendly GPT option
GPT 5.2 Codexgpt-5.2-codexOpenRouter, CodexBudget-friendly code-optimized GPT
GPT 5.1 Codex Minigpt-5.1-codex-miniOpenRouter, CodexFastest/cheapest GPT option
GPT OSS 120Bgpt-oss-120bOpenRouterOpen-source GPT variant

Google Models

ModelModel IDProviderBest for
Gemini 3 Progemini-3-pro-previewOpenRouterComplex reasoning, large context
Gemini 2.5 Flashgemini-2.5-flashOpenRouterFast, cost-effective tasks

Other Models

ModelModel IDProviderBest for
Grok Code Fastgrok-code-fast-1OpenRouterRapid code generation
DeepSeek V3.2deepseek-v3.2OpenRouterCost-effective coding
MiniMax M2.5minimax-m2.5OpenRouterGeneral coding tasks
Qwen3 Coder Nextqwen3-coder-nextOpenRouterCode-focused tasks
Kimi K2.5kimi-k2.5OpenRouterGeneral coding tasks

Setting Your Default Model

Your default model determines which model runs when you create new tasks. You can set defaults at two levels:

Workspace Default

Workspace defaults apply to all members. A workspace admin can set these in Settings > Workspace > Defaults.

User Default

Your personal default overrides the workspace default. Set it in Settings > My Defaults or during onboarding. Each provider has its own default model:
  • CCR + OpenRouter: Claude Sonnet 4.5
  • Claude Subscription: Claude Opus 4.6
  • Codex Subscription: GPT 5.4
When you switch providers, the model automatically switches to that provider’s default unless you’ve set a specific model override.

Per-Task Model Override

You can override the default model when creating any task. In the task creation dialog, select a different model from the model dropdown. This override only affects that specific task — your default stays the same.
Per-task overrides are useful when you want a more capable model for a specific complex task, or a faster/cheaper model for a simple one.

CU Cost Considerations

How models affect your CU usage depends on your provider:

CCR + OpenRouter

CU cost has two components:
  1. Container runtime — charged at 1 CU per 30 minutes regardless of model
  2. API tokens — varies by model; more capable models cost more tokens per CU
More capable models (like Claude Opus 4.6) use more tokens per CU than lighter models (like Claude Haiku 4.5 or Gemini 2.5 Flash). If CU efficiency matters, consider using a lighter model for straightforward tasks and reserving premium models for complex work.

Claude Subscription / Codex Subscription

With subscription providers, Whim only charges CU for container runtime (1 CU per 30 minutes). Token costs go through your existing Anthropic or OpenAI subscription. This makes subscription providers more CU-efficient for token-heavy tasks.

Model Selection Tips

The default model for each provider is a good general-purpose choice. Try it first and switch if you need more speed or capability.
Claude Opus 4.6 and GPT 5.4 are best for tasks that require deep reasoning — complex debugging, large refactors, or architectural decisions.
Claude Haiku 4.5, Gemini 2.5 Flash, and GPT 5.1 Codex Mini are fast and cost-effective for simple edits, formatting, boilerplate, and quick fixes.
Claude Sonnet 4.5 (1M context) is ideal when the agent needs to reason across many files simultaneously.

Next Steps

Fast Mode & Reasoning

Fine-tune speed vs quality with fast mode and reasoning effort.

Compute Units

Understand how CU usage is calculated.