Fast Mode & Reasoning

Overview

Whim gives you two controls to tune the speed vs quality tradeoff: fast mode (available for Claude and Codex) and reasoning effort (Codex only). These let you optimize for quick iteration when speed matters, or deep reasoning when accuracy matters.

Fast Mode

Fast mode tells the AI provider to prioritize response speed over depth of reasoning. It’s useful for straightforward tasks where you want rapid output — simple edits, boilerplate, quick fixes, and iterative changes.

How It Works

Provider	Effect	Supported Models
Claude	Passes `--effort low` to the CLI, reducing reasoning depth	All Claude models (Opus 4.6, Sonnet 4.5, Sonnet 4.5 1M, Haiku 4.5)
Codex	Sets `service_tier=fast` in the Codex config	GPT 5.4 only
CCR + OpenRouter	Not available	—

Fast mode for Claude uses the same model — it does not switch to a different, smaller model. It reduces reasoning effort to produce faster output.

When to Use Fast Mode

Use fast mode when...

The task is straightforward (renaming, formatting, simple bug fixes)
You’re iterating quickly and want rapid feedback
You’re running multiple tasks in parallel and want to conserve time
The task doesn’t require deep architectural reasoning

Disable fast mode when...

The task involves complex logic or multi-step reasoning
You need the agent to carefully analyze trade-offs
The task requires understanding large amounts of context
Accuracy is more important than speed

Enabling Fast Mode

You can enable fast mode at three levels:

Per task — Toggle fast mode when creating a task. This overrides all defaults for that task only.
User default — Go to Settings > My Defaults and enable fast mode under the relevant provider section. Applies to all your new tasks.
Workspace default — Go to Settings > Workspace > Defaults and enable fast mode. Applies to all new tasks for all workspace members (unless overridden by a user default).

Codex has fast mode enabled by default for GPT 5.4. Claude has fast mode disabled by default.

Reasoning Effort (Codex Only)

Reasoning effort controls how much compute the Codex model spends thinking through a problem before responding. Higher effort means more thorough analysis but slower output.

Levels

Level	Best for	Speed
Low	Simple, well-defined tasks. Minimal reasoning needed.	Fastest
Medium	Moderate tasks. Some reasoning helpful but not critical.	Fast
High	Complex tasks that benefit from deeper analysis.	Moderate
xhigh	Difficult problems requiring extended reasoning — complex debugging, architecture, multi-file refactors.	Slowest

The default reasoning effort for Codex is xhigh. This gives the best results for most coding tasks. Lower it when you want faster output on simpler work.

Setting Reasoning Effort

Like fast mode, reasoning effort can be set at three levels:

Per task — Choose a reasoning effort level when creating a task
User default — Set in Settings > My Defaults > Codex
Workspace default — Set in Settings > Workspace > Defaults

User defaults override workspace defaults. Per-task settings override both.

Speed vs Quality Reference

Here’s how the different controls compare across providers:

Configuration	Speed	Quality	CU Impact
Claude Opus 4.6 + fast mode off	Slower	Highest	More container time per task
Claude Opus 4.6 + fast mode on	Faster	High	Less container time
Claude Sonnet 4.5 + fast mode off	Fast	High	Moderate container time
Claude Sonnet 4.5 + fast mode on	Fastest (Claude)	Good	Least container time
Claude Haiku 4.5	Very fast	Good	Low container time
Codex GPT 5.4 + xhigh reasoning	Slower	Highest (GPT)	More container time
Codex GPT 5.4 + low reasoning	Fastest (Codex)	Lower	Least container time

CU Impact

Fast mode and lower reasoning effort reduce the time the agent spends on each task, which directly reduces container runtime CU costs. For subscription providers (Claude, Codex), this is the only CU cost. For CCR + OpenRouter, it also reduces token usage since the model generates less reasoning output. The tradeoff is straightforward: faster responses use fewer CU but may produce lower-quality results on complex tasks.

Getting Started

Tasks

AI Providers & Models

Workspaces

Git & Version Control

Terminal

Collaboration

Integrations

Configuration

Billing & Plans

Files & Attachments

Notifications

Account & Security

Support

Fast Mode & Reasoning

Overview

Fast Mode

How It Works

When to Use Fast Mode

Enabling Fast Mode

Reasoning Effort (Codex Only)

Levels

Setting Reasoning Effort

Speed vs Quality Reference

CU Impact

Next Steps

Model Selection

Compute Units

Getting Started

Tasks

AI Providers & Models

Workspaces

Git & Version Control

Terminal

Collaboration

Integrations

Configuration

Billing & Plans

Files & Attachments

Notifications

Account & Security

Support

​Overview

​Fast Mode

​How It Works

​When to Use Fast Mode

​Enabling Fast Mode

​Reasoning Effort (Codex Only)

​Levels

​Setting Reasoning Effort

​Speed vs Quality Reference

​CU Impact

​Next Steps

Model Selection

Compute Units

Overview

Fast Mode

How It Works

When to Use Fast Mode

Enabling Fast Mode

Reasoning Effort (Codex Only)

Levels

Setting Reasoning Effort

Speed vs Quality Reference

CU Impact

Next Steps