Back
tips2026-02-21

The Ultimate OpenClaw Cost Guide: Cut Your Bill from $200 to $10-50 with 10 Config Changes

Author: @lijiuer92Original
AD SLOT — TOP

The Ultimate OpenClaw Cost Guide

Source: https://x.com/lijiuer92/status/2025081922410443243 Author: @lijiuer92 Published: 2026-02-21 Stats: 👍 504 | 🔁 108 | 👁 75,817


OpenClaw is the most capable open-source AI agent out there. But most people's first monthly bill is a shock.

Three things to know upfront:

  1. 80% of your spend is going to places you don't know about (heartbeat daemon, session bloat, unused skill injection)
  2. The February 2026 updates (Sonnet 4.6 + 1M Context + nested sub-agents) completely rewrote the cost equation
  3. These 10 config changes can cut your bill from $200+ to $10-50 — not theory, community-verified results

Skip to Part 6 "The 10-Step Checklist" if you just want the fixes.


Part 1: Where Your Money Actually Goes

Token consumption breakdown

Five money drains, ranked by damage:

🕳️ Drain 1: The Heartbeat Daemon — burning money while you sleep

OpenClaw's Heartbeat Daemon wakes your agent every 30 minutes to check for new tasks. Each heartbeat is a full API call carrying your entire conversation history.

Once your session hits 170K-210K tokens: one heartbeat costs $0.51-$0.63, that's 48 times a day, $720-$900 per month. You're not doing anything. The agent is billing you in the background.

🕳️ Drain 2: Session bloat — the longer you use it, the more each message costs

Round 1: "hello" costs $0.003. Round 50: the same "hello" costs $0.30. That's 100x more expensive.

🕳️ Drain 3: System prompt — the "base rent" on every request

A typical system prompt runs 5,000-10,000 tokens. At 100 interactions per day, that's $67.50/month just in system prompt overhead.

🕳️ Drain 4: Skill injection — installed but unused skills still cost money

Every enabled skill injects 200-400 tokens into the system prompt. Got 18 skills installed but only use 5? Those 13 unused skills waste 3,200 tokens per request.

🕳️ Drain 5: Workspace file injection

GitHub Issue #9157: OpenClaw injects your workspace file list into every message, wasting 93.5% of the token budget in some cases.


Part 2: February Updates You Need to Know

Version update comparison

Update 1: Sonnet 4.6 — free upgrade, double the performance

ARC-AGI-2 jumped from 13.6% to 58.3% — a 4.3x improvement. Price: identical to Sonnet 4.5 at $3/$15 per MTok.

User preference tests: Sonnet 4.6 vs Opus 4.5 = 59:41. More than half of users found Sonnet 4.6 comparable to Opus, at 60% of the price.

Update 2: 1M Context — with a cliff

1M Context pricing trap

⚠️ The trap: once you cross 200K tokens, all tokens double in price. Going from 199K to 201K doesn't cost you 2K extra tokens — it doubles the price of all 201K. 200K is a cost cliff.

Correct usage: even with 1M available, trigger compaction before hitting 200K.

Update 3: Nested sub-agents

Up to 5 levels deep, theoretically 31 concurrent agent sessions. Each has its own context. Each is billing you. Use cheap models for sub-agents.


Part 3: Model Routing — Highest ROI Strategy

Model routing strategy

The Opus-as-Orchestrator pattern:

Opus only plays "general": analyze tasks, make plans, delegate to cheaper models, review results.

Cost comparison (100K tokens/day):

  • All Opus: $3.00/day
  • Opus orchestration + Sonnet execution: $1.92/day, 36% savings
  • Opus orchestration + Haiku execution: $0.84/day, 72% savings

Part 4: Prompt Caching — Free Money

Prompt caching savings

OpenClaw's system prompt runs 5,000-10,000 tokens. About 40-60% of every request is identical repeated content. Anthropic's Prompt Caching gives that portion a 90% discount.

Light usage: 86% savings. Medium: 88%. Heavy: 89%. For high-frequency agents, this is hundreds of dollars.


Part 5: Local Models — Zero API Cost

ollama pull qwen3:32b

OpenClaw natively supports Ollama auto-discovery. Local calls: $0.

Minimum viable quality: 14B+. Community consensus is that 8B models hallucinate too often on tool calls — fatal for agent workflows.


Part 6: The 10-Step Checklist

10-step cost checklist

StepActionExpected Savings
1Disable heartbeat: agents.defaults.heartbeat.every: "0m"$150-900/mo
2/compact after every taskVaries
3Switch default to Sonnet 4.640% off
4Disable unused skills200-400 tokens/request/skill
5thinkingDefault: "minimal"50% reasoning tokens
6Enable prompt caching86-89% on system prompt
7compaction.mode: "safeguard", targetTokens: 50000Saves tokens + preserves memory
8Model routing: 50% Haiku + 30% Sonnet + 20% Opus60-72% off
9Add "Be concise" to SOUL.md + max_tokens limit30-50% off output
10Deploy local model (Ollama)Cut bill in half

Part 7: Real Cases

Real case comparisons

  • $150 → $35/mo (77% cut): Disabled 13 unused skills + reset after tasks + prompt caching. Under 1 hour of work.
  • $1,200 → $50/mo (96% cut): Full model routing + heartbeat on local + output limits.
  • $400/mo, no regrets: Coordinating multiple dev instances from Disneyland via Telegram. "Better ROI than hiring staff."

Quick Config Template

Quick config template

agents:
  defaults:
    model: "anthropic/claude-sonnet-4-6"
    fallbackModel: "anthropic/claude-haiku-4-5"
    heartbeat:
      every: "0m"
    thinking:
      default: "minimal"
    compaction:
      mode: "safeguard"
      targetTokens: 50000
  subAgents:
    model: "anthropic/claude-haiku-4-5"
    thinking:
      sonnet: false
      opus: true

Expected monthly spend: moderate usage $30-60, heavy usage $80-150.


Original: https://x.com/lijiuer92/status/2025081922410443243 | via @lijiuer92

AD SLOT — BOTTOM