← All articles
BudgetKill SwitchTutorialCost

How to Set Budget Limits on AI Agents (Step by Step)

2025-04-15·6 min read·Nova — @NovaShips

Two Types of Budget Limits

Before you set budget limits, it's worth understanding the two types and what each one does.

Provider total caps — Set on OpenAI, Anthropic, or Google. Stops all API calls when your total account spend reaches a threshold. Protects against catastrophic overspend but applies to every agent and workflow equally.

Per-agent budget caps — Set in your monitoring tool. Stops a specific agent when it hits its budget. Other agents keep running. This is what AgentShield adds.

Most teams need both.

Step 1: Set a Provider-Level Safety Net

First, set a hard cap on your provider account. This is your last line of defense.

OpenAI:

1. Go to platform.openai.com → Settings → Limits

2. Set a "Hard limit" — all API calls stop when this is hit

3. Set a "Soft limit" — you get an email warning before the hard limit

Anthropic:

Anthropic uses a prepaid credit model. Buy credits, and usage draws from that balance. Set your initial credit purchase conservatively until you understand your usage patterns.

Google AI:

Set billing alerts in Google Cloud Console at 50%, 80%, and 100% of your expected budget.

Step 2: Identify Your Agents and Their Expected Costs

Before setting per-agent caps, you need baselines. If you don't have them yet, run your agents for a week with tracking enabled and measure.

Typical patterns:

  • Support agents (short context, high volume): $0.05-0.50 per session
  • Research agents (long context, low volume): $0.50-5.00 per session
  • Classifier agents (minimal tokens, very high volume): $0.001-0.01 per session
  • Set your initial caps at 3-5x your expected daily cost. This gives room for legitimate spikes while catching runaway loops.

    Step 3: Set Per-Agent Caps in AgentShield

    Using the Python SDK:

    import agentshield as shield
    
    # Set a daily budget cap with automatic freeze
    shield.set_budget(
        agent_id="research-agent",
        max_usd=50.0,
        period="daily",
        action="freeze"  # or "alert" for email/Slack only
    )
    
    # Set a monthly cap on a cheaper agent
    shield.set_budget(
        agent_id="support-agent",
        max_usd=200.0,
        period="monthly",
        action="freeze"
    )

    Using the REST API (for n8n, Make, Zapier, or any language):

    curl -X POST https://api.agentshield.one/v1/budgets \
      -H "Authorization: Bearer ags_live_xxxxx" \
      -H "Content-Type: application/json" \
      -d '{
        "agent_id": "research-agent",
        "max_usd": 50.0,
        "period": "daily",
        "action": "freeze"
      }'

    Step 4: Configure Alerts

    Set up alerts before the cap is hit:

    shield.set_alert(
        agent_id="research-agent",
        threshold_pct=80,  # Alert at 80% of budget
        channels=["email", "slack"]
    )

    This gives you time to investigate before the agent freezes.

    Step 5: Test Your Caps

    Before going to production, verify your caps work:

    1. Set a very low cap (e.g., $0.01) on a test agent

    2. Make a few API calls through it

    3. Verify the agent freezes and you receive the alert

    4. Reset the cap to your production value

    What Happens When a Cap Is Hit

    When an agent hits its budget cap with action set to "freeze":

  • The agent returns HTTP 402 on the next call
  • An alert is sent to your configured channels
  • The agent stays frozen until you manually reset it or the period resets
  • All other agents continue running normally
  • This is the key difference from provider total caps: one agent's problem doesn't affect the rest.

    Recommended Cap Strategy

    | Agent type | Daily cap | Monthly cap | Action |

    |-----------|-----------|-------------|--------|

    | Support/customer | $10 | $200 | Freeze |

    | Research/long-context | $50 | $500 | Alert at 80%, freeze at 100% |

    | Classifier/cheap | $5 | $50 | Freeze |

    | Experimental/new | $2 | $20 | Freeze |

    Start conservative. You can always raise caps. It's harder to explain a surprise $800 charge.

    Ready to monitor your AI agents?

    Set up AgentShield in 5 minutes. Free plan available.

    Start for Free →