What is AgentFlare?

AgentFlare is a lightweight observability and safety layer for AI agents. It sits between your agent code and the LLM API, watching every token that flows through. The moment your agent's spend crosses a limit you define, AgentFlare pauses it automatically and fires a Slack alert — before a runaway loop costs you hundreds of dollars.

Your AI Agent
     │
     │  every LLM call → POST /events  (3-line SDK)
     ▼
AgentFlare Backend  ──→  calculates cost
     │                   checks 24h threshold
     │                   auto-pauses if breached
     │                   fires Slack alert
     ▼
AgentFlare Dashboard  ──→  live event feed, cost charts, pause controls

Why AgentFlare?

AI agents can fail silently in expensive ways. A bug in a loop, a poorly scoped prompt, or an accidental retry storm can rack up hundreds of dollars in API costs in minutes. Existing monitoring tools show you what happened — AgentFlare stops it while it's happening.

Problem	AgentFlare's answer
Runaway agent loops	Auto-pause at your budget threshold
No visibility into token spend	Per-call cost tracking in real time
Silent failures on prod	Slack alert when an agent is paused
Hard to integrate	3-line SDK drop-in for any Python agent

Introduction

What is AgentFlare?

Why AgentFlare?

Core concepts

Agent

Event

Threshold

Pause / Resume

Next steps

Quick Start

SDK Reference

API Reference

Supported Models

On this page