Autopilot
The Autopilot tab shows how Maev is protecting and improving your agents over time.
Stats cards
At the top of the page, four cards summarize Autopilot activity across all your agents:
| Stat | What it means |
|---|---|
| Total Runs | Number of agent runs wrapped with maev.run(..., gateway=True) |
| Retries Saved | LLM calls that failed and were automatically retried successfully |
| Budget Protected | Estimated cost savings from circuit breakers stopping runaway runs |
| Strategies Applied | Times a learned strategy was applied to improve a run |
Run list
Below the stats, each row is one agent run. You can see:
- Agent name and run ID
- Status:
success,failed,circuit_broken,timeout - Cost, duration, and LLM call count for the run
- Number of Autopilot interventions (retries, fallbacks, circuit breaks) that fired
Click any row to see the full session timeline with every LLM call and intervention logged in order.
Strategies
The Strategies section lists every learned recovery pattern Maev has identified for your agents. Each strategy shows:
- The failure pattern it addresses (e.g. empty response on a specific model, rate limit on a specific prompt type)
- The recovery action (temperature nudge, model switch, retry delay)
- Win rate and number of times applied
- A plain English explanation of why it works, generated automatically
Strategies are scoped to the agent, to your organization, or globally (Teams plan). You can disable any strategy or delete it.
Autopilot protections
Every maev.run(..., gateway=True) call activates these protections automatically:
Auto-retry
If an LLM call fails (rate limit, timeout, empty response), Maev retries it up to 3x with exponential backoff. If the same prompt returns an empty response twice in a row, Maev nudges the temperature slightly before retrying.
No configuration required. Works across OpenAI, Anthropic, LangChain, LiteLLM, and Gemini.
Cost circuit breaker
If the accumulated cost of LLM calls in a single run exceeds cost_budget (default $1.00), Maev stops the run immediately and raises CircuitBrokenError. The partial run is saved to your dashboard.
maev.run(agent, gateway=True, cost_budget=0.25) # stop if this run exceeds $0.25Call count limit
If more than max_calls LLM calls fire in a single run (default 50), Maev stops the run. This catches infinite loops before they drain your budget.
maev.run(agent, gateway=True, max_calls=20)Duration limit
If the run takes longer than max_duration_s seconds (default 300), Maev stops it.
maev.run(agent, gateway=True, max_duration_s=60.0)Loop detection
Maev hashes the last 20 prompts in a run. If the same prompt hash appears 3 times, the run is stopped as a detected loop. This catches cases where an agent is stuck in a semantic cycle even if the exact text varies slightly.
Fallback models
If the primary model fails repeatedly, Maev automatically switches to the next model in your fallback list.
maev.run(
agent,
gateway=True,
fallback_models=["gpt-4o-mini", "gpt-3.5-turbo"]
)How Autopilot learns
Autopilot operates at three levels:
Level 1: Immediate (every run)
Mechanical protections that fire instantly: retries, circuit breakers, fallback models, loop detection. These activate from run 1 with no data required.
Level 2: Pattern learning (after 20+ runs)
After 20 runs for an agent, Maev starts analyzing patterns:
- Which retry strategies recovered the most failures
- Which models produce the fewest empty responses for this agent's prompts
- What temperature ranges produce the best outputs
- Which intervention sequences worked vs did not
Winning patterns become strategies that are applied automatically in future runs.
Level 3: Cross-agent strategies (Teams plan)
Strategies that prove effective across many different agents in your organization become global strategies. These apply to new agents before they have accumulated enough runs to have their own patterns.
Viewing interventions
Every Autopilot action is logged to the session timeline. In the Sessions detail view, look for intervention events:
retry— an LLM call was retried and the reasonfallback— switched to a fallback modelcircuit_break— run was stopped and whyloop_detected— prompt loop detectedstrategy_applied— a learned strategy modified a call
Autopilot is available on Dev and Teams plans. Free plan users get full observability and alerts but Autopilot protections require an upgrade.