Agent Reliability Dashboard

Based on 24 documented incidents. Average severity: 5.7/10

Agent Risk Scores

Agent Incidents Avg Severity Critical High Risk Level
github-copilot 1 10.0 1 0 HIGH RISK
unknown-agent 1 10.0 1 0 HIGH RISK
amazon-ai-agent 1 8.4 0 1 HIGH RISK
windsurf 1 7.5 0 1 HIGH RISK
claude-code 4 6.8 1 2 MODERATE
aider 1 6.3 0 0 MODERATE
devin 11 5.0 3 0 MODERATE
cursor 2 3.8 0 0 LOW RISK
autogpt 1 2.9 0 0 LOW RISK
claude 1 0.8 0 0 LOW RISK

Failure Mode Distribution

Recent Critical Incidents