Windsurf
AI Agent Reliability Report
7.5
Avg Severity /10
1
Total Incidents
0
Critical
1
High
Failure Modes
Ignored Instructions
1
Root Causes
Tool Misuse
1
Frequently Asked Questions
Is Windsurf reliable?
Based on 1 documented incidents, Windsurf has an average failure severity of 7.5/10. 0 incidents were rated critical and 1 were rated high severity. Common failure modes include ignored instructions.
What are the most common Windsurf failures?
The most frequently documented Windsurf failure modes are: ignored instructions (1 incidents). These failures range from low to high severity.
How many Windsurf AI failures have been documented?
StupidLLM has documented 1 Windsurf AI agent failures as of 2026. Each incident is severity-scored on a 0-10 scale, verified against source evidence, and categorized by failure mode and root cause.