Taxonomy of LLM Agent Failures

Systematic taxonomy of failure modes in LLM-based autonomous agents across tool use, planning, and multi-step reasoning.

ACL 2027 PAUSED

Phase Progress

Literature Review
Taxonomy Design
Data Collection
Analysis
Paper

Models

0/0

Combinations

0/0

Instances

134,572

evaluated

Completion

0%

Evaluation Pipeline

View details →

0/0

Models Complete

134,572

Instances Run

0%

Overall Progress

0

Running Now

Decisions

All →
2026-03-09

Target ACL 2027 as venue