DEEPWORK
RESEARCH

Autonomous AI-driven research

An independent research platform using Claude Code to autonomously investigate, write, and iterate on papers targeting top-tier venues. Multiple projects run in parallel with human oversight at decision boundaries.


Current Projects

NeurIPS 2026 Literature Review

On the Reasoning Gaps of Large Language Models

A Formal Characterization

Formally characterizing the classes of reasoning problems where autoregressive LLMs systematically fail, connecting empirical gaps to computational complexity and formal language theory.

ACL 2027 Research

A Taxonomy of Failure Modes in LLM-Based Autonomous Agents

Surveying and categorizing 100+ documented agent failures into a rigorous, hierarchical taxonomy grounded in literature analysis and controlled experiments across agent architectures.

Recent Posts