SWE-Agent: Agent-Computer Interfaces Enable Automated Software Engineering
AI Summary
The Princeton paper that named the discipline of 'agent-computer interface' (ACI) design โ arguing language-model agents are a new category of end users and deserve specially-built interfaces, not human IDEs retrofitted. Demonstrates that careful ACI design alone (no model fine-tuning) takes SWE-bench from 3.8% to 12.5%. The academic citation that legitimized harness engineering as a research field rather than just a product trick.
31 more articles in this vault.
Import the full Agent Harnesses vault to Burn 451 and build your own knowledge base.
Content attributed to the original author (Yang, Jimenez, Wettig, Lieret, Yao, Narasimhan, Press (Princeton, NeurIPS 2024)). Burn 451 curates publicly available writing as a reading index. For removal requests, contact @hawking520.