Abstract
By early 2026, the practical value of artificial intelligence in software engineering is no longer determined solely by the reliability of the underlying large language model (LLM). Increasingly, productivity depends on an execution harness: the surrounding runtime of prompts, tools, sandboxes, permissions, memory, repositories, connectors, model-routing policies, observability, and user-interface affordances that turn a model into an operational coding or general-purpose agent. Popular systems such as Codex, Claude Code, Gemini CLI, Cursor-like development environments, and general agents such as Manus demonstrate that the “AI product” is not only a model but a socio-technical toolchain. This paper argues that AI resilience must therefore be extended into harness resilience: the ability of an engineering team to continue safe and effective work when a preferred AI agent harness changes behavior, loses access, degrades, is rate-limited, is withdrawn from a plan, suffers a security incident, or becomes unavailable. Using an illustrative X.com post by Peter Gostev about a Claude Code user being humorously shown as opted out after a code-quality review, this paper motivates a broader resilience agenda: portability of context, tool abstraction, reproducible agent workflows, fallback harnesses, audit trails, and human recovery procedures. The paper contributes a reference architecture, a failure-mode taxonomy, measurable resilience objectives, a connector threat model, and an evaluation plan that turns the concept from a practice guide into a testable engineering framework.



![Author ORCID: We display the ORCID iD icon alongside authors names on our website to acknowledge that the ORCiD has been authenticated when entered by the user. To view the users ORCiD record click the icon. [opens in a new tab]](https://www.cambridge.org/engage/assets/public/coe/logo/orcid.png)