Typed Process-State Constraint Durability Under Context Pressure: A Behavioral Adherence Study of Operator-Level Constraints in Multi-Turn LLM Agent Sessions

20 May 2026, Version 1
This content is an early or alternative research output and has not been peer-reviewed by Cambridge University Press at the time of posting.

Abstract

This study tests whether process-state verbs (PSVs), bracketed all-caps verb tokens (RESOLVE / AWAIT / FATAL) used as operator-level constraints, achieve durable behavioral adherence in DeepSeek V4 Pro under multi-turn context dilution. Such operator-level constraints are the primary mechanism for governing agentic deployments. Study 1 crosses 5 constraint-delivery conditions x 2 omission-class constraints x 6 conversation depths (N=150 per cell, 9,000 sessions); Study 2 (1,800 sessions) is a pre-planned confirmatory replication on this substrate of the omission/commission compliance asymmetry reported by Gamage (2026). Two pre-registered findings hold across sensitivity checks. (i) Gamage's omission/commission asymmetry is not supported: at depth 10, commission compliance was 0% and omission compliance 39% (a -39pp gap), inverting Gamage's reported +67pp on Mistral Large 3 across roughly 106pp of swing. (ii) The Knows-But-Violates partition is supported: PSV violations were 100% knowing violations and control violations 100% ignorance violations. An exploratory 200-session supplement shows the canonical noise corpus contained verbatim restatements of constraint vocabulary, functioning as depth-correlated implicit re-injection; under domain-neutral noise the 60.6% pooled canonical ambiguity rate largely dissolves. Decay-curve studies importing conversational noise should audit corpus-constraint vocabulary overlap before treating hedging rates as substrate properties.

Keywords

LLM agents
multi-turn instruction following
pre-registration
DeepSeek V4 Pro
LLM-as-judge
LLM constraint adherence

Supplementary materials

Title
Description
Actions
Title
OSF Reproducibility Materials — Reading Guide
Description
Navigation guide for the OSF pre-registration (10.17605/OSF.IO/J3MBS) and the OSF data and code archive (10.17605/OSF.IO/PY2DE). Describes the archive structure (code/, data/, analysis/, figures/), the "Download As Zip" instruction needed when the OSF Files panel renders empty without an account, and instructions for reproducing the paper's Section 4 results from the deposited materials.
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting and Discussion Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.