Hostname: page-component-6766d58669-kn6lq Total loading time: 0 Render date: 2026-05-20T11:22:55.180Z Has data issue: false hasContentIssue false

Autonomous Agents and Policy Compliance: A Framework for Reasoning About Penalties

Published online by Cambridge University Press:  02 January 2026

VINEEL TUMMALA
Affiliation:
Miami University, Oxford, OH, USA (e-mails: tummalvs@miamioh.edu, inclezd@miamioh.edu)
DANIELA INCLEZAN
Affiliation:
Miami University, Oxford, OH, USA (e-mails: tummalvs@miamioh.edu, inclezd@miamioh.edu)
Rights & Permissions [Opens in a new window]

Abstract

This paper presents a logic programming-based framework for policy-aware autonomous agents that can reason about potential penalties for noncompliance and act accordingly. While prior work has primarily focused on ensuring compliance, our approach considers scenarios where deviating from policies may be necessary to achieve high-stakes goals. Additionally, modeling noncompliant behavior can assist policymakers by simulating realistic human decision-making. Our framework extends Gelfond and Lobo’s Authorization and Obligation Policy Language ($\mathscr{AOPL}$) to incorporate penalties and integrates Answer Set Programming (ASP) for reasoning. Compared to previous approaches, our method ensures well-formed policies, accounts for policy priorities, and enhances explainability by explicitly identifying rule violations and their consequences. Building on the work of Harders and Inclezan, we introduce penalty-based reasoning to distinguish between noncompliant plans, prioritizing those with minimal repercussions. To support this, we develop an automated translation from the extended $\mathscr{AOPL}$ into ASP and refine ASP-based planning algorithms to account for incurred penalties. Experiments in two domains demonstrate that our framework generates higher-quality plans that avoid harmful actions while, in some cases, also improving computational efficiency. These findings underscore its potential for enhancing autonomous decision-making and informing policy refinement.

Information

Type
Rapid Communication
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press
Figure 0

Fig 1. Layout of the traffic norms domain.

Figure 1

Fig 2. Policies and penalties for the traffic norms domain.

Figure 2

Fig 3. ASP translation for policy rule 1 from Figure 2.

Figure 3

Fig 4. High-level framework view.

Figure 4

Table 1. Performance results: room domain

Figure 5

Table 2. Performance results: traffic norms domain

Figure 6

Fig 5. Rooms domain – scenario #3. The agent starts in room $r6$ and needs to get to room $r1$. Arrows indicate uni-directional doors. White doors are unlocked, while grey doors are locked. There is an active fire in room $r2$.

Figure 7

Table 3. Impact of the number of distinct speed values on performance in the traffic domain

Figure 8

Table 4. Performance results: traffic norms domain – revisited