Hostname: page-component-77f85d65b8-hzqq2 Total loading time: 0 Render date: 2026-03-28T18:03:02.085Z Has data issue: false hasContentIssue false

Repairing neural network-based control policies with safety preservation

Published online by Cambridge University Press:  07 October 2025

A response to the following question: How to ensure safety of learning-enabled cyber-physical systems?

Pengyuan Lu*
Affiliation:
University of Pennsylvania, Philadelphia, USA
Matthew Cleaveland
Affiliation:
University of Pennsylvania, Philadelphia, USA
Oleg Sokolsky
Affiliation:
University of Pennsylvania, Philadelphia, USA
Insup Lee
Affiliation:
University of Pennsylvania, Philadelphia, USA
Ivan Ruchkin
Affiliation:
University of Florida, Gainesville, USA
*
Corresponding author: Pengyuan Lu; Email: pelu@seas.upenn.edu
Rights & Permissions [Opens in a new window]

Abstract

Neural network (NN)-based control policies have proven their advantages in cyber-physical systems (CPS). When an NN-based policy fails to fulfill a formal specification, engineers leverage NN repair algorithms to fix its behaviors. However, such repair techniques risk breaking the existing correct behaviors, losing not only correctness but also verifiability of initial state subsets. That is, the repair may introduce new risks, previously unaccounted for. In response, we formalize the problem of Repair with Preservation (RwP) and develop Incremental Simulated Annealing Repair (ISAR). ISAR is an NN repair algorithm that aims to preserve correctness and verifiability—while repairing as many failures as possible. Our algorithm leverages simulated annealing on a barriered energy function to safeguard the already-correct initial states while repairing as many additional ones as possible. Moreover, formal verification is utilized to guarantee the repair results. ISAR is compared to a reviewed set of state-of-the-art algorithms, including (1) reinforcement learning-based techniques (STLGym and F-MDP), (2) supervised learning-based techniques (MIQP and minimally deviating repair) and (3) online shielding techniques (tube MPC shielding). Upon evaluation on two standard benchmarks, OpenAI Gym mountain car and an unmanned underwater vehicle, ISAR not only preserves correct behaviors from previously verified initial state regions, but also repairs 81.4% and 23.5% of broken state spaces in the two benchmarks. Moreover, the signal temporal logic (STL) robustness of the ISAR-repaired policies is higher than the baselines.

Information

Type
Results
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. The workflow of incremental simulated annealing repair (ISAR).

Figure 1

Figure 2. The unmanned underwater vehicle.

Figure 2

Figure 3. Repair results of the UUV.

Figure 3

Table 1. Quantitative repair results of the UUV (π′ = π before repair)

Figure 4

Table 2. Repair result of tube MPC shielding in the UUV

Figure 5

Figure 4. The openAI Gym mountain car.

Figure 6

Figure 5. Repair results of MC.

Figure 7

Table 3. Quantitative repair results of MC (π′ = π before repair)

Figure 8

Table 4. Repair result of tube MPC shielding in MC

Figure 9

Table 5. Repair time taken for each method in each case study

Figure 10

Figure 6. Flag states automata designed for (a) UUV and (b) MC. Here, Ydanger = [10, 13], Svalley = [−0.7, 0.3] × [−0.07, 0.07] and Smomentum = [ − 1.2, − 0.7) × [0.01, 0.07].

Author Comment: Repairing neural network-based control policies with safety preservation — R0/PR1

Comments

No accompanying comment.

Recommendation: Repairing neural network-based control policies with safety preservation — R0/PR2

Comments

No accompanying comment.

Author Comment: Repairing neural network-based control policies with safety preservation — R1/PR3

Comments

No accompanying comment.

Recommendation: Repairing neural network-based control policies with safety preservation — R1/PR4

Comments

No accompanying comment.

Author Comment: Repairing neural network-based control policies with safety preservation — R2/PR5

Comments

No accompanying comment.

Decision: Repairing neural network-based control policies with safety preservation — R2/PR6

Comments

No accompanying comment.