Hostname: page-component-6766d58669-r8qmj Total loading time: 0 Render date: 2026-05-15T02:33:16.056Z Has data issue: false hasContentIssue false

Adversarial consistency and the uniqueness of the adversarial bayes classifier

Published online by Cambridge University Press:  04 April 2025

Natalie S. Frank*
Affiliation:
Mathematics, Courant Institute, New York, NY, USA
*
Rights & Permissions [Opens in a new window]

Abstract

Minimizing an adversarial surrogate risk is a common technique for learning robust classifiers. Prior work showed that convex surrogate losses are not statistically consistent in the adversarial context – or in other words, a minimizing sequence of the adversarial surrogate risk will not necessarily minimize the adversarial classification error. We connect the consistency of adversarial surrogate losses to properties of minimizers to the adversarial classification risk, known as adversarial Bayes classifiers. Specifically, under reasonable distributional assumptions, a convex surrogate loss is statistically consistent for adversarial learning iff the adversarial Bayes classifier satisfies a certain notion of uniqueness.

Information

Type
Papers
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Several common loss functions for classification along with the indicator ${\mathbf {1}}_{\alpha \leq 0}$.

Figure 1

Figure 2. The adversarial Bayes classifier for two gaussians with equal variances and differing means. We assume in this figure that $\mu _1\gt \mu _0$. The shaded blue area depicts the region inside the adversarial Bayes classifier. Figure 2a depicts an adversarial Bayes when $\epsilon \leq (\mu _1-\mu _0)/2$ and Figure 2b and 2c depict the adversarial Bayes classifier when $\epsilon \geq (\mu _1-\mu _0)/2$. (See [15, Example 4.1] for a justification of these illustrations.) the adversarial Bayes classifiers in Figure 2b and 2c are not equivalent up to degeneracy.