Hostname: page-component-5db58dd55d-mhzq2 Total loading time: 0 Render date: 2026-05-30T16:24:38.077Z Has data issue: false hasContentIssue false

Strategic information disclosure to classification algorithms: an experiment

Published online by Cambridge University Press:  02 December 2025

Jeanne Hagenbach*
Affiliation:
CNRS, Sciences Po, WZB, CEPR, CESifo, Paris, France
Aurélien Salas
Affiliation:
Sciences Po, Paris, France
*
Corresponding author: Jeanne Hagenbach; Email: jeanne.hagenbach@sciencespo.fr
Rights & Permissions [Opens in a new window]

Abstract

We experimentally study how individuals strategically disclose multidimensional information to a Naive Bayes algorithm trained to guess their characteristics. Subjects’ objective is to minimize the algorithm’s accuracy in guessing a target characteristic. We vary what participants know about the algorithm’s functioning and how obvious are the correlations between the target and other characteristics. Optimal disclosure strategies rely on subjects identifying whether the combination of their characteristics is common or not. Information about the algorithm functioning makes subjects identify correlations they otherwise do not see but also overthink. Overall, this information decreases the frequency of optimal disclosure strategies.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Economic Science Association.
Figure 0

Fig. 1 A screen seen by subjects when they had to make their disclosure choices

Figure 1

Table 1. Summary of data

Figure 2

Table 2. Optimal strategies - all targets

Figure 3

Fig. 2 Hiding 0 to 6 answers, per treatment and pooling all targets

Figure 4

Fig. 3 Frequency of optimal strategy, per treatment and per target

Figure 5

Fig. 4 Hiding 0 to 6 answers for the MAR target, per treatment. (a) Common subjects. (b) Uncommon subjects

Figure 6

Fig. 5 Hiding 0 to 6 answers for the NUC target, per treatment. (a) Common subjects. (b) Uncommon subjects

Figure 7

Fig. 6 Frequency of optimal strategies for NUC, per treatment and type of subject

Figure 8

Table 3. Frequency of consistent strategies, by treatment and target

Figure 9

Table 4. Frequency of correctly identified correlations, by treatment and target

Supplementary material: File

Hagenbach and Salas supplementary material

Hagenbach and Salas supplementary material
Download Hagenbach and Salas supplementary material(File)
File 835.5 KB