Adversarial Learning and Secure AI

David J. Miller; Zhen Xiang; George Kesidis

doi:10.1017/9781009315647

Chapter 14: Reverse-Engineering Attacks (REAs) on Classifiers

pp. 315-321

David J. Miller

, Pennsylvania State University,

Zhen Xiang

, University of Illinois, Urbana-Champaign,

George Kesidis

, Pennsylvania State University

Get access

Add bookmark
Cite
Share

Extract

In this chapter we describe reverse-engineering attacks (REAs) on classifiers and defenses against them. REAs involve querying (probing) a classifier to discover its decision rules. One primary application of REAs is to enable TTEs. Another is to reveal a private (e.g., proprietary) classifier’s decision-making. For example, an adversary may seek to discover the workings of a military automated target-recognition system. Early work demonstrates that, with a modest number of (random) queries, which do not rely on any knowledge of the nominal data distribution, one can learn a surrogate classifier on a given domain that closely mimics an unknown classifier. However, a critical weakness of this attack is that random querying makes the attack easily detectable – randomly selected query patterns will typically look nothing like legitimate examples. They are likely to be extreme outliers of all the classes. Each such query is thus individually highly suspicious, let alone thousands or millions of such queries (required for accurate reverse-engineering). However, more recent REAs, which are akin to active learning strategies, are stealthier. Here, we use the ADA method (developed in Chapter 4 for TTE detection) to detect REAs. This method is demonstrated to provide significant detection power against stealthy REAs.

Keywords

reverse-engineering attack
querying attack
probing attack
test-time evasion based detector
random querying

About the book

Chapter DOI https://doi.org/10.1017/9781009315647.015
Book DOI https://doi.org/10.1017/9781009315647
Subjects Computer Science,Machine Learning and Pattern Recognition,Security, Cryptography, and Privacy
Format: Hardback
- Publication date: 31 August 2023
- ISBN: 9781009315678
Format: Digital
- Publication date: 07 September 2023
- ISBN: 9781009315647
Find out more details about this book

Access options

Review the options below to login to check your access.

Purchase options

eTextbook

US$69.99

Hardback

US$69.99

Have an access code?

To redeem an access code, please log in with your personal login.

If you believe you should have access to this content, please contact your institutional librarian or consult our FAQ page for further information about accessing our content.

Also available to purchase from these educational ebook suppliers