Hostname: page-component-77f85d65b8-s5zpc Total loading time: 0 Render date: 2026-03-26T11:26:55.542Z Has data issue: false hasContentIssue false

One-class classification: taxonomy of study and review of techniques

Published online by Cambridge University Press:  24 January 2014

Shehroz S. Khan
Affiliation:
David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, Canada N2L 3G1; e-mail: shehroz@gmail.com
Michael G. Madden
Affiliation:
College of Engineering and Informatics, National University of Ireland, Galway, Republic of Ireland; e-mail: michael.madden@nuigalway.ie
Rights & Permissions [Opens in a new window]

Abstract

One-class classification (OCC) algorithms aim to build classification models when the negative class is either absent, poorly sampled or not well defined. This unique situation constrains the learning of efficient classifiers by defining class boundary just with the knowledge of positive class. The OCC problem has been considered and applied under many research themes, such as outlier/novelty detection and concept learning. In this paper, we present a unified view of the general problem of OCC by presenting a taxonomy of study for OCC problems, which is based on the availability of training data, algorithms used and the application domains applied. We further delve into each of the categories of the proposed taxonomy and present a comprehensive literature review of the OCC algorithms, techniques and methodologies with a focus on their significance, limitations and applications. We conclude our paper by discussing some open research problems in the field of OCC and present our vision for future research.

Information

Type
Articles
Copyright
Copyright © Cambridge University Press 2014 
Figure 0

Table 1 Confusion matrix for OCC

Figure 1

Figure 1 Our taxonomy for the study of OCC techniques. OCC=one-class classification.

Figure 2

Figure 2 The hyper-sphere containing the target data, with center a and radius R. Three objects are on the boundary are the support vectors. One object xi is outlier and has ξ>0. Source: Tax (2001).

Figure 3

Figure 3 Data description trained on a banana-shaped data set. The kernel is a Gaussian kernel with different width sizes s. Support vectors are indicated by the solid circles; the dashed line is the description boundary. Source: Tax (2001).

Figure 4

Figure 4 Outlier SVM Classifier. The origin and small subspaces are the original members of the second class. Source: Manevitz and Yousef (2001). SVM=Support Vector Machines.

Figure 5

Figure 5 Improved OSVM. Source: Li et al. (2003). OSVM=One-class Support Vector Machines.

Figure 6

Figure 6 Boundaries of SVM and OSVM on a synthetic data set: big dots: positive data, small dots: negative data. Source: Yu (2005). SVM=Support Vector Machines.

Figure 7

Figure 7 The nearest neighbor data description. Source: Tax (2001).

Figure 8

Figure 8 Outline of a problem in the relevance feedback documents retrieval. Source: Onoda et al. (2005).