Hostname: page-component-77f85d65b8-2tv5m Total loading time: 0 Render date: 2026-04-19T15:01:23.986Z Has data issue: false hasContentIssue false

EndNote: Feature-based classification of networks

Published online by Cambridge University Press:  23 September 2019

Ian Barnett*
Affiliation:
Department of Biostatistics, University of Pennsylvania, Philadelphia, PA 19104, USA
Nishant Malik*
Affiliation:
Mathematical Sciences, Rochester Institute of Technology, Rochester, NY 14623, USA (Email: nxmsma@rit.edu)
Marieke L. Kuijjer
Affiliation:
Biostatistics and Computational Biology, Dana Farber Cancer Institute, Boston, MA 02115, USA (Email: mkuijjer@jimmy.harvard.edu)
Peter J. Mucha
Affiliation:
Department of Mathematics, University of North Carolina, Chapel Hill, NC 27599, USA (Email: mucha@unc.edu)
Jukka-Pekka Onnela
Affiliation:
Department of Biostatistics, Harvard University, Boston, MA 02115, USA (Email: onnela@hsph.harvard.edu)
*
*Corresponding author. Email: ibarnett@pennmedicine.upenn.edu
*Corresponding author. Email: ibarnett@pennmedicine.upenn.edu

Abstract

Network representations of systems from various scientific and societal domains are neither completely random nor fully regular, but instead appear to contain recurring structural features. These features tend to be shared by networks belonging to the same broad class, such as the class of social networks or the class of biological networks. Within each such class, networks describing similar systems tend to have similar features. This occurs presumably because networks representing similar systems would be expected to be generated by a shared set of domain-specific mechanisms, and it should therefore be possible to classify networks based on their features at various structural levels. Here we describe and demonstrate a new hybrid approach that combines manual selection of network features of potential interest with existing automated classification methods. In particular, selecting well-known network features that have been studied extensively in social network analysis and network science literature, and then classifying networks on the basis of these features using methods such as random forest, which is known to handle the type of feature collinearity that arises in this setting, we find that our approach is able to achieve both higher accuracy and greater interpretability in shorter computation time than other methods.

Information

Type
Original Article
Copyright
© Cambridge University Press 2019 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

Supplementary material: PDF

Barnett et al. supplementary material

Barnett et al. supplementary material

Download Barnett et al. supplementary material(PDF)
PDF 2.1 MB