Hostname: page-component-5db58dd55d-htx7c Total loading time: 0 Render date: 2026-06-02T10:29:15.641Z Has data issue: false hasContentIssue false

Methods for Classifying Nonprofit Organizations According to their Field of Activity: A Report on Semi-automated Methods Based on Text

Published online by Cambridge University Press:  01 January 2026

Julia Litofcenko*
Affiliation:
WU Vienna University of Economics and Business, Vienna, Austria
Dominik Karner
Affiliation:
WU Vienna University of Economics and Business, Vienna, Austria
Florentine Maier
Affiliation:
WU Vienna University of Economics and Business, Vienna, Austria
Rights & Permissions [Opens in a new window]

Abstract

There are various methods for classifying nonprofit organizations (NPOs) according to their field of activity. We report our experiences using two semi-automated methods based on textual data: rule-based classification and machine learning with curated keywords. We use those methods to classify Austrian nonprofit organizations based on the International Classification of Nonprofit Organizations. Those methods can provide a solution to the widespread research problem that quantitative data on the activities of NPOs are needed but not readily available from administrative data, long high-quality texts describing NPOs’ activities are mostly unavailable, and human labor resources are limited. We find that in such a setting, rule-based classification performs about as well as manual human coding in terms of precision and sensitivity, while being much more labor-saving. Hence, we share our insights on how to efficiently implement such a rule-based approach. To address scholars with a background in data analytics as well as those without, we provide non-technical explanations and open-source sample code that is free to use and adapt.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Copyright
Copyright © The Author(s) 2019
Figure 0

Table 1 ICNPO groups and subgroups used

Figure 1

Fig. 1 Flowchart of the research process

Figure 2

Table 2 Performance of individual human coders

Figure 3

Table 3 Overall performance of manual human coding

Figure 4

Fig. 2 Example of rule-based classification in the Austrian case

Figure 5

Table 4 Performance of rule-based classification (column percent; figures are rounded)

Figure 6

Table 5 Examples of curated organization names

Figure 7

Table 6 Performance of decision tree classification with curated organization names (column percent; figures are rounded)