Hostname: page-component-6766d58669-6mz5d Total loading time: 0 Render date: 2026-05-19T20:52:48.101Z Has data issue: false hasContentIssue false

Automated detection and staging of malaria parasites from cytological smears using convolutional neural networks

Published online by Cambridge University Press:  02 August 2021

Mira S. Davidson
Affiliation:
Department of Life Sciences, Imperial College London, London, United Kingdom
Clare Andradi-Brown
Affiliation:
Department of Life Sciences, Imperial College London, London, United Kingdom Department of Infectious Disease, Imperial College London, London, United Kingdom
Sabrina Yahiya
Affiliation:
Department of Life Sciences, Imperial College London, London, United Kingdom
Jill Chmielewski
Affiliation:
Research Centre for Infectious Diseases, School of Biological Sciences, University of Adelaide, Adelaide, South Australia, Australia
Aidan J. O’Donnell
Affiliation:
Institute of Evolutionary Biology, and Institute of Immunology and Infection Research, School of Biological Sciences, The University of Edinburgh, Edinburgh, United Kingdom
Pratima Gurung
Affiliation:
Division of Infectious Diseases, Boston Children’s Hospital, Boston, Massachusetts, USA Department of Pediatrics, Harvard Medical School, Boston, Massachusetts, USA
Myriam D. Jeninga
Affiliation:
Mikrobiologisches Institut–Klinische Mikrobiologie, Immunologie und Hygiene, Universitätsklinikum Erlangen, Friedrich-Alexander-Universität (FAU) Erlangen-Nürnberg, Erlangen, Germany
Parichat Prommana
Affiliation:
Medical Molecular Biotechnology Research Group National Center for Genetic Engineering and Biotechnology (BIOTEC), Pathum Thani, Thailand
Dean W. Andrew
Affiliation:
Thailand Center of Excellence for Life Sciences, Bangkok, Thailand
Michaela Petter
Affiliation:
Mikrobiologisches Institut–Klinische Mikrobiologie, Immunologie und Hygiene, Universitätsklinikum Erlangen, Friedrich-Alexander-Universität (FAU) Erlangen-Nürnberg, Erlangen, Germany
Chairat Uthaipibull
Affiliation:
Medical Molecular Biotechnology Research Group National Center for Genetic Engineering and Biotechnology (BIOTEC), Pathum Thani, Thailand Thailand Center of Excellence for Life Sciences, Bangkok, Thailand
Michelle J. Boyle
Affiliation:
QIMR Berghofer Medical Research Institute, Herston, Queensland, Australia
George W. Ashdown
Affiliation:
Department of Life Sciences, Imperial College London, London, United Kingdom
Jeffrey D. Dvorin
Affiliation:
Division of Infectious Diseases, Boston Children’s Hospital, Boston, Massachusetts, USA Department of Pediatrics, Harvard Medical School, Boston, Massachusetts, USA
Sarah E. Reece
Affiliation:
Institute of Evolutionary Biology, and Institute of Immunology and Infection Research, School of Biological Sciences, The University of Edinburgh, Edinburgh, United Kingdom
Danny W. Wilson
Affiliation:
Research Centre for Infectious Diseases, School of Biological Sciences, University of Adelaide, Adelaide, South Australia, Australia Burnet Institute, Melbourne, Victoria, Australia
Kane A. Cunningham
Affiliation:
Google Cloud AI, Sunnyvale, California, USA
D. Michael. Ando
Affiliation:
Applied Science Team, Google Research, Mountain View, California, USA
Michelle Dimon
Affiliation:
Applied Science Team, Google Research, Mountain View, California, USA
Jake Baum*
Affiliation:
Department of Life Sciences, Imperial College London, London, United Kingdom
*
*Corresponding author. Email: jake.baum@imperial.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Microscopic examination of blood smears remains the gold standard for laboratory inspection and diagnosis of malaria. Smear inspection is, however, time-consuming and dependent on trained microscopists with results varying in accuracy. We sought to develop an automated image analysis method to improve accuracy and standardization of smear inspection that retains capacity for expert confirmation and image archiving. Here, we present a machine learning method that achieves red blood cell (RBC) detection, differentiation between infected/uninfected cells, and parasite life stage categorization from unprocessed, heterogeneous smear images. Based on a pretrained Faster Region-Based Convolutional Neural Networks (R-CNN) model for RBC detection, our model performs accurately, with an average precision of 0.99 at an intersection-over-union threshold of 0.5. Application of a residual neural network-50 model to infected cells also performs accurately, with an area under the receiver operating characteristic curve of 0.98. Finally, combining our method with a regression model successfully recapitulates intraerythrocytic developmental cycle with accurate lifecycle stage categorization. Combined with a mobile-friendly web-based interface, called PlasmoCount, our method permits rapid navigation through and review of results for quality assurance. By standardizing assessment of Giemsa smears, our method markedly improves inspection reproducibility and presents a realistic route to both routine lab and future field-based automated malaria diagnosis.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2021. Published by Cambridge University Press
Figure 0

Figure 1. Data collection workflow. (a) Example images of the P. falciparum dataset. Datasets were collected by six different research centers resulting in staining and lightning variations. (b) Model-assisted labeling workflow. At each labeling round, model predictions were imported and loaded as editable annotations on an image. These annotations were corrected by annotators and added to the training data for a new training round. This process was then repeated with the retrained object detection and malaria classification models. (c) Example of disagreement between annotators on parasite stage classification (R, ring; T, trophozoite; S, schizont). Whereas the first labeling round (LR1) only captures the disagreement between two stages, the second round (LR2) reveals that this is due to the parasite existing between these two stages (LT, late trophozoite; ES, early schizont). The final value for the image is calculated by averaging labels across all annotators.

Figure 1

Figure 2. Object detection of red blood cells (RBCs) on thin smear images.Ground truth labels are shown in orange, and predictions are shown in blue. Examples shown for P. vivax (left) and P. falciparum (right) test data. The object detection model detects individual RBCs with high precision despite the presence of dust, residual stain, and overlap with other cells.

Figure 2

Table 1. Performance of ResNet-50 model on malaria detection in images of segmented red blood cells. Metrics were calculated only for images on which all annotators agreed.

Figure 3

Figure 3. Model prediction on parasite intraerythrocytic cycle (IDC) development. (a) Model predictions are marked with a black dot. Labels were converted to a numeric scale (ring = 1, trophozoite = 2, and schizont = 3) and averaged across all annotators to set a ground truth (GT) label. Boxplots show the label distribution across annotators with error bars determined by the outermost data values. Density plot shows the predicted life stage distribution within the sample. Colors represent progression through the IDC as defined by the GT. (b) After learning from the averaged GT labels, the model successfully orders all detected infected RBCs in the independent test set based on its intraerythrocytic cycle (IDC) life stage predictions (left to right; top to bottom). Color bar represents progression through the IDC as predicted by the model. Top and bottom black lines represent the arbitrary cutoff points used by PlasmoCount between the ring and trophozoite (cutoff = 1.5) and trophozoite and schizont (cutoff = 2.5) stages, respectively.

Figure 4

Figure 4. PlasmoCount, a mobile friendly tool for automated assessment of Giemsa-stained cytological smears. In the upload form, users can attach their images of Giemsa-stained thin blood films. The client sends the data to the server to be passed to the models; the results then get sent back and are displayed in a summary section and a table. The summary section is divided into parasitemia pie charts and a histogram of the IDC life stage distribution. The rows in the table correspond to individual images and are selectable; this will display the corresponding image with overlaying model predictions. The user interface is optimized for mobile phone dimensions, allowing users to use a mobile device for both image acquisition and data analysis. Every job has a unique ID associated with it; this allows users to come back to their results from multiple devices at any time.

Supplementary material: File

Davidson et al. supplementary material

Davidson et al. supplementary material

Download Davidson et al. supplementary material(File)
File 17.2 MB