Hostname: page-component-89b8bd64d-ksp62 Total loading time: 0 Render date: 2026-05-11T03:29:45.263Z Has data issue: false hasContentIssue false

Image-based food portion size estimation using a smartphone without a fiducial marker

Published online by Cambridge University Press:  06 April 2018

Yifan Yang
Affiliation:
Departments of Neurosurgery, Electrical & Computer Engineering, and Bioengineering, University of Pittsburgh, Pittsburgh, PA 15260, USA Image Processing Center, School of Astronautics, Beihang University, Beijing, People’s Republic of China
Wenyan Jia
Affiliation:
Departments of Neurosurgery, Electrical & Computer Engineering, and Bioengineering, University of Pittsburgh, Pittsburgh, PA 15260, USA
Tamara Bucher
Affiliation:
Priority Research Center for Physical Activity and Nutrition, Faculty of Health and Medicine, The University of Newcastle, Callaghan, New South Wales, Australia
Hong Zhang
Affiliation:
Image Processing Center, School of Astronautics, Beihang University, Beijing, People’s Republic of China
Mingui Sun*
Affiliation:
Departments of Neurosurgery, Electrical & Computer Engineering, and Bioengineering, University of Pittsburgh, Pittsburgh, PA 15260, USA
*
*Corresponding author: Email drsun@pitt.edu
Rights & Permissions [Opens in a new window]

Abstract

Objective

Current approaches to food volume estimation require the person to carry a fiducial marker (e.g. a checkerboard card), to be placed next to the food before taking a picture. This procedure is inconvenient and post-processing of the food picture is time-consuming and sometimes inaccurate. These problems keep people from using the smartphone for self-administered dietary assessment. The current bioengineering study presents a novel smartphone-based imaging approach to table-side estimation of food volume which overcomes current limitations.

Design

We present a new method for food volume estimation without a fiducial marker. Our mathematical model indicates that, using a special picture-taking strategy, the smartphone-based imaging system can be calibrated adequately if the physical length of the smartphone and the output of the motion sensor within the device are known. We also present and test a new virtual reality method for food volume estimation using the International Food Unit™ and a training process for error control.

Results

Our pilot study, with sixty-nine participants and fifteen foods, indicates that the fiducial-marker-free approach is valid and that the training improves estimation accuracy significantly (P<0·05) for all but one food (egg, P>0·05).

Conclusions

Elimination of a fiducial marker and application of virtual reality, the International Food Unit™ and an automated training allowed quick food volume estimation and control of the estimation error. The estimated volume could be used to search a nutrient database and determine energy and nutrients in the diet.

Information

Type
Research paper
Copyright
© The Authors 2018 
Figure 0

Fig. 1 (colour online) Wireframe method for food volume estimation

Figure 1

Fig. 3 (colour online) Mathematical model for reconstructing the world coordinates of the tabletop (CMOS, complementary metal oxide semiconductor)

Figure 2

Fig. 2 (colour online) Model of the smartphone imaging system

Figure 3

Table 1 Smartphone (iPhone 6 plus) parameters used in Eqs (8) to (12)

Figure 4

Table 2 Calibration parameters used in Eqs (8) to (12) for the smartphone (iPhone 6 plus)

Figure 5

Fig. 4 International Food UnitTM

Figure 6

Fig. 5 (colour online) Virtual reality-based volume estimation using the International Food UnitTM (IFUTM): (a) fiducial-marker-free image; (b) extended image with a virtual grid of 4 cm spacing and an IFUTM; (c) final image in which the volume of the scaled IFUTM cube is visually equivalent to the volume of the milk, yielding an estimate of 4·2 F

Figure 7

Fig. 6 (colour online) (a) A complex real-world fiducial-marker-free image; (b) extended virtual reality image from which any food item can be estimated by moving and scaling the International Food UnitTM

Figure 8

Fig. 7 (colour online) (a) Fifteen food models tested in both Studies 1 and 2; (b) fourteen foods, beverages and non-food objects utilized for generating forty-five training images in different containers for Study 2. Note that, in each image, the two-dimensional barcode records the motion sensor data at the time of image acquisition

Figure 9

Table 3 Experimental results of the fifteen foods, tested with and without training, using the new approach of fiducial-marker-free image-based food portion size estimation using a smartphone

Figure 10

Fig. 8 (colour online) (a) Mean relative error ((estimate – ground truth)/ground truth×100 %) with standard deviation indicated by vertical rules and (b) root mean-square error (RMSE) for each of the fifteen foods tested using the new approach of fiducial-marker-free image-based food portion size estimation using a smartphone: , Study 1, without training; , Study 2, with training