Hostname: page-component-77f85d65b8-t6st2 Total loading time: 0 Render date: 2026-03-29T11:16:33.683Z Has data issue: false hasContentIssue false

Daily activity recognition based on recurrent neural network using multi-modal signals

Published online by Cambridge University Press:  01 January 2018

Akira Tamamori*
Affiliation:
Department of Information Science, Aichi Institute of Technology, 1247 Yachigusa, Yakusa-cho, Toyota, Aichi 470-0392, Japan
Tomoki Hayashi
Affiliation:
Graduate School of Information Science, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, Aichi 464-8601, Japan
Tomoki Toda
Affiliation:
Information Technology Center, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, Aichi 464-8601, Japan
Kazuya Takeda
Affiliation:
Graduate School of Information Science, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, Aichi 464-8601, Japan
*
Corresponding author: Akira Tamamori Email: akira-tamamori@aitech.ac.jp

Abstract

Our aim is to develop a smartphone-based life-logging system. Human activity recognition (HAR) is one of the core techniques to realize it. Recent studies reported the effectiveness of feed-forward neural network (FF-NN) and recurrent neural network (RNN) as a classifier for HAR task. However, there are still unresolved problems in those studies: (1) a life-logging system using only a smartphone for recording device has not been developed, (2) only indoor activities have been utilized for evaluation, (3) insufficient investigations/evaluations of RNN. In this study, we address these unresolved problems as follows: (1) we build a prototype system for life-logging and conduct data recording experiment on this system to include both indoor and outdoor activities. The experimental results of HAR on this new dataset showed that RNN-based classifier was still effective. (2) From the results of a HAR experiment, it was demonstrated that a multi-layered Simple Recurrent Unit with a non-linear transform at the bottom layer and a highway-connection was the most effective. (3) We could grasp the reason for the improvement of RNN from FF-NN by observing the posterior probabilities over test data.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Authors, 2018
Figure 0

Fig. 1. A concept image of our social implementation.

Figure 1

Fig. 2. Overview of our target life-logging system [6]; The system sends the recognition result, the subject's activity to their smartphone. The history of the user's activity can be viewed through a graphical user interface on the smartphone. The subject can send a feedback about his/her feeling about the activity. This feedback information can be used to improve the recognition performance.

Figure 2

Table 1. Data recording conditions of Nagoya-COI database [6]

Figure 3

Table 2. Recorded daily activities in Nagoya-COI database [6]

Figure 4

Fig. 3. Performance of daily activity recognition [9]; a comparison with other popular methods. “KNN” and “Tree” represent k-nearest neighbor and a decision tree, respectively.

Figure 5

Fig. 4. Framework of the prototype system.

Figure 6

Table 3. Target activities in subject-closed experiment conducted on Nagoya-COI database [9]

Figure 7

Table 4. Target activities in subject-open experiment conducted on Nagoya-COI database [9]

Figure 8

Fig. 5. Performance of daily activity recognition in comparison with architectures of LSTM-RNN: the number of parameters, i.e., weight matrices and gates. The number of hidden layers was set to 1.

Figure 9

Fig. 6. Performance of daily activity recognition in comparison with architectures of RNN: highway connection and non-linear transform of input. “n layer (s)” means that the number of hidden layer is set to n.

Figure 10

Fig. 7. Performance of daily activity recognition; leave-one-subject-out evaluation.

Figure 11

Fig. 8. Performance of daily activity recognition; subject adaptation.

Figure 12

Fig. 9. Visualization of posterior probability over consecutive frames of test data. (a) FF-NN: From “Cleaning” to “Reading” (b) FF-NN: From “Sleeping” to “Watching-TV” (c) LSTM-RNN: From “Cleaning” to “Reading” (d) LSTM-RNN: From “Sleeping” to “Watching-TV”.

Figure 13

Table 5. Data recording conditions

Figure 14

Table 6. Recorded daily activities using prototype system for each subject; the number in each cell represents the activity length in hours.

Figure 15

Table 7. Target activities of newly constructed dataset

Figure 16

Table 8. Confusion matrix of subject #8. Diagonal elements represent recall, that of the right-end column represent precision, and that of the lower end row represent F1-score.

Figure 17

Fig. 10. Performance of daily activity recognition on newly constructed dataset.