Hostname: page-component-5db58dd55d-d6ndz Total loading time: 0 Render date: 2026-06-03T07:41:27.054Z Has data issue: false hasContentIssue false

A multi-modal embodied robot framework for English as a second language learning in preschoolers: design and evaluation

Published online by Cambridge University Press:  29 October 2025

Anastasiya Rybakova
Affiliation:
Center for Humanoid Research, Korea Institute of Science and Technology, Seoul, South Korea
JongSuk Choi*
Affiliation:
AI&ROBOTICS, University of Science and Technology , Daejeon, South Korea
*
Corresponding author: JongSuk Choi; Email: cjs@kist.re.kr
Rights & Permissions [Opens in a new window]

Abstract

A multi-modal embodied robot framework was developed and evaluated to support English as a Second Language (ESL) learning in preschoolers through physical interaction and adaptive engagement. The system integrates a 4-DOF OpenManipulator-X robot with a tablet-based educational application, forming a unified instructional platform that delivers synchronized auditory, visual, and kinesthetic stimuli. Designed to improve lexical retention and motivation in early learners, the framework enables task-based interaction through pick-and-place vocabulary reinforcement, collaborative drawing, and tablet-mediated language tasks, coupled with a real-time emotion recognition module to adjust instructional cues.

An experimental design within the subject was used with 21 Korean preschool children (ages 4–8), comparing robot-assisted language learning (RALL) with traditional teacher-led language learning (TLLL) in matched tasks involving vocabulary learning, math reasoning, color categorization, and spelling recall. Each session was conducted under controlled classroom conditions and analyzed using both quantitative and qualitative metrics, including engagement frequency, task precision, and structured post-session surveys.

The results demonstrate significantly higher participation and task completion rates in the RALL condition, with vocabulary acquisition outcomes comparable to TLLL (p > 0.05). Children exhibited increased motivation and sustained interaction when guided by the robot and the application, suggesting that embodied adaptive systems can effectively support early second language learning. The study contributes validated design principles for integrating physical embodiment, affective responsiveness, and multi-modal instructional delivery in educational robotics. Implications are discussed for the scalable deployment of robot-assisted systems in preschool contexts, emphasizing child-centered interaction and developmental appropriateness within RALL environments.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. System architecture and interaction flow of the proposed RALL framework integrating robot and tablet modalities.

Figure 1

Figure 2. Robot-executed drawing and vocabulary-based pick-and-place tasks using OpenManipulator-X to reinforce embodied ESL learning.

Figure 2

Figure 3. Tablet-based EduApp interface with vocabulary, math, color-matching, and scramble tasks supporting robot-led instruction.

Figure 3

Figure 4. Emotion recognition process and real-time animated feedback display used to reflect the child’s affective state.

Figure 4

Figure 5. Classroom-based instructional settings for TLLL (left) and RALL (right) conditions during live sessions.

Figure 5

Figure 6. Round 1 experimental sessions comparing TLLL (left) and RALL (right) modalities in real classroom setting.

Figure 6

Figure 7. Round 2 instructional sequence: reversed order of RALL (left) and TLLL (right) conditions for crossover validation.

Figure 7

Figure 8. Structured post-session user experience survey conducted with children following TLLL and RALL sessions.

Figure 8

Figure 9. Statistical comparison of mean task scores under RALL and TLLL conditions across four learning activities.

Figure 9

Table I. Task-based performance metrics (Paired t-test results).

Figure 10

Figure 10. Statistical comparison of learner engagement, task completion confidence, and learning motivation between TLLL and RALL conditions using ANOVA analysis.

Figure 11

Table II. Engagement and motivation metrics (ANOVA results).