Analyzing language samples of Spanish–English bilingual children for the automated prediction of language dominance

  • T. SOLORIO (a1), M. SHERMAN (a2), Y. LIU (a2), L. M. BEDORE (a3), E. D. PEÑA (a3) and A. IGLESIAS (a4)...

In this work we study how features typically used in natural language processing tasks, together with measures from syntactic complexity, can be adapted to the problem of developing language profiles of bilingual children. Our experiments show that these features can provide high discriminative value for predicting language dominance from story retells in a Spanish–English bilingual population of children. Moreover, some of our proposed features are even more powerful than measures commonly used by clinical researchers and practitioners for analyzing spontaneous language samples of children. This study shows that the field of natural language processing has the potential to make significant contributions to communication disorders and related areas.

