Hostname: page-component-77f85d65b8-t6st2 Total loading time: 0 Render date: 2026-03-29T18:34:01.491Z Has data issue: false hasContentIssue false

Modeling the auxiliary phrase asymmetry in code-switched Spanish–English

Published online by Cambridge University Press:  24 August 2020

Chara Tsoukala
Affiliation:
Centre for Language Studies, Radboud University
Stefan L. Frank*
Affiliation:
Centre for Language Studies, Radboud University
Antal Van Den Bosch
Affiliation:
KNAW Meertens Institute
Jorge Valdés Kroff
Affiliation:
Department of Spanish and Portuguese Studies, University of Florida
Mirjam Broersma
Affiliation:
Centre for Language Studies, Radboud University
*
Address for correspondence: Stefan Frank, E-mail: s.frank@let.ru.nl
Rights & Permissions [Opens in a new window]

Abstract

Spanish–English bilinguals rarely code-switch in the perfect structure between the Spanish auxiliary haber (“to have”) and the participle (e.g., “Ella ha voted”; “She has voted”). However, they are somewhat likely to switch in the progressive structure between the Spanish auxiliary estar (“to be”) and the participle (“Ella está voting”; “She is voting”). This phenomenon is known as the “auxiliary phrase asymmetry”. One hypothesis as to why this occurs is that estar has more semantic weight as it also functions as an independent verb, whereas haber is almost exclusively used as an auxiliary verb. To test this hypothesis, we employed a connectionist model that produces spontaneous code-switches. Through simulation experiments, we showed that i) the asymmetry emerges in the model and that ii) the asymmetry disappears when using haber also as a main verb, which adds semantic weight. Therefore, the lack of semantic weight of haber may indeed cause the asymmetry.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Author(s), 2020. Published by Cambridge University Press
Figure 0

Table 1. Absolute and relative frequencies in the Miami and Gibraltar corpora. Values reported in Guzzardo Tamargo et al. (2016)

Figure 1

Table 2. Absolute frequencies per language in the Miami corpus

Figure 2

Fig. 1. The Bilingual Dual-path model generates sentences word-by-word that express a given message (see Section 2.3 for examples of messages). It is based on a Simple Recurrent Network architecture (the syntactic stream, via the ‘compress’ layers) that is augmented with a semantic stream (upper path) that contains information about concepts and their realization, thematic roles, event semantics, and the target language. The numbers in the parentheses indicate the size of each layer (e.g., 52 concept units); the sizes of the hidden and compress layers vary with each model run (see Section 2.4). The solid arrows denote connections with weights that change during training, whereas the lines between roles, realization, and concepts correspond to connections that are given as part of a message-to-be-expressed (e.g., the AGENT is connected to WOMAN in a particular message). The dotted arrow indicates that once a word is produced, it is given back as input thus contributing to the production of the next word.

Figure 3

Table 3. Parts of speech (POS) in bilingual lexicon (Spanish in italics)

Figure 4

Table 4. Allowed structures for English and Spanish

Figure 5

Table 5. Structure frequencies in the model training sentences

Figure 6

Table 6. Performance (percentage of test sentences with correct meaning) of the three models at the last epoch. The numbers in the brackets show the bootstrapped 95% Confidence Interval.

Figure 7

Fig. 2. Percentage of Spanish-to-English participle switches (computed over 60 network runs per simulation and over the course of network training) of the correctly produced sentences per structure in the three simulations (top: “haber model”; middle: “tener model”; bottom: “synonym model”). Shaded areas show the bootstrapped 95% Confidence Interval.

Figure 8

Fig. 3. Percentage of code-switches at auxiliary and participle for the progressive and perfect structures in the “haber model”, computed over 60 network runs. Shaded areas show the boot-strapped 95% Confidence Interval.