Spoken second language words activate native language orthographic information in late second language learners

OUTI VEIVO; VINCENT PORRETTA; JUKKA HYÖNÄ; JUHANI JÄRVIKIVI

doi:10.1017/S0142716418000103

Spoken second language words activate native language orthographic information in late second language learners

Published online by Cambridge University Press: 11 June 2018

JUKKA HYÖNÄ and

OUTI VEIVO*: Affiliation:
University of Turku
VINCENT PORRETTA: Affiliation:
University of Windsor
JUKKA HYÖNÄ: Affiliation:
University of Turku
JUHANI JÄRVIKIVI: Affiliation:
University of Alberta
*: ADDRESS FOR CORRESPONDENCE Outi Veivo, University of Turku, School of Languages and Translation Studies, 20014 University of Turku, Finland. E-mail: outi.veivo@utu.fi

Article contents

Abstract
CURRENT STUDY
EXPERIMENT 1
EXPERIMENT 2
GENERAL DISCUSSION
Footnotes
References

Rights & Permissions

Abstract

This study investigated the time course of activation of orthographic information in spoken word recognition with two visual world eye-tracking experiments in a task where second language (L2) spoken word forms had to be matched with their printed referents. Participants (n = 64) were native Finnish learners of L2 French ranging from beginners to highly proficient. In Experiment 1, L2 targets (e.g., <cidre> /sidʀ/) were presented with either orthographically overlapping onset competitors (e.g., <cintre> /sɛ̃tʀ/) or phonologically overlapping onset competitors (<cycle> /sikl/). In Experiment 2, L2 targets (e.g., <paume> /pom/) were associated with competitors in Finnish, L1 of the participants, in conditions symmetric to Experiment 1 (<pauhu> /pauhu/ vs. <pommi> /pom:i/). In the within-language experiment (Experiment 1), the difference in target identification between the experimental conditions was not significant. In the between-language experiment (Experiment 2), orthographic information impacted the mapping more in lower proficiency learners, and this effect was observed 600 ms after the target word onset. The influence of proficiency on the matching was nonlinear: proficiency impacted the mapping significantly more in the lower half of the proficiency scale in both experiments. These results are discussed in terms of coactivation of orthographic and phonological information in L2 spoken word recognition.

Keywords

eye tracking first language effects orthography proficiency second language spoken word recognition visual world paradigm

Information

Type: Original Article
Information: Applied Psycholinguistics , Volume 39 , Issue 5 , September 2018 , pp. 1011 - 1032

DOI: https://doi.org/10.1017/S0142716418000103 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2018

There is a growing body of literature showing that just as phonological information is activated during the processing of written language (see, e.g., Frost, Reference Frost1998, for a review), orthographic information is activated during the processing of spoken language (see Frost & Ziegler, Reference Frost, Ziegler and Gaskell2007, for a review). There are, however, fewer studies on the role of orthography in second language (L2) spoken word processing. In this study, we are interested in how L2 learners with a formal instruction background use orthographic information in spoken word recognition.

Late L2 learners differ from native language (L1) speakers because they already use one phonological system that can influence the learning of another system (e.g., Best & Tyler, Reference Best, Tyler, Bohn and Munro2007), and because they are already familiar with the grapheme–phoneme correspondences of their L1, which can have a strong impact on the perception and learning of L2 sounds (Bassetti, Reference Bassetti2006; Escudero, Hayes-Harb, & Mitterer, Reference Escudero, Hayes-Harb and Mitterer2008; Escudero & Wanrooij, Reference Escudero and Wanrooij2010; Hayes-Harb, Nicol, & Barker, Reference Hayes-Harb, Nicol and Barker2010; Showalter & Hayes-Harb, Reference Showalter and Hayes-Harb2015). In addition, unlike L1 speakers who learn orthographic word forms only after the phonological forms have been established, literate L2 learners in formal instruction are exposed to written word forms early on in the learning process. The present study investigated how these L2 learners map L2 spoken words onto their written counterparts, specifically the extent to which this mapping is mediated by orthographic or phonological information, and to which L1 grapheme–phoneme correspondences are activated in this process. For this purpose, we conducted two experiments where spoken word forms had to be matched with their printed referents while participants’ eye movements were monitored. We also set out to evaluate the role of L2 proficiency in this matching process.

Even though orthographic information is not necessary in L1 spoken language processing, it is known to be activated even during (non-metaphonological) language processing tasks such as lexical decision that do not demand a special focus on the phonotactic or orthotactic structure of the word forms (Grainger, Diependaele, Spinelli, Ferrand, & Farioli, Reference Grainger, Diependaele, Spinelli, Ferrand and Farioli2003; Grainger & Ferrand, Reference Grainger and Ferrand1996; Salverda & Tanenhaus, Reference Salverda and Tanenhaus2010; Ventura, Morais, Pattamadilok, & Kolinsky, Reference Ventura, Morais, Pattamadilok and Kolinsky2004; Ziegler & Ferrand, Reference Ziegler and Ferrand1998, but see, e.g., Mitterer & Reinisch, Reference Mitterer and Reinisch2015, for the lack of orthographic effects in the perception of conversational speech). These orthographic effects have been explained by a simultaneous coactivation of phonological and orthographic representations (e.g., Grainger et al., Reference Grainger, Diependaele, Spinelli, Ferrand and Farioli2003) or by an activation of orthographically restructured phonological representations (Taft, Castles, Davis, Lazendic, & Nguyen-Hoan, Reference Taft, Castles, Davis, Lazendic and Nguyen-Hoan2008) during the processing of spoken words.

In the L1, the written forms of words are learned after their spoken forms, but in L2 instructed learning environments, the two modalities are learned in parallel. As a result of this co-structuration of orthographic and phonological information (Veivo & Järvikivi, Reference Veivo and Järvikivi2013), orthography may have a more important role in the L2 lexicon than in the L1 lexicon. Further, there is evidence that if the orthographic system of the L2 is incongruent (i.e., if the phonemes can be represented by several different graphemes or vice versa), parallel acquisition of orthography and phonology can be a hindrance to the acquisition of the L2 phonological system (Escudero, Simon, & Mulak, Reference Escudero, Simon and Mulak2014).

In L2 spoken word processing, the activation of orthographic information has been studied especially from the point of view of the parallel activation of the L1. For example, Bartolotti, Daniel, and Marian (Reference Bartolotti, Daniel and Marian2013) showed that during spoken word recognition in a newly acquired L2, orthographically similar L1 word forms are activated even if they are pronounced differently from the target words. This result is complementary to studies showing that phonologically similar words of both languages of bilingual or second language speakers compete for recognition in parallel (Blumenfeld & Marian, Reference Blumenfeld and Marian2007; Marian & Spivey, Reference Marian and Spivey2003a, Reference Marian and Spivey2003b; Spivey & Marian, Reference Spivey and Marian1999).

The role of orthographic input for the learning of L2 phonology has been widely studied (for reviews, see Bassetti, Reference Bassetti, Thorsten and Young-Scholten2008; Young-Scholten, Reference Young-Scholten, Burmeister, Piske and Rohde2002). There is evidence that orthography can help to acquire new phonemic categories of the L2 (Escudero et al., Reference Escudero, Hayes-Harb and Mitterer2008, Reference Escudero, Simon and Mulak2014; Showalter & Hayes-Harb, Reference Showalter and Hayes-Harb2013; Simon, Chambless, & Kickhöfel Alves, Reference Simon, Chambless and Kickhöfel Alves2010), but can also have a negative impact on the acquisition of L2 phonology (Bassetti, Reference Bassetti, Guder, Jiang and Wan2007; Bassetti & Atkinson, Reference Bassetti and Atkinson2015; Young-Scholten & Langer, Reference Young-Scholten and Langer2015), especially when the grapheme–phoneme relations of the L2 are different from the L1 (Escudero & Wanrooij, Reference Escudero and Wanrooij2010; Hayes-Harb et al., Reference Hayes-Harb, Nicol and Barker2010). Furthermore, there is evidence that late L2 learners in instructed learning environments can have an orthographic bias in their lexical knowledge, especially in the recognition of decontextualized word forms (Veivo, Suomela-Salmi, & Järvikivi, Reference Veivo, Suomela-Salmi and Järvikivi2015). At the same time, words for these learners can have imprecise phonological representations (Cook & Gor, Reference Cook and Gor2015; Cook, Pandža, Lancaster, & Gor, Reference Cook, Pandža, Lancaster and Gor2016), which may lead not only to the activation of false semantic content (Cook et al., Reference Cook, Pandža, Lancaster and Gor2016) but also to increased lexical competition (Broersma & Cutler, Reference Broersma and Cutler2011).

If the phonological representations of L2 words are more imprecise and unstable than those for L1 words, they may also be less well connected to their orthographic counterparts. As proficiency in the L2 increases, phonological representations are likely to become more accurate (Darcy, Daidone, & Kojima, Reference Darcy, Daidone and Kojima2013) and the orthographic bias in accessing semantic content decreases (Veivo et al., Reference Veivo, Suomela-Salmi and Järvikivi2015). Taken together, the lexicon of late L2 learners in instructed learning could be orthographically biased so that orthographic representations may be more robust than phonological representations. Moreover, this relative bias might decrease as proficiency increases. In the present study, we evaluated this orthographic bias hypothesis by examining the flow of information from spoken word forms to written word forms in late L2 learners at different proficiency levels.

Previous studies have shown that proficiency can influence orthographic activation in L2 spoken word processing: orthographic information during spoken word processing is activated more rapidly and more strongly in more proficient than in less proficient L2 learners (Mitsugi, Reference Mitsugi2016; Veivo & Järvikivi, Reference Veivo and Järvikivi2013; Veivo, Järvikivi, Porretta, & Hyönä, Reference Veivo, Järvikivi, Porretta and Hyönä2016). Specifically, Veivo et al. (Reference Veivo, Järvikivi, Porretta and Hyönä2016) used the visual world paradigm with printed referents and observed a significant effect for the degree of orthographic overlap of the vowels in targets and competitors (target: <mince> /mɛ̃s/ “slim” vs. O+ competitor: <mite> /mit/ “moth” or O– competitor: <mythe> /mit/ “mythe”), but only for higher proficiency participants. This suggests that orthographic information modulates L2 spoken word identification at least for higher proficiency learners. However, Veivo et al. (Reference Veivo, Järvikivi, Porretta and Hyönä2016) did not contrast the two types of within-language L2 competitors in the same experiment or investigate the activation of between-language competitors from the participants’ L1 to evaluate the activation of L1 orthography in L2 spoken word processing. The present study was designed to fill this gap.

CURRENT STUDY

In the present study, our main objectives were to investigate the mapping of spoken L2 words onto their written referents, and to evaluate whether this mapping is mediated mainly via orthographic or phonological information. For this purpose, we used the visual world eye-tracking paradigm (Allopenna, Magnuson, & Tanenhaus, Reference Allopenna, Magnuson and Tanenhaus1998; Cooper, Reference Cooper1974; Tanenhaus, Spivey-Knowlton, Eberhard, & Sedivy, Reference Tanenhaus, Spivey-Knowlton, Eberhard and Sedivy1995; for a review, see Huettig, Rommers, & Meyer, Reference Huettig, Rommers and Meyer2011) in a task where spoken words are matched with their written counterparts (cf. Huettig & McQueen, Reference Huettig and McQueen2007, Reference Huettig and McQueen2011; McQueen & Viebahn, Reference McQueen and Viebahn2007). We studied Finnish learners of French with a wide range of proficiency levels. The task in both experiments consisted of listening to spoken instructions in French (“cliquez sur le mot cidre”) and clicking on one of the four words (target, competitor, and two unrelated distractors) that appeared on the computer screen 200 ms before the acoustic onset of the target word. The spoken target words were accompanied by a high orthographic low phonological overlap (OH-PL) competitor (e.g., <cidre> /sidʀ/ “cider” vs. <cintre> /sɛ̃tʀ/ “coat hanger”) or a low orthographic high phonological overlap (OL-PH) competitor (e.g., <cidre> /sidʀ/ vs. <cycle> /sikl/ “cycle”) either in the L2 (Experiment 1) or in the L1 (Experiment 2).

If orthographic input in L2 acquisition leads to an orthographic bias in the lexical knowledge of late L2 learners (e.g., Young-Scholten, Reference Young-Scholten, Burmeister, Piske and Rohde2002; for a review, see Bassetti, Reference Bassetti, Thorsten and Young-Scholten2008), we expect orthographically similar competitors to delay the mapping more than phonologically similar competitors. If the precision of phonological representations depends on proficiency (Darcy et al., Reference Darcy, Daidone and Kojima2013), proficiency will affect the speed of the mapping process. Based on previous results (Veivo & Järvikivi, Reference Veivo and Järvikivi2013), lower proficiency learners might activate sublexical grapheme–phoneme correspondences of the L1, which would show as increased activation of phonologically similar L1 competitors in Experiment 2.

We started by investigating in Experiment 1 the matching of French spoken and written L2 word forms in the presence of within-language orthographic and phonological competitors.

EXPERIMENT 1

Method

Participants

Sixty-four students from the University of Turku participated for course credit or volunteered. They reported no hearing impairment or language deficits and had normal or corrected-to-normal vision. All participants were native speakers of Finnish who had learned French as a foreign language in instructed learning. None of the participants had acquired French or any other language besides Finnish before the age of 3. Their age of onset for L2 French varied between 5 and 45 (median = 14). This means that they were all either literate or had started to acquire literacy in their L1 when they began to learn the L2. The participants represented a wide range of proficiency levels ranging from beginners to highly proficient. They evaluated their proficiency in French for five subskills (listening, reading, spoken interaction, spoken production, and writing) with the CEFR self-assessment grid (2001, pp. 26–27). Each subskill was self-assessed on six levels, which were assigned values from 1 to 6. The maximum score for proficiency for each participant was therefore 30.Footnote ¹ Participant-related background information is summarized in Table 1.Footnote ²

Table 1.

Background information for participants (n = 64) in Experiments 1 and 2.

Materials

The visual displays comprised four words: target, competitor, and two distractors. There were 20 target words (e.g., <cidre>) each associated with either a OH-PL overlap competitor (e.g., <cintre>) or a OL-PH overlap competitor (e.g., <cycle>). In the OH-PL condition, targets and competitors had a word initial orthographic overlap of two letters so that the nucleus vowel of the first syllable was always spelled similarly but pronounced differently (e.g., <cidre> /sidʀ/ “cider” vs. <cintre> /sɛ̃tʀ/ “coat hanger”).Footnote ³ In the OL-PH condition, targets and competitors always had a word-initial phonological overlap of two sounds so that the nucleus vowel of the first syllable was pronounced similarly but spelled differently (e.g., <cidre> /sidʀ/ vs. <cycle> /sikl/ “cycle”). Each target (e.g., <cidre> /sidʀ/) and its competitors (vs. <cintre> /sɛ̃tʀ/ and <cycle> /sikl/) were associated with two distractor words that were orthographically, phonologically, and semantically unrelated.

The two competitors were matched for frequency (Lexique 3; New, Pallier, Ferrand, & Matos, Reference New, Pallier, Ferrand and Matos2001) as well as possible. The mean frequency of the OH-PL competitors was 43.7 per million and of the OL-PH competitors 47.9 per million. In addition, distractors in each display were matched for frequency with the target, 32.6 and 35.3 per million, respectively. Targets, competitors, and distractors were also matched for written length.Footnote ⁴ The 20 target word sets are listed in Appendix A. In addition to the target displays, 50 filler displays were constructed. In order to avoid the participants developing test-taking strategies and recognizing the target displays on the basis of formal similarity between the words, 20 of the filler displays had an overlap between the distractor words. In 10 of these filler displays, the distractors had an OH-PL overlap, and in 10 displays, the distractors had an OL-PH overlap. The remaining 30 filler sets comprised four words with no orthographic, phonological, or semantic overlap. In sum, Experiment 1 consisted of 70 trials (20 target word displays, 20 manipulated filler displays, and 30 filler displays).

Each target word was embedded in a French sentence instructing the participant to click on the target word (e.g., “cliquez sur le mot cidre”). These sentences were recorded digitally using the SANAKO Lab100 hardware in the Learning, Age and Bilingualism laboratory at the University of Turku. A female native speaker of French, unaware of the aims of the study, read the sentences in a randomized order with a brief prosodic break before each target word. The mean duration for target words was 616 ms.

Design and procedure

Each trial consisted of responding to the spoken instruction sentence (e.g., “cliquez sur le mot cidre”), by choosing the target word with a mouse click among the four words appearing on the computer screen. The position of each type of word was randomized for each display. For the target word displays, the competitors in the two experimental conditions were counterbalanced between two lists so that each list contained an equal number of OH-PL (10) and OL-PH (10) overlap competitors. The order for the presentation of the 70 trials was randomized for each participant, and the participants were assigned to the two experimental lists in the order of appearance.

Participants’ eye movements were monitored using a head-mounted SR EyeLink II eye-tracker (www.sr-research.com) sampling at 500 Hz. Each trial started with drift correction where the participants fixated on a small cross appearing in the center of the screen for the experimenter to accept the gaze accuracy. After that, the spoken instruction to click on the target word was given via headphones. The visual display (see Figure 1) appeared on the screen 200 ms before the onset of the target word (cf. Huettig & McQueen, Reference Huettig and McQueen2007; McQueen & Viebahn, Reference McQueen and Viebahn2007; Salverda & Tanenhaus, Reference Salverda and Tanenhaus2010). As it takes about 200 ms to program and launch a saccade after a stimulus is presented (Matin, Shao, & Boff, Reference Matin, Shao and Boff1993), this assured that the participants were not able to read the target words and have access to the phonological form via orthography before hearing the targets. The written words were presented in lowercase Times New Roman font being approximately 3 to 4 degrees wide, with the center of each word appearing approximately 8 degrees from the center of the screen (Figure 1).

Figure 1.

An example of the visual display used in Exp. 1 and Exp. 2 (target: route, competitor: rouva, distractors: kansa & frère).

Before the main experiment, participants were familiarized with the task by presenting a practice block of 10 displays consisting of unrelated words. After that, they were presented with Experiments 1 and 2. The order of the experiments was counterbalanced between participants.

Results and discussion of Experiment 1

Five trials (0.5% of the data) were removed from the analyses because the participants clicked on the competitor word instead of the target word.Footnote ⁵ The proportion of looks to the targets, to the competitors, and to the distractors was determined for each trial and for each participant by calculating the number of looks to each word in 20-ms time bins. Mean proportions of looks to each type of word in the two experimental conditions for a 1200-ms period starting from target word onset are presented in Figure 2.

Figure 2.

Mean proportion of looks to each type of word in the two experimental conditions in Exp. 1.

Proportions of looks to each type of word do not differ at word onset, but as Figure 2 shows, looks to distractors start to diverge from target and competitor looks in both experimental conditions at about 300 ms after the onset of the target word. Looks to competitors increase until around 500 ms, and looks to targets increase until reaching the asymptote around 1000 ms after onset. Therefore, we examined the data more in detail within a time window ending at this latter time point (200–1000 ms after target word onset). The proportions of fixations were logit-transformed for statistical analyses (Fox & Weisberg, Reference Fox and Weisberg2011),Footnote ⁶ providing an unbounded measure in which zero represents 50% of looks (Barr, Reference Barr2008).

Visual world eye-tracking data is inherently time-series data and usually presents nonlinearly over time (see Figure 2). In addition, it is possible that the time course interacts with other continuous variables, such as proficiency (cf. Veivo et al., Reference Veivo, Järvikivi, Porretta and Hyönä2016), which may also be nonlinear. We therefore used generalized additive mixed modeling (GAMM; Baayen, Vasishth, Kliegl, & Bates, Reference Baayen, Vasishth, Kliegl and Bates2017; Hastie & Tibshirani, Reference Hastie and Tibshirani1990; Wood, Reference Wood2006), which does not assume a linear relationship between predictors and the response variable and is capable of modeling interactions between continuous variables (here, time and proficiency; see Baayen et al., Reference Baayen, Vasishth, Kliegl and Bates2017; Baayen, van Rij, de Cat, & Wood, Reference Baayen, van Rij, de Cat, Wood, Speelman, Heylan and Geeraerts2018; Veivo et al., Reference Veivo, Järvikivi, Porretta and Hyönä2016). Furthermore, given the time series nature of the data, GAMM also allows for the control of autocorrelation in the data (see, e.g., Porretta, Kyröläinen, van Rij, & Järvikivi, Reference Porretta, Kyröläinen, van Rij, Järvikivi, Czarnowski, Howlett and Jain2018). Autocorrelation relates to the correlation between data points in a time series; a measurement at time point t is correlated to differing degrees with a measurement at time point t-i, depending on the lag. Autocorrelation is particularly problematic because it can greatly increase overconfidence of the model estimates.

In order to understand how online target word processing is modulated by proficiency and overlap, we modeled logit transformed looks to the target word as a function of time (200–1000 ms after target onset), proficiency (ranging from A1 to C2), and overlap condition (OH-PL vs. OL-PH). In addition, list and trial were included in the analysis as control variables. Finally, to control for individual variation in looking behavior, we created the variable event. Here, event represents the combination of participant and trial, capturing participants’ variable responses to different items in the experiment. Event was included in the model as a random effect, allowing each unique time series to have its own intercept in the model (Baayen et al., Reference Baayen, van Rij, de Cat, Wood, Speelman, Heylan and Geeraerts2016; Nixon, van Rij, Mok, Baayen, & Chen, Reference Nixon, van Rij, Mok, Baayen and Chen2016; Porretta, Tucker, & Järvikivi, Reference Porretta, Tucker and Järvikivi2016).

It is reasonable to expect that proficiency (a continuous variable) may influence the time course of processing nonlinearly. To allow for this, we used a tensor product (Wood, Reference Wood2006) for a nonlinear relationship between time and proficiency. Further, also using a tensor product, a difference surface (Baayen, Reference Baayen and Olson2010; Wood, Reference Wood2006) was included for overlap condition. This approach allows for the evaluation of the significance of the factor relative to the interaction of time and proficiency. In this case, the difference surface informs how and where OH-PL is different from the overall effect by adding an additional smoothing parameter on top of the main trend (Zuur, Ieno, Walker, Saviliev, & Smith, Reference Zuur, Ieno, Walker, Saveliev and Smith2009). Finally, trial order was included as a smooth term, and list was included as a parametric term.

The model was fitted to the data through a series of steps in order to assess the contribution of each variable. First, we fitted a full model (i.e., all the predictors, as described above). Second, autocorrelation was estimated from the data (ρ = 0.895, indicating a fairly high correlation between subsequent time points), and the model was refitted including this parameter to adjust the confidence of the estimates. Third, we evaluated the contribution of the individual predictors in the model. For this, two criteria were used: the p value of the term (indicating whether a given effect is not zero) and maximum likelihood (ML) score comparison between model variants (indicating whether the inclusion of the predictor improved the fit of the model; Zuur et al., Reference Zuur, Ieno, Walker, Saveliev and Smith2009). This process was done iteratively in a backward stepwise fashion until the model contained only predictors that were statistically significant and contributed to the model fit. Trial and the difference surface for overlap condition were removed through the fitting process, indicating that the order of presentation of the targets was not significant, χ² (2) = 1.017, p = .362, nor was the type of overlap between targets and competitors, χ² (5) = 1.182, p = .797.

ML score comparisons with chi-square tests between variant models justified including proficiency as an input variable, χ² (3) = 30.192, p < .001. The resulting model contained the following predictors: event, experimental list, Time × Proficiency, and explained 30.6% of the deviance. The statistics for the parametric and smooth terms of the model with the best fit are summarized in Table 2. The significant effect of proficiency over time is depicted in Figure 3.

Table 2.

Generalized additive mixed model with best fit for target looks in Experiment 1: Parametric coefficients and estimated degrees of freedom (Edf), reference degrees of freedom (Ref. df), F values, and p values for the tensor product

Figure 3.

The effect of Proficiency over Time for target looks in Exp. 1.

In interpreting the GAMM results, visual inspection of the figures is essential, perhaps even more so than in other types of data analysis. Figure 3 presents the interaction between proficiency and time as a regression surface, showing that overall, as time progressed, participants were generally more likely to look at the target. Here darker shades of gray represent fewer looks to the target, whereas lighter shades of gray represent more looks to the target, and the contour lines indicate the rate of change.

More interesting, as proficiency increased, the participants were more likely to look at the target. Lower proficiency learners looked at the targets later than higher proficiency learners. Proficiency especially influenced processing in participants with proficiency scores under 15 (equal to CEFR-levels A1, A2, and B1) and did so in a graded fashion. For example, if we follow the time course for participants with proficiency scores 5 and 20, we find that lower proficiency participants were less likely to fixate the targets (between 400 ms and 600 ms). However, we can also see that the effect of proficiency was not linear along the proficiency continuum. This is evidenced by the shape of the contour lines, which indicate a strong effect of proficiency for participants with scores under 15 and little to no effect for participants with scores over 15.

The results of Experiment 1 fail to provide evidence that the OH-PL overlap between targets and within-language competitors delays the mapping between spoken and written forms more than OL-PH overlap. This suggests that when both orthographic and phonological competitors are present at the same time, both orthographic and phonological information is used in the matching process to a similar degree. We will return to this issue in detail in the General Discussion. However, our results confirm that the speed of target identification depends on L2 proficiency in a nonlinear fashion: more proficient L2 listeners fixate the targets faster than less proficient learners, and the influence of proficiency is more pronounced in the lower half of the proficiency scale.

As we were interested in how the L1 modulates L2 performance, we next moved on to investigate the activation of orthographic and phonological information from the participants’ L1 in the recognition of L2 word forms in Experiment 2. This experiment was designed to examine the impact of L1 orthography on the mapping process of the L2 at different proficiency levels.