Cross-language interactions during novel word learning: The contribution of form similarity and participant characteristics

Abstract The study examined whether false-cognates, overlapping in form but not meaning across languages, are easier to learn due to form overlap, or more difficult to learn due to meaning competition, compared to unambiguous control and cognate words. Fifty-four native Hebrew speakers learned 14 cognates, 14 false-cognates, and 28 control Arabic words in one session. Cognates were learned better than control items. There was no overall difference in learning false-cognates relative to controls, but individuals with higher phonological short-term memory, or with lower L1 verbal fluency, did exhibit a false-cognate learning-advantage. For these individuals, form overlap was more influential than meaning competition. Lexical decisions to Hebrew words following Arabic learning were slower for false-cognates than controls, indicative of backward influences. The findings reveal the influence of prior knowledge on learning and processing, and highlight the importance of jointly considering item-based and learner-based characteristics during the initial stages of vocabulary learning.


Introduction
Among the many challenges associated with learning a new language, learners need to acquire extensive new vocabulary to become proficient users of the language (e.g., Nation, 2013;Perfetti, 2007).Improving initial vocabulary acquisition and identifying the factors that may modulate it are therefore of great importance (Rice & Tokowicz, 2020;Tokowicz & Degani, 2015).In the current study, we focused on understanding how the overlap between the to-be-learned materials and participants' prior knowledge affect foreign-language (FL) learning.We examined how overlap in form and in meaning between the to-be-learned words and learners' first language (L1) affected the learning trajectories of different individuals.Whereas it is known that cognates that overlap in both form and meaning across languages (e.g., /ʔozen/ means 'ear' in both Arabic and Hebrew) are easier to learn compared to other types of words (Lotto & de Groot, 1998), it is less clear how form-overlap in the absence of meaning-overlap affects learning.Therefore, the aim of the current study was to examine how FALSE-COGNATES, which overlap in phonological and/or orthographic form across languages but differ in meaning (e.g., /sˤu:sˤ/ refers to a 'chick' in Arabic but to a 'horse' in Hebrew) are learned.Furthermore, we aimed to answer two additional questions: (1) how individual characteristics (phonological short-term memory and L1 verbal fluency) modulate the learning efficiency of this FL vocabulary, and (2) whether the newly learned words could have an influence on L1 processing.

The role of L1 in novel word learning
A point of departure for many models that aim to explain FL vocabulary learning is that learners rely on prior knowledge during learning.For example, a key feature of the Unified Competition Model (MacWhinney, 2005) is that, due to the interconnections between the L1 and the FL/second language (L2), whatever can transfer from the L1 will transfer to affect learning and processing of the FL/L2.The model proposes that in the lexical domain, the L2 is parasitic to the L1, and relies on the formal structure of the L1, at least at the beginning stages of learning.Similarly, according to Ecke's Parasitic Model (2015), when learning novel words, learners detect form similarities between the to-be-learned word and existing representations, such that the novel word is connected to a host representation in the previously known language(s).The Revised Hierarchical Model (RHM, Kroll & Stewart, 1994) postulates that with low L2 proficiency, L2 words rely on their L1 translations before developing independent links from form to meaning.Further, the Typological/Contrastive approach (Odlin, 1989) similarly suggests that FL learning capitalizes on knowledge from the L1, and postulates that a FL which is structurally similar to the L1 would be easier to learn than a FL which is not similar to it.
Accordingly, the similarity of the to-be-learned language to the L1 is expected to influence the learning outcome.
Of relevance, language similarity may be operationalized as language typological similarity (e.g., English and Italian vs. English and Arabic) resulting in manipulation of this variable at the whole language level.At the same time, language similarity may be manipulated at the item level, with particular items being more or less similar across languages.For instance, in a recent line of studies, the item-based overlap across languages, as reflected in the mapping of specific translations (i.e., translation ambiguity), was found to influence FL vocabulary learning (Degani & Tokowicz, 2010;Degani & Goldberg, 2019;Degani, Tseng & Tokowicz, 2014).Here, we extend this line of research to examine how semantic-form ambiguity (the mapping of a single form to different meanings, i.e., false-cognates) influences learning.In the current study, learners' L1 (Hebrew) and the FL they learned (Arabic) are linked at the whole language level, because both are Semitic languages that share partially similar phonological and morphological properties (Ibrahim, 2006;Saiegh-Haddad & Henkin-Roitfarb, 2014).However, we specifically focused on an item-based similarity manipulation, by including items that do or do not overlap in form across the two languages.
The role of semantic-form overlap in novel word learning

Cognate learning
One can think of the process of novel word learning as incorporating the learning of three dimensions: form, meaning and the mapping between these two (Tokowicz & Degani, 2015).Cognates overlap in all three dimensions across languages and thus may enjoy an advantage in FL vocabulary learning, because transfer from the L1 can occur smoothly without producing errors (MacWhinney, 2005).Indeed, Lotto and de Groot (1998) tested cognate and non-cognate word learning among adult Dutch speakers who learned Italian vocabulary.Participants showed better learning of cognate words, in both accuracy and latency measures, regardless of whether the words were learned in association with a Dutch translation word or a picture.Similarly, in a study conducted by Raboyeau, Marcotte, Adrover-Roig, and Ansaldo (2010), French speakers learned Spanish cognates better than non-cognates.This advantage was reflected in both an early learning phase (spanning over 5 days), and in a later consolidation period (of 2 weeks).A similar cognate facilitation effect was observed among learners of more distant languages (Persian and French, Ghazi-Saidi & Ansaldo, 2017).
Previous studies that examined learning of new meanings to a known form within a language may shed light on how false-cognates are learned.Specifically, learning a new meaning to a known word-form creates an ambiguous word, in that a single form is now mapped onto two different meanings.Storkel et al. (2013) examined such learning amongst children and observed an advantage for learning new meanings to known forms compared to learning completely new words, attributed to better access to the frequent known word-forms.Thus, wordform familiarity may facilitate learning.
At the same time, other research suggests that learning an unrelated meaning to an already known form may hinder learning.For instance, Rodd et al. (2012) taught adult native English monolinguals fictional new meanings for unambiguous English words and showed better learning for meanings that were semantically related to the original meaning (e.g., Hive as a busy household), compared to those that were semantically unrelated meanings (e.g., Hive as a mythical monster).This latter finding indicates that learning a new unrelated meaning may impose a challenge (see also Bracken, Degani, Eddington & Tokowicz, 2017;Floyd & Goldberg, 2020;Maciejewski et al., 2020).
Together, these two lines of research may reveal the operation of non-mutually exclusive processes: namely, form facilitation (Storkel et al., 2013) and meaning competition (Rodd et al., 2012), which may affect learning performance.In support of this possibility, recent work provides evidence for both form facilitation and meaning competition, taking place at different time points in learning.Specifically, Fang et al. (2017) compared learning novel words (new forms and meanings) to learning new meanings to already known forms by examining learning efficiency, indexed as the ability to generate meanings relative to the time spent studying the word.Interestingly, a biphasic effect emerged in that in the early stages of learning participants were more efficient in learning new meanings to already known forms, but over time the pattern reversed with a greater learning efficiency for the novel words.
Therefore, the extent to which overlap in form can facilitate learning when there is no meaning-overlap remains unclear.Moreover, the majority of past research tested learning of new meanings to already existent forms WITHIN a given language (but see Ghazi-Saidi & Ansaldo, 2017;Mulík et al., 2019;Otwinowska, Foryś-Nogala, Kobosko & Szewcyzk, 2020), whereas our study aims to better understand the role of phonological-overlap, with and without a semantic-overlap, ACROSS the language boundary in two different languages.One relevant study conducted recently by Mulík et al. (2019), examined how Spanish-English bilinguals with low and high L2 proficiency learned three types of Slovak words: overlapping in phonology but not in meaning with the L1 (Spanish false-cognates), overlapping in phonology but not in meaning with the L2 (English false-cognates) and control words, not overlapping with either of the languages.The results indicated better learning of false-cognates of both types in comparison to control words in the high proficiency group, and better learning of L1 false-cognates by the low proficiency group.
Contrasting findings, suggesting worse learning of falsecognates relative to control, were also observed.Specifically, in a recent study, Otwinowska and Szewczyk (2019) examined the learnability of different L2 words by asking adult Polish learners of English to provide the translation of a list of English words and rate their confidence in the translation.Interestingly, falsecognates were found to be less learnable than control and cognate words, although learners presumably had comparable opportunities to learn these words in the input.
Other research observed no difference between false-cognates and control words (Ghazi-Saidi & Ansaldo, 2017;Otwinowska et al., 2020).In particular, Ghazi-Saidi and Ansaldo (2017) taught adult Persian speakers cognates, false-cognates and control French words as novel words.Results showed faster naming of cognates in comparison to false-cognates and control words, with no significant difference in naming latencies between falsecognates and control words.At the same time, false-cognates and cognates differed from control words in the recruitment of certain brain areas (basal gangilia and parahippocampal gyrus) thought to reflect the engagement of implicit memory systems, suggesting that the similar learning accuracy for false-cognates and controls may be based on different mechanisms (see also Grant, Fang & Li, 2015).
Given the contradicting findings of the previous studies and the lack of certainty regarding false-cognate learning, we aimed to provide additional insight into the way in which false-cognates are learned in comparison to control words.Therefore, participants in the current study learned Arabic false-cognates with meanings that are semantically unrelated to the original Hebrew words.Based on the results of Mulík et al. (2019) and Storkel et al. (2013), facilitation may be expected for false-cognates over control words due to the overlap in form.In contrast, according to the findings of Otwinowska and Szewczyk (2019) and Rodd et al. (2012), we expected a challenge in learning these words in comparison to unambiguous control words due to the difficulty in mapping unrelated meanings.Further, to better understand the conflicting pattern that emerged from previous research, we suggest that participants' individual characteristics should be considered.We propose that learners' cognitive and linguistic profile may modulate the extent to which they benefit from form facilitation or suffer from meaning competition during learning.Therefore, in the current study, interactions between item-based and learner-based characteristics were examined.

Phonological short-term memory
Our study further aimed to test how individual characteristics modulate novel word learning.Specifically, better phonological short-term memory has been linked to better vocabulary learning of foreign and artificial languages both in adults (Cheung, 1996;Kaushanskaya, 2012;Martin & Ellis, 2012) and in children (Gathercole, Hitch & Martin, 1997;Michas & Henry, 1994).Interestingly, Degani and Goldberg (2019) found that better phonological short-term memory was associated with better overall FL vocabulary learning, yet it was also associated with a greater disadvantage in learning translation-ambiguous words.Presumably, the stronger activation of phonological forms for individuals with higher phonological short-term memory resulted in increased competition in the ambiguous condition in which two different words in the FL (Arabic) corresponded to one form in L1 (Hebrew).In the current study, we expected to observe an overall advantage for individuals with higher phonological short-term memory as measured by a non-word repetition task (Shatil & Share, 2003).We further tested whether it modulated the difference in learning false-cognates vs. control words.We expected individuals with higher phonological short-term memory abilities to capitalize on the phonological overlap of falsecognates, such that for these individuals, false-cognates would be easier to learn compared to control words.

L1 abilities
FL vocabulary learning has also been positively linked with L1 proficiency (Geva, 2014).For instance, Sparks, Patton, Ganschow, Humbach, and Javorsky (2006) showed that literacy measures in the L1 predicted better FL proficiency 10 years later.Mulík et al. (2019) demonstrated that participants with higher L2 proficiency were better learners of false-cognates in an L3 in comparison to participants with lower L2 proficiency.Notably, Degani and Goldberg (2019) found that increased L1 verbal fluency, rather than subjective ratings of proficiency in the language from which participants learned the FL vocabulary, was associated with better learning.Although L1 verbal fluency could be thought of as a subcomponent of the complex construct of language proficiency (Hulstijn, 2011), in the Degani and Goldberg (2019) sample there was no correlation between the subjective ratings and the verbal fluency score.Verbal fluency may reflect cognitive control in addition to lexical access (Carpenter, Rao, Peñaloza & Kiran, 2020;Friesen, Luo, Luk & Bialystok, 2015), but critically it may index the availability of L1 representations at the time of learning (Linck, Kroll & Sunderman, 2009).Therefore, and because self-ratings of proficiency may be of limited range and predictive power in the L1 (Tomoschuk, Ferreira & Gollan, 2019), in the current study we focused on participants' L1 (Hebrew) verbal fluency score, as measured by a semantic fluency task (Kavé, 2005), as an index of the availability of learners' L1 representations.We further examined individual differences in current patterns of L1 (Hebrew) use, as these may similarly affect the availability of L1 representations during learning.
Notably, the link between knowledge of prior languages and novel vocabulary learning may stem from two reasons (see also Hirosh & Degani, 2018).The first is the reliance of individuals on direct transfer from their previous languages, in that individuals with higher abilities in the known languages can more easily access phonological and semantic representations in their lexicon (Gathercole, 2006).The second is that observed L1 verbal fluency in fact reflects individuals' general language-learning capacity (a common proficiency/aptitude construct, e.g., Cummins, 1979Cummins, , 1991)), which may be linked to more general cognitive abilities (Geva, 2014).Under both accounts, we expected learners with better L1 verbal fluency to be overall better at FL vocabulary learning.
Further, the degree to which L1 verbal fluency modulates how different word types are learned may take several forms.To the extent that the advantage of individuals with higher L1 verbal fluency stems from their specific reliance on L1 phonological and semantic representations in their lexicon (Gathercole, 2006), we expected individuals with higher L1 verbal fluency to show a larger advantage in learning novel words with semantic-form overlap (cognates).Reliance on L1 meaning may lead individuals with higher L1 abilities to show a false-cognate disadvantage relative to controls, given the misalignment in form to meaning mapping for false-cognates.However, to the extent that the effect of L1 verbal fluency comes from a common proficiency construct, the difference between individuals with lower and higher abilities may become larger under more demanding learning conditions: namely, false-cognates and controls but not cognates.

Bidirectional cross-language effects
The final goal of the present study was to investigate whether cross-language interactions that are often observed for proficient bilinguals can be found at the initial stages of learning.In particular, proficient bilinguals exhibit cross-language interactions when presented with words with semantic-form overlap across their two languages (Degani, Prior & Hajajra, 2018;Prior, Degani, Awawdy, Yassin & Korem, 2017).For example, Degani et al. (2018) observed that proficient Arabic-Hebrew bilinguals activated the Arabic meaning of Hebrew/Arabic cognates and false-cognates during a visual semantic-relatedness judgment task in Hebrew (see also Prior et al., 2017 for auditory presentation).This crosslanguage influence was present even though bilinguals performed a task in which the other language was not relevant, suggesting that activation of a word-form in one language leads to form and meaning activation in both languages (i.e., NON-SELECTIVE ACCESS, Dijkstra & van Heuven, 2002;Kroll, Bobb & Wodniecka, 2006).Thus, when individuals acquire new meanings for an already existent unambiguous word in their other language, as in the case of false-cognates in the current study, competition might occur between the original dominant meaning in the L1 and the newly learned meaning (see also Dumay & Gaskell, 2007, 2012;Rodd et al., 2012).
This competition may take the form of FORWARD influences across-languages, in that the L1 affects FL vocabulary learning.Notably, the current study further tested for BACKWARD crosslanguage influences, testing whether the newly learned meanings affect processing of the already existent forms in the L1.Such bidirectional cross-language influences have been demonstrated among more proficient bilinguals (Degani, Prior & Tokowicz, 2011;Malt, Jobe, Li, Pavlenko & Ameel, 2016).Further, in the study by Rodd et al. (2012) described above, learners responded more quickly to the words to which they learned a semantically related new meaning in comparison to unrelated new meanings in a semantic decision task following learning (see also Maciejewski et al., 2020).
These findings suggest the possibility that a newly learned meaning would affect processing of the established word form in the L1.To test this, following learning, participants performed a visual lexical decision task in their L1, containing Hebrew words that correspond to the newly learned Arabic cognates and falsecognates.To the extent that the newly learned Arabic cognates and false-cognates were integrated into a unified bilingual lexicon (Dijkstra & van Heuven, 2002), we expected that following learning, the Arabic meaning would be activated when learners were presented with the Hebrew existing form.Thus, we expected learners to respond to Hebrew false-cognates as ambiguous Hebrew words entailing more than one meaning.Conversely, we expected the Hebrew cognates to enjoy an advantage in processing due to the convergence of meanings from both languages.

The current study
To summarize, in the current study, native Hebrew speakers with no previous knowledge of Arabic learned three types of Arabic words in the spoken modality (cognates, false-cognates, and controls).All participants completed a set of training and testing cycles, as well as objective (a non-word repetition task to test phonological short-term memory, and an L1 verbal fluency task) and subjective (language history questionnaire) measures of linguistic and cognitive abilities.The study aimed to answer three main questions.First, we tested how semantic-form overlap affected learning by incorporating translation recognition tests intermitted with learning cycles.Thus, in the course of learning and testing we examined which of the three word types was learned more easily, and specifically whether cognates were learned better than unambiguous controls and whether falsecognates were harder or easier to learn compared to unambiguous control words.Second, we examined how individual differences modulated learning of FL vocabulary, with a focus on phonological short-term memory and L1 verbal fluency.The availability of L1 representations was also examined via ratings of current L1 use.Finally, we tested for backward cross language influences.Thus, we asked whether the newly learned cognates and falsecognates affected the processing of these forms in the L1 by employing an L1 visual lexical decision task post learning.

Participants
Fifty-six native Hebrew speakers, with no or limited knowledge of Arabic, participated in the experiment.Thirty-four of them indicated learning Arabic in high school, but reported minimal proficiency and use of Arabic 1 .All participants had acquired English as an L2 or L3 and use it frequently in their academic or professional life.They were recruited from a large university in Israel, and indicated no learning or attention disabilities.Participants' language profile was assessed via objective (verbal fluency task in Hebrew) and subjective (language history questionnaire) language measures.Participants signed an informed consent as an approval of their participation, and were compensated with class credit or payment.
Data from two participants were excluded, one because Hebrew was not their native language, and the other due to incompletion of the protocol due to extremely long completion time of the learning phase.Analyses of word learning are therefore based on a final set of 54 participants (40 females, 14 males, average age 24.85).See Table 1 for more information.Notably, as discussed below, each cycle of learning was analyzed with the maximum number of participants who completed that cycle (54 in cycle 1; 37 in cycle 2; 22 in cycle 3, and 12 in cycle 4).Further, analysis of backward influences was based on a subset of 43 participants who reached a learning criterion of 80% in the learning phase.

Stimuli
Fifty-six Arabic words were taught to each participant, of which 14 words were cognates (e.g., /ʔozen/ which means 'ear' in both languages), 14 were false-cognates (e.g., /sˤu:sˤ/ which means 'chick' in Arabic but 'horse' in Hebrew) and 28 were unambiguous control words.See Table 2 for mean characteristics and Supplementary Materials (Supplementary Materials, S1) for the full stimuli set.All Arabic stimuli were recorded in standard Arabic by a female native Arabic speaker.
To determine the phonological similarity of the Hebrew and Arabic forms, norming procedures were conducted such that native Hebrew speakers (with no previous knowledge of Arabic) rated the similarity of the aural form of the Arabic word along with the phonological form of the visually presented Hebrew word.Each item was rated by at least 10 participants on a scale 1 To verify that this minimal Arabic exposure did not affect the results we conducted the analyses on the subset of participants who reported learning Arabic in high-school (n = 34).The pattern of results was identical, with better learning of cognates relative to control, and no difference between false-cognates and control in each cycle of learning, and across cycles.
Bilingualism: Language and Cognition of 1-5 (1 = minimum form similarity, 5 = maximum form similarity).All selected cognates and false-cognates were rated higher than 3 in phonological similarity between their Hebrew and Arabic forms, such that cognates (M = 4.23, SD = 0.61) did not differ significantly from false-cognates (M = 4.36, SD = 0.53), t < 1. Note, that although the cognates and false-cognates used in the current study substantially overlap in phonological form across Hebrew and Arabic, their phonological realization is not identical.Further, phonological similarity ratings of the Arabic false-cognates with the form of the appropriate Hebrew translations (e.g., /sˤu:sˤ/ in Arabic vs. /ʔefroaħ/ in Hebrew, both meaning 'chick') were lower than 3 (M = 1.18,SD = .18).

Procedure
Each participant completed the protocol in one session in which they completed up to four computerized learning cycles and translation recognition tests, all using E-prime software (Psychology Software Tools, Pittsburgh, PA).At the beginning of each learning or testing task, participants completed eight practice trials to get familiarized with the task, in the presence of the experimenter.Following the word learning and testing tasks, participants completed additional linguistic and cognitive tasks and a language history questionnaire.

Familiarization learning cycle
In the first cycle, participants were aurally presented with the to-be-learned Arabic words and visually presented with their Hebrew translation to provide their meaning, and were requested to repeat out loud each Arabic word they hear, with no feedback.Specifically, on each trial a 1000 ms fixation cross was presented followed by a 500 ms blank screen.A Hebrew word was then visually presented in its written form for 1000 ms, followed by an aural presentation of the to-be-learned Arabic translation via headphones.Subsequently, a question mark appeared on the screen until the participant's vocal response.A blank screen of 1500 ms was presented before the next trial.
Second and subsequent learning cyclesretrieval attempt In these cycles, participants were presented with the Hebrew words and were requested to attempt to produce their Arabic translations.Retrieval-based learning (Karpicke, 2012) was incorporated because practice testing has been shown to be a more efficient learning strategy in comparison to rehearsal or imitation (Kang, Gollan & Pashler, 2013;Rice & Tokowicz, 2020;Tokowicz & Degani, 2015).On each trial, following the 1000 ms fixation cross and the 500 ms blank screen, participants were visually presented with a Hebrew word for 1000 ms, followed by a question mark that appeared on the screen until the participant's vocal response triggered the voice key.Following this retrieval attempt, a 1000 ms blank screen appeared followed by aural presentation of the correct Arabic translation, regardless of the participant's answer (Kang et al., 2013), to keep the number of presentations of the Arabic word equal for all participants.An additional 1500 ms blank screen then appeared before the next trial.Note: There were no significant differences among the word types based on a one-way Anova in any of the measures (all p s > 0.20).Hebrew frequency is based on heTenTen 2014 corpus via SketchEngine (Kilgarriff et al., 2014;Kilgarriff et al., 2010).Standard deviations appear in parenthesis.

Recognition test
Participants then completed a recognition test, in which a 1000 ms fixation cross appeared on the screen followed by a 500 ms blank screen.Participants then heard an Arabic word via headphones and four Hebrew alternative translations appeared on the screen.Participants selected their response by button press (1 to 4), received feedback (correct or incorrect), and were visually presented with the correct Hebrew response, regardless of their accuracy on the trial, to strengthen learning and keep the number of presentations equal (Butler, Karpicke & Roediger, 2008;Karpicke & Roediger, 2008).The four Hebrew alternative translations were selected randomly on each trial from the pool of Hebrew translations used during the learning cycles.
To ensure that the novel FL words were sufficiently learned before backward influences of FL on L1 were examined, a learning criterion was used (Bartolotti & Marian, 2012).Success rates on the recognition test were calculated by the computer program.If the participant reached 80% success rate on all three types of words (cognate, false-cognate and control), the learning protocol was over.When performance was below 80% success rate in one (or all) of the word types, learning resumed using the retrieval attempt protocol, followed by a recognition test.Learning and testing were repeated until an 80% criterion was met in all 3 word types for a maximum of 4 cycles.

Individual differences measures
Phonological short-term memory task A Hebrew non-word repetition task (Shatil & Share, 2003) was used in order to test participants' phonological short-term memory span.Although this task was developed for school age children (up to 6 years of age), it was successfully used with adult learners in previous studies (e.g., Degani & Goldberg, 2019).
Participants were asked to repeat out loud sets of growing lengths of Hebrew non-words (2 to 8 characters, 2 words from each length) in the same order presented to them by the experimenter.The test ended when participants failed to accurately repeat both sets of the same length.

L1 (Hebrew) verbal fluency task
Participants were asked to name as many words as they can in Hebrew within one minute for each of two different semantic categories (Kavé, 2005; see also Gollan, Montoya & Werner, 2002).Two categories (fruits and vegetables, and vehicles) were each presented on a computer screen, followed by an hour glass marking the time limit for the task (60 seconds per category).
Language History Questionnaire (LHQ) Participants completed an LHQ (adapted from the LEAP-Q questionnaire, Marian, Blumenfeld & Kaushanskaya, 2007) providing details regarding proficiency and use patterns in the languages they know.

FL on L1 influence
In order to test whether the newly learned words affected participants' L1 processing, participants completed a HEBREW VISUAL LEXICAL DECISION TASK.The Hebrew words included the 14 cognate words that were learned during the learning protocol, as well as 14 Hebrew false-cognates for which an Arabic false-cognate had been learned.For instance, in the learning protocol participants learned that /sˤu:sˤ'/ in Arabic means 'chick', whereas in the lexical decision task, the Hebrew word /sus/, meaning 'horse', was presented.In addition, control items learned during the learning phase were included in the lexical decision task as a comparison, but due to matching considerations (in that now the task included the Hebrew false-cognate items), only 24 of the original 28 control items were included.Across the three word types (cognates, falsecognates and control), items were matched on Hebrew length in letters and syllables, in Hebrew frequency and in Hebrew bigram and trigram frequency (all ps>.31, see Supplementary Materials (Supplementary Materials, S2) for summary characteristics).Thirty-two additional filler items (18 unambiguous and 14 ambiguous Hebrew words such as /mapa/ which means both a tablecloth and a map) were included in order to conceal the purpose of this task.In total, the task included 84 words and 84 orthographically legal non-words.The selected non-words were matched to the real words on length in number of letters and syllables, and in bigram and trigram Hebrew frequency (from a corpus of 12 million words from a collection of articles from the Hebrew newspaper Haaretz, see also Peleg, Degani, Raziq & Taha, 2020; see Supplementary Materials, S2 for the full analyses data), to avoid fast and superficial lexical decisions based on orthographic information only (Keuleers & Brysbaert, 2010).On each trial, a 500 ms fixation cross appeared, followed by the letter string.Participants were requested to decide whether the letter string is a real Hebrew word by clicking "yes" or "no" on the response box as quickly and as accurately as possible.The letter string remained on the screen until participants' response, or up to 4 seconds, at which point a fixation cross appeared to signal the next trial.
To verify that the results of the learners reflect the effect of Arabic learning, their performance on this Hebrew lexical decision task was compared to that of a second group of 30 native Hebrew speakers who did not participate in the learning paradigm (see Supplementary Materials (Supplementary Materials, S6) for detailed background information).The two groups were comparable except that the non-learners performed only the lexical decision task and completed the LHQ.

Results
In what follows, we first report on the analyses examining how word type (cognate, false-cognate, control words) affects learning, including (a) the number of cycles participants needed in order to reach an 80% learning criterion in each word type, followed by detailed analyses of the (b) error rates and (c) RTs within each cycle of learning (and across cycles).Next, we report the analyses of how individual differences in linguistic and cognitive abilities modulate learning.Finally, we report how lexical decisions to L1 (Hebrew) words are affected by prior learning of related vocabulary in the FL (Arabic).

Number of cycles to criterion
For each participant, we computed the cycle at which they first reached the 80% criterion for each word type.On average, participants reached the overall learning criterion with M = 1.4 cycles (SD = .74) 2 .Interestingly, a repeated measures Anova with 2 Ten participants did not reach this criterion in one or more of the word types within the four cycles provided.These 10 participants were included in the relevant learning analyses described below, but not in the subsequent backward influences analyses which assume successful learning (i.e., that the learning criterion had been met).

Data analysis and model structure
Results were analyzed using linear mixed-effects models as implemented in the lme4 (Baayen, Davidson & Bates, 2008) and lmerTest packages (Kuznetsova, Brockhoff & Christensen, 2017) in R (Version 4.0.3,R Core Team, 2020).RTs were analyzed for correct responses only.A preliminary inspection of the RT data revealed trials with absurdly long RTs (above 40 seconds) in the translation recognition test, that most likely do not reflect relevant task processes.Thus, we excluded trials with RTs longer that 6.5 seconds, resulting in the exclusion of less than 2.5% of the data.Further, examination of the RT distribution revealed substantial deviation from normality.To improve skew and kurtosis, we explored log and inverse transformations.Based on the QQ plots, the skew (raw data:1.2;log transformation: 0.2; inverse transformation: -1.12), kurtosis (raw data:1.75;log transformation: -0.58; inverse transformation: 1.46), and Anderson-Darling Normality test (raw data: A = 232.94,p < .001;log transformation: A = 18.09, p < .001;inverse transformation: A = 98.63, p < .001), the log transformation provided a better remedy, and was thus adopted.Error rate data were analyzed following the binomial distribution in logistic mixed-effect models.Each cycle was analyzed separately with the maximum number of participants who completed that cycle (54 in cycle 1; 37 in cycle 2; 22 in cycle 3, and 12 in cycle 4), and additional analyses compared performance across cycles for the 22 participants who completed the first three cycles.
Within each cycle and each measure (log RTs; Error rate) the models included the fixed effect of Word Type, dummy (treatment) coded with Control items set as the reference, to allow comparisons of Cognates vs. Controls and False-Cognates vs. Controls 3 .A maximal model including by-participant and by-item intercepts and a by-participant slope for Word Type (Bell, Fairbrother & Jones, 2019;Brauer & Curtin, 2018) was submitted to a buildmer function in the buildmer package (v.1.3, Voeten, 2019, as used in e.g., Johns & Steuck, 2021) in R (version 4.0.3,R Core Team, 2020), which uses the (g)lmer function from the lme4 package (v 1.1.-21,Bates, Maechler, Bolker & Walker, 2015).Starting from the maximal model, and using backwardfitting model selection procedure, the buildmer function systematically simplifies the random slopes until convergence in addition to using likelihood ratio tests (LRTs) to examine the contribution of random slopes to the fit of the model (one of the common methods to test model fit, Matuschek, Kliegl, Vasishth, Baayen & Bates, 2017, p. 308).This systematic procedure is not based on decisions made by the researcher (e.g., avoids the need to determine the cutoff point for component weights selection in a PCA on the random structure, Bates, Kliegl, Vasishth & Baayen, 2015), and is fully replicable from the data.The buildmer function further tests the contribution of each fixed effect to the model fit via a chi-squared test on the residual sum of squares of each model.P-values for all fixed effects were determined based on Satterthwaite degrees of freedom using the lmerTest package (v.3.1-0, Kuznetsova et al., 2017), or the Wald degrees of freedom for binomial distribution.When necessary, to probe interactions and examine pairwise comparisons, the selected model was refitted using (g)lmer, and followed by the testInteractions function from the phia package (v.0.2-1, Martinez, 2015) or with the contrast function from the emmeans package (v. 1. 5. 2-1, Lenth, 2020) with Bonferroni adjustments of multiple comparisons.In what follows, significance of main effects was derived from the anova function for each model (see Supplementary Materials, S3).Detailed models are presented in the Tables following the summary function.Intraclass correlations were computed using the icc function in the performance package (Lüdecke, Makowski, Waggoner & Patil, 2020).

Performance across cycles
Due to the low number of participants who completed the 4 th cycle, we focused on comparisons across the first three cycles with the 22 participants who completed these cycles.To examine modulations in performance across time and practice, the effect of Cycle (1, 2, 3, with 1 as the reference) and its interaction with Word Type were added to the model, as well as by-participant and by-item slopes for Cycle.In both the error rate and the RT data, there was a significant effect of Cycle  5).Pairwise comparisons with Bonferroni corrections for multiple comparisons revealed that all differences among the cycles were significant, except for the error rate difference between Cycle 2 and Cycle 3. Further, there was a significant effect of Word Type

Bilingualism: Language and Cognition 555
Critically, the effects did not interact 4 .Thus, although performance significantly improved from cycle to cycle, the difference between Cognates and Controls, or between False-Cognates and Controls, did not vary by learning cycle.

Individual differences
We examined whether overall FL vocabulary learning was modulated by individual differences and further, whether the effects of semantic-form overlap were modulated by these individual differences.We focused on three key predictors: Phonological Short-Term Memory, L1 Verbal Fluency, and L1 Use ratings.Because these variables were not significantly correlated with each other (see Table 6), they were jointly entered into the analyses, after they have been normalized.

Learning to criterion
We examined the correlation between each of the three individual differences measures and the time it took participants to reach the learning criteria.As can be seen in Table 6, overall performance was correlated with phonological short-term memory and L1 verbal fluency, such that participants with increased phonological short-term memory and L1 verbal fluency reached the learning criterion with fewer cycles.Interestingly, performance on the False-Cognates was specifically modulated by phonological shortterm memory, such that individuals with increased phonological short-term memory reached the learning criterion for these items with fewer cycles.

Performance by cycle
For cycles with enough participants (1 through 3), in each measure (error rate and RT) we specified a model including the effects of the three individual difference predictors and their interactions with Cycle and with Word Type.A maximal model including these interactions as well as by-participant and by-item intercepts, and a by-participant slope for Word Type and by-participant and by-item slopes for Cycle, was submitted to a buildmer function in the buildmer package (v. 1.3, Voeten, 2019) in R (version 3.6.1,R Core Team, 2019).With this function, the contribution of each fixed effect to the model fit is examined in a 'leave-one-out' backward procedure, such that the model is compared to a model without one of the fixed effects via a chi-squared test on the residual sum of squares of each model (similar to an anova function in R for model comparisons). 5See Supplementary Materials (Supplementary Materials, S5) for full details of these selected models from the summary function (note that Control items are set as the reference for the Word Type effect), and Supplementary Materials (Supplementary Materials, S3) for details from the anova function.

Cycle 1
In the error rate analysis, phonological short-term memory predicted performance (F(1) = 16.09,p < .001)such that individuals with higher phonological short-term memory learned better To verify that the backward stepwise selection procedure we used did not inflate other terms (see Sonderegger, Wagner & Torreira, 2018) we repeated the analysis while keeping the Word Type by Cycle interaction in the model.These analyses again showed no significant interaction (Errors: F(4) = 1.69, p = 0.15; RT: F(4,58.38)= 0.38, p = 0.82) with no change in the pattern of significant effects of Word Type and Cycle.

5
Using the forward selection procedure, instead of the backward selection direction for evaluating the contribution of each fixed effect, resulted in the exact same pattern of significant effects, with one exception.In the error data analysis of Cycle 2, the main effect of L1 verbal fluency reached significance (F(1) = 4.73, p = 0.03).
Further, including all predictors in the model, rather than removing terms (in a forward or backward procedure), resulted in the same pattern of significant effects, with two exceptions.First, in the error rate of Cycle 1, the difference between falsecognates and controls was modulated by Phonological short-term memory (b = −0.27,SE = 0.14, z = −1.97,p = 0.049), such that the difference was larger for individuals with lower phonological short-term memory (M Control = 0.29 vs M False-Cognates = 0.36 for individuals with lower phonological short-term memory (Z = −2), but M Control = 0.08 vs M False-Cognates = 0.04 for individuals with higher phonological short-term memory (Z = 2)).This reduced difference with increased phonological short-term memory likely reflects near ceiling performance in error rates for individuals with higher phonological short-term memory.Second, in the error rate analysis of Cycle 2, the effect of L1 verbal fluency reached significance (F(1) = 5.60, p = 0.02).Critically, the modulation of the difference between Cognates and Controls by phonological short-term memory remained significant (b = 1.25, SE = 0.61, z = 2.05, p = 0.041).
than those with lower phonological short-term memory (for Control reference items b = −0.51,SE = 0.13, z = −3.97,p < .001).There was no interaction, however, between phonological shortterm memory and the relevant contrasts.

Cycle 2
In the error rate analysis, individuals with higher L1 use performed with fewer errors than those with lower L1 use (F(1) = 6.54, p = 0.01, for Control items b = −0.48,SE = 0.18, z = −2.65,p = 0.008).In addition, individual differences in phonological short-term memory modulated the difference between Cognates and Controls (b = −1.09,SE = 0.49, z = 2.19, p = 0.03), such that it was larger for individuals with lower phonological short-term memory (see Figure 2).Specifically, whereas performance on Control words improved with higher phonological short-term memory, performance on Cognates remained stable, potentially due to near ceiling performance.
In the RT analysis, the difference between False-Cognates and Controls was modulated by phonological short-term memory (b = −0.07,SE = 0.03, df = 1672, t = −2.57,p = 0.01).Increased phonological short-term memory was associated with a larger advantage for False-Cognates over Control items (see Figure 3).Whereas RTs for False-Cognates improved with phonological shortterm memory, performance on Control items remained unchanged.At the same time, the difference between False-Cognates and Controls was modulated by L1 verbal fluency (b = 0.06, SE = 0.02, df = 1672, t = 2.56, p = 0.01), such that increased L1 verbal fluency was associated with faster processing of Control items but not of False-Cognates, such that the difference between them was reduced (see Figure 4).

Cycle 3
Phonological short-term memory modulated the error rate difference between Cognates and Controls (b = 1.03,SE = 0.39, z = 2.62, p = 0.009).In particular, as in Cycle 2, whereas individuals with lower phonological short-term memory exhibited fewer errors for Cognates relative to Controls (for individuals 1 SD below the mean M Control = 0.15 vs. M Cognate = 0.01), there was no difference for those with higher phonological short-term memory (for individuals 1 SD above the mean M Control = 0.09 vs. M Cognate = 0.04), likely due to near ceiling performance on this measure on Cognate items.The (lack of) difference between False-Cognates and Controls was not modulated by individual differences.

Backward influence analyses
To examine whether learning of Arabic vocabulary influenced processing of Hebrew words, indicative of backward crosslanguage influences, we examined learners' performance in a visual lexical decision task on Hebrew letter strings.Data from ten participants were excluded because they did not reach the 80% learning criteria in one or more of the word types; data from one additional participant were excluded because they reported incorrectly pressing the buttons at the beginning of the lexical decision task.Thus, data from 43 participants that reached the learning criterion (80% success) in all word types were analyzed for the backward influences task.
To verify whether the difference between False-Cognates and Controls for the learners indicate the presence of backward influences, we compared their performance to that of another group of 30 native Hebrew speakers who did not participate in the learning paradigm (see Supplementary Materials, S6 for background characteristics and comparisons to the learner group).These analyses including both groups (see Table 8) revealed no effects in the error rate data, but a main effect of Word Type in the RT data (F(2,50.85)= 5.78, p = 0.005), with faster responses to Control relative to False-Cognate items (b = 0.02, SE = 0.01, df = 50.92,t = 3.12, p = 0.003), and no difference between Control and Cognate items (b = −0.00,SE = 0.02, df = 50.69,t = −0.11,p = 0.91).The learners responded marginally6 more slowly (F (2,76.49)= 3.38, p = 0.07), likely because this was the last task in their learning and testing protocol.Critically, the interaction between the two factors was marginally significant (F(2,49.08)= 3.13, p = 0.05), but the specific difference between False-Cognates and Control items was modulated by group (b = 0.06, SE = 0.03, df = 49.33,t = 2.35, p = 0.023).Pairwise comparisons with Bonferroni corrections revealed that the difference between False-Cognates and Controls was significant for the learner group (value = −0.07,χ 2 = 14.20, p < .001),but not for the non-learner group (value = −0.01,χ 2 = 0.22, p = 1.00, see Figure 5).Thus, learning the Arabic meaning of the phonological forms of False-Cognates to an 80% accuracy criterion resulted in backward influences in L1 processing immediately after learning.A Cognate facilitation effect, however, was not observed in L1 Hebrew.
We further examined whether within the learner group, individual differences modulated the effect.These analyses revealed that participants' overall learning score from the translation recognition test and the three individual difference predictors of interest (phonological short-term memory, L1 verbal fluency, L1 use) did not modulate the difference across word types.

Discussion
In the current study, we investigated FL vocabulary learning, focusing on learning FL words which overlap in form and meaning (cognates), or in form only (false-cognates), with learners' L1.Furthermore, we examined whether individual differences in phonological short-term memory, L1 verbal fluency and L1 use modulate the learning process across the different word types.Finally, we tested for backward influences on learners' L1 following the learning of new meanings to existing forms (falsecognates).The results of the study show that cognates were learned more quickly, such that participants reached the 80% learning criterion with the fewest number of learning and testing cycles.Cognates were also processed more quickly and accurately than controls in the recognition test.The cognate advantage was more pronounced for individuals with lower phonological shortterm memory abilities.Critically, whereas overall performance suggests no difference in learning false-cognates and control items, the individual difference analyses reveal that individuals with higher phonological short-term memory abilities and those with lower L1 verbal fluency abilities exhibited a significant advantage for false-cognates over control items.Finally, there was evidence for backward influences of learning false-cognates, such that false-cognates were responded to more slowly than control words following learning.We discuss each of these findings and their implications in the following sections.

The cognate advantage in learning
In concurrence with previous studies, cognate words were easier and faster to learn compared to control words (e.g., Lotto & De Groot, 1998).As expected, participants reached the learning criterion for cognates with the fewest number of learning and testing cycles and performed more quickly and more accurately in the recognition test on cognate items.Of relevance, whereas previous studies typically focused on cognate learning between languages that share orthography (Lotto & De Groot, 1998;De Groot & Keijzer, 2000;Raboyeau et al., 2010), the current study demonstrated the cognate learning advantage when learning a FL via phonology, with no reliance on a shared orthographic system (see also Ghazi-Saidi & Ansaldo, 2017).As such, the findings  12) 605 ( 13) 599 ( 10) 602 ( 6) 717 ( 7) Note: Standard errors appear in parentheses.underscore the generalizability of the effect and highlight the role of phonological overlap in mediating this advantage.Moreover, although the cognates used in the current study substantially overlap in form across Hebrew and Arabic, their phonological realization is not identical.In fact, none of the 14 cognates used would be considered an identical cognate (see Supplementary Materials, S1).Nonetheless, cognate words were learned substantially better than non-cognates.By examining individual difference modulations of FL learning we observed a greater cognate advantage for individuals with lower phonological short-term memory.However, we believe this is because performance on cognate items at this point was near ceiling for all participants.As seen in Figure 2, increased phonological short-term memory was associated with fewer errors for control items, but no such facilitation was observed for cognate items because performance on these easier items was already at ceiling.

Individual differences in learning of false cognates
The more critical aim of the current study was to test whether false-cognates are easier or harder to learn compared to control words.The results show no overall difference between learning false-cognates and control words, as revealed by the number of cycles required to meet the learning criterion, or by the latency and error rate data in the recognition tests.
The comparable overall performance on false-cognates and control words may reflect the operation of two opposing mechanisms, a form facilitation and a meaning competition.Specifically, the overlap in form across the L1 and the to-be-learned words may facilitate learning compared to control items (Mulík et al., 2019;Marecka, Szewczyk, Otwinowska, Durlik, Foryś-Nogala, Kutyłowska & Wodniecka, 2020).At the same time, because the meanings of false-cognates differ from those of the L1, meaning competition for these items may cause difficulty in learning falsecognates (Rodd et al., 2012).These two mechanisms may both be at play and may cancel each other out, leading to an overall null effect.Indeed, Ghazi-Saidi and Ansaldo (2017) similarly observed no learning accuracy or RT difference between false-cognates and controls, but found differential brain recruitment when processing the two types of items.Thus, the comparable performance on false-cognates and control words may not be supported by the same learning mechanism.Furthermore, Fang et al. (2017) exemplified differences in the time course of these opposing mechanisms within a language, such that initial form facilitation was followed by subsequent meaning interference.It is possible that the current study taps a point in time at which the two processes balance out.This claim is further supported by the individual differences results.Specifically, we observed that processing of false-cognates was modulated by phonological short-term memory.First, individuals with higher phonological short-term memory required fewer learning cycles to learn false-cognates compared to those with lower phonological short-term memory.Second, individuals with higher phonological short-term memory were able to recognize false-cognate words significantly more quickly than control items in the 2 nd cycle of learning.This finding can be explained by considering the suggestion that learning of false-cognates is affected both by facilitation due to the overlap in phonological form and by competition between the different semantic representations (Fang et al., 2017).We propose that individuals with higher phonological short-term memory relied to a greater extent on the phonological form, and thus exhibited an advantage for false-cognates over control words.
At the same time, we observed learning modulations with L1 verbal fluency.Specifically, increased L1 verbal fluency was expected to be associated with better learning, as seen in Figure 4 for control items.Critically, this improved learning was not observed for false-cognates.We propose that learning of falsecognates is not facilitated by increased verbal fluency because of increased meaning competition for individuals with higher verbal fluency.This sensitivity to competition for individuals with higher verbal fluency is consistent with the findings of Degani and Goldberg (2019, Figure 4), where increased verbal fluency was associated with better learning of unambiguous (control) items, but not of ambiguous items.As a result, in that study the ambiguity disadvantage in learning was more pronounced for individuals with higher verbal fluency.Here, we observed that individuals with lower verbal abilities exhibited a falsecognate advantage over control words, as they were able to benefit from form facilitation, and did not rely on meaning enough to suffer from competition.Individuals with higher verbal abilities did not show a difference because for them, meaning competition offset the form facilitation.
This finding is important for two reasons.First, it shows that the lack of overall difference in learning false-cognates and controls likely obscures a substantial difference in the processes involved in learning these different types of items (Ghazi-Saidi & Ansaldo, 2017).Learners with different cognitive and linguistic resources may be differentially affected by the overlap in form and not in meaning.Second, the finding underscores the importance of considering individual differences in learners' prior knowledge and abilities and shows the merits in jointly considering item and learner characteristics (Degani & Goldberg, 2019).
Our findings demonstrate that the advantage for learning false-cognates over control words may become apparent under some conditions, but not others, but critically does not reverse into a false-cognate disadvantage in learning.Converging evidence comes from a recently published paper by Marecka et al. (2020), who examined learning of Polish non-words in association with pictures.These non-words could resemble the Polish word for the associated picture (i.e., cognates), resemble a different Polish word (i.e., false-cognates) or be different from existing Polish words (i.e., non-cognate controls).Of relevance, their findings show no difference in learning false-cognates versus noncognates in a recognition task, but a false-cognate advantage in a production task.The authors argue that the interference with L1 meaning was either nonexistent or outweighed by the form overlap facilitation, when production of the form was examined.

Individual differences in FL learning
Furthermore, the current findings show an important overall relation between FL vocabulary learning on the one hand, and phonological short-term memory and L1 verbal fluency on the other.Consistent with previous work (Degani & Goldberg, 2019), we found that phonological short-term memory was positively correlated with overall learning, such that individuals with higher phonological short-term memory required fewer learning cycles to reach the learning criterion and performed the translation recognition tests more accurately across cycles.
We further found that participants' verbal fluency in their L1 was positively linked to learning, as reflected by both the number of learning cycles to criterion, and the translation recognition RT data across cycles.This finding is consistent with previous research where verbal fluency in the language through which learning took place was positively linked to learning (Degani & Goldberg, 2019).Notably, however, the verbal fluency measure may not capture only linguistic abilities, but also domain general abilities such as executive functions (Jurado & Rosselli, 2007;Friesen et al., 2015).Indeed, in the current study, we observed only low correlations between the L1 verbal fluency score and the subjective L1 and L2 proficiency and use measures (see also Degani & Goldberg, 2019), but this may be due to limitations of the self-report measure (Tomoschuk et al., 2019).Future studies in which additional measures of executive control abilities are collected would shed light on the mechanisms underlying the relation between verbal fluency and FL vocabulary learning.Moreover, executive functions may be of special relevance in the case of false-cognate learning.In particular, false-cognates require the management of meaning competition between the two languages.Therefore, learning and processing of these items may recruit executive control to a greater degree than control words.Indeed, Grant et al. (2015) tested for the role of cognitive control abilities amongst English speakers learning false-cognates (interlingual homographs) and unambiguous Spanish (L2) words throughout an academic year.The findings showed that inhibitory control was more critical in early rather than advanced stages of L2 learning (consistent with this, Prior et al., 2017 did not observe executive control modulations of falsecognate processing among proficient Arabic-Hebrew bilinguals).Thus, future studies in which individual differences in executive control are collected may be of special interest in the case of false-cognate learning.
To the extent that L1 verbal fluency is taken to reflect linguistic abilities, the observed relation between verbal fluency and learning is informative with respect to the debate over whether L1 verbal fluency is linked to FL learning due to a common proficiency construct, or due to direct transfer.If L1 verbal fluency reflects a common proficiency construct (e.g., Cummins, 1979Cummins, , 1991; for a cognitive variant, see Geva, 2014), the difference between individuals with lower and higher abilities was predicted to be larger under more demanding learning conditions: namely, falsecognates and controls, but not cognates.The data shows that although L1 verbal fluency did modulate the difference between false-cognates and controls, it was mostly the control items that were learned better with increased L1 verbal fluency.According to transfer accounts, by which the advantage of individuals with higher L1 verbal fluency stems from their specific reliance on L1 phonological and semantic representations in their lexicon (Gathercole, 2006), we expected individuals with higher L1 verbal fluency to show a larger advantage for cognates because these items overlap in both form and meaning across languages.We further expected these individuals to show a decrement in processing false-cognates because of the misalignment in meaning across languages.The findings support the second of these predictions, but not the first, because processing of cognates does not appear to vary by L1 verbal fluency in the current study.Our results highlight the possibility that both aspects are in fact at play: prior language abilities may facilitate FL learning both because they allow for direct transfer, and because they indirectly reflect a more general enhanced language ability of the individual (for further discussion see e.g., Hirosh & Degani, 2018).
Finally, the current findings exemplify not only that FL learning is modulated by individual differences, but also that these individual differences may differentially affect learning of different types of words.Nonetheless, limitations in power may have obscured additional variability of interest.Thus, future studies incorporating larger samples may prove useful in examining such interactions, especially because the number of items that 562 Mariana Elias and Tamar Degani learners can successfully learn may be limited in single session learning protocols of the type implemented here.

Backward influence
The third and final aim of this study was to test whether learning new meanings to already established forms (false-cognates) would result in backward influences, such that the newly learned meanings would affect processing of the word form in the L1.Previous findings demonstrated backward influences as a result of altering L1 mappings following novel word learning within a language (Davis, Di Betta, Macdonald & Gaskell, 2009;Fang & Perfetti, 2019;Malt et al., 2016;Rodd et al., 2012;Maciejewski et al., 2020), and thus, we expected learners to respond to false-cognates as ambiguous Hebrew words entailing more than one meaning after having learned new meanings to these words.At the same time, we expected cognates to enjoy an advantage in processing due to the convergence of meanings from both languages.The results of the current study provide initial evidence for such backward influences, in that lexical decision latencies to false-cognate Hebrew words were slower immediately after learning compared to control items.This increased processing time was not present for Hebrew speakers who did not learn the Arabic vocabulary.Cognate words, in contrast, were processed similarly by the two groups, suggesting no cognate facilitation immediately following learning.Successful learning of false-cognate words to an 80% criterion was sufficient to lead to a false-cognate interference in the L1.Thus, having learned a competing meaning in a FL to the same phonological form made it slightly more difficult for native Hebrew speakers to deem these words as Hebrew words in a lexical decision task.Notably, the current study may have underestimated the presence of backward influences because these were examined immediately following learning.Previous work suggests that knowledge consolidation may be necessary for backward influences of the type examined here to appear.For instance, in a study conducted by Davis et al. (2009), adult participants learned non-words as novel words (e.g., alcohin) and completed a lexical decision task on the day of learning and on the following day.Results showed competition between these novel non-words and similar-sounding established English words (e.g., alcohol) only in the day following learning, but not on the same day as learning.Similarly, Dumay and Gaskell (2007) demonstrated that a consolidation period including a night sleep is necessary in order to observe lexical competition effects of newly learned spoken forms (see also Dumay & Gaskell, 2012).Moreover, consolidation in the L2 has been suggested to require more time than consolidation of L1 novel words (Lindsay & Gaskell, 2010).Future studies in which participants will be given the opportunity for overnight consolidation will test the possibility that false-cognates and cognates learned in a FL can similarly impact processing of L1 words to a greater extent after consolidation.

Conclusion
The current study shows that the overlap of form and meaning across the to-be-learned FL vocabulary and existing linguistic knowledge impacts vocabulary learning, and critically that these effects vary with learner characteristics.While all learners exhibited an advantage in learning cognates, these effects were larger for individuals with lower phonological short-term memory abilities, who were further from ceiling performance.Further, while there was no overall difference in learning false-cognates vs. control words, likely reflecting the operation of opposing mechanisms of form facilitation and meaning competition, individuals with higher phonological short-term memory and those with lower L1 verbal fluency exhibited a learning advantage for falsecognates over control words.Hence, the current findings carry important implications for educational settings.Specifically, although this study focused on learning in laboratory rather than natural settings (see Tokowicz & Degani, 2015 for discussion), in light of the variability in the learning process observed here, vocabulary learning in educational settings should aim to take into account each individual's linguistic and cognitive abilities in order to provide them with optimal learning opportunities.Further, the importance of considering each word type separately is highlighted by recent work showing that some instructional conditions (e.g., language of instructionnamely, learning via the L1 or the L2 among bilinguals learning a FL) affects learning of false cognates and control words, but not cognate words (Hirosh & Degani, 2021).Together, these findings highlight the relevance of jointly taking into account item-based and learner-based characteristics and differences in both linguistic and cognitive abilities in the early stages of FL learning.Finally, the study provides initial novel evidence for backward influences, in that processing of L1 words was affected immediately after learning competing FL meanings to L1 forms, although no substantial consolidation phase has taken place.The findings thus highlight the importance of cross-language mapping in the processes involved in vocabulary learning and the emergence of the bilingual lexicon.

Fig. 1 .
Fig. 1.The effect of Word Type on error rate (a) and RT (b) in cycle 1 (estimated means, with error bars representing SE).

Fig. 2 .
Fig. 2. Estimated difference between Cognate and Control items as a function of Phonological Short-Term memory in the error rate data of Cycle 2.

Fig. 3 .
Fig. 3.Estimated difference between False-Cognates and Control items as a function of Phonological Short-Term memory in the RT data of Cycle 2.

Fig. 4 .
Fig. 4. Estimated difference between False-Cognates and Control items as a function of L1 Verbal Fluency in the RT data of Cycle 2.

Fig. 5 .
Fig. 5.Estimated difference between False Cognates and Control items in the RT data of the Lexical Decision Task as a function of Group.

Table 1 .
Participants background information.
Note: L2* is the second most proficient language reported by the participants, following Hebrew.It varied across participants, but was never Arabic, and was English for 47 of the participants.a Language proficiency reflects the average proficiency ratings in reading, writing, talking, and speech comprehension rated on a scale of 0 (lowest) to 10 (highest).b Language use reflects the average use in reading, writing, conversation, internet, listening, and TV watching rated on a scale from 0 (lowest) to 10 (highest).

Table 2 .
Mean item characteristics as a function of word type.
Table 3 provides mean Error Rate and Response Times (RTs) for correct responses by Word Type (Cognate, False-Cognate, and Control) per Cycle.

Table 3 .
Error rate and RT for each word type per cycle.
Note: N indicates the number of participants per cycle.Standard errors appear in parentheses.

Table 4 .
Summary of the Translation Recognition test as a function of Word Type in Cycle 1 (n=54).See Supplementary Materials (S4) for the results of other cycles.

Table 5 .
Summary of the Translation Recognition test as a function of Word Type and Cycle.
Note: ± p < 0.1 * p < .05;** p < 0.01; *** p < 0.001.Intraclass correlations may not be accurate in the presence of random slopes.The overall ICC for the error rate model (adjusted ICC) is 0.47 and for the RT model it is 0.44.FC refers to false-cognates.

Table 6 .
Pearson correlations among the individual difference measures and learning to criterion.

Table 7 .
Error Rate and RT on correct responses in the Hebrew lexical decision task as a function of Lexicality, Word Type and Group.

Table 8 .
Summary of the Lexical Decision task as a function of Word Type and Group.Note: ± p < 0.1 * p < .05;** p < 0.01; *** p < 0.001.Intraclass correlations may not be accurate in the presence of random slopes.The overall ICC for the RT model (adjusted ICC) is 0.25.FC refers to false-cognates.