Hostname: page-component-7dd5485656-glrdx Total loading time: 0 Render date: 2025-10-25T16:15:56.535Z Has data issue: false hasContentIssue false

What makes a cognate? Implications for research on bilingualism

Published online by Cambridge University Press:  14 May 2024

Tanja C. Roembke*
Affiliation:
Cognitive and Experimental Psychology, Institute of Psychology, RWTH Aachen University, Aachen, Germany
Iring Koch
Affiliation:
Cognitive and Experimental Psychology, Institute of Psychology, RWTH Aachen University, Aachen, Germany
Andrea M. Philipp
Affiliation:
Cognitive and Experimental Psychology, Institute of Psychology, RWTH Aachen University, Aachen, Germany
*
Corresponding author: Tanja C. Roembke; Email: tanja.roembke@psych.rwth-aachen.de
Rights & Permissions [Opens in a new window]

Abstract

Cognates are studied in many psychological studies of bilingual language processing. Despite their frequent use, there is no clear operationalized definition of what constitutes a cognate. We conducted a literature search in three major journals to better understand how cognate status is typically defined and operationalized. In these journals, we analyzed similarity of cognate and non-cognate stimuli. We found that approximately 60% of the reviewed studies operationalized cognate status empirically. Stimulus analyses revealed a similarity continuum between cognates and non-cognates without a consistent cut-off. Based on these results, we make recommendations for future research.

Information

Type
Research Notes
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2024. Published by Cambridge University Press

According to Crystal's (Reference Crystal2008) “A Handbook of Linguistics and Phonetics,” cognates are words that are “historically derived from the same source as another […] form” (p. 83). Due to this shared etymological background, cognates are often similar in how they are pronounced and written across languages (A. Costa et al., Reference Costa, Caramazza and Sebastián-Gallés2000). Examples for cognates are GRAVE (/greiv/Footnote 1) and GRAB (/graːp/) or FISH (/fɪʃ/) and FISCH (/fɪʃ/) (English/German). Cognates can range from being identical across languages to having more limited overlap (Schepens et al., Reference Schepens, Dijkstra and Grootjen2012; Vanlangendonck et al., Reference Vanlangendonck, Peeters, Rueschemeyer and Dijkstra2020). Importantly, studying cognates can help us understand how languages are represented and interact with each other during language acquisition and processing (Dijkstra et al., Reference Dijkstra, Miwa, Brummelhuis, Sappelli and Baayen2010; Guediche et al., Reference Guediche, Baart and Samuel2020; Schepens et al., Reference Schepens, Dijkstra and Grootjen2012). In this context, cognates are both studied as the focal point of interest (e.g., are cognates represented differently in the bilingual mind than other types of words?; Dijkstra et al., Reference Dijkstra, Miwa, Brummelhuis, Sappelli and Baayen2010) as well as a proxy to better understand the impact of cross-language similarity of words on bilingual language processing (e.g., does overlap of translation-equivalent words facilitate language naming and switching?; A. Costa et al., Reference Costa, Caramazza and Sebastián-Gallés2000; Declerck et al., Reference Declerck, Koch and Philipp2012; Hoshino & Kroll, Reference Hoshino and Kroll2008; Li & Gollan, Reference Li and Gollan2018; Muylle et al., Reference Muylle, Van Assche and Hartsuiker2022). Despite a continued strong interest in cognates and a general consensus that cognates are words from two different languages that share meaning and form (orthographic and/or phonological), the exact definition of what a cognate is varies widely within the literature. Thus, it is critical to have a closer look at how cognates – and non-cognates – are defined across different studies. The goal of this research note is to raise awareness of how differently cognate status has been operationalized across psycholinguistic studies and to help establish guidelines for future research on cognates.

1. Cognate definitions and cross-language similarity for cognate stimuli within the literature

To better understand how cognates are conceptualized within the field of psychology, we conducted a literature review and analysis. We did a Scopus searchFootnote 2 (www.scopus.com) to find articles that included the word “cognate” in its title, abstract or keywords. We further limited our search to the categories “social sciences” and “journal articles” to have a clear focus on experimental research. These restrictions resulted in over 1,700 articles, with a substantial portion being clearly not relevant. Thus, we decided to limit our analysis to three journals that commonly publish high-quality psycholinguistic and experimental research on bilinguals (Bilingualism: Language and Cognition, Journal of Memory and Language and Journal of Experimental Psychology: Learning, Memory and Cognition). By doing so, we were able to guarantee that the results presented here are representative for experimental, cognitive and psycholinguistic research on bilingualism. These restrictions resulted in 91Footnote 3 articles (published between 1989 and 2022), from which we then extracted how cognates/non-cognates were defined and, if the study was experimental, operationalized as well as other more general information (e.g., task performed by participants, languages investigated). When available, we entered stimuli into a common document and calculated cross-language similarity of word pairs (see more on this later). The results of the literature search and analysis are publicly available at: https://osf.io/x9ur3/?view_only=.

To summarize, the majority of reviewed articles was experimental, including tasks such as picture naming, lexical decision or sentence reading. The articles studied a relatively diverse set of languages, even as a high proportion focused on Western languages with English, Spanish and Dutch being the most frequent. Most, but not all, articles included a definition for cognates in their introduction. In contrast to the linguistic definition included earlier (Crystal, Reference Crystal2008), these definitions commonly focused on the shared meaning and form overlap of cognate words, sometimes highlighting phonological similarities (e.g., Colomé & Miozzo, Reference Colomé and Miozzo2010; Ramon-Casas et al., Reference Ramon-Casas, Fennell and Bosch2017; Sudarshan & Baum, Reference Sudarshan and Baum2019), orthographic similarities (e.g., Cop et al., Reference Cop, Dirix, Van Assche, Drieghe and Duyck2017; Dijkstra et al., Reference Dijkstra, Miwa, Brummelhuis, Sappelli and Baayen2010, Reference Dijkstra, Van Hell and Brenders2015) or both (e.g., Comesaña et al., Reference Comesaña, Ferré, Romero, Guasch, Soares and García-Chico2015; Miwa et al., Reference Miwa, Dijkstra, Bolger and Baayen2014; Pureza et al., Reference Pureza, Soares and Comesaña2016; Robinson Anthony & Blumenfeld, Reference Robinson Anthony and Blumenfeld2019), often depending on the task that participants had to perform. That is, a definition was more likely to highlight cognates' orthographic overlap if words were presented visually/had to be written during the study's experiment(s) and their phonological overlap if words were presented auditorily/had to be spoken. Dijkstra et al. (Reference Dijkstra, Grainger and Van Heuven1999) note that most studies define cognates as having identical orthographic forms. Based on our review, this is not true anymore with most definitions allowing for some mismatches in form. Variations in the definitions of cognates make it harder to compare experimental results across studies (Dijkstra et al., Reference Dijkstra, Grainger and Van Heuven1999).

We then looked at how many articles operationalized cognate status (i.e., the extent to which translation-equivalent word pairs shared orthographic and/or phonological form). Here, we defined operationalization broadly, meaning that authors verified cognate status with an empirical procedure (e.g., by collecting similarity ratings for word pairs) and reported this as part of the manuscript. Interestingly, shared overlap was operationalized only in approximately 60% of the experimental articles that used cognates as stimuli (43/72 manuscripts). The remaining studies did not specify how cognates were selected. Most frequently, the latter meant that it was simply stated that there would be two different stimulus groups with no further reference to cross-language similarity (e.g., Declerck et al., Reference Declerck, Koch and Philipp2012; Li & Gollan, Reference Li and Gollan2018; Vorwerg et al., Reference Vorwerg, Suntharam and Morand2019).

Operationalizations of cognate status varied greatly across studies, though three methods were most common: (1) similarity was quantified by using some kind of norming procedure to obtain similarity ratings (i.e., people either classified word pairs as cognate/non-cognate or rated them on a similarity scale; N = 13), (2) (normalized) Levenshtein distance (LD) (N = 14; Levenshtein, Reference Levenshtein1966; Schepens et al., Reference Schepens, Dijkstra and Grootjen2012) or (3) Van Orden's (Reference Van Orden1987) graphemic similarity algorithm (N = 11; four studies used more than one method).

LD is a measure of string similarity denoting the minimum number of operations (substitution, insertion or deletion) that need to be performed to turn one word into another (Levenshtein, Reference Levenshtein1966). It can be used to quantify both orthographic as well as phonological similarity (Schepens et al., Reference Schepens, Dijkstra and Grootjen2012, Reference Schepens, Dijkstra, Grootjen and van Heuven2013). For example, LOUSE (English; /laus/) and LAUS (German; /laus/) have an orthographic LD of two because one letter has to be substituted and another subtracted/added. Similarly, its phonological LD is zero because they are pronounced identically. LD can be normalized with a simple formula that takes the maximum length of the words into account (Schepens et al., Reference Schepens, Dijkstra and Grootjen2012). Van Orden's (Reference Van Orden1987) graphemic similarity algorithm is calculated based on whether two words share the first and last letter or not, the number of pairs of letters shared (both forward and in reverse order), number of single letters shared across words as well as word length measures. Possible values for the normalized LD and the graphemic similarity algorithm range from 0 to 1, and an arbitrary cut-off point of .5 to classify cognate status was used for each (Arêas Da Luz Fontes & Schwartz, Reference Arêas Da Luz Fontes and Schwartz2015; Gullifer & Titone, Reference Gullifer and Titone2019; Schepens et al., Reference Schepens, Dijkstra and Grootjen2012) with some exceptions that implemented different thresholds (Schwartz & Tarin, Reference Schwartz and Tarin2021; Van Assche et al., Reference Van Assche, Drieghe, Duyck, Welvaert and Hartsuiker2011).

When an operationalization was included, articles often lacked the detail needed to understand the degree of similarity needed for word pairs in order to qualify as cognates or not. For example, studies reported that cognate status was verified by raters that did not participate in the main study (e.g., Ghazi-Saidi & Ansaldo, Reference Ghazi-Saidi and Ansaldo2017; Libben & Titone, Reference Libben and Titone2009; Sudarshan & Baum, Reference Sudarshan and Baum2019), but did not specify what exact ratings were necessary for a word to qualify as a cognate. Similarly, some studies listed average cross-language similarity values per stimulus category, but did not report a range or what similarity cut-off was used to determine group membership or referred to an earlier study for more details on stimulus selection. About 25% of the reviewed experimental articles included a continuous measure of word pair similarity in at least a subset of their analyses.

The findings that (1) many studies did not operationalize cognate status, (2) if they did, a range of measures plus cut-off points was used, (3) studies only reported mean similarity values but no range or cut-off for group membership, make it hard to draw any definite conclusions on how uniformly cognate status is conceptualized across studies. Thus, in a next step, we calculated the similarity of cognates and non-cognates. To operationalize orthographic similarity for studies that used a visual task or written response, we used the normalized LD (Schepens et al., Reference Schepens, Dijkstra and Grootjen2012). Normalized LD scores have been found to correlate highly with similarity ratings and can be calculated automatically for a large set of translation-equivalent word pairs (Schepens et al., Reference Schepens, Dijkstra and Grootjen2012). While normalized LD is well established as a measure of orthographic similarity, there is no such comparable index for phonological similarity (A. S. Costa et al., Reference Costa, Comesaña and Soares2022). Thus, to operationalize phonological similarity for studies that used an auditory task or spoken response, we had to focus on a smaller subset of studies for which phonological similarity values could be extracted from the recently published PHOR-in-One database (A. S. Costa et al., Reference Costa, Comesaña and Soares2022). This database includes phonological similarity scores for European Portuguese, Spanish, English and German based on the phoneme distance algorithm introduced by Schepens (Reference Schepens2010; see also Schepens et al., Reference Schepens, Dijkstra, Grootjen and van Heuven2013). We were not able to calculate phonological similarity based on published stimuli because only five studies that used an auditory task/spoken response included stimuli's phonological forms (and almost never reported its source).

We were able to extract stimuli from 47 studies that used distinct stimulus categories (approx. 25% of experimental studies did not make their stimuli publicly available). On further inspection, we had to focus our analysis on 23 studies for which we calculated stimuli's orthographic similarity and six studies for which we calculated stimuli's phonological similarity. We were not able to use all studies, as not all provided the word form for both languages, languages used different scripts (which makes it e.g., impossible to use normalized LD to quantify orthographic similarity) or the language combination examined was not included in the PHOR-in-One database.

As summarized in Table 1, median orthographic and phonological similarity was higher for cognates than for non-cognates. Even if not stated explicitly by researchers, cognate status is often operationalized by whether word pairs overall “look/sound similarly.” Likewise, non-cognates are operationalized as word pairs that “do not look/sound similarly.” Nevertheless, in practice, this means that translation-equivalent word pairs that are classified as cognates and non-cognates can vary significantly in sound and spelling similarity.

Table 1. Overview of the results for the stimulus analysis. For visual/written experiments, orthographic similarity (operationalized as normalized LD) was analyzed. For auditory/spoken experiments, phonological similarity (operationalized as normalized LD as published in the PHOR-in-One database) was analyzed

Note: Each cognate and non-cognate pair (within each study) was treated as its own data point. Cognates that share form but no meaning were included in the cognate category. There were nine studies that had both visual/written and auditory/spoken components; these were included in both average calculations (if possible). For example, reading sentences aloud would be considered to have both visual/written (visual presentation) as well as auditory/spoken (spoken production) components. The reason why the orthographic similarity range for cognates includes 0 is that FIRE/VUUR (English/Dutch; orthographic normalized LD = 0) was used once as a cognate pair in a visual task (de Groot & Nas, Reference de Groot and Nas1991). Phonological transcriptions are retrieved from the PHOR-in-One database (A. S. Costa et al., Reference Costa, Comesaña and Soares2022).

To visualize the range of similarity within each stimulus category (cognate or non-cognate), we plotted normalized LD for each word pair and task type (visual/written or auditory/spoken) across studies as histograms (see Figure 1). For visual/written tasks, it can be seen that many of the studies included highly similar cognates, whereas not all studies included non-cognates (leading to lower overall frequency counts). In addition, there was a lot of diversity in the cross-language orthographic similarity within each category, resulting in the histograms to overlap around a normalized LD of .4, with some studies including translation pairs as cognates with orthographic normalized LDs between .1 and .4 (see Figure 1A). That is, there were stimuli with the same orthographic similarity score that were in some studies classified as cognates and in others as non-cognates. For example, SHIP/SCHIFF (English/German) has an orthographic normalized LD score of .5 and was classified as a cognate in a vocabulary learning task where participants were presented with both the written and spoken forms of words (Salomé et al., Reference Salomé, Casalis and Commissaire2022); KORREL/KORN (Dutch/German) has the same score but was considered a non-cognate in a written lexical decision and spoken production task (Lemhöfer et al., Reference Lemhöfer, Spalek and Schriefers2008). Similarly, the word pair KING/KONING (English/Dutch; orthographic normalized LD = .67) was classified as a cognate (de Groot & Nas, Reference de Groot and Nas1991) and a non-cognate (Muylle et al., Reference Muylle, Van Assche and Hartsuiker2022; Poort et al., Reference Poort, Warren and Rodd2016). For auditory/spoken tasks, a similar pattern can be observed (though it should be noted that the overall contributing number of stimuli was much smaller than for visual/written tasks), with an overlap of the two categories around a phonological normalized LD of .5.

Figure 1. Histograms of normalized LD for stimuli categorized as cognates (dark gray) or non-cognates (light gray) for visual/written tasks (orthographic normalized LD; panel A) and auditory/spoken tasks (phonological normalized LD; panel B). Violin plots of difference score (cognates minus non-cognates) of normalized LD in studies that included both cognates and non-cognates for visual/written tasks (panel C) and auditory/spoken tasks (panel D). Higher, more extreme values represent larger differences in normalized LD scores between cognates and non-cognates.

We then calculated average orthographic and phonological similarity per stimulus category and a difference score (average similarity for cognates minus non-cognates) for each study that included both cognates and non-cognates (visual/written tasks: N = 14; auditory/spoken tasks: N = 6). As can be seen in Figures 1C and 1D, there was again a fair amount of variability across studies, with some contrasting word pairs that are, on average, at the extreme ends of the similarity scores (resulting in a very high difference score), whereas others concentrated on more similar ones. This variability may impact how likely it is that a study finds a difference between how cognates and non-cognates are processed by bilinguals.

To summarize, while it is clear that cognates and non-cognates differed, on average, in their cross-language similarity, our analysis also highlights that there was a considerable, potentially meaningful range in orthographic and phonological overlap across both categories. For both task types, similarity was more varied for cognates than for non-cognates, likely because cognates constitute for many of the studied language combinations the minority of translation-equivalents, leaving a smaller pool of stimuli to choose from.

2. Theoretical and practical implications for research on bilingualism

Using variable criteria and thresholds for cognates adds noise to our data and may make it harder to understand how cognates and non-cognates are processed by bilinguals (Dijkstra et al., Reference Dijkstra, Grainger and Van Heuven1999). The impact of this noise may be considerable, as cognate effects are nuanced and depend on several characteristics (e.g., list composition, bilinguals' proficiency, the context in which a stimulus is presented in; Guediche et al., Reference Guediche, Baart and Samuel2020). Effects of noisy definitions may be compounded when interactions with other variables are the research focus (e.g., when investigating how cognate status impacts language switch costs; Declerck et al., Reference Declerck, Koch and Philipp2012; Li & Gollan, Reference Li and Gollan2018). In other words, ignoring this impreciseness in cognate definitions may lead to conflicting results across studies and make it harder to develop theories of bilingual language processing.

When the focus of a study is on how cognates are processed, researchers need to actually consider if words are etymologically related given this is the linguistically accurate definition (Crystal, Reference Crystal2008). When, however, the goal of a study is to use cognates and non-cognates as a proxy of cross-language similarity (which may in turn mediate how activation flows within languages), we need to be more precise as to what we mean by these categories. Thus, we urge researchers to be more explicit in their methods as to how they determined cognate status (i.e., what criteria were used to determine a word was a cognate). In addition, a list of the items used should always be openly available (in the form of an appendix or supplementary material/an open science deposit). Finally, researchers should describe the different sets continuously (e.g., by using normalized LD; Schepens et al., Reference Schepens, Dijkstra and Grootjen2012) in the methods. If phonological similarity of words is quantified, phonological transcriptions should be reported and it should be specified where they were retrieved, given there are considerable differences in how words are pronounced and thus described phonologically in databases. For example, the English word TIGER is transcribed as /ˈtaigə(r)/ in the Langenscheidt Dictionary (commonly used by German speakers; Langenscheidt, n.d.) and as /ˈtʌɪɡə/ (British English) and /ˈtaɪɡər/ (U.S. American English) in the Oxford English Dictionary (Oxford English Dictionary, n.d.). These differences may seem subtle but would result in different similarity values as calculated by normalized LD. Table 2 provides a summary of our suggestions.

Table 2. Overview of practical suggestions for studying cognates in research on bilingualism

In general, the way forward may be to step away from using categories such as “cognates” and “non-cognates” when the real goal is to investigate the impact of phonological and orthographic similarity across words from different languages and to quantify similarity continuously instead (Broersma et al., Reference Broersma, Carter, Donnelly and Konopka2020), especially when analyzing data and/or when languages do not share many morphemes (Miwa et al., Reference Miwa, Dijkstra, Bolger and Baayen2014). In fact, analyzing cognateness as a continuous variable was not uncommon in the articles we screened (even if only in a sub-analysis). As in other areas of research, information gets lost when a continuous variable is dichotomized, sacrificing statistical power and yielding potentially misleading results (Baayen & Milin, Reference Baayen and Milin2010; Cohen, Reference Cohen1983; MacCallum et al., Reference MacCallum, Zhang, Preacher and Rucker2002). Consistent with this, Miwa et al. (Reference Miwa, Dijkstra, Bolger and Baayen2014) for example found that participants' eye movements were co-determined by phonological and semantic similarities (among other factors) but not by the dichotomous cognate status of word pairs (see for more detailed discussions on how continuous measures of cross-language similarity may enrich research on bilingual processing elsewhere, e.g., Comesaña et al., Reference Comesaña, Ferré, Romero, Guasch, Soares and García-Chico2015; A. S. Costa et al., Reference Costa, Comesaña and Soares2022; Fahey, Reference Fahey2021).

Having said that, the last suggestions assume that words' relevant similarity can be captured (easily) continuously. While (normalized) LD is a straightforward way to do so, its applicability is unfortunately narrow: it only captures string similarity of the same scripts. Additionally, even as phonological similarity can be quantified by using LD, it is less straightforward to do so (A. S. Costa et al., Reference Costa, Comesaña and Soares2022; Schepens et al., Reference Schepens, Dijkstra, Grootjen and van Heuven2013). Moreover, it may be harder to assess cross-language similarity for verbs than for nouns, as it is less clear which form should be compared (Cop et al., Reference Cop, Dirix, Van Assche, Drieghe and Duyck2017), and for words embedded in text than for isolated ones (Balling, Reference Balling2013).

One way to quantify similarity that works for all language combinations and similarity dimensions is having a separate group of people with the same language backgroundFootnote 4 as the participants of the main study rate a word pair's semantic, phonological and orthographic similarity. As mentioned earlier, this is indeed a method that has been used repeatedly (e.g., Cai et al., Reference Cai, Pickering, Yan and Branigan2011; de Groot & Nas, Reference de Groot and Nas1991; Dijkstra et al., Reference Dijkstra, Miwa, Brummelhuis, Sappelli and Baayen2010; Ghazi-Saidi & Ansaldo, Reference Ghazi-Saidi and Ansaldo2017; Titone et al., Reference Titone, Libben, Mercier, Whitford and Pivneva2011). However, there appears to be little agreement on how many independent raters are necessary for such a similarity measure to be valid (e.g., in our review, stimuli were rated by between 2 [the authors of the specific article] and 60 people with a median of 10 raters). In addition, such an approach, while universally applicable, can unfortunately be labor- and time-intensive if a high number of stimuli has to be rated (Schepens et al., Reference Schepens, Dijkstra and Grootjen2012) and/or access to the relevant participant population is limited.

3. Conclusions

Cognates are a pivotal tool when investigating how multiple known languages interact during language processing. Despite this, there is no consensus on how cognateness should be measured. As a result, a significant subset of reviewed experimental studies failed to include an operationalization of cognate status in their methods. Moreover, our analysis revealed that cognates and non-cognates differed widely in their similarity across studies, likely adding noise to the data collected and impeding our understanding of bilingual language processing. We make practical suggestions for researchers who want to study cognates in the future.

Data availability statement

Results of the literature search (annotated table and stimulus analysis; results from database search) are located at: https://osf.io/x9ur3/?view_only=

Acknowledgments

We thank Maria Bruggaier for helping with the categorization and annotation of articles that investigated cognates. Also, we thank Dr. Sarah Colby and Dr. Marc Brysbaert for helpful discussions on how to quantify phonological similarity of words. We appreciate Dr. Montserrat Comesaña and her PhD student for helpful feedback on previous versions of this manuscript.

Competing interests

None.

Footnotes

1. Pronunciation is given in International Phonological Alphabet (IPA) denotation. IPA denotations for both English and German words were always retrieved from the Langenscheidt Dictionary (Langenscheidt, n.d.).

2. The literature search was conducted on November 30, 2022 and December 1, 2022.

3. Despite the filters we used, a small number of articles was not directly relevant to the review (e.g., a corrigendum; see OSF link for details).

4. In the study by Miwa et al. (Reference Miwa, Dijkstra, Bolger and Baayen2014), phonological similarity ratings behaved differently in analyses depending on whether Japanese–English word pairs were rated by late Japanese–English bilinguals or native speakers of English.

References

Arêas Da Luz Fontes, A. B., & Schwartz, A. I. (2015). Bilingual access of homonym meanings: Individual differences in bilingual access of homonym meanings. Bilingualism: Language and Cognition, 18(4), 639656. https://doi.org/10.1017/S1366728914000509CrossRefGoogle Scholar
Baayen, R. H., & Milin, P. (2010). Analyzing reaction times. International Journal of Psychological Research, 3(2), 1228. https://doi.org/10.21500/20112084.807CrossRefGoogle Scholar
Balling, L. W. (2013). Reading authentic texts: What counts as cognate? Bilingualism: Language and Cognition, 16(3), 637653. https://doi.org/10.1017/S1366728911000733CrossRefGoogle Scholar
Broersma, M., Carter, D., Donnelly, K., & Konopka, A. (2020). Triggered codeswitching: Lexical processing and conversational dynamics. Bilingualism: Language and Cognition, 23(2), 295308. https://doi.org/10.1017/S1366728919000014CrossRefGoogle Scholar
Cai, Z. G., Pickering, M. J., Yan, H., & Branigan, H. P. (2011). Lexical and syntactic representations in closely related languages: Evidence from Cantonese–Mandarin bilinguals. Journal of Memory and Language, 65(4), 431445. https://doi.org/10.1016/j.jml.2011.05.003CrossRefGoogle Scholar
Cohen, J. (1983). The cost of dichotomization. Applied Psychological Measurement, 7(3), 249253. https://doi.org/10.1177/014662168300700301CrossRefGoogle Scholar
Colomé, À., & Miozzo, M. (2010). Which words are activated during bilingual word production? Journal of Experimental Psychology: Learning Memory and Cognition, 36(1), 96109. https://doi.org/10.1037/a0017677Google ScholarPubMed
Comesaña, M., Ferré, P., Romero, J., Guasch, M., Soares, A. P., & García-Chico, T. (2015). Facilitative effect of cognate words vanishes when reducing the orthographic overlap: The role of stimuli list composition. Journal of Experimental Psychology: Learning Memory and Cognition, 41(3), 614635. https://doi.org/10.1037/xlm0000065Google ScholarPubMed
Cop, U., Dirix, N., Van Assche, E., Drieghe, D., & Duyck, W. (2017). Reading a book in one or two languages? An eye movement study of cognate facilitation in L1 and L2 reading. Bilingualism: Language and Cognition, 20(4), 747769. https://doi.org/10.1017/S1366728916000213CrossRefGoogle Scholar
Costa, A., Caramazza, A., & Sebastián-Gallés, N. (2000). The cognate facilitation effect: Implications for models of lexical access. Journal of Experimental Psychology: Learning Memory and Cognition, 26, 12831296. https://doi.org/10.1037/0278-7393.26.5.1283Google ScholarPubMed
Costa, A. S., Comesaña, M., & Soares, A. P. (2022). PHOR-in-One: A multilingual lexical database with PHonological, ORthographic and PHonographic word similarity estimates in four languages. Behavior Research Methods, 55(7), 36993725. https://doi.org/10.3758/s13428-022-01985-3CrossRefGoogle ScholarPubMed
Crystal, D. (2008). A dictionary of linguistics and phonetics (6th ed.). Blackwell Publishing Ltd. https://doi.org/10.1002/9781444302776CrossRefGoogle Scholar
de Groot, A. M. B., & Nas, G. L. J. (1991). Lexical representation of cognates and noncognates in compound bilinguals. Journal of Memory and Language, 30(1), 90123. https://doi.org/10.1016/0749-596X(91)90012-9CrossRefGoogle Scholar
Declerck, M., Koch, I., & Philipp, A. M. (2012). Digits vs. pictures: The influence of stimulus type on language switching. Bilingualism: Language and Cognition, 15(4), 896904. https://doi.org/10.1017/S1366728912000193CrossRefGoogle Scholar
Dijkstra, T., Grainger, J., & Van Heuven, W. J. B. (1999). Recognition of cognates and interlingual homographs: The neglected role of phonology. Journal of Memory and Language, 41(4), 496518. https://doi.org/10.1006/jmla.1999.2654CrossRefGoogle Scholar
Dijkstra, T., Miwa, K., Brummelhuis, B., Sappelli, M., & Baayen, R. H. (2010). How cross-language similarity and task demands affect cognate recognition. Journal of Memory and Language, 62(3), 284301. https://doi.org/10.1016/j.jml.2009.12.003CrossRefGoogle Scholar
Dijkstra, T., Van Hell, J. G., & Brenders, P. (2015). Sentence context effects in bilingual word recognition: Cognate status, sentence language, and semantic constraint. Bilingualism: Language and Cognition, 18(4), 597613. https://doi.org/10.1017/S1366728914000388CrossRefGoogle Scholar
Fahey, D. K. (2021). The shape of the bilingual mental lexicon: Testing the cognate continuum [Doctoral dissertation]. Retrieved from https://scholarcommons.sc.edu/etd/6399Google Scholar
Ghazi-Saidi, L., & Ansaldo, A. I. (2017). The neural correlates of semantic and phonological transfer effects: Language distance matters. Bilingualism: Language and Cognition, 20(5), 10801094. https://doi.org/10.1017/S136672891600064XCrossRefGoogle Scholar
Guediche, S., Baart, M., & Samuel, A. G. (2020). Semantic priming effects can be modulated by crosslinguistic interactions during second-language auditory word recognition. Bilingualism: Language and Cognition, 23(5), 10821092. https://doi.org/10.1017/S1366728920000164CrossRefGoogle Scholar
Gullifer, J. W., & Titone, D. (2019). The impact of a momentary language switch on bilingual reading: Intense at the switch but merciful downstream for L2 but not L1 readers. Journal of Experimental Psychology: Learning Memory and Cognition, 45(11), 20362050. https://doi.org/10.1037/xlm0000695Google Scholar
Hoshino, N., & Kroll, J. F. (2008). Cognate effects in picture naming: Does cross-language activation survive a change of script? Cognition, 106, 501511. https://doi.org/10.1016/j.cognition.2007.02.001CrossRefGoogle ScholarPubMed
Lemhöfer, K., Spalek, K., & Schriefers, H. (2008). Cross-language effects of grammatical gender in bilingual word recognition and production. Journal of Memory and Language, 59(3), 312330. https://doi.org/10.1016/j.jml.2008.06.005CrossRefGoogle Scholar
Levenshtein, V. I. (1966). Binary codes capable of correcting deletions, insertions and reversals. Soviet Physics Doklady, 10(8), 707710.Google Scholar
Li, C., & Gollan, T. H. (2018). Cognates facilitate switches and then confusion: Contrasting effects of cascade versus feedback on language selection. Journal of Experimental Psychology: Learning Memory and Cognition, 44(6), 974991. https://doi.org/10.1037/xlm0000497Google ScholarPubMed
Libben, M. R., & Titone, D. A. (2009). Bilingual lexical access in context: Evidence from eye movements during reading. Journal of Experimental Psychology: Learning Memory and Cognition, 35(2), 381390. https://doi.org/10.1037/a0014875Google ScholarPubMed
MacCallum, R. C., Zhang, S., Preacher, K. J., & Rucker, D. D. (2002). On the practice of dichotomization of quantitative variables. Psychological Methods, 7(1), 1940. https://doi.org/10.1037/1082-989X.7.1.19CrossRefGoogle ScholarPubMed
Miwa, K., Dijkstra, T., Bolger, P., & Baayen, R. H. (2014). Reading English with Japanese in mind: Effects of frequency, phonology, and meaning in different-script bilinguals. Bilingualism: Language and Cognition, 17(3), 445463. https://doi.org/10.1017/S1366728913000576CrossRefGoogle Scholar
Muylle, M., Van Assche, E., & Hartsuiker, R. J. (2022). Comparing the cognate effect in spoken and written second language word production. Bilingualism: Language and Cognition, 25(1), 93107. https://doi.org/10.1017/S1366728921000444CrossRefGoogle Scholar
Poort, E. D., Warren, J. E., & Rodd, J. M. (2016). Recent experience with cognates and interlingual homographs in one language affects subsequent processing in another language. Bilingualism: Language and Cognition, 19(1), 206212. https://doi.org/10.1017/S1366728915000395CrossRefGoogle Scholar
Pureza, R., Soares, A. P., & Comesaña, M. (2016). Cognate status, syllable position and word length on bilingual tip-of-the-tongue states induction and resolution. Bilingualism: Language and Cognition, 19(3), 533549. https://doi.org/10.1017/S1366728915000206CrossRefGoogle Scholar
Ramon-Casas, M., Fennell, C. T., & Bosch, L. (2017). Minimal-pair word learning by bilingual toddlers: The Catalan /e/-/ɛ/ contrast revisited. Bilingualism: Language and Cognition, 20(3), 649656. https://doi.org/10.1017/S1366728916001115CrossRefGoogle Scholar
Robinson Anthony, J. J. D., & Blumenfeld, H. K. (2019). Language dominance predicts cognate effects and inhibitory control in young adult bilinguals. Bilingualism: Language and Cognition, 22(5), 10681084. https://doi.org/10.1017/S1366728918001013CrossRefGoogle Scholar
Salomé, F., Casalis, S., & Commissaire, E. (2022). Bilingual advantage in L3 vocabulary acquisition: Evidence of a generalized learning benefit among classroom-immersion children. Bilingualism: Language and Cognition, 25(2), 242255. https://doi.org/10.1017/S1366728921000687CrossRefGoogle Scholar
Schepens, J. (2010). Cross-language distributions of high frequency and phonetically similar cognates [Unpublished master's thesis]. Radboud University, Nijmegen, The Netherlands.Google Scholar
Schepens, J., Dijkstra, T., & Grootjen, F. (2012). Distributions of cognates in Europe as based on Levenshtein distance. Bilingualism: Language and Cognition, 15(1), 157166. https://doi.org/10.1017/S1366728910000623CrossRefGoogle Scholar
Schepens, J., Dijkstra, T., Grootjen, F., & van Heuven, W. J. B. (2013). Cross-language distributions of high frequency and phonetically similar cognates. PLoS ONE, 8(5), e63006. https://doi.org/10.1371/journal.pone.0063006CrossRefGoogle ScholarPubMed
Schwartz, A. I., & Tarin, K. S. (2021). The impact of a discourse context on bilingual cross-language lexical activation. Bilingualism: Language and Cognition, 24(5), 879890. https://doi.org/10.1017/S136672892100016XCrossRefGoogle Scholar
Sudarshan, A., & Baum, S. R. (2019). Bilingual lexical access: A dynamic operation modulated by word-status and individual differences in inhibitory control. Bilingualism: Language and Cognition, 22(3), 537554. https://doi.org/10.1017/S1366728918000111CrossRefGoogle Scholar
Titone, D., Libben, M., Mercier, J., Whitford, V., & Pivneva, I. (2011). Bilingual lexical access during L1 sentence reading: The effects of L2 knowledge, semantic constraint, and L1–L2 intermixing. Journal of Experimental Psychology: Learning Memory and Cognition, 37(6), 14121431. https://doi.org/10.1037/a0024492Google ScholarPubMed
Van Assche, E., Drieghe, D., Duyck, W., Welvaert, M., & Hartsuiker, R. J. (2011). The influence of semantic constraints on bilingual word recognition during sentence reading. Journal of Memory and Language, 64(1), 88107. https://doi.org/10.1016/j.jml.2010.08.006CrossRefGoogle Scholar
Vanlangendonck, F., Peeters, D., Rueschemeyer, S. A., & Dijkstra, T. (2020). Mixing the stimulus list in bilingual lexical decision turns cognate facilitation effects into mirrored inhibition effects. Bilingualism: Language and Cognition, 23(4), 836844. https://doi.org/10.1017/S1366728919000531CrossRefGoogle Scholar
Van Orden, G. C. (1987). A ROWS is a ROSE: Spelling, sound, and reading. Memory & Cognition, 15(3), 181198. https://doi.org/https://doi.org/10.3758/BF03197716CrossRefGoogle ScholarPubMed
Vorwerg, C. C., Suntharam, S., & Morand, M.-A. (2019). Language control and lexical access in diglossic speech production: Evidence from variety switching in speakers of Swiss German. Journal of Memory and Language, 107, 4053. https://doi.org/10.1016/j.jml.2019.03.007CrossRefGoogle Scholar
Figure 0

Table 1. Overview of the results for the stimulus analysis. For visual/written experiments, orthographic similarity (operationalized as normalized LD) was analyzed. For auditory/spoken experiments, phonological similarity (operationalized as normalized LD as published in the PHOR-in-One database) was analyzed

Figure 1

Figure 1. Histograms of normalized LD for stimuli categorized as cognates (dark gray) or non-cognates (light gray) for visual/written tasks (orthographic normalized LD; panel A) and auditory/spoken tasks (phonological normalized LD; panel B). Violin plots of difference score (cognates minus non-cognates) of normalized LD in studies that included both cognates and non-cognates for visual/written tasks (panel C) and auditory/spoken tasks (panel D). Higher, more extreme values represent larger differences in normalized LD scores between cognates and non-cognates.

Figure 2

Table 2. Overview of practical suggestions for studying cognates in research on bilingualism