The study presented in this book focuses on the acquisition, maintenance, and change of Differential Object Marking (DOM) in Spanish, English, and Romanian in the United States. Differential Object Marking (DOM) is the overt marking of some direct objects and is widespread among languages of the world (Bossong, Reference Bossong, Wanner and Kibbee1991). DOM is an iconic procedure because the arguments that are overtly marked morphologically are more salient/prominent semantically or pragmatically than unmarked objects. In this chapter, I describe how DOM is manifested in Spanish, Romanian, and Hindi and I present current syntactic synchronic analyses of the phenomenon in these languages. I also discuss the diachronic evolution of DOM in language contact situations.
3.1 The Phenomenon
DOM is found in many languages and language families, including Turkish, Finnish, Hebrew, Spanish, Romanian, Hindi, Mongolian, Guaraní, and Amharic (Bossong, Reference Bossong, Wanner and Kibbee1991; Sinnemäki, Reference Sinnemäki2014). Many other languages, such as English, German, Japanese, and French, do not have DOM. Languages that exhibit DOM split objects into two types: those that are morphologically unmarked and those that are morphologically marked. The term “differential” indicates that marked objects alternate or co-exist with unmarked objects. The overt marking of some direct objects obeys semantic and pragmatic principles that emphasize syntactic, semantic, or pragmatic prominence. The most natural type of transitive construction is one where the subject is high in animacy and definiteness, and the object is lower in animacy and definiteness (Comrie, Reference Comrie1989). Consider the sentence in (1a).
| (1) | a. | Peter kicked a pebble. |
| b. | A pebble hit Peter. |
Typical objects like a pebble in (1a) are inanimate and non-specific while typical subjects, in our example, Peter, are human (volitional), animate, specific. Conversely, human animate, specific objects and inanimate, non-agentive, non-specific subjects, as in Peter and a pebble in (1b) are atypical (Aissen, Reference Aissen2003). In general, typical objects are unmarked in DOM-languages; atypical objects, which are more semantically and pragmatically prominent, are marked. The linguistic factors that determine DOM are animacy, referentiality, topicality, the affectedness of the object, and, to a lesser extent, the lexical semantics of the verb (Von Heusinger and Gáspár, Reference Von Heusinger and Gáspár2008), as illustrated in Figure 3.1.

Figure 3.1 Object split (marked/unmarked) and factors that determine the prominence of marked objects in different languages
In languages with unidimensional systems, only one semantic factor triggers DOM. For example, specificity triggers DOM in Amharic, Hebrew, Turkish, and Persian, while in Malayalam DOM is triggered by animacy. The languages that are the focus of this book – Spanish, Hindi, and Romanian – are considered two-dimensional systems because DOM is triggered by animacy and referentiality (definiteness and specificity) (Aissen, Reference Aissen2003; De Hoop and Malchukov, Reference De Hoop and Malchukov2007), although, as we will see, the weight of these factors can also differ in these languages.
Depending on the language, the overt morphological marking of special objects can be achieved through case particles (as in in Persian and Turkish), object agreement on the verb (as in Swahili and Ostyak), prepositions that have other functions in the language (as in Spanish and Romanian), or postpositions (as in Hindi). Some languages also use clitic doubling (such as Romanian, Macedonian, Catalan). In Spanish, Romanian, and Hindi, human specific objects are marked; inanimate non-specific objects are not marked, as shown in (2)–(4).
| (2) | a. | Maria vio a Juan. | Spanish |
| Maria saw DOM Juan | |||
| ‘Maria saw Juan.’ | |||
| b. | Maria vio una ambulancia. | ||
| ‘Maria saw an ambulance.’ |
| (3) | a. | L- | am văzut pe Mihai. | Romanian |
| Cl.3sg.masc | have seen DOM Michael | |||
| ‘I have seen Michael.’ | ||||
| b. | Claudia a văzut o casă. | |||
| Claudia has seen a house | ||||
| ‘Claudia saw a house.’ | ||||
| (4) | a. | Mira-ne Ramesh-ko dekhaa. | Hindi |
| Mira-erg Ramesh-DOM saw | |||
| ‘Mira saw Ramesh.’ | |||
| b. | Mira-ne ghar dekhaa. | ||
| Mira-erg house saw | |||
| ‘Mira saw a house.’ |
In Turkish, a language with a unidimensional system whose main trigger is specificity, all definite NPs (names, pronouns, definite descriptions, and demonstrative NPs) are obligatorily marked with accusative case –(y)I irrespective of animacy (Enç, Reference Enç1991: p. 9), as shown in (5a). Inanimate direct objects may appear bare or with the accusative case marker –(y)I depending on the specificity/definiteness of the noun, as in (5a,b,d).
| (5) | a. | Zeynep | Ali-yi | o-nu | adam-ı | o | masa-yı | gör-dü. |
| Zeynep | Ali-acc | s/he-acc | man-acc | that | table-acc | see-past | ||
| ‘Zeynep saw Ali/her-him/ the man/ that table.’ | ||||||||
| b. | (Ben) | kitab-ı | oku-du-m. | (definite) | ||||
| I | book-acc | read-past-1sg | ||||||
| ‘I read the book.’ | ||||||||
| c. | (Ben) | bir | kitap | oku-du-m. | (non-specific, indefinite) | |||
| I | a | book | read-past-1sg | |||||
| ‘I read a book.’ | ||||||||
| d. | (Ben) | bir | kitab-ı | oku-du-m. | (specific, indefinite) | |||
| I | a | book-acc | read-past-1sg | |||||
| ‘I read a certain book.’ | ||||||||
In Basque, which seems to be acquiring DOM through contact with Spanish (Rodríguez-Ordóñez, Reference Rodríguez-Ordóñez2017), DOM appears as dative marking (-ri) on the object NP and dative agreement (-a) on the verb, as in (6a). Inanimate objects, as in (6b), are not marked with dative case and there is no dative object agreement on the verb:
| (6) | a. | Ni-k | Mikel-e-rii | ikusi | d-o-ts-ai-t |
| I-erg | Mikel-epenth-dat | see | Aux-3sg.pr.df-3sg.dat-1sg-erg | ||
| ‘I have seen Mikel.’ | |||||
| b. | Ni-k | etxi-e-Øi | ikusi di-o-t | ||
| I-erg | house-the-abs | see | Aux-3sg.pr. -1sg.erg | ||
| ‘I have the house.’ | |||||
Since DOM is mainly conditioned by semantic and pragmatic factors, syntactic and morphological analyses of this phenomenon abound (Aissen, Reference Aissen2003; De Swart and De Hoop, Reference De Swart and De Hoop2007; Keine, Reference Keine, Trommer and Opitz2007; Von Heusinger and Chiriacescu, Reference Von Heusinger and Chiriacescu2013). Aissen (Reference Aissen2003) discusses the relevance of animacy and specificity (or definiteness) for DOM, as determined by the animacy and definiteness scales (Woolford, Reference Woolford1999) shown in Tables 3.1 and 3.2, with their corresponding feature distribution (Von Heusinger and Kaiser, Reference Von Heusinger, Kaiser, Von Heusinger, Kaiser and Stark2005). In the animacy scale, human animate is at the top of the scale (left side of the table); in the definiteness scale personal pronouns are the highest category.
Table 3.1 The animacy hierarchy (Silverstein, Reference Silverstein and Dixon1976)
human > animate > inanimate
| human | animate | inanimate |
|---|---|---|
| + human | – human | |
| + animate | – inanimate |
Table 3.2 The definiteness scale (Aissen, Reference Aissen2003; Croft, Reference Carreira, Montrul and Polinsky1988)
Personal pronoun> Proper name > Definite NP > indefinite specific NP > Non-specific NP.
| Pronoun > | PN > | Def > | Spec > | Non-Spec |
|---|---|---|---|---|
| + definite | – definite | |||
| + specific | – specific |
The extent to which the animacy and definiteness scales determine DOM vary from language to language. In many languages, DOM emerged, developed, and expanded over time. Empirical evidence from Spanish, Romanian, Persian, and Afrikaans indicates that the diachronic expansion of DOM in these languages followed the animacy and definiteness scales (Aissen, Reference Aissen2003; Manoliu, Reference Manoliu1993; Ponelis and Ponelis, Reference Ponelis and Ponelis1993). DOM is also heavily determined by information structure. In both Spanish and Romanian, topicality was a contributing factor in DOM expansion along their histories. Even in Romance languages that do not usually display DOM, such as Italian and Catalan, the dislocation of a first or second person object pronoun triggers DOM (D’Alessandro, Reference D’Alessandro2021; Pineda, Reference Pineda, Kabatek, Obrist and Wall2021).
Within languages, there is also variable expression of DOM. Some languages have a very restrictive DOM system and others a full-fledged system. Within the Romance languages, those languages that have very restrictive DOM, such as Catalan, Sardinian, and Portuguese, only exhibit DOM with personal pronouns, which are high on the animacy and prominence scales (Table 3.2). Spanish and Romanian, the two Romance languages with a more extensive DOM system, exhibit DOM with inanimate and indefinite DPs. The definiteness scale establishes implicational relationships and makes very direct predictions: if a language exhibits DOM with categories on the right side of the scale (the lower end), it will have DOM with all the items on the left. To date, we know of no languages that mark DOM with personal pronouns and definite indefinite DPs, omitting names and definite DPs, for example. Yet, different languages demarcate the use of DOM at different points in the scale. Understanding the apparent variability of DOM, its emergence and diffusion as well as its retraction in different languages and language contact situations, with particular attention to the semantic and syntactic parameters that regulate DOM, has been the focus of intense research from both synchronic and diachronic perspectives (Kalin, Reference Kalin2018; Mardale and Karatsareas, Reference Mardale and Karatsareas2020; Von Heusinger and Kaiser, Reference Von Heusinger, Kaiser, Von Heusinger, Kaiser and Stark2005). The evolution and current characterization of DOM systems in different related and unrelated languages have implications for linguistic theory and theories of case (Fábregas, Reference Fábregas2013; Kalin, Reference Kalin2018; Keine, Reference Keine, Trommer and Opitz2007).
To summarize, cross linguistically, three main parameters determine whether an object will be marked: animacy, referentiality, and topicality. DOM languages rank these parameters differently. For example, DOM in Spanish is triggered by animacy and specificity, but in Hindi and Turkish specificity outranks animacy (Aissen, Reference Aissen2003). However, even within a language DOM may exhibit variability with respect to its application among monolingually raised native speakers. The study presented in this book focuses on Spanish, Hindi, and Romanian, and the rest of the chapter describes how DOM evolved and is manifested at present in these languages.
3.1.1 Spanish
The diachronic evolution of DOM in Spanish has received significant attention (Company, Reference Company2002; Laca, Reference Laca and Pensado Ruiz1995, Reference Laca and Company2006; Von Heusinger and Kaiser, Reference Von Heusinger, Kaiser, Von Heusinger, Kaiser and Stark2005) and there is significant research as well on how it is currently expressed in different varieties (Brugè and Brugger, Reference Brugè and Brugger1996; Fábregas, Reference Fábregas2013; Leonetti, Reference Leonetti2004, Reference Leonetti2008; López, Reference López2012; Pensado, Reference Pensado2005; Rodríguez-Mondoñedo, Reference Rodríguez-Mondoñedo2007; Torrego, Reference Torrego1998; Von Heusinger and Kaiser, Reference Von Heusinger, Kaiser, Von Heusinger, Kaiser and Stark2005; Weissenrieder, Reference Weissenrieder1985, Reference Weissenrieder1990). Descriptively, human and specific (definite) direct objects are obligatorily marked with the preposition a. In (7), which is also the preposition that marks goals in ditransitive constructions (dative case, e.g., Roberto le dio un libro a Patricia “Roberto gave Patricia a ring”), the preposition that appears with psych predicates (dative experiencers, e.g., A Juan le gusta el helado “Juan likes ice cream”), and the directional preposition used with locatives (e.g., Fueron a la tienda “They went to the store”). Human non-specific objects, as in (8), are optionally marked (depending on context). Inanimate objects – specific in (9) and non-specific in (10) – are not typically marked. The most succinct generalization in Spanish is that if the object is animate and specific it must be marked. Other objects are typically unmarked.
| (7) | Marina vio a/*Ø | la mujer. | [animate, specific] |
| ‘Marina saw the woman.’ | |||
| (8) | Marina vio a/Ø | una mujer. | [animate, non-specific] |
| ‘Marina saw a woman.’ | |||
| (9) | Marina vio *a/Ø | el barco. | [inanimate, specific] |
| ‘Marina saw the boat.’ | |||
| (10) | Marina vio *a/Ø | un barco. | [inanimate, non-specific] |
| ‘Marina saw a boat.’ | |||
There is substantial variability in the use of DOM with inanimate objects (Fábregas, Reference Fábregas2013; López, Reference López2012). Some verbs like llamar “call” describir “describe” and seguir “follow” require DOM with inanimate objects (e.g., llamar a una ambulancia “call an ambulance,” seguir a un auto “follow a car”); inanimate proper nouns can be optionally marked by DOM, visitar a Brasil “visit Brasil” (Laca, Reference Laca and Company2006); the use of DOM with animals (non-human animate) is variable and depends on the size and or domesticity of the animal (matar un mosquito “kill a mosquito,” pasear al/ a un perro “walk the/a dog”); DOM can be used to disambiguate subject and object when both are inanimate (El submarino hundió al barco “The submarine sank the ship”); and when inanimate objects are topicalized in clitic left dislocations, they tend to favor DOM (A la película la terminé de ver ayer “The movie I finished watching yesterday”) in some varieties of Spanish (Montrul, Reference Montrul2013a).
For Torrego (Reference Torrego1998), definiteness, specificity, aspect, topicality, agentivity, and affectedness determine when objects are marked (see also Leonetti, Reference Leonetti2004 and Zubizarreta, Reference Zubizarreta1998). However, Von Heusinger and Kaiser’s (Reference Von Heusinger, Kaiser, Von Heusinger, Kaiser and Stark2005) diachronic analysis suggests that as far as properties of the object are concerned, animacy and specificity are the most relevant parameters for DOM in Spanish and definiteness less relevant. All other factors (agentivity, affectedness, aspect, etc.) are contributing but not deterministic factors. López (Reference López2012) argues that within the object properties, animacy is the overriding feature that controls DOM in Spanish, at least with respect to the structure of nouns. The empirical evidence for his claim comes from the behavior of (non-specific) quantifiers like alguien “somebody,” nadie “nobody,” ninguno “no,” muchos “many,” pocos “few,” which require DOM when referring to animate objects (Vi a alguien “I saw somebody”/No vi a nadie “I saw somebody/I did not see anybody” vs. Vi algo “I saw something”/No vi nada “I did not see anything”).
Because our study is concerned with language change in the DOM system of Spanish in a language contact situation, an important question is whether the changes in DOM expression present in heritage language grammars are similar to or different from the changes already in progress in different Spanish varieties. I discuss next how DOM emerged along the Animacy and Definiteness scales, and the current variation attested in different monolingual or homeland varieties, before outlining the synchronic analyses of DOM assumed in this study.
Synchronic studies of regional varieties of Spanish (Argentina, Cuba, Mexico, Spain, US Spanish, Andean Spanish) from both sociolinguistic and experimental psycholinguistic perspectives have uncovered important variation with uses of DOM by monolingual (Arechabaleta Regulez, Reference Arechabaleta Regulez2019; Arechabaleta Regulez and Montrul, Reference Arechabaleta Regulez and Montrul2021; Bautista-Maldonado and Montrul, Reference Bautista-Maldonado and Montrul2019; Montrul, Reference Montrul2013a; Tippets, Reference Tippets2010) and bilingual speakers (Arechabaleta Regulez, Reference Arechabaleta Regulez2019, Reference Arechabaleta Regulez, Mardale and Montrul2020; Hur, Reference Hur, Mardale and Montrul2020; Montrul, Reference Montrul2004, Reference Montrul2014; Montrul and Bowles, Reference Montrul and Bowles2009; Montrul and Sánchez-Walker, Reference Montrul and Sánchez-Walker2013; Montrul et al., Reference Montrul, Bhatt and Girju2015), and it is important to understand how the current variation relates to diachronic DOM expansion. DOM did not exist in Latin: Latin direct objects were marked with accusative case but with no preposition. Among the Romance languages, only Spanish and Romanian developed full-fledged DOM systems, and the current tendency in most monolingual varieties of Spanish has been the gradual expansion of DOM along the Animacy and Definiteness scales, to inanimate definite objects. However, in bilingual varieties in contact with non-DOM languages we typically see the opposite of expansion, i.e., retraction through omission of DOM in core cases (animate, specific objects), as we confirm in this study. An exception to this pattern is a variety of Italian that exhibits DOM (Sicilian) in contact with Québec French that appears to retain DOM (D’Alessandro, Reference D’Alessandro2021).
Aissen (Reference Aissen2003) analyzed instances of DOM in the Medieval text Poema del Mio Cid. She found that a-marking with direct objects was less widely distributed then than it is in present day Spanish. In Medieval Spanish, common noun phrases referring to humans, both definite and indefinite, were not always marked, as shown in (11), whereas today they must be marked, as in (12).
| (11) | … quando dexaron mis fijas en el rrobredo de Corpes CMC 3151 |
| when they left my daughters in the oak-forest of Corpes |
| (12) | … cuando dejaron a mis hijas en el robledo de Corpes. |
Other diachronic studies (Company, Reference Company2001, Reference Company2002; Laca, Reference Laca and Company2006) have shown that a-marking is also advancing with inanimate objects in some Spanish varieties (see Table 3.3), such as Mexican Spanish, as in (13) and (14) (examples from Company, Reference Company2001: p. 147).
| (13) | Después de conocer mucho a la vida, ya no me interesa el teatro. |
| (Proceso, May 1999) | |
| ‘After knowing life too much, I am no longer interested in theater.’ |
| (14) | Para que no nos peleemos, puse a la silla en el medio. |
| (Mexico, spoken Spanish) | |
| ‘So that we do not fight, I put the chair in the middle.’ |
Table 3.3 Corpus-based analysis of the diachronic advance of a-marking with animate and inanimate objects in Spanish (adapted from Company, Reference Company2001). Percentage of DOM by object and by century.
| Object | XII century | XIV century | XV century | XVI century | XX century |
|---|---|---|---|---|---|
| animate | 56% (420/752) | 52% (440/849) | 52% (377/720) | 60% (847/1416) | 74% (168/228) |
| inanimate | 1% (2/300) | 0% (1/300) | 3% (8/300) | 8% (954/641) | 17% (64/373) |
The evolution of DOM with inanimate objects was further confirmed by Von Heusinger and Kaiser’s (Reference Von Heusinger, Kaiser, Von Heusinger, Kaiser and Stark2005) diachronic and synchronic investigation of Spanish DOM, using corpora from the Poema de Mio Cid and its contemporary translations, and synchronic analyses of different varieties of Spanish (Argentina, Uruguay, Peru, and Mexico) collected from written and oral corpora (informal interviews, short stories, and email messages to a newspaper). The purpose of their investigation was to trace whether the evolution of DOM in Old Spanish and in different Latin American varieties followed the definiteness scale and the animacy scale. Tables 3.4 and 3.5 show the distribution of DOM along the two scales (columns for definiteness and rows for animacy). Grey scale shows expansion.
Table 3.4 DOM in Old Spanish: Cross-classification of animacy hierarchy and definiteness scale (Von Heusinger and Kaiser, Reference Von Heusinger, Kaiser, Von Heusinger, Kaiser and Stark2005: p. 42)
| Animacy | Strong Pro> | PN > | Definite > | + Spec > | – Spec |
|---|---|---|---|---|---|
| human | + | + | ± | – | – |
| animate | + | + | ± | – | – |
| inanimate | 0 | ± | – | – | – |
Table 3.5 DOM in Modern Spanish: Cross-classification of animacy hierarchy and definiteness scale (Von Heusinger and Kaiser, Reference Von Heusinger, Kaiser, Von Heusinger, Kaiser and Stark2005: p. 42)
| Animacy | Strong Pro> | PN > | Definite > | + Spec > | – Spec |
|---|---|---|---|---|---|
| human | + | + | + | + | ± |
| animate | + | + | + | + | – |
| inanimate | 0 | ± | – | – | – |
Comparing Tables 3.4 and 3.5, we see that DOM extended from marking animate pronouns and proper names to marking animate and specific noun phrases. This transition was mediated by topicality because at first only topicalized specific noun phrases were marked, while non-topicalized ones were not. The expansion later included indefinite NPs with the features [± specific]. So, in the transition from Old to Modern Spanish, topicality and specificity were transitional triggers affecting definite and indefinite NPs, respectively.
The evolution of DOM in Latin American Spanish in Table 3.6 shows expansion along the Animacy scale, since inanimate definite objects are specific and can receive DOM marking.
Table 3.6 DOM in American Spanish: Cross-classification of animacy hierarchy and definiteness scale (Von Heusinger and Kaiser, Reference Von Heusinger, Kaiser, Von Heusinger, Kaiser and Stark2005: p. 46)
| Definite > | Indefinite | |||||
|---|---|---|---|---|---|---|
| Animacy | Strong Pro> | PN > | + Spec > | – Spec | + Spec > | – Spec |
| human | + | + | + | + | + | ± |
| animate | + | + | + | ± | + | ± |
| inanimate | 0 | ± | ± | – | – | – |
Several semantic and syntactic factors facilitated the marking of specific inanimate objects. These include the lexical nature of the verb (verbs that seem to be followed by the directional preposition “a,” like seguir a or perseguir a “follow”), secondary predication (Consideran a las facturas fraudulentas “They consider the bills fraudulent”), the preverbal or topicalized position of the objects (A esta película ya la vi “This movie I already saw”), and clitic doubling (Lo vi a Juan “ I saw Juan”). Von Heusinger and Kaiser (Reference Von Heusinger, Kaiser, Von Heusinger, Kaiser and Stark2005) also suggest that there is synchronic variation in the parameters that regulate DOM in Latin American Spanish. For example, while definiteness and specificity trigger DOM in the Spanish spoken in Argentina and Uruguay, animacy and specificity are the main triggers in other varieties.
Several studies have investigated the factors that may underlie variable uses of DOM in different varieties, such as peninsular Spanish and Argentinian Spanish (Dumitrescu, Reference Dumitrescu1997), and between Mexican, Argentinian, and Peninsular Spanish (Arechabaleta Regulez, Reference Arechabaleta Regulez2019; Arechabaleta Regulez and Montrul, Reference Arechabaleta Regulez and Montrul2021; Balasch, Reference Balasch2011; Tippets, Reference Tippets2010), to enable understanding of the evolutionary path of DOM. There are also studies of DOM variation within varieties, such as in the Spanish of Cuba (Alfaraz, Reference Alfaraz2011), of Argentina (Montrul, Reference Montrul, Rodríguez Louro and Colantoni2013b), of Mexico (Arechabaleta Regulez, Reference Arechabaleta Regulez2019; Arechabaleta Regulez and Montrul, Reference Arechabaleta Regulez and Montrul2021; Bautista-Maldonado and Montrul, Reference Bautista-Maldonado and Montrul2019), and of the United States (Arechabaleta Regulez, Reference Arechabaleta Regulez, Mardale and Montrul2020; Montrul, Reference Montrul2004, Reference Montrul2014; Montrul and Bowles, Reference Montrul and Bowles2009; Montrul and Sánchez-Walker, Reference Montrul and Sánchez-Walker2013; Montrul et al., Reference Montrul, Bhatt and Girju2015). Topicalization turns out to be a very significant driving factor in this evolution, as mentioned earlier. In Mexico and Argentina, definite inanimate objects tend to appear with DOM in production (e.g., choqué al auto “I crashed the car,” cocecharon al maíz “They harvested corn”), especially when these are topicalized (A esta plaza la patrocinan Aerolineas Argentinas y Ud. “This park is sponsored by Aerolineas Argentinas and you”) (Company, Reference Company2001; Dumitrescu, Reference Dumitrescu1997; Von Heusinger and Kaiser, Reference Von Heusinger and Kaiser2011). The acceptance of topicalized inanimate objects with DOM was confirmed by Montrul’s (Reference Montrul, Rodríguez Louro and Colantoni2013b) experimental study with 26 native speakers in Argentina. The native speakers completed a written grammatical judgment task testing, in simple clauses, animate and inanimate objects with and without DOM, with and without accusative clitic doubling,and with topicalizations. Montrul investigated whether topicalizations and accusative clitic doubling, which are common in Argentine Spanish, contributed to the extension of DOM to inanimate objects. The results showed categorical acceptance of DOM with animate objects and acceptablity to some degree with inanimate specific and non-specific objects, especially when these were clitic doubled and topicalized (A la casa la compró Mariela “DOM the house it Mariela bought”). Montrul’s experimental findings confirmed Dumitrescu’s (Reference Dumitrescu1997) observation of DOM in speech and street signs as well as the evolutionary paths described by Company (Reference Company2001) and Von Heusinger and Kaiser (Reference Von Heusinger, Kaiser, Von Heusinger, Kaiser and Stark2005).
Bautista-Maldonado and Montrul (Reference Bautista-Maldonado and Montrul2019) investigated DOM in native speakers of Mexican Spanish using an elicited production task and a written acceptability judgment task. They found that DOM with animate, specific direct objects is produced consistently, and is categorically accepted as grammatical. Except for non-specific NPs, all other NP types along the definiteness scale showed categorical DOM marking (Pronouns, Names, specific NPs, and quantifiers). They also confirmed that there is a slight tendency toward DOM marking with specific inanimate objects, although the expansion of DOM to inanimate objects with topicalizations was not examined in this study. Non-specific inanimate objects were preferred without DOM in the acceptability judgment task. An unexpected finding of this study, which goes against what has been reported in the literature, is that native speakers of Mexican Spanish did not produce or accept clitic-doubled personal pronouns with DOM, such as Lo vi a él “I saw him,” preferring vi a él, without the clitic, or lo vi, without the strong pronoun, instead). (Arechabaleta Regulez, Reference Arechabaleta Regulez2019; Arechabaleta Regulez and Montrul, Reference Arechabaleta Regulez and Montrul2021) tested young native speakers of Mexican Spanish with two oral production tasks, an acceptability judgment task, and a sentence processing task with eye tracking during reading to corroborate the expansion of DOM to inanimate objects in Mexican Spanish. Although the native speakers did not produce inanimate objects with DOM in the two production tasks, they rated them more acceptable (*Cocecharon al maíz “They harvested corn”) than unmarked animate objects (*Vieron Juan “They saw Juan”) in the acceptability judgment task. In the eye tracking processing task, the Mexican speakers showed sensitivity (measured by longer latencies) to DOM omission in sentences with animate objects (*El actor liberó el compañero con su llave “The actor freed his partner with is key”) but less sensitivity (shorter latencies) to the ungrammaticality of sentences with marked inanimate objects (*El joven movió al sofá a la calle para dormir “The young man moved the sofa to the street to sleep”). Since DOM innovations were detected in online processing more than in grammaticality judgments and oral production, Arechabaleta Regulez‘s (Reference Arechabaleta Regulez2019) and Arechabaleta Regulez and Montrul’s (Reference Arechabaleta Regulez and Montrul2021) results support the hypothesis that language variation and change may start with comprehension before it reaches production (Lundquist et al., Reference Lundquist, Rodina, Sekerina and Westergaard2016).
As we will cover in more depth in the rest of this book, retraction (omission) of DOM has also been reported in other varieties, particularly in Spanish in contact with English in the United States (Montrul, Reference Montrul2014; Montrul and Bowles, Reference Montrul and Bowles2009) and in Spanish in contact with Quechua in Perú (Sánchez, Reference Sánchez2003), both are bilingual varieties where Spanish is in contact with a non-DOM language. Yet, omission of DOM with animate and specific objects has also been attested in monolingual homeland varieties, as in Cuban (Alfaraz, Reference Alfaraz2011) and Dominican Spanish (Lunn, Reference Lunn, Lee, Geeslin and Clemens2002). For example, Alfaraz (Reference Alfaraz2011) reports significant omission of DOM with animate specific direct objects in spontaneous production, as (15) and (16) illustrate. The data came from younger and older speakers, and the younger speakers omitted DOM with animate objects almost 80% of the time compared to the older speakers, who omitted about 40% of the time. If these trends are replicated in other studies, they would indicate diachronic change in the monolingual variety.
| (15) | Nosotros no conocimos Julio Sánchez (Alfaraz, Reference Alfaraz2011) |
| ‘We did not meet/know Julio Sánchez.’ |
| (16) | Para que ellos fueran a buscar un familiar de Juanito (Alfaraz, Reference Alfaraz2011) |
| ‘So that they could go find a relative of Juanito.’ |
In sum, leaving aside bilingual varieties for the moment, it seems that recent studies of monolingual varieties of Spanish have uncovered important variation in the expression of DOM with animate and inanimate objects.
3.1.2 Romanian
DOM in Romanian is marked with the preposition pe, which is homophonous with the locative preposition pe, as in (17), and the preposition appearing with some indirect objects, as in (18).
| (17) | Cartea este pe masǎ. |
| book-def is on table | |
| “The book is on the table.” |
| (18) | Claudia se bazează foarte mult pe mama sa. |
| Claudia refl. relies very much on mother-def her. | |
| “Claudia relies very much on her mother.” |
The two main parameters that regulate DOM in Romanian are animacy and referential stability (Ciovârnache and Avram, Reference Ciovârnache and Avram2013; Farkas and Von Heusinger, Reference Farkas and von Heusinger2003; Mardale, Reference Mardale2008, Reference Mardale2010a). Referential stability is related to how the specificity value of a given DP can change depending on discourse properties (Farkas and Von Heusinger, Reference Farkas and von Heusinger2003). Farkas and Von Heusinger (Reference Farkas and von Heusinger2003) proposed the referentiality stability scale, as in (19).
| (19) | Referentiality Stability Scale (Farkas and Von Heusinger, Reference Farkas and von Heusinger2003) |
| Proper nouns, definite pronouns > definite DPs > partitives >indefinite DPs |
Following the referentiality scale in (19), pe-marking is obligatory with personal pronouns and proper names, as in (20), including those referring to personified animals (Farkas and Von Heusinger, Reference Farkas and von Heusinger2003; Române, Reference Române2005; Von Heusinger and Chiriacescu, Reference Von Heusinger, Chiriacescu, Espinal, Leonetti and McNally2009; Von Heusinger and Gáspár, Reference Von Heusinger and Gáspár2008). Names of cities (i.e. Chicago) are not pe-marked. Pe-marking is optional with referentially stable objects, as in modified definite animate DPs in (21) and indefinite animate DPs, in (22).
| (20) | Raluca a văzut-o | pe | Beatrice/ea. | (*Raluca a văzut Beatrice/ea.) |
| Raluca has seen-Cl.3sgFem | DOM | Beatrice/her | ||
| ‘Raluca saw Beatrice/her.’ | ||||
| (21) | (L)-am văzut | (pe) | băiatul înalt. |
| CL-have seen | DOM | boy-def tall | |
| ‘I/we saw the tall boy.’ | |||
| (22) | Roxana a vizitat (pe) un prieten. |
| Roxana has visited (DOM) a friend | |
| ‘Roxana visited a friend.’ |
Pe-marking is generally ungrammatical with inanimate objects, both definite specific and indefinite DPs, as in (23) and (24).
| (23) | Angelica a văzut casa. | (*Angelica a văzut pe casa.) |
| Angelica has seen house-def | ||
| ‘Angelica saw the house.’ |
| (24) | Luminița a văzut o casă. | (*Luminița a văzut pe o casă.) |
| Luminița has seen a house | ||
| ‘Luminița saw a house.’ |
In Romanian, animacy is less deterministic as a trigger of DOM than it is in Spanish because some specific inanimate objects can be DOM-marked. DOM (pe-marking) is also sensitive to definiteness, specificity, and topicalization (Cornilescu, Reference Cornilescu2000; Mardale, Reference Mardale2010b; Tigău, Reference Tigău2010). Strong pronominal direct objects are obligatorily marked with pe irrespective of animacy, as in (25b). Inanimate objects are usually not acceptable with DOM (Irimia, Reference Irimia2018); however, specific inanimate objects can also be differentially marked with pe, as is (26), and particularly when dislocated, as in (27a). Some speakers find that using DOM with inanimate objects in a post verbal position creates a semantic upgrading effect (27b) (Ticio and Avram, Reference Ticio and Avram2015: p. 387).
| (25) | a. | L-am | cumpărat | pe | acesta | / | pe | celălat. |
| it.Cl.masc.3sg-have.1bought | DOM | this.one | DOM | the.other | ||||
| ‘I have bought this one / the other.’ | ||||||||
| b. | Ai | luat-o | pe | aceea. | ||||
| have.2sg | taken-it.Cl.fem.3sg | DOM | that.one | |||||
| ‘You have taken that one.’ | ||||||||
| (26) | L-ai | uitat | pe | A | din | text. |
| it.Cl.masc.3sg-have.2sg | forgot | DOM | A | from | text | |
| ‘You have forgot the (letter) A in the text.’ | ||||||
| (27) | a. | Pe | trandafir | l-a | lăsat | albina | la | urmă. |
| DOM | rose | it.Cl.masc.3sg-have.3sg | left | bee.the | at | end | ||
| ‘The rose, the bee left it for the end/for last.’ | ||||||||
| b. | Albina l-a | lăsat | pe | trandafir | la | urmă. | ||
| bee.the it.Cl.masc.3sg | have.3sg left | DOM | rose | at | end | |||
| ‘The bee left the rose for last/the end.’ | ||||||||
Summarizing, animate specific direct objects are marked with the preposition pe in Romanian, especially if they are expressed with names or pronouns. Definite animate objects (definite DPs) are optionally pe-marked because, semantically, they are less referentially stable, and inanimate specific objects can be pe-marked if dislocated. In addition, there are other syntactic constraints that determine the optionality of pe-marking with definite DPs. Romanian has post-nominal definite articles that are an enclitic suffix (e.g. băiat-ul ‘boy-the’). Definite DPs behave differently depending on the presence or absence of the definite enclitic article, and this is the result of a syntactic constraint independent of DOM that applies to all prepositions in Romanian (except for cu ‘with’). Therefore, constructions with a preposition + noun with a definite article suffix are ungrammatical, as the sentence in (28) shows. If the noun is suffixed with a definite article (băiat-ul ‘boy-the’), it cannot be pe-marked unless it is further modified with an adnominal expression, as in băiatul înalt (boy-the tall) (Dobrovie-Sorin, Reference Dobrovie-Sorin1994; Mardale, Reference Mardale2010b; Von Heusinger and Chiriacescu, Reference Von Heusinger and Chiriacescu2013). Some Romanian speakers use pe and drop the definite article leaving a bare noun, as in (29), while others drop pe and use the definite article, as in (30) (Von Heusinger and Chiriacescu, Reference Von Heusinger, Chiriacescu, Espinal, Leonetti and McNally2009; Von Heusinger and Onea Gáspar, Reference Von Heusinger and Gáspár2008).
| (28) | *L- | am văzut | pe | băiatul. |
| Cl.3sg.masc | have seen | DOM | boy-def.masc | |
| ‘I have seen the boy.’ | ||||
| (29) | L- | am văzut | pe | băiat. | (*Am văzut băiat.) |
| Cl.3sg.masc | have seen | DOM | boy | ||
| ‘I have seen the boy.’ | |||||
| (30) | Am | văzut | băiatul. |
| I have | seen | boy-def.masc | |
| ‘I have seen the boy.’ | |||
When DOM is present with definite DPs, the nominal expression is referentially stable because DOM provides referential persistence in the discourse (Chiriacescu and Von Heusinger, Reference Chiriacescu and von Heusinger2010); when DOM is omitted, the expression has a role reading (e.g., the director, the king, the president), where the referent is not stable (Hill, Reference Hill2013). Some familiar functional expressions such as the mother, the teacher, the priest, or the boss are exceptions to the syntactic constraint on prepositions and may appear with pe (at least in spoken Romanian) (Von Heusinger and Gáspár, Reference Von Heusinger and Gáspár2008). Example (31) is acceptable under the reading that the boss is granted contextual uniqueness to its referent (role reading), though such expressions are rare:
| (31) | L- | am | văzut | pe | şeful. |
| Cl.3sg.masc | have | seen | DOM | boss-def.masc | |
| ‘I have seen the boss.’ | |||||
Another distinctive syntactic characteristic of Romanian is the presence of accusative clitic doubling (CD), by which the object DP is doubled by an accusative clitic. Whenever a direct object is doubled by an accusative clitic, DOM (pe-marking) is required, as in (32) and (33). Clitic doubled objects without pe-marking are typically ungrammatical (Farkas and Von Heusinger, Reference Farkas and von Heusinger2003), but there is also some variation in native speakers of Romanian, as we discuss below.
| (32) | Angelica a văzut-o | pe Madonna/ea. (*Angelica a văzut-o Madonna/ea.) |
| Angelica has seen-Cl.3sg.fem | DOM Madonna/her | |
| ‘Angelica saw Madona/her.’ | ||
| (33) | Elisabeta a văzut-(o) | pe o femeie. (*Elisabeta a văzut-o o femeie.) |
| Elisabeta has seen-Cl.3sg.fem | DOM a woman | |
| ‘Elisabeta saw a woman.’ (specific reading) | ||
The distribution of CD with postverbal objects also follows the definiteness and referentiality scales: while pronouns and proper names always occur with CD, there is a strong preference for CD with pe-marked definite direct objects as in (34), and some preference for CD with pe-marked animate indefinite direct objects, as in (35) (Von Heusinger and Gáspár, Reference Von Heusinger and Gáspár2008). Inanimate objects are ungrammatical with CD because CD requires DOM, andnon-specific inanimate objects cannot be pe-marked, as in (36).
| (34) | (L)-am | văzut | pe | bărbatul | înalt. |
| Cl.3sg.masc.have | seen | DOM | man | the tall. | |
| ‘I saw the tall man.’ | |||||
| (35) | *Ioana | a | văzut-o | pe | o | casă. |
| Ioana | has | seen-Cl.3sg.fem | DOM | a | house | |
| ‘Ioana saw a house.’ | ||||||
There is disagreement regarding the use of CD in some contexts, particularly for definite modified objects, and this is likely due to register (higher registers prefer CD). In general, Gramatica Limbii Române (Reference Române2005), Von Heusinger and Chiriacescu (Reference Von Heusinger, Chiriacescu, Espinal, Leonetti and McNally2009), Von Heusinger and Gáspár (Reference Von Heusinger and Gáspár2008) all indicate that CD is obligatory with proper names and personal pronouns, regardless of whether the pronouns refer to animate or inanimate objects, and thus sentences as in (36) are ungrammatical:
| (36) | Maria *(l)-a desenat *(pe) Matei / el. |
| Maria *(Cl) has drawn *(DOM) Matei /him | |
| ‘Maria drew Matei/him.’ |
With modified animate DPs, CD and DOM are optional (Farkas and Von Heusinger, Reference Farkas and von Heusinger2003). Gramatica Limbii Române (Reference Române2005) notes that CD is obligatory with modified animate DPs with strong quantifiers, while definite modified DPs are reported to be marginal for some speakers (Von Heusinger and Chiriacescu, Reference Von Heusinger, Chiriacescu, Espinal, Leonetti and McNally2009). For the purposes of the present study, I assume that CD and DOM are both required for proper names and personal pronouns, and optional with definite DPs, despite the variation that exists in the language. DOM and CD are ungrammatical with inanimate objects, especially if these are not specific.
David (Reference David2015) proposes that the clitic-doubled pe construction is prototypically employed in clauses involving direct objects with human referents but can be extended to clauses containing non-human referents, as the constructional polysemy branches out. CD with inanimate objects and pe was common in Old Romanian (Hill and Mardale, Reference Hill and Mardale2017). Unlike other Balkan languages, Romanian does not have a clitic doubling construction independent of DOM (all clitic-doubled constructions must have pe-marking), although this was possible in Old Romanian. Some pe-marked direct objects without CD are heard in present-day spoken Romanian spoken, but these are dispreferred in the higher registers. It appears that in the historical development of Romanian, CD and DOM were different constructions that developed independently (Cornilescu, Reference Cornilescu2000; Von Heusinger and Onea Gáspár, Reference Von Heusinger and Gáspár2008, but cf., Hill and Mardale, Reference Hill and Mardale2017). Because both CD and DOM mark discourse prominence, the two constructions became entrenched in present day Romanian (David, Reference David2015). Diachronically, object topicalization (fronting) and clitic left dislocation (CLLD) facilitated the merger of CD and DOM with animate objects into what today can be considered a merged construction. In this study, we examine the extent to which different speakers of Romanian, including heritage speakers, immigrants, and Romanian speakers in Romania, exhibit this close relationship between DOM and CD in their grammars.
3.1.3 Hindi
Hindi/Urdu (henceforth Hindi) is a SOV split ergative language that also exhibits DOM marked with the post-position –ko, as shown in (36). Hindi does not have definite articles, unlike Spanish and Romanian. Hence, when –ko appears with direct objects, it signals human animacy and specificity. –ko is obligatory with pronouns and human personal names; it is optional with animals (Butt, Reference Butt1993; Junghare, Reference Junghare1983; Magier, Reference Magier1987; Masica, Reference Masica1982; Mohanan, Reference Mohanan1993, Reference Mohanan1994a, Reference Mohanan, Butt, Holloway King and Ramchand1994b; Singh, Reference Singh1994).
Human, specific direct objects must be overtly marked with –ko, as in (37), and are ungrammatical without –ko marking. Non-human animate, specific direct objects may be optionally marked with –ko, as in (38).
| (37) | Mira-ne | Ramesh-ko | dekhaa. | (*Mira-ne Ramesh dekhaa). |
| Mira-erg | Ramesh-DOM | saw | ||
| ‘Mira saw Ramesh.’ | ||||
| (38) | Mira-ne | us | kutte-ko | dekhaa. | (Mira-ne vo kuttaa dekhaa) |
| Mira-erg | that | dog-DOM | saw | ||
| ‘Mira saw that dog.’ | |||||
Ko-marking is obligatory with specific indefinites. In general, non-specific human objects are marginally acceptable without -ko marking, as in (39).
| (39) | Neha | kii | maaN-ne | us-ke liye | laRke | dekhe |
| Neha | of | mother.fem.sg-erg | her-for | boys.masc.pl | saw.masc.pl | |
| ‘Neha’s mother saw boys for her (for marriage).’ | ||||||
Animate non-human non-specific objects are unmarked, as in (40a), and they are unacceptable with overt –ko marking (40b).
| (40) | a. | tum | vahaaN | ek saaNp | dekhoge. |
| b. | *tum | vahaaN | ek saaNp -ko | dekhoge. | |
| you | there | one snake-DOM | see-fut. | ||
| ‘You will see a (non-specific) snake there.’ | |||||
Inanimate, specific direct objects can also be optionally marked with –ko, as in (41):
| (41) | Mira-ne | ghar(-ko) | dekhaa. |
| Mira-erg | house (DOM) | saw | |
| ‘Mira saw the house.’ | |||
Overt marking is optional with inanimate objects, but only when these are specific. Thus, both (42a,b) are possible, and while the object in (42a) can be interpreted as either indefinite or definite, the ko-marked one in (42b) must refer to an already mentioned banana:
| (42) | a. | Ravii-ne | kaccaa | kelaa | kaat!aa. |
| Ravi-erg | unripe | banana | cut | ||
| ‘Ravi cut the/an unripe banana.’ | |||||
| b. | Ravii-ne | kacce | kele-ko | kaat!aa. | |
| Ravi-erg | unripe | banana-DOM | cut.perf | ||
| ‘Ravi cut the unripe banana.’ | (Mohanan, Reference Mohanan1994a: pp. 87–88) | ||||
Inanimate, non-specific objects, as in (43), are ungrammatical with –ko-marking. However, specificity has to be evaluated in context.
| (43) | *Mira-ne | ek | ghar-ko | dekhaa. |
| Mira-erg | a | house-DOM | saw | |
| ‘Mira saw a (non-specific) house.’ | ||||
Object marking in Hindi has quite a different function for inanimate objects than it does for human objects: in the case of inanimate objects only, -ko marks definiteness. The claim that -ko is possible with indefinite animate objects, but not with indefinite inanimate objects, is supported by Singh’s (Reference Singh1994: p. 227) and Mohanan’s (Reference Mohanan1994a: p. 79) observations that the indefinite marker ek may occur with ko-marked objects if they refer to animate objects, but not if they refer to inanimate objects. However, Hindi sources in general vary considerably on the status of overt object marking on inanimate objects. Butt (Reference Butt1993) claims that ko-marking is possible for specific indefinite inanimate objects, whereas Singh (Reference Singh1994) mentions that some speakers disallow -ko with inanimate objects altogether. It is unclear whether this variation is due to dialect variation or to other linguistic factors, since context, word order, and aspect (in addition to overt case) play roles in fixing the specificity of inanimate direct objects in Hindi (Butt, Reference Butt1993; Mahajan, Reference Mahajan1990; Mohanan, Reference Mohanan1993, Reference Mohanan1994a; Singh, Reference Singh1994). To summarize, following Mohanan (Reference Mohanan1994a, Reference Mohanan, Butt, Holloway King and Ramchandb), Dayal (Reference Dayal2011), Kulikov et al. (Reference Kulikov, Malchukov and de Swart2006), and López (Reference López2012), the following generalizations for –ko marking as DOM are assumed in this study:
1) pronouns, names and definite human DPs in object position are always marked with -ko
2) -ko is obligatory with indefinite specific animate objects
3) -ko is optional with inanimate objects depending on context and specificity
Thus, specificity more than definiteness and animacy is the main trigger for DOM in Hindi.
3.2 Syntactic Analyses
There are several syntactic accounts of DOM within generative grammar (Hill and Mardale, Reference Hill and Mardale2021; Kalin, Reference Kalin2018; Lidz, Reference Lidz2006; López, Reference López2012; Rodríguez-Mondoñedo, Reference Rodríguez-Mondoñedo2007; Torrego, Reference Torrego1998) that capture the prominence of marked objects in the syntax. One of the important results of generative work on this topic is the idea that DOM is related to the phenomenon of object shift. In many languages, local movement of the object to a higher position has some of the same interpretive effects as DOM, particularly with respect to the dimension of definiteness. The fact that object shift makes some of the same distinctions as DOM has been considered as evidence that DOM is the visible marking of case, which is assigned in the higher, shifted position, and is associated with particular interpretive effects. Case-marked objects are associated with a relatively high phrase structural position and non-case-marked ones with a lower one (Woolford, Reference Woolford1999). DOM objects check specificity or accusative case, depending on the particular analysis, in a higher functional projection that is different from the projection for unmarked objects. Hence, overtly marked objects are structurally more complex than unmarked objects because they involve movement and PF realization of morphology (Lidz, Reference Lidz2006; Rodríguez-Mondoñedo, Reference Rodríguez-Mondoñedo2007; Torrego, Reference Torrego1998; Woolford, Reference Wray and Grace2006).
López (Reference López2012) provides a syntactic analysis of Spanish, Hindi, Romanian, and other languages that tries to capture the syntax, morphology, and semantics of DOM. His analysis relies on scrambling to account for the fact that small clauses, which are not DPs, names or pronouns, among other syntactic phenomena, are DOM-marked. Marked objects undergo short scrambling out of the VP to a vP-internal position, and there is an indirect connection between configuration and interpretation. Assuming Distributed Morphology (Halle and Marantz, Reference Halle, Marantz, Carnie and Harley1994), López also argues that the actual realization of marked morphology depends on environmental conditions surrounding the objects (animate noun, telic predicate, object affecting verb, etc.), giving the appearance of prominence scales. In Distributed Morphology, there is separation of syntax and morphology such that morphology is added post-syntactically. Affixes carry abstract syntactic and semantic features that can be more or less specified, and compete for lexical insertion post-syntactically, depending on how well they match the formal semantic and syntactic features of the stem.
For López (Reference López2012), there are three positions for objects in Spanish in the clause structure, as shown in the tree in Figure 3.2: the lowest in the VP (P1) is for unmarked objects (typically non-specific), the next one higher above the IO (indirect object) (P2) is for direct objects marked for DOM. These objects may or may not be specific. The highest position above the external argument (P3) is for left dislocated objects. So, the critical positions for marked and unmarked objects are P2 and P1.

Figure 3.2 Syntactic representation for DOM in Spanish and other languages
Marked nominals have case (represented in a case phrase or KP) that dominates DPs. Unmarked nominals are DP, NP, or P. When the marked object moves to the second higher position (P2 [Spec α]) within a KP, a vocabulary insertion rule inserts the DOM marker (the preposition a in Spanish, the preposition pe in Romanian, and the postposition -ko in Hindi), as in Figure 3.3.

Figure 3.3 Vocabulary insertion of DOM in Spanish and Hindi
The spell-out of these lexical items can be influenced by features of the nominal phrases, the external argument, the event structure of the predicate, or the verb itself. In the case of Spanish, the nominal feature [animate] triggers the insertion of a, and [humanness], which subsumes [animate], triggers insertion of pe in Romanian. As mentioned earlier, the fact that quantifiers, which are non-specific, trigger DOM, suggests that animacy trumps specificity for DOM-marking in Spanish. In Romanian, all object wh-phrases must have pe. In Hindi, definiteness and specificity trigger -ko insertion, and -ko is optional with wh-phrases. Keine (Reference Keine, Trommer and Opitz2007) captures the overt/non-overt -ko alternation by means of morphological impoverishment rules within Distributed Morphology (Halle and Marantz, Reference Halle, Marantz, Carnie and Harley1994). The impoverishment rule states that if a direct object is inanimate and non-specific, the morphological case is realized as zero instead of -ko:
[+ oblique] ➔ ø / [– human, – specific, + α]
The empirical evidence supporting López’s analysis comes from the behavior of DOM with bare plurals in Spanish, as in (44), small clauses in Spanish, Hindi, and Romanian, as in (45), causative, perception, and control verbs, as in (46), among others, all requiring DOM.
| (44) | a. | *Yo contrato a traductores. |
| b. | Yo contrato traductores. | |
| ‘I hire translators.’ |
| (45) | a. | El profesor consideró a/*Ø Juan inteligente. |
| ‘The professor considered Juan intelligent.’ | ||
| b. | Ion (îl) considera pe/*Ø student tîmpit. | |
| Ion Cl.acc.3sg considers DOM student stupid. | ||
| ‘Ion considers a/the student stupid.’ | ||
| c. | Saadhu-ne ciRiyaa-ko/*Ø raanii samjh-yaa. | |
| Holy.man-erg bird-DOM queen consider-perf | ||
| ‘The holy man considers the bird to be a queen.’ |
| (46) | a. | María hizo llegar tarde a/*Ø un niño. |
| ‘Maria made a boy arrive late.’ | ||
| b. | Ion l-a lasat pe//*Ø un copil sa joace Nintendo. | |
| Ion Cl.acc.3sg-aux let DOM a child play Nintendo | ||
| ‘Ion let a/the child play Nintendo.’ | ||
| c. | Anju-ne ciRiyaa-ko//*Ø jaa-ne di-yaa. | |
| Anjum-erg bird-DOM go-inf give-perf | ||
| ‘Anjum let the/a bird go.’ |
Bare plurals do not require DOM because they are of category #P and their case assignment is realized via incorporation (López, Reference López2012: p. 52). However, small clauses without bare plurals receive case, and so do objects of causative constructions, which are all KPs and therefore marked.
Syntactic analyses of DOM objects proposed on the basis of Spanish (Bernstein et al., Reference Bernstein, Ordóñez and Roca2018; López, Reference López2012) also assume Kayne’s generalization (Jaeggli, Reference Jaeggli1982: 20), which takes place in the verbal domain: an object NP must be doubled by a clitic (through clitic doubling) only if the NP is preceded by a preposition. Although similar analyses have been extended to Romanian (Dobrovie-Sorin, Reference Dobrovie-Sorin1994; López, Reference López2012), Hill and Mardale (Reference Hill and Mardale2019, Reference Hill and Mardale2021) argue, on the basis of diachronic and typological evidence, that such analyses do not easily extend to Romanian and propose, instead, micro-parametric differences in DOM representations between Spanish and Romanian. In Hill and Mardale’s (Reference Hill and Mardale2021) analysis, Romanian DOM is executed in the DP (or KP) and not in the vP.
Hill and Mardale (Reference Hill and Mardale2017, Reference Hill and Mardale2019, Reference Hill and Mardale2021) propose that Romanian DOM obligatorily involves an activated bundle of discourse features in K. DOM-marked direct objects stay in situ in the VP, in a split DP structure (Bernstein et al., Reference Bernstein, Ordóñez and Roca2018) with two D positions (D1 and D2), and a DDOM position in between. The object marker pe spells out the discourse feature bundles of K, although it is not a functional preposition in Modern Romanian (unlike the Spanish DOM a). The bundle includes the formal feature [particularized], which maps the semantic noun features animacy, reference, and specificity, and the feature [Fmark], which maps the speaker’s intention on the noun for pragmatic effects (e.g., contrastive vs. familiar effects). Diachronically, these DOM features remained in the nominal domain at all times, but changes occurred in their bundling, distribution, and spell out over time (Hill and Mardale, Reference Hill and Mardale2021). So, Modern Romanian has two ways to express DOM; one with CD (the default marking option), which favors [+ human] nouns, and one without CD. In Old Romanian, the DOM marker pe could check both [particularized] and [Fmark], but in Modern Romanian these features are split. The proposed structure for CD with DOM is an ApplP, as in Figure 3.4. (ApplP is equivalent to αP in López’s analysis in Figure 3.2). The feature [Fmark] is in the D of the DP and is checked by pe, whereas the feature [particularized] is checked by the accusative clitic in Appl.
Personal pronouns and names of animate entities systematically undergo DOM through ApplP, whereas animate DPs (nouns) are only preferably derived through ApplP. Categories at the end of the referentiality scale in (1) still show traces of Old Romanian. Unmodified nouns and specific indefinites project to DP with pe merged in D, as in Figure 3.5: the presence of pe forces the specific reading of the noun. Both [particularized] and [Fmark] bundled under D and mapped to pe in D.
The difference between DOM pe with unmodified versus modified nouns is that, for the former, pe merges in a collapsed D, whereas in the latter pe merges in a split D field (Bernstein et al., Reference Bernstein, Ordóñez and Roca2018). This analysis implies that Romanian pe does not involve Kayne’s generalization, while Spanish a does: Romanian pe is a DP-internal marker (Hill and Mardale, Reference Hill and Mardale2019), Spanish a is a vP-internal marker. Evidence for the different status of pe and DOM in Romanian compared to Spanish a comes from the fact that pe and the definite article are not allowed when the DP is unmodified (either the article or pe must be present but not both, as in (28) and (29)). Furthermore, Hill and Mardale (Reference Hill and Mardale2021) argue that Romanian clitic pronouns are not arguments as in Spanish, and do not affect the theta-role and case checking between V and the DP object.
To summarize, Modern Romanian has two mechanisms to mark DOM: with clitic doubling and pe-marking in an ApplP, as in Figure 3.4, or in a lower D position in a split DP for unmodified DPs, as in Figure 3.5. In the former, the features [particularized] and [Fmark] are spelled out in separate projections by the accusative clitic and the marker pe, respectively, whereas in the latter, both features are bundled and overtly expressed by the marker pe. I assume that native speakers of Romanian have and activate these two representations for DOM for comprehension and production.
3.3 A Note on Dative Subjects
Spanish and Hindi mark dative experiencer subjects with the same preposition/postposition marking DOM, (47) and (48), and apparently this is not coincidental but another crosslinguistic tendency in DOM languages (Bossong, Reference Bossong, Wanner and Kibbee1991). Romanian also uses pe-marking with some verbs, as in (49), and dative case with others, as in (50). These dative experiencer subjects lack volitional control, and express a range of thematic roles (experiencer, obligation, possession) that can be categorized broadly under the thematic role of experiencer.
| (47) | A | Juan | le | gusta | el | helado. |
| dat | Juan | Cl.dat | likes | ice | cream | |
| ‘Juan likes ice cream.’ | ||||||
| (48) | Ramesh-ko | kulfi | pasand | haiN |
| Ramesh-dat | icecream | like | aux-pres-pl | |
| ‘Ramesh likes icecream.’ | ||||
| (49) | Pe Camelia o interesează medicina. |
| pe Camelia Cl.acc. interest medicine | |
| ‘Camelia is interesed in medicine.’ |
| (50) | Lui Filip îi place fotbalul. |
| dat Filip Cl.dat. likes soccer | |
| ‘Filip likes soccer.’ |
Dative experiencer subjects can be considered instances of Differential Subject Marking (DSM) (Woolford, Reference Woolford2003, Reference Woolford2009) which, according to Aissen (Reference Aissen2003), Rodríguez-Mondoñedo (Reference Rodríguez-Mondoñedo2007) and Silverstein (Reference Silverstein and Dixon1976), is related to DOM. Differential Subject Marking (DSM) refers to subjects that have different case, agreement patterns, or occupy a different position in the syntax compared to canonical subjects (Aranovich, Reference Aranovich2009). These subjects are also lower in prominence compared to canonical subjects.
Like DOM, DSM is also conditioned by different parameters, having to do with the lexical semantics of particular verbs, the argument structure of the verb and the positions of the arguments, transitivity alternations, phonological constraints on the morphological realization of certain case features, and person/animacy features (Aranovich, Reference Aranovich2009). DSM can appear as a lexical or inherent case, such as ergative, genitive, or dative, depending on the language. Hindi is a split ergative language and ergativity is conditioned by perfectivity. Ergative marking can only appear on subjects of transitive-perfective verbs as in (51), which shows the ergative particle -ne on the subject Nikhil.
| (51) | Nikhil-ne | akhbaar | paRh-ii | hai. |
| Nikhil.masc.sg-Erg | newspaper.masc.sg.nom | read-perf.masc.sg | be.Pres.3Sg | |
| ‘Nikhil has read the newspaper.’ | ||||
The example in (52) shows a genitive subject in Icelandic:
| (52) | Jóns | nýtur | ekki | lengur | vid |
| Jons-gen | enjoys | not | longer | at | |
| ‘John is no longer available.’ | |||||
Aissen (Reference Aissen2003) and Silverstein (Reference Silverstein and Dixon1976) suggested that DOM and DSM may even be considered mirror images of each other. Woolford (Reference Aranovich2009), however, disagrees with this generalization, showing that DSM involving case is not a unified phenomenon. Woolford (Reference Woolford2003: p. 535) suggests that DSM is not related to DOM, citing the fact that the former does not make reference to multiple dimensions, i.e. to both animacy and definiteness. DOM is primarily triggered by features of the object NP, while DSM is regulated by transitivity, aspect, argument structure, and other features of the VP. The specific case of DSM discussed in this book is dative subject of psychological predicates (e.g. the verb like), which involve a non-structural lexical case associated with experiencers (non-agentive) at the level of argument structure (in the vP).
Standard generative theory divides case into two types, structural andnon-structural, which differ in their behavior and manner of licensing (Chomsky, Reference Chomsky1981). Nominative and accusative case are typical instances of structural case because they are licensed structurally by T or V, are often morphologically unmarked (i.e., phonologically null), and are independent of the thematic role of the DPs. Woolford (Reference Wray and Grace2006) proposes that non-structural case further subdivides into two types, lexical and inherent case, which also differ in their manner of licensing. The dative case of dative experiencers or dative subjects with psych predicates has been treated as lexical case because it is idiosyncratic; that is, related to a semantically coherent group of verbs with a particular thematic role (experiencer). The genitive is another example of lexical case.
Inherent case is also non-structural and semantically based (often linked to semantic roles), but unlike lexical case, it is relatively predictable and less idiosyncratic. Woolford (Reference Wray and Grace2006) argues that indirect objects and ergative subjects are marked with inherent case. The dative case of indirect objects is typically linked to goals and beneficiaries and considered by many to be an inherent case rather than structural case. Butt (Reference Butt2006), however, claims that crosslinguistically, goals and beneficiaries are realized as datives by default, and there is regularity and predictability in the realization of dative with indirect objects. In this study, I follow Butt and assume that indirect objects have structural dative case and that dative experiencers have lexical case.
The theoretical distinction between structural and lexical case may be relevant to predict and interpret differential effects of erosion of case marking in heritage speakers. In this study, we focus on the loss and erosion differential object marking (DOM) but some of our experimental materials also included stimuli with psych verbs, an instance of lexical case. In some of the instruments used to test knowledge of DOM in the studies presented in Chapters 6–8, some of the control sentences in the acceptability judgment task and in the elicited oral and written production tasks included dative experiencer subjects. We will see that lexical case is significantly more affected than structural case, as with indirect objects, in first- and second-generation Spanish speakers in the United States. We also show that incipient loss of dative case with psych verbs – manifested with the omission of the a in Spanish – is also evident in the monolingual variety of Spanish spoken in Mexico. This result is consistent with the view that, in addition to being perhaps partially explained by transfer from the majority language and incomplete language development or attrition in childhood, some aspects of heritage language development also find parallels in natural diachronic processes (Poplack and Levey, Reference Poplack, Levey, Auer and Schmidt2010), and that the language contact situation may accelerate natural processes of development already present in the monolingual variety (Silva-Corvalán, Reference Silva-Corvalán1994).
3.4 Summary
Table 3.7 summarizes the distribution of the element that marks DOM in Spanish, Hindi, and Romanian.
Table 3.7 Distribution of the Spanish preposition a, the Hindi postposition ko and the Romanian preposition pe in the syntax
| Spanish | Hindi | Romanian |
|---|---|---|
| preposition a | postposition -ko | preposition pe |
| animate specific direct objects | human specific direct objects | animate specific direct objects |
| dative (and locative) | dative | locative (and dative) |
| experiencer subjects | experiencer subjects | experiencer subjects |
In Spanish and Romanian DOM is sensitive to animacy (animate non-specific objects can be marked with a or pe), whereas in Hindi specificity is more relevant than animacy, since specific inanimate can be marked with -ko. The DOM markers in the three languages are also the dative case markers appearing with indirect objects and dative experiencer subjects regardless of the animacy, definiteness or specificity of the argument. In the case of Romanian, pe is also a locative preposition.
Before presenting the methodology and results of the studies, Chapter 4 considers the acquisition of DOM in monolingual and bilingual grammars, and the relationship of language acquisition and language change.






