Linguistic typology is the study of universals and diversity in human language. In seeking to determine the range of variation across languages, typologists chart the possibilities of human language. Various ‘types’ of language and linguistic structure have been proposed, and typologists are becoming more confident in their statements about what is expected or usual and what is exceptional in human language (Song Reference Song2011, Aikhenvald and Dixon Reference Aikhenvald and Dixon2017). Like in any area of the world, languages of mainland Southeast Asia are in some ways ordinary, having properties shared by many languages around the world, but are in other ways unusual, displaying rare or unexpected features. What makes any area unique in typological terms is not any single feature but a characteristic constellation of features.
2.1 Summary Points
The chapters of this book describe some of the important structural features of MSEA languages, defining an array of features found in the area. In this overview chapter, we begin with a summary list of some of the typological features that are found in the MSEA area.Footnote 1
Overview summary of sound system features
1. Vowel systems are large, showing many distinctions (it is sometimes difficult to determine how many vowels a system has; there are alternative analyses of features such as diphthongs and phonation splits).
2. Vowel phoneme systems often show a common underlying structure (canonically, nine-place, symmetrical, hi-mid-low by front-central-back).
3. Vowel systems often show a long versus short vowel distinction.
4. Many more consonants are possible in initial position than in final position.Footnote 2
5. There is a preference for one (major) syllable per word, with many languages featuring minor syllables or pre-syllables in an iambic pattern (see Pittayaporn Reference Pittayaporn, Enfield and Comrie2015, Butler Reference Butler, Enfield and Comrie2015, Post Reference Post, Enfield and Comrie2015, and Brunelle and Kirby Reference Brunelle, Kirby, Enfield and Comrie2015).
6. Lexical contrast is marked by laryngeal features, including pitch and phonation type, often in combination; lexically contrastive pitch and phonation type are strongly correlated in functional and historical terms.
7. Tone systems are complex (often with around six distinct tones, with tone counts for a language differing depending on the analysis chosen).
8. Phonation type systems usually distinguish two phonation registers, e.g., ‘clear’ versus ‘breathy’ or similar.
9. There is often a gap in voiced stop series at velar place of articulation (no voiced ‘g’).
10. Implosive consonants are found more frequently than elsewhere in the world.
Overview summary of morphosyntax-semantics system features
1. There is no inflectional morphology (no case, gender, number, or definiteness marked on noun phrases, no agreement or tense-marking on verbs); note that derivational morphology is widespread and sometimes highly productive in Austroasiatic languages of MSEA (see Alves Reference Alves, Enfield and Comrie2015, Jenny et al. Reference Jenny, Weber, Weymuth, Jenny and Sidwell2015).
2. Open-class items – mostly nouns and verbs – often serve functions that are expressed by dedicated functional morphemes (including bound morphology) in other languages, e.g., nominals as prepositions, verbs as aspect markers, comparative markers, adversative passive markers, and valence-changing devices (Clark and Prasithrathsint Reference Clark, Prasithrathsint, Ratanakul and Suwilai1985, Kölver Reference Kölver, Seiler and Premper1991, Ansaldo Reference Ansaldo1999).
3. Verb serialization (by which I informally mean a range of different kinds of predicative structures that use combinations of verbs) is used extensively, with a rich array of types and functions in each language (Bisang Reference Bisang, Seiler and Premper1991).
4. Order of major constituents of the clause tends to be relatively flexible within languages, sensitive to pragmatic factors (though verb–object constituent order is dominant); noun phrases tend to be head-initial, and may have discontinuous constituents, especially when classifiers are involved (‘Constituent order at the clause level is fluid’; Kruspe Reference Kruspe2004: 5).
5. Zero anaphora: noun phrases may often be elided when their referents are contextually retrievable (this, combined with flexibility in constituent order, results in quite variable surface options; for a case study see Enfield Reference Enfield2007: 271–284).
6. Topic–comment structure is used extensively at the clause level.
7. Languages often have a large set of labile or ambitransitive verbs, especially of the S=O type (sometimes called causative/inchoative or unaccusative; e.g., Lao hak2 can mean transitive ‘snap’ or intransitive ‘is/has been snapped’).
8. There are often rich inventories of sentence-final particles that make subtle distinctions in sentence type, stance, evidentiality, and combinations thereof.
9. There are often rich inventories of ideophones (or ‘expressives’) and other expressive forms, including rhyming four-syllable expressions, and productive elaborative rhyming devices.
10. Many languages feature numeral classifiers and related systems of nominal classification (see Blench Reference Blench, Enfield and Comrie2015).
11. A number of languages feature extremely complex pronominal systems, with multi-level social-deictic meaning distinctions.
Some of the most noteworthy commonalities among MSEA languages have to do with their lack of marking of certain semantico-grammatical categories. Most notably, as remarked upon in the list above, the languages lack inflectional morphology in the usual sense of that term (i.e., including agreement, case, gender/number/definiteness on noun phrases, tense-marking on verbs). It may be said that a lack of a feature should not be treated as a feature; for example, that we would not group English with Swahili on the basis that they both lack a category of dual pronouns, or verbal marking of evidentiality. True, but categories like dual pronouns and evidentiality, while common in the world’s languages, are in a sense marked categories, while the things that MSEA languages happen to lack are much more general and unmarked types of language structure, in particular the absence of any form of inflectional morphology. We can proceed with caution.
2.2 Morphological Profile
Morphological typology is concerned with how languages put words together (see Payne Reference Payne, Aikhenvald and Dixon2017). Every language has an ‘index of synthesis’, defined as the average number of morphemes per word in that language (Sapir Reference Sapir1921: 128, Payne Reference Payne, Aikhenvald and Dixon2017). At the far end of the scale of morphological synthesis are polysynthetic languages such as those of the Inuit-Aleut family, which have a morpheme-to-word ratio of ‘well over 3’ (Payne Reference Payne, Aikhenvald and Dixon2017: 87). English has a synthesis index of ‘approximately 1.7’ morphemes per word (Payne Reference Payne, Aikhenvald and Dixon2017: 86). At the low end of the scale is the logical extreme: a morpheme-to-word ratio of exactly 1, in other words, where all morphemes are stand-alone words. Languages that are positioned toward this end of the scale are referred to as isolating in type, and languages of mainland Southeast Asia are usually put forward as classic examples of languages with this low index of synthesis.
We shall see in later chapters that MSEA languages do not, in fact, quite show the logical extreme low ratio of one morpheme per word. One reason for this is that the MSEA languages that are typically cited in this regard include Vietnamese, Thai, and Chinese, and these languages are in a minority in MSEA in this respect; Austroasiatic languages, which make up nearly half of the languages in core MSEA, clearly show a higher index of synthesis, with their widespread use of derivational morphology (see Chapter 4). And even in oft-cited extreme isolating languages such as Vietnamese and Thai, certain rich and productive morphological processes are often overlooked in typological work.
When typologists think of complex morphology, they tend to think of markings for categories such as case (e.g., accusative versus nominative), person (e.g., first, second, or third person), or number (e.g., plural versus singular). MSEA languages notably lack morphological marking for these categories, but they make up for it with marking of meanings in other domains, such as the compounding and elaborate forms of reduplication described in Chapter 4.
In his famous characterization of morphological type, Sapir (Reference Sapir1921) used Vietnamese (which he termed Annamese) as the paradigm case of a language in which words are not morphologically complex. Many people still think this (but see Chapter 4 below for a case study of a complex derivational morphological system in Vietnamese). It is probably true that these languages are as far as you get on that scale, but they do not approach the logical extreme – i.e., the situation in which there are no words in the language with more than one morpheme. No language shows a pure isolating profile.
Some scholars acknowledge that there is significant morphological complexity in MSEA, along the same lines as Chinese compounding (Kratochvíl Reference Kratochvíl1968), but there is a domain of morphological complexity that is unappreciated in MSEA. This is what we may call elaborative reduplication. It is found in Tai languages like Lao, as well as in Austroasiatic languages like Vietnamese and Semelai. Another type of morphological complexity is the derivation morphology found in languages like Khmer (though unproductive) and Semelai and Khmu (fairly productive) and Kri (somewhere in between). The real generalization that can be made is that there is no obligatory inflectional morphology in MSEA languages, specifically no case-marking, no agreement marking, and no grammatically obligatory tense or aspect marking.
Another important point to register is that MSEA languages do not rely in a straightforward way on a simple constituent-order rule to distinguish between basic grammatical functions A, S, and O. The widespread idea that they do is, I think, based on the presumption that it must be the case, since there is no head-marking or dependent-marking to rely on for distinguishing between who and whom. It is assumed that hearers of isolating/analytic languages have no choice but to rely on strict constituent order to maintain informational coherence in predicate–argument relations. This claim is, however, weak at best, as (1) a language may have an array of different argument-structure constructions, with variation in the mapping of functional role to syntactic position, (2) extensive ellipsis (including zero anaphora) and movement create widespread surface ambiguity, even within examples of a single construction, as to the mapping of functional role to syntactic position, and (3) this variation and ambiguity does not appear to compromise communication. Accordingly, for Mandarin, Li and Thompson (Reference Li and Thompson1981: 26) state that ‘no basic word order can be established’. On Riau Indonesian, Gil (Reference Gil, Carnie, Harley and Dooley2005) argues that observed word-order patterns in the language are epiphenomenal (cf. also LaPolla Reference LaPolla1993).
Writing about constituent order in Khmu clauses, Suwilai (Reference Suwilai1987: 60–61) writes that A-V-O is ‘the normal order of transitive clauses’. She adds, however, that ‘for emphasis’ the object can often occur clause initially, resulting in O-A-V order being ‘also common’. See the following examples:Footnote 3
| rpàːng | nɔ̀ː | tám |
| gong | they | beat |
| ‘The gong, they beat (it).’ | ||
| Khmu | AA | Suwilai Reference Suwilai1987: 60 | ||
| kɔ́ːn | màˀ | nòːk |
| child | mother | beat |
| ‘The child was beaten by the mother.’ | ||
| Khmu | AA | Suwilai Reference Suwilai1987: 61 | ||
Suwilai adds that the ‘reversed’ order of the unmarked A-V-O – i.e., O-V-A – may be used ‘for special emphasis’. Systematic analysis of corpus data is needed in order to understand what ‘special emphasis’ means. Here are two examples with O-V-A order:
| sˀɔ̀ːŋ | pɛ́ŋ | ˀòˀ |
| tree | cut.down | 1 |
| ‘I cut down a tree.’ | ||
| Khmu | AA | Suwilai Reference Suwilai1987: 61 | ||
| réˀ | rìəm | ˀòˀ |
| farm | clear | 1 |
| ‘I clear the land (for crops).’ | ||
| Khmu | AA | Suwilai Reference Suwilai1987: 61 | ||
See Enfield (Reference Enfield2007) for details of a similar situation in Lao.
Having said that there is no case-marking in MSEA languages, I will note here that Kruspe describes a system in Semelai in which transitive subject arguments are marked with a proclitic (Kruspe Reference Kruspe2004: 259). Here is an example:
| ki=ɡoŋ | la=knlək=hn | bantal |
| 3a=bring | A=husband=3poss | pillow |
| ‘Her husband brought the pillow.’ | ||
| Semelai | AA | Kruspe Reference Kruspe2004: 259 | ||
It appears that no other MSEA system has this kind of marking. If it is a case system, it is a simple one. Note also that the relation between the proclitic and the A function in clausal syntax is not straightforward: ‘Not all post-verbal NPs marked with la= are coding the A; la= also expresses causation or reason’ (Kruspe Reference Kruspe2004: 259). This seems to suggest that la= marks the participant that is responsible for the occurrence of an event, regardless of whether it is core or peripheral.Footnote 4
The terminology of case-marking has also been used in LaPolla’s (Reference LaPolla, Thurgood and LaPolla2003) description of an ‘anti-ergative’ marker thàʔ in Lahu. The function of this marker is ‘to mark an animate argument that might otherwise be interpreted as an actor as being something other than an actor’ (LaPolla Reference LaPolla, Thurgood and LaPolla2003: 34). Here are two examples:
| ŋà | thàʔ | tâ | dɔ̂ʔ |
| 1sg | obj | neg.imp | hit |
| ‘Don’t hit me.’ | |||
| Lahu | TB | Matisoff Reference Matisoff1973a: 156; glossed by LaPolla (Reference LaPolla, Thurgood and LaPolla2003: 34) | |||
| liʔ | chi | ŋà | thà | pîʔ |
| book | that | 1sg | obj | give |
| ‘Give me that book.’ | ||||
| Lahu | TB | Matisoff Reference Matisoff1973a: 157; glossed by LaPolla (Reference LaPolla, Thurgood and LaPolla2003: 34). | ||||
The morpheme thàʔ marks ŋà as a patient in the first example, and as a recipient in the second.
And finally, in Chinese, the morpheme ba has sometimes been described as a special kind of case marker. Examples (2.8) and (2.9) have the same verbal and nominal elements. In example (2.8) the compound verb qi si serves as a high-effect transitive verb. Example (2.9) shows a rephrase of this using ba as a direct marker of the effected undergoer in a reshuffled constructional variant:
| Wo | qi | si | ta | le |
| I | angry | dead | 3SG | asp |
| ‘I made him/her angry to death.’ | ||||
| Chinese | SN | Liu Reference Liu2013: 2230 | ||||
| Wo | ba | ta | qi | si | le |
| I | ba | him/her | angry | dead | asp |
| ‘I made him/her angry to death.’ | |||||
| Chinese | SN | Liu Reference Liu2013: 2230 | |||||
Li and Yip (Reference Li, Yip and Plank1979) argue that ba is an absolutive case marker, but Liu (Reference Liu2013) argues against this position, and treats it as an object marker. Many other things have been said about ba, including that it is not a case marker at all but a not-so-vestigial serial verb (see Chapter 6).
2.3 Basic Phrasal Constituent-Order Patterns
We will examine in later chapters many of the typological features listed above. The rest of this chapter presents a preliminary overview of some of the basic features of constituent order in the organization of phrases and clauses in MSEA languages. Austroasiatic and Tai languages are generally head-initial, while Chinese, Tibeto-Burman, and Hmong-Mien exhibit a mix of head-initial and head-final structures. Languages to the north and west of the core MSEA area are generally head-final (Japanese, Mongolian, Uralic, Indo-Iranian, Dravidian). Here we see that the area is firmly split in terms of the direction of headedness in phrases (Comrie Reference Comrie and Kullavanijaya2007: 23). Tibeto-Burman languages, like East Asian languages (e.g., Japanese, Korean, Mongolian, but not Sinitic) and Indic languages, tend to show object–verb order in main clauses, and are, accordingly, head-final in other areas of phrasal grammar, such as in various nominal constructions, including relative clause constructions and possessive constructions. By contrast, languages of core MSEA, such as Khmer, Brou, or Semelai, show verb–object order in main clauses, and are similarly head-initial in other areas of phrasal grammar. We shall see that not only does the MSEA region show a differential distribution of head-initial and head-final structures, but such a differential distribution is observed within languages as well, resulting in systems whose mixed appearance is often attributed to language contact effects.
For example, while Tibeto-Burman languages are almost exclusively verb final in main clauses, Karenic languages depart from this pattern, in having verb–object clausal constituent order. Here are examples from Sgaw Karen, Pwo Karen, and Eastern Kayah Li:
| jə¹ | tɔ² | jə¹ | Ɉɔ¹ |
| 1.sg | hit | 1.sg | older brother |
| ‘I hit my older brother.’ | |||
| Sgaw Karen | TB | Lucey-Weinhold Reference Lucey-Weinhold and Brunelle2011: 79 | |||
| θa_ʔwa | thɛ_ | thwiː |
| Thawa | kick | dog |
| ‘Thawa kicked the dog.’ | ||
| Pwo Karen | TB | Kato Reference Kato, Thurgood and LaPolla2003: 639 | ||
| ʔa | chɯ̄ | ʔa | thwi |
| 3 | stab | 3 | dog |
| ‘He stabbed his own dog.’ | |||
| Eastern Kayah Li | TB | Solnit Reference Solnit1997: 149 | |||
This distinct profile of Karenic languages is suggested to be an effect of language contact, given that these languages have long been in close contact with Mon and Tai languages (Matisoff Reference Matisoff1991b).
Another example of diversity in constituent ordering within a language family comes from the Tai-Kadai languages. There are differences in the basic organization of phrases in Tai-Kadai languages spoken in China versus those spoken further south. While Tai-Kadai languages are verb–object in clause structure, there is variation in headedness of other types of structure, including relative clause constructions and classifier constructions (see cases below). Again, language contact effects are often posited as the likely cause (though specific causal hypotheses are not put forward).
Now we turn to some examples illustrating the differences in direction of headedness in some basic grammatical structures.
2.3.1 Relative Position of Verb and Object
In the order of constituents in a basic clausal predicate, languages of most MSEA language families show verb–object order, the only exception being Tibeto-Burman (except for the Karenic branch).
V-O
Following are examples showing V-O ordering in a range of MSEA languages.
| kñom | sɔm’aat | ptěah |
| 1sg | clean | house |
| ‘I clean the house.’ | ||
| Khmer | AA | Ehrman Reference Ehrman1972: 29 | ||
| Tus | dev | tom | tus | npua |
| clf | dog | bite | clf | pig |
| ‘The dog bit the pig.’ | ||||
| Hmong | HM | Clark Reference Clark and Bradley1989: 178 | ||||
| kéuih | ngoi | ngóh |
| 3sg | love | 1sg |
| ‘S/he loves me.’ | ||
| Cantonese | SN | Matthews and Yip Reference Matthews and Yip1994: 68 | ||
| tsɛ2 | mɔ6 | chi1 | fja1 |
| sister | he | comb | hair |
| ‘His sister was combing her hair.’ | |||
| Mulao | TK | Wang and Zheng Reference Wang and Zheng1993: 79 | |||
O-V
Less common is O-V ordering, as shown in these Tibeto-Burman language examples:
| yɔ̂ | yɛ̀ | te | chɛ̂ ve |
| 3sg | house | make | prog |
| ‘He is building a house.’ | |||
| Lahu | TB | Matisoff Reference Matisoff, Closs Traugott and Heine1991a: 404 | |||
2.3.2 Relative Position of Modifier and Head in Noun Phrases
In constructions that connect an adjective or equivalent property concept word to a head noun, again most languages show head-initial order. Notable exceptions are Sinitic languages, which show head-final order in many modifier–modified phrases; this is also the case in some non-Sinitic languages that are spoken in China.
N-A
The following examples show the predominant ordering of modifier and head in noun phrases:
| rapyā | kạriạ̄r |
| girl | beautiful |
| ‘the beautiful girl’ | |
| Paluang | AA | Milne Reference Milne1921: 38 | |
| hɔe' | hnòk |
| house | large |
| ‘a large house’ | |
| Mon | AA | Bauer Reference Bauer1982: 333 | |
| tsən1 | ɣaːi3 |
| hair | long |
| ‘long hair’ | |
| Mulao | TK | Wang and Zheng Reference Wang and Zheng1993: 82 | |
| yei | eì |
| water | cold |
| ‘cold water’ | |
| Burmese | TB | Okell Reference Okell1969: 43 | |
A-N
Less common in MSEA is adjective–noun order in noun phrases. This pattern is standard in Sinitic languages, such as in this example from the Sinitic language Taiwanese, in which the adjectival modifier precedes its nominal head:
| chit | tâi | sin | chhia |
| this | clf | new | car |
| ‘this new car’ | |||
| Taiwanese | SN | Lin Reference Lin2015: 150 | |||
This sort of structure is also found closer to the core MSEA area, often involving a particle that links the modifier and the head. In this example from Cantonese, the linking particle ge is used between the nominal head and its preposed modifier:
| dākyi | ge | gáu-jái |
| cute | lp | puppy |
| ‘cute puppy’ | ||
| Cantonese | SN | Matthews and Yip Reference Matthews and Yip1994: 88 | ||
Similar structures are found in Kam-Tai languages that are spoken in contact with Sinitic languages. Here is an example from Mulao (a Kam-Sui language spoken in China):
| səp8 | at7 | hən3 | lo4 | kɔ | ku3 |
| ten | clf | very | big | pcl | drum |
| ‘ten very big drums’ | |||||
| Mulao | TK | Wang and Zheng Reference Wang and Zheng1993: 87 | |||||
There are cases in which speakers of a single language can be observed using both head-final and head-initial structures for the same type of grammatical function. See Section 5.1 for further discussion.
Among Tibeto-Burman languages, head-initial N-A order in noun phrases is the usual pattern in the MSEA area. Tibeto-Burman languages spoken to the west of Northern Myanmar (considered here as part of greater MSEA but outside of core MSEA), do show A-N order. Map 2.1 (reproduced from Dryer Reference Dryer, Thurgood and LaPolla2003: 46) shows that A-N order in Tibeto-Burman languages is found only to the west of the Myanmar/India border (Dryer Reference Dryer, Thurgood and LaPolla2003: 45).

In Idu, for example (spoken in Arunachal Pradesh, Northeast India), ‘Adjectives generally precede the noun qualified’ (Pulu Reference Pulu1978: 16):
| icia | mazi |
| small | goat |
| ‘small goat’ | |
| Idu | TB | Pulu Reference Pulu1978: 16 | |
In Tshangla (spoken in Eastern Bhutan), going just beyond the perimeter of MSEA, adjectives ‘may either precede or follow the head noun’. ‘Pre-nominal adjectives are restrictive; post-nominal adjectives are non-restrictive and descriptive of the head’ (Andvik Reference Andvik, Thurgood and LaPolla2003: 441). Here are examples showing the two structures, respectively:
| dukpu | waktsa | khepa |
| poor | child | top |
| ‘the poor child’ (identification) | ||
| Tshangla | TB | Andvik Reference Andvik, Thurgood and LaPolla2003: 441 | ||
| waktsa | dukpu | khepa |
| child | poor | top |
| ‘the child, who is poor’ (description) | ||
| Tshangla | TB | Andvik Reference Andvik, Thurgood and LaPolla2003: 442 | ||
2.3.3 Relative Position of Noun and Relative Clause
The placement of relative clauses in relation to the head nouns that they modify generally shows the same pattern as head–modifier relations in noun phrases, discussed in the last section. In some languages, however, the Sinitic head-final pattern is followed.
N-RC
Following are examples from MSEA languages in which a relative clause or similar structure follows the head that it modifies:
| sách | mà | anh | mua | hôm-qua | ở | dâu |
| book | alp | 2sg | buy | yesterday | stay | where |
| ‘Where is the book you bought yesterday?’ | ||||||
| Vietnamese | AA | Capell Reference Capell and Liem1979: 10 | ||||||
| ib | tug | dev | tom | npua |
| one | clf | dog | bite | pig |
| ‘a dog who bites pigs’ | ||||
| Hmong | HM | Clark Reference Clark and Bradley1989: 183 | ||||
| n̥gwa1 | lik8 | naːn4 |
| dog | chase | deer |
| ‘a (deer-)hunting dog’ | ||
| Mulao | TK | Wang and Zheng Reference Wang and Zheng1993: 29 | ||
RC-N
In a subset of MSEA languages, mostly Sinitic and Tibeto-Burman, relative clauses precede their heads. Here are some examples:
| ngóh | chéng | ge | gūngyàhn |
| 1sg | hire | lp | maid |
| ‘the maid I hire’ | |||
| Cantonese | SN | Matthews and Yip Reference Matthews and Yip1994: 88 | |||
| mə-`hma-tɛ' | `sheshəya |
| not-mistake-makes | doctor |
| ‘a doctor who makes no mistakes’ | |
| Burmese | TB | Capell Reference Capell and Liem1979: 11 | |
| n̥aːu3 | twa3 | jəu6 | foŋ1 | jəu6 | kɣaːŋ1 | kɔ | ɣaːn2 | ŋwa4 |
| one | clf | both | tall | and | bright | pcl | house | tile |
| ‘a tiled house which is both tall and bright’ | ||||||||
| Mulao | TK | Wang and Zheng Reference Wang and Zheng1993: 87 | ||||||||
Note that Mulao, a Kam-Sui language spoken in Guangxi, appears in both lists, showing a mixed profile. See Section 5.1 for further discussion.
2.3.4 Relative Position of Possessor and Possessed
In possessive phrases, such as John’s house, possessors and possessed entities are combined in a single phrase. (We notate these as Pr and Pe respectively; Aikhenvald and Dixon Reference Aikhenvald and Dixon2013.) These structures are also used when expressing part–whole relations (cf. the arrow’s tip). Combining a possessor and a possessed entity can be done by morphological means, for example using an affix (such as English ’s in John’s house) or a dedicated pronoun (such as English his in his house). Or it may be done by syntactic/constructional means (such as English of in the roof of the house). Languages vary in their relative ordering of possessor and possessed in these phrases.
Pe–Pr
Following are some examples of possessive phrases in MSEA languages in which the possessed precedes the possessee, following the head-initial structure:
| pu4 | niu2 |
| father | 1sg |
| ‘my father’ | |
| Mulao | TK | Wang and Zheng Reference Wang and Zheng1993: 59 | |
Pr–Pe
And here are some languages that use the opposite ordering; these include not only Sinitic and Tibeto-Burman languages, but also Hmong-Mien languages and those Tai languages that are in intensive contact with Sinitic:
| koj | (lub) | tsev |
| 2sg | clf | house |
| ‘your house’ | ||
| Hmong | HM | Clark Reference Clark and Bradley1989: 185 | ||
| ngóh | sailóu |
| 1sg | YBro |
| ‘my younger brother’ | |
| Cantonese | SN | Matthews and Yip Reference Matthews and Yip1994: 107 | |
| hohksāang | ge | gājéung |
| student | poss | parents |
| ‘the student’s parents’ | ||
| Cantonese | SN | Matthews and Yip Reference Matthews and Yip1994: 107 | ||
| ɲa2 | təm1 tau2 |
| 2sg | chest |
| ‘your chest’ | |
| Mulao | TK | Wang and Zheng Reference Wang and Zheng1993: 62 | |
| ù tháñ-yé | ǎmyiñ |
| U-poss | view |
| ‘U Thant’s view’ | |
| Burmese | TB | Okell Reference Okell1969: 169 | |
2.3.5 Relative Position of Adposition and Noun
Non-core phrases express elements of a situation such as time or place. When added to the core of a clause, these are often marked by an adposition or equivalent. English non-core phrases are marked by prepositions such as at or in (e.g., John left at three o’clock or They cooked dinner in a hut). Languages differ with respect to whether their adpositions appear before or after the phrase that they link to the clausal core (hence, prepositions versus postpositions).
Preposition-N
Numerous MSEA languages follow the head-initial principle in prepositional (or equivalent) phrases, as shown in the following examples:Footnote 5
| sui53 | ȵau33 | te323 | mɐi31 | ȶa33 |
| sit | at | below | tree | that |
| ‘Sit underneath that tree.’ | ||||
| Dong | TK | Long and Zheng Reference Long and Zheng1998: 141 | ||||
N-Postposition
Other languages – notably, in the Sinitic and Tibeto-Burman language families – feature postpositions (though not necessarily exclusively, with varieties of Chinese combining noun-based postpositions with verb-based prepositions). Here are two examples:
| kéuih | hái | jáudim | chēutbihn | dáng | ngóh |
| 3sg | be.at | hotel | outside | wait | 1sg |
| ‘S/he is waiting for me outside the hotel.’ | |||||
| Cantonese | SN | Matthews and Yip Reference Matthews and Yip1994: 118 | |||||
| chò | kàɁ |
| here | loc |
| ‘here, hither, hence’ | |
| Lahu | TB | Matisoff Reference Matisoff1973a: 164 | |
2.3.6 Relative Position of Adjective and Standard of Comparison
In languages with comparative constructions (e.g., English this is old → this is older than that), the adjective (or equivalent property concept word) may be placed either before, or after, the standard of comparison.
Adj-SComp
The following examples show MSEA languages in which the standard of comparison follows the property concept word:Footnote 6
| foŋ1 | ta6 | ɲa2 |
| tall | over | 2sg |
| ‘taller than you’ | ||
| Mulao | TK | Wang and Zheng Reference Wang and Zheng1993: 56 | ||
| A-wàhn | leng | gwo | kéuih | mùihmúi |
| A-W | pretty | than | 3sg | sister |
| ‘Wan is prettier than her sister.’ | ||||
| Cantonese | SN | Matthews and Yip Reference Matthews and Yip1994 | ||||
SComp-Adj
As the following examples show, the opposite ordering is found in Sino-Tibetan languages, and in some languages that are in intense contact with them, such as Dong, a Tai language spoken in China:
| A-wàhn | béi | kéuih | mùihmúi | leng |
| A-W | than | 3sg | sister | pretty |
| ‘Wan is prettier than her (younger) sister.’ (formal) | ||||
| Cantonese | SN | Matthews and Yip Reference Matthews and Yip1994: 167 | ||||
| mau33 | pi323 | ȵa212 | phaŋ35 |
| 3sg | compare | 2sg | tall |
| ‘He is taller than you.’ | |||
| Dong | TK | Long and Zheng Reference Long and Zheng1998: 141 | |||
| ŋà | a-kɛ́ | N | V |
| I | more.than | n | v |
| ‘n is more v than me.’ | |||
| Lahu | TB | Matisoff Reference Matisoff1973a: 131 | |||
2.4 Sentence-Type Distinctions
All languages need to distinguish between the basic speech acts of asserting, questioning, and commanding. These speech-act types correlate with the grammatical sentence types declarative, interrogative, and imperative (Sadock and Zwicky Reference Sadock, Zwicky and Shopen1985). The languages of MSEA are fairly uniform in terms of how they signal these basic distinctions in sentence type. Interrogatives tend to be marked by in situ question words (for content questions) and sentence-final particles (for polar questions). Both imperatives and declaratives tend to be marked by sentence-final particles.
Among interrogatives, there are two major types: polar questions (‘yes–no’ questions) and content questions (‘WH’ questions). Polar questions in MSEA tend to be formed using final particles (which are sometimes clearly related to forms such as negators and existential markers). Here are examples from all of the major language families, with the final particle highlighted:
| noŋ[13] | tɕhi | vəʔ[3–4] |
| you | go | q? |
| ‘Did you go?’ | ||
| Shanghainese | SN | Zee and Xu Reference Zee, Xu, Thurgood and LaPolla2003: 142 | ||
| dzɑ33 | dzɑ21 | lɑ21 |
| rice | eat | q |
| ‘Have you eaten?’ | ||
| Lisu | TB | Bradley Reference Bradley, Thurgood and LaPolla2003: 234 | ||
| Nin | daih | nza? |
| nin33 | tai31 | dza33 |
| 3sg | come | ptc |
| ‘Is he coming?’ | ||
| Ruyuan Mien | HM | Liu Reference Liu2016: 130 | ||
| pɔ̀ | kùːɲ | plàːŋ | rɔ̀t | kì | há |
| 2pl | see | pn | come | prox | q |
| ‘Have you seen Plàaŋ coming here?’ | |||||
| Kammu | AA | Svantesson and Holmer Reference Svantesson, Holmer, Jenny and Sidwell2015: 971 | |||||
| ʔuŋ21khiu21 | sat24 | zaːjʔ33 | teʔ24 |
| doctor | true | come | q |
| ‘Is the doctor really coming?’ | |||
| Hainan Cham | AN | Thurgood et al. Reference Thurgood, Thurgood and Li2014: 222 | |||
| Chĭm | kung | rang | chuh | nơn | rang | mơk | mai |
| animal | that | 3.INDEF | to.hunt | that | 3.INDEF | to.bring | toward |
| ƀơng | hu | mĭn | |||||
| to.eat | to.be.able.to | yn.qm | |||||
| ‘Can they eat animals that they hunted?’ | |||||||
| Western (Cambodian) Cham | AN | Baumgartner Reference Baumgartner1998: 17 | |||||||
| ȵi2 | pə1 | ta6 | pak7kiŋ1 | ma6 |
| 2sg | go | asp | Beijing | part |
| ‘Have you been to Beijing?’ | ||||
| Chadong | TK | Li Reference Li, Diller, Edmondson and Luo2008: 610 | ||||
There are other means of forming polar questions in these languages. For example, in a number of Hmongic languages, a polar question is formed by placing a question word after the subject and before the verb, as in this White Hmong example:
| koj | puas | nqis | los | hauv | no | na? |
| 2sg | q | descend | come | inside | here | pcl |
| ‘Are you coming down here?’ | ||||||
| White Hmong | HM | Jarkey Reference Jarkey2015: 57 | ||||||
Another option found in Hmongic languages is a ‘V-not-V’ strategy, widely known from Sinitic languages (Clark Reference Clark and Bradley1989: 209). Here is an example from Miao:
| moŋ55 | moŋ11 | a55 | moŋ11 | nen35? |
| 2sg | go | not | go | sfp |
| ‘Do you want to go or not?’ | ||||
| Miao | HM | Clark Reference Clark and Bradley1989: 209 | ||||
We turn now to content questions. English has a typologically unusual pattern of forming content questions. This involves a construction in which question words such as ‘who’ or ‘what’ occupy a distinct syntactic position: We tend not to say ‘You ate what?’ but rather ‘What did you eat?’. In MSEA languages, by contrast, content questions are formed by means of in situ question words, that is, question words that appear in the same slot that their corresponding referring expressions would appear in. Here are some examples of MSEA languages content questions with in situ question words:
| noŋ[13] | ma | tsɹ̙[1–3] | saməʔzr̙[3-5-1] |
| you | bought | asp | what? |
| ‘What did you buy?’ | |||
| Shanghainese | SN | Zee and Xu Reference Zee, Xu, Thurgood and LaPolla2003: 142 | |||
| /ɑ44le44 | (thi21) | mɑ44 | lɑ33 | ɑ44 | (ɑ21)/[ɑ41] |
| which? | one | cl | come | impfv/ q | |
| ‘Which one is coming?’ | |||||
| Lisu | TB | Bradley Reference Bradley, Thurgood and LaPolla2003: 234 | |||||
| mouj2 | kɔŋ1 | haj4kaj1 |
| 2sg | say | what |
| ‘What did you say?’ | ||
| Vietnamese Mien | HM | Doan and Mai Reference Doan and Mai1992: 98 | ||
| ʔɑ̀ʔ | cə̀ | pə̀ʔ | mə́h |
| 1du | irr | eat | what |
| ‘What shall we eat?’ | |||
| Kammu | AA | Svantesson and Holmer Reference Svantesson, Holmer, Jenny and Sidwell2015: 972 | |||
| man2 | van1ʔȵuŋ1 | ȵaːu6 | ju1kaːi1 | tshjeŋ5 | ni4nam2 | ʔnaːk7 | se1 | ʔni3 |
| 3sg | yesterday | at | town | sing | what | to | 2pl | listen |
| ‘What was s/he singing to you in town yesterday?’ | ||||||||
| Maonan | TK | Lu Reference Lu2008: 179 | ||||||||
| kaw33 | ʔa33zaːŋʔ33 | lə21 |
| 1 | who | pfv |
| ‘Who am I?’ | ||
| Hainan Cham | AN | Thurgood et al. Reference Thurgood, Thurgood and Li2014: 226 | ||
2.5 Sentence-Final Particles
All languages make use, to a greater or lesser extent, of sentence-final elements, including tag questions and particles of various kinds, to mark sentence types or to define the pragmatic meaning of utterances. Speakers of MSEA languages rely heavily on sets of sentence-final particles in language usage, especially in dialogue. The above examples of polar questions all illustrated the use of sentence-final particles for grammatical marking.
Inventories of sentence-final particles in MSEA languages are large and structured systems. In Lao, for example, more than twenty-five particles are divided into three main categories depending on the type of sentence function they are associated with – interrogative, factive, imperative – showing differences in meaning within those categories, invoking subtleties of aspect, evidentiality, and illocutionary force (Enfield Reference Enfield2007: ch. 4). In Cantonese, there are some thirty basic particles, with over seventy possible combinations of two or three of these (Kwok Reference Kwok1984, Luke Reference Luke1990). In their summary of the system, Matthews and Yip (Reference Matthews and Yip1994: 338–356) identify a set of features that are typical of these systems across MSEA:
Features of Cantonese sentence-final particles (SFPs; Matthews and Yip Reference Matthews and Yip1994: 338)
SFPs indicate speech-act type distinctions (interrogative, declarative, imperative), as well as distinctions in evidentiality, and affective and emotional colouring.
SFPs convey the kind of information that is conveyed by intonation in other languages (such as English) (sometimes said to compensate for lack of freedom to intone, due to lexical tone).
SFPs are very frequent and important in colloquial speech.
Analysts find it difficult to agree on whether SFPs belong in semantics, pragmatics, or discourse.
SFPs sometimes have a non-arbitrary relationship to tone (some have claimed that there are ‘tonal particles’ – this has been claimed for Mandarin which has only about seven particles, which can’t stack); many of the Cantonese particles have ‘tonal variants’; ‘Typically, the high-tone variants are more tentative, the low-tone ones assertive and the mid-tone ones neutral’.
Here are further examples, also including other sentence types such as declaratives and directives.
| husb: | léih | mhóu | joi | máaih | yéh | la | ||
| you | don’t | again | buy | stuff | pcl | |||
| ‘Don’t you go buying more stuff.’ | ||||||||
| wife: | ngóh | tái-háh | jē | ma | ||||
| I | look-del | pcl(playing:down) | pcl(of course) | |||||
| ‘I’m just going to take a look.’ | ||||||||
| Cantonese | SN | Matthews and Yip Reference Matthews and Yip1994: 344 | ||||||||
| Kuv | tsis | hnov | koj | hais | os |
| I | not | hear | you | say | pcl(emph) |
| ‘I didn’t hear what you said!’ | |||||
| Hmong | HM | Clark Reference Clark and Bradley1989: 182 | |||||
| Ngày mai | anh | có | đi | không? | Đi | chứ |
| tomorrow | you | have | go | pcl(q) | go | pcl(sure) |
| ‘Are you going tomorrow? Of course I’m going.’ | ||||||
| Vietnamese | AA | Clark Reference Clark and Bradley1989: 182 | ||||||
| sam1 | naen1 | seic | mans | nyac | jas | dung1 | xogx | laeuc | bac |
| three | clf | cake | yellow | those | Det. | cook | good | sfp1 | sfp2 |
| ‘I guess those three yellow cakes should have been cooked through (now)?’ | |||||||||
| Kam | TK | Yang and Edmonson Reference Yang, Edmonson, Diller, Edmondson and Luo2008: 570 | |||||||||
2.6 Expressive Language
All languages have words, structures, and other resources for expressing aspects of first-person experience, such as sensory and emotional experience. Here we survey the phenomena of (1) ideophones and (2) poetic idioms.
2.6.1 Ideophones/Expressives
Many Southeast Asian languages have a distinct word class of ideophones, which have their own syntactic behaviour. For example, although they are semantically verb-like, because they encode information about actions, properties, and states of affairs, they are often very limited in the kinds of tense/aspect/modality marking (if any) they can take. Another term for these words is expressives. This term is widely used in MSEA linguistics, following Diffloth’s pioneering work (Diffloth Reference Diffloth1972).
Dingemanse (Reference Dingemanse2012) defines ideophones as ‘marked words that depict sensory imagery’. He notes that they are found in languages around the world: ‘Many natural languages have words depictive of sensory perceptions like Japanese nyoro nyoro ‘wriggling motion’ and tsuru tsuru ‘smooth surface’ (Gomi Reference Gomi1989), Semelai rɔ̃prãp ‘something large walking through twigs’ and cərãlãp ‘sound of someone/something entering the undergrowth’ (Kruspe Reference Kruspe2004), and Gbeya ɛlɛlɛ ‘hair waving gently in a breeze’ and ɓakat ɓakat ‘sound of sandal flapping’ (Samarin Reference Samarin1970). As these examples suggest, ideophones often have highly specific semantic values, associated with modification of very particular property concepts (e.g., colours, states) or events. Ideophones are often separate lexical items (usually with two syllables, showing a rhyming or alliterative pattern), which function as adverbials, often with highly evocative meanings.Footnote 7 Writing of Cantonese, Bodomo states that an ideophone ‘mainly serves to help express the vivaciousness of one’s feeling towards the entity one wants to describe’ (Bodomo Reference Bodomo2006: 209, original emphasis).
Table 2.1 supplies data from Sgaw Karen, illustrating two of the quintessential properties of ideophones: (1) highly specific imagistic meanings and (2) rhyming and alliterative sound structures.
| bla2bla2 | to have no taste such as in badly prepared soup |
| səli2sə1la1 | to spread out in different directions |
| kəʔ1di2kəʔda2 | to flee from danger in a hasty and unorganized fashion |
| shɣɛʔ3shɣɨ2shɛ2daʔ3 | (of sound or actions that are) loud and swift |
| kiʔ1kiʔ | moving (like the surface of water) |
| pəʔ1cɨ1pəʔ1te2 | (to wait/to see with) impatience |
| brəʔ1briʔ3 | to have the inside out (such as shirt, bag) |
| ko1lo1 | (to come) in groups |
| po1liʔpo1ɣɔ2 | worn out (like old shirts or rags) |
| lɔ2swɛʔ3lɔ2swɔ2 | serious wound that was torn out (as by a tiger) |
| do2mo2do2khlo2 | bumpy (road) |
| kəʔ1jɔ2kəʔ1jɔ2 | (to cry with) low voice |
| kə1lɔʔkə1laʔ1 | to do things in a sloppy way |
| shɛʔ3ʔɛʔ3 | dishonestly clever |
| kho2do2 | (to stay) quietly without care nor enthusiasm |
| kə1bɛʔ3kə1bɛʔ3 | of actions that are slow |
| kəʔphu1li2lu2 | describing actions/situations that are unorderly |
| cɔʔ1klɔʔ1 | describing actions that are clumsy |
| kəʔ1swe2kəʔ1swɔ2 | describing soil that can let water pass freely |
| shiʔ3kɛ2 | for a little moment |
| pə1diʔ | (to do things) bit by bit |
| tɛ2xɛ2 | see-through (describing paper, cloth etc.) |
| sə1xiʔ1diʔ1 | (to do things) with care/attention to details |
Turning to a Tai-Kadai language, some examples from Lao show ways in which ideophones can convey fine distinctions in meaning in specific semantic fields. The following examples show distinct ideophones describing patterns of baldness (as modifiers of the verb/adjective laan4 ‘bald’):
| laan4 | qùù4-khùù4 |
| bald | expr |
| ‘completely bald’ | |
| Lao | TK | |
| laan4 | khim1-mim1 |
| bald | expr |
| ‘lightly balding’ (‘with a frog by the pond’) | |
| Lao | TK | |
| laan4 | sameng4-kheng4 |
| bald | expr |
| ‘bald across the whole dome’ (‘swidden in the jungle’) | |
| Lao | TK | |
| laan4 | samook5-khook5 |
| bald | expr |
| ‘bald, with fuzz all over, and a protruding forehead and back of head’ | |
| Lao | TK | |
The next two examples describe contrasting ways in which a person can be seen standing:
| jùùn3 | tòk2-pòk2 |
| stand | expr |
| ‘of old lady, “skinnily” standing, angular.’ | |
| Lao | TK | |
| jùùn3 | sêê1-lêê1 |
| stand | expr |
| ‘of someone standing, stooped over “all alone, no-one talking to them”.’ | |
| Lao | TK | |
Here are two examples describing manners in which a person can be seen lying down:
| nòòn2 | khòò5-lòò5 |
| lie | expr |
| ‘lying down like a child asleep’ | |
| Lao | TK | |
| nòòn2 | qêêk1-lêêk4 |
| lie | expr |
| ‘lying down lazily, not doing anything’ | |
| Lao | TK | |
The following three examples describe some ways a person’s hair may appear (note the use of hêt1 ‘do/make’ as a dummy verb that the ideophone can modify):
| phom3 | hêt1 | keng4-ceng4 |
| hair | do/make | expr |
| ‘of messed up hair, “uncombed, not flat, going this way and that”’ | ||
| Lao | TK | ||
| phom3 | hêt1 | phùk1-vùk1 |
| hair | do/make | expr |
| ‘of hair bent from being slept on’ | ||
| Lao | TK | ||
| phom3 | hêt1 | khuk1-ñuk1 |
| hair | do/make | expr |
| ‘of curly, flowing, hair’ | ||
| Lao | TK | ||
The next two examples describe different perceptions of the colour white:
| khaaw3 | pen4-ven4 |
| white | expr |
| ‘white, of face or body, pale, like too much powder applied’ | |
| Lao | TK | |
| khaaw3 | còn1-phòn1 |
| white | expr |
| ‘white, of an old person’s completely white hair’ | |
| Lao | TK | |
And finally, here are some ideophones describing different possible sizes of a hole or perforation:
| huu2 | hêt1 | cing1-ping1 |
| hole | do/make | expr |
| ‘of a hole, tiny, cannot put finger in’ | ||
| Lao | TK | ||
| huu2 | hêt1 | còng1-pòng1 |
| hole | do/make | expr |
| ‘of a hole, like one from snails in a lettuce, can put a finger in’ | ||
| Lao | TK | ||
| huu2 | hêt1 | côông-pôông1 |
| hole | do/make | expr |
| ‘of a hole, can put hand in’ | ||
| Lao | TK | ||
As these examples should suggest, ideophones are poetic and evocative in nature. Accordingly, their usage is appreciated as a matter of style. Thomas (Reference Thomas1971: 155ff) notes that ‘the judicious use’ of ideophones in Chrau (AA) ‘is a mark of an expert story teller.’ Table 2.2 lists some Chrau examples.
| hǔl hǔl | sitting quietly |
| khǔch khǔch | many fish |
| khyơng khyơng | walking stiffly |
| klơ klơ | darkly, unclear |
| lǎq lǎq | sitting still, sick |
| caprǎh caprǎng | scattered |
| ravênh rawai | dizzy |
| mbǎq mban | unskillfully |
| mlǔq mlǎq | dirty |
Following are further examples from languages across the MSEA area:
| plig plawg ‘bird rising from a nest on the ground’, as in: | |||||
| ib | pab | noog | ya | plig | plawg |
| one | group | bird | fly | ideo | |
| ‘a flock of birds fly by making a great flapping’ | |||||
| White Hmong | HM | Ratliff Reference Ratliff1992: 220 | |||||
| paːi3 | n̥iu5n̥iu5 |
| wag | ideo |
| ‘to wag (the tail)’ | |
| Chadong | TK | Li Reference Li, Diller, Edmondson and Luo2008: 608 | |
| ʔia33 | lio21 | phia21 | phia21 |
| water | flow | gurgle | gurgle |
| ‘Water flows gurgling.’ | |||
| Hainan Cham | AN | Thurgood et al. Reference Thurgood, Thurgood and Li2014: 190 | |||
| cʰŋis |
| ‘the smell of ammonia, urine’ |
| Semelai | AA | Kruspe Reference Kruspe2004: 396 |
| crãlãp |
| ‘the sound of something entering the undergrowth’ |
| Semelai | AA | Kruspe Reference Kruspe2004: 396 |
| cnayur |
| ‘with the appearance of bushiness in several places’ |
| Semai | AA | Kruspe Reference Kruspe2004: 396 |
| kəralɔɡ lɔɡ |
| ‘with the sound of heavy footsteps’ |
| Temiar | AA | Kruspe Reference Kruspe2004: 397 |
2.6.2 Poetic Idioms (Four-Syllable Expressions)
Another form of expressive language that is widespread in MSEA languages involves four-syllable expressions which combine patterns of rhyming or alliteration with patterns of semantic interrelations such as parallelism, association, and combinations of metaphor and metonymy. These are well known and extensively documented in Sinitic languages (in Mandarin, these four-syllable expressions are called cheng yu).
Liem (Reference Liem1970) lists over 800 four-syllable expressions in Vietnamese, dividing them into a range of formal categories. For example, one set is of the form verb–object verb–object, two-verb phrases with meanings that together point to the higher-level meaning of the expression as a whole. Here are two examples:
| buôn | mây | bán | gió |
| buy | cloud | sell | wind |
| ‘engage in risky commerce’ | |||
| Vietnamese | AA | Liem Reference Liem1970: 9 | |||
| cày | mây | cuốc | gió |
| plough | cloud | hoe | wind |
| ‘have a free life in the open air’ | |||
| Vietnamese | AA | Liem Reference Liem1970: 9 | |||
Another set takes the form noun-adjective noun-adjective, two noun phrases with meanings that together point to the higher-level meaning of the expression as a whole. Here are two examples:
| ma | thiêng | nước | độc |
| ghost | powerful | water | poison |
| ‘of the highlands’ | |||
| Vietnamese | AA | Liem Reference Liem1970: 21 | |||
| mặt | to | tai | lớn |
| face | big | ear | long |
| ‘with power, influence’ | |||
| Vietnamese | AA | Liem Reference Liem1970: 21 | |||
The following examples show patterns in another language family, Tai-Kadai:
| khiaw3 sot2 ngot1 ngaam2 |
| green fresh euph beautiful |
| ‘luscious, fresh and green’ (as the Lao countryside in the rainy season) |
| Lao | TK | Enfield Reference Enfield2007: 305 |
| weŋ 212 | kau 323 | weŋ 212 | kha 35 |
| horizontal | head | horizontal | ear |
| ‘do not listen to reason’ | |||
| Dong | TK | Long and Zheng Reference Long and Zheng1998: 162 | |||
