Hostname: page-component-6766d58669-nf276 Total loading time: 0 Render date: 2026-05-20T16:35:24.366Z Has data issue: false hasContentIssue false

Baima

Published online by Cambridge University Press:  18 November 2021

Katia Chirkova
Affiliation:
Centre National de la Recherche Scientifique, Centre de Recherches Linguistiques sur l’Asie Orientale & Nankai University, School of Literature katia.chirkova@gmail.com
Tanja Kocjančič Antolík
Affiliation:
Charles University, Faculty of Arts, Institute of Phonetics tkocjancic@gmail.com
Angélique Amelot
Affiliation:
Centre National de la Recherche Scientifique, Laboratoire de Phonétique et Phonologie angelique.amelot@gmail.com
Rights & Permissions [Opens in a new window]

Extract

The Baima language (/pêkê/, Chinese 白马语 báimăyŭ, ISO-639 code bqh) is a little-studied Tibeto-Burman (Bodic or Himalayish) language spoken in the southwestern part of the People’s Republic of China. Approximately 10,000 people, who traditionally reside in three counties in Sichuan Province (Pingwu 平武, Songpan 松潘 (in Written Tibetan, hereafter WT, zung chu), and Jiuzhaigou 九寨沟 (WT gzi rtsa sde dgu), and in one county in Gansu Province (Wenxian 文县), speak the Baima language (see Sichuan Sheng Minzu Yanjiusuo 1980, Zeng & Xiao 1987), see Figure 1.1 The largest concentrations of Baima speakers are in Baima Township (白马乡, Baima /tôpû/) of Pingwu County, and in Tielou Township (铁楼乡) of Wenxian County.

Information

Type
Illustration of the IPA
Copyright
© The Author(s), 2021. Published by Cambridge University Press on behalf of the International Phonetic Association
Figure 0

Figure 1 Distribution of the Baima language (adapted from Sichuan Sheng Minzu Yanjiusuo 1980). Black dots with white centers indicate county seats, black dots indicate townships, shaded areas indicate Baima villages.

Figure 1

Table 1 Phonation types associated with different syllable onsets and tones. T = voiceless unaspirated obstruent, Tʰ = voiceless aspirated obstruent, D = (non-nasalized) voiced obstruent, ND = prenasalized stop or affricate, S = sonorant, V = vowel. Shaded cells show non-occurring combinations.

Figure 2

Figure 2 Waveforms, spectrograms, and pitch tracks for the two repetitions of the words ‘horn’ (2a); ‘mountain peak’ (2b), and ‘to filter’ (2c).

Figure 3

Figure 3 Time courses of HNR05 (dB) (3a), H1*–H2* (dB) (3b), SoE (3c), F1(Hz) (3d), and f0 (Hz) (3e) for vowels in the three minimal pairs with voiced stop onsets (in black) and voiceless unaspirated stop onsets (in gray) at the alveolar and velar places of articulation before the vowels and /ɑ/, uttered in isolation. The x-axis represents vowel duration divided into five regions: 0–20$\%$, 20–40$\%$, 40–60$\%$, 60–80$\%$, 80–100$\%$. The number of analyzed tokens is 18.

Figure 4

Figure 4 Time courses of HNR05 (dB) (4a), H1–H2 (dB) (4b left), H1*–H2* (dB) (4b right), SoE (4c), F1 (Hz) (d), and f0 (Hz) (4e) in three (near) minimal pairs with epiglottalized nasals (in black) and non-epiglottalized nasals (in gray). The x-axis represents onset and vowel duration divided, respectively, into five regions: 0–20$\%$, 20–40$\%$, 40–60$\%$, 60–80$\%$, 80–100$\%$. Results for HNR05 (dB), SoE, and f0 (Hz) are presented in the same plot. Results for the difference in amplitude between the first and second harmonics are presented separately for the onset (uncorrected measures, H1–H2) and for the vowel (corrected measures, H1*–H2*). F1 (Hz) measures are only taken over the vowel. Based on three (near) minimal pairs at the bilabial, alveolar, and velar places of articulation before the vowels and /ɑ/, uttered in isolation. The total number of tokens is 18.

Figure 5

Figure 5 Waveforms and spectrograms for the words //ʈ͡ɽē/ ‘mule’, /ʈ͡ɽʰê/ ‘to meet (PFV/IMP)’, and /ɳɖ͡ɽē/ ‘rice’.

Figure 6

Figure 6 Waveform and spectrogram for the word /ɖ͡ɽì/ ‘to roll’. Arrows indicate contacts between the tip of the tongue and the roof of the mouth.

Figure 7

Table 2 COG values (Hz) for the spectral slices taken from the middle of the frication portion of [t͡s], [t͡ʃ], [t͡ɕ], [ʈ͡ɽ]. Based on 89 tokens.

Figure 8

Figure 7 Mean F3 values of the vowel /a̠/, following voiceless unaspirated alveolar, postalveolar, alveolo-palatal, and retroflex affricates (left), and mean F3 values of the vowel /e/, following voiceless aspirated alveolar, postalveolar, alveolo-palatal, and retroflex affricates (right). Based on two near-minimal sets: (1) /t͡sâ̠/ ‘grass’ – /t͡ʃâ̠/ ‘tea’ – /t͡ɕâ̠/ ‘hair’ – /ʈ͡ɽâ̠/ ‘to sever, cut’ (three repetitions per word, a total of 12 tokens), and (2) /t͡sʰê/ ‘winnowing fork’ – /t͡ʃʰê/ ‘to be tired’ – /ʈ͡ɽʰê/ ‘to meet (PFV/IMP)’ (three repetitions per word, a total of 9 tokens). The x-axis represents vowel duration divided into five regions: 0–20$\%$, 20–40$\%$, 40–60$\%$, 60–80$\%$, 80–100$\%$. /t͡s/ in black dotted line, /t͡ʃ/ in gray solid line, /t͡ɕ/ in gray dashed line, /ʈ͡ɽ/ in black solid line.

Figure 9

Figure 8 Waveforms and spectrograms for the minimal set /sâ̠/ ‘to itch’, /sʰâ̠/ ‘earth, soil’, /zâ̠/ ‘to filter’. The dashed lines mark the onset of frication noise and the onset of voicing, the solid line marks the beginning of the aspiration period in the voiceless aspirated fricative /sʰ/.

Figure 10

Table 3 Number of tokens analyzed, mean duration and mean standard deviation (in ms) of voiceless unaspirated, voiceless aspirated, and voiced fricatives at the alveolar, postalveolar, and alveolo-palatal places of articulation.

Figure 11

Figure 9 Waveforms and spectrograms for the words /rē/ ‘cloth’, /rô/ ‘corpse’, and /râ̠/ ‘goat’ with arrows indicating contacts between the tip of the tongue and the roof of the mouth.

Figure 12

Figure 10 Waveforms, spectrograms, and formant tracks for the words [rr̩ɘ̂] ‘snake’ and [zz̩ɘ̂] ‘leopard’.

Figure 13

Figure 11 Acoustic vowel plots showing vowel phonemes with ellipses fit to the data and average F1–F2 values (left). Acoustic vowel plots showing vowel phonemes with ellipses fit to the data and average F2–F3 values (right). Formants were sampled at the token midpoint using the website Visible Vowels (https://www.visiblevowels.org). Based on 245 tokens across different initial consonants and tones from the Baima recordings of one male speaker in this Illustration.

Figure 14

Table 4 Examples of Baima monophthongs in words with alveolar stop initial consonants in the three tones.

Figure 15

Figure 12 Average F1–F2 values for native Baima diphthongs. Based on 24 tokens.

Figure 16

Table 5 Examples of diphthongs in words with alveolar onsets in the three tones.

Figure 17

Figure 13 Pitch contours and vowel duration in the three tones. Based on the minimal triplet /ɲɛ̂/ ‘fire’, /ɲɛ̄/ ‘to sleep’, and /ɲɛ̀/ ‘mountain god’ (mean of three repetitions per word, a total of 9 tokens), uttered in isolation by our male speaker. High falling tone (HF) in solid line, mid tone (M) in dotted line, low tone (L) in dashed line.

Figure 18

Figure 14 Mean duration and mean standard deviation (in s) of Baima monophthongs, based on 245 tokens across different initial consonants and tones from the Baima recordings of one male speaker in this Illustration.

Figure 19

Table 6 Examples of tone contrasts on words with sonorant and prenasalized stop and affricate onsets.

Figure 20

Figure 15 Time courses of HNR05 (dB) (left) and H1*–H2* (dB) (right) in two minimal triplets for tone: (i) /ɲɛ̂/ ‘fire’, /ɲɛ̄/ ‘to sleep’, /ɲɛ̀/ ‘mountain god’, and (ii) /nô/ ‘inside’, /nō/ ‘heaven, sky’, /nò/ ‘to exist’ (with three repetitions per word, a total of 18 tokens). The x-axis represents vowel duration divided into five regions: 0–20$\%$, 20–40$\%$, 40–60$\%$, 60–80$\%$, 80–100$\%$. High falling tone in black solid line, mid tone in light gray dotted line, low tone in gray dashed line.

Figure 21

Table 7 Examples of disyllabic words with nine possible combinations of tones.

Supplementary material: File

Chirkova et al. supplementary material

Chirkova et al. supplementary material

Download Chirkova et al. supplementary material(File)
File 22.6 MB