Hostname: page-component-89b8bd64d-4ws75 Total loading time: 0 Render date: 2026-05-07T15:13:14.332Z Has data issue: false hasContentIssue false

Acoustic Correlates of Mizo Tones

Published online by Cambridge University Press:  09 December 2025

Wendy Lalhminghlui
Affiliation:
Department of Humanities and Social Sciences, Indian Institute of Technology Guwahati, Assam, India
Priyankoo Sarmah*
Affiliation:
Department of Humanities and Social Sciences, Indian Institute of Technology Guwahati, Assam, India
*
*Corresponding author. Email: priyankoo@iitg.ac.in
Rights & Permissions [Opens in a new window]

Abstract

Mizo (ISO 639-3 code: lus) is a Tibeto-Burman tone language spoken in Mizoram, India. This work provides an acoustic-phonetic description of Mizo tones spoken in Aizawl. The acoustic features of Mizo tones are modelled after the four tones in the language. The patterns of the f0 contours of the four Mizo tones in this study indicate that three have dynamic f0 contours. The analysis also shows that the f0 slope is crucial in distinguishing the four Mizo tones. Discrete Cosine Transform is used to obtain the average f0 and the f0 slope features of the Mizo tone contours represented by the first three Discrete Cosine Transform coefficients. The first three coefficients of the Discrete Cosine Transform, which are associated with the average f0 and the f0 slope of the four Mizo tone f0 contours, along with the tonal duration, can automatically classify the Mizo tones with an average accuracy of 87.12% using a quadratic discriminant analysis.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of The International Phonetic Association
Figure 0

Table 1. Mizo tonal inventories reported in the previous studies

Figure 1

Table 2. Tonal minimal and near minimal sets in Mizo

Figure 2

Figure 1. Average raw f0 contours of the four Mizo tones in all contexts, plotted by gender.

Figure 3

Figure 2. Average normalized f0 contours of Mizo tones in four words in isolation.

Figure 4

Figure 3. Average normalized f0 contours of Mizo tones in four words in sentences.

Figure 5

Figure 4. Average normalized f0 contours of Mizo tones in four words in semantic context.

Figure 6

Figure 5. Tone contours of the four Mizo tones in semitones by gender.

Figure 7

Figure 6. Mean and standard error of raw f0 contours of four Mizo tones produced in isolation.

Figure 8

Figure 7. Duration of Mizo tones in the three contexts excluding tokens with long vowels.

Figure 9

Table 3. Average duration of Mizo tones (in milliseconds) for each word and context, with standard deviations in parentheses

Figure 10

Figure 8. The normalized f0 contours of the four Mizo tones averaged from three contexts.

Figure 11

Table 4. Wald χ2 tests on duration model on the entire database. Asterisk indicates p < .001.

Figure 12

Table 5. Summary of comparison of tone duration in short vowels. Statistical significance is indicated by an asterisk while non-significance is indicated by n.s.

Figure 13

Table 6. Average C1 and C2 values for Mizo tones in the three contexts

Figure 14

Figure 9. Reconstructed f0 contours from DCT coefficients of Mizo tones.

Figure 15

Table 7. Accuracy of classification of tones using QDA (values in %)

Figure 16

Table 8. Wald χ2 tests on C0, C1 and C2 models

Figure 17

Figure 10. Interaction plot for C2 showing the interaction of Tone × Context × Gender.

Figure 18

Figure 11. Tone f0 contours produced in isolation, reconstructed using DCT coefficients.

Figure 19

Table 9. Comparison of Mizo tones in terms of three DCT coefficients

Figure 20

Table 10. The four Mizo tones represented in Chaos five-point tone scale number

Figure 21

Figure 12. The f0 contours of four Mizo tones in isolation plotted with Chaos tone numerals.

Figure 22

Table A1. Words and sentences recorded from Mizo speakers

Figure 23

Table B1. Wald χ2 tests on duration model excluding the tokens with long vowels. Asterisk indicates, p < .001

Figure 24

Table B2. Comparison of tone duration excluding the tokens with long vowels.

Figure 25

Figure B1. Mean and standard error of the raw f0 contours of four Mizo tones in semantic context, plotted speaker-wise.

Figure 26

Figure B2. Mean and standard error of the raw f0 contours of four Mizo tones in sentence context, plotted speaker-wise.

Figure 27

Figure B3. Duration of Mizo tones in all three contexts.

Figure 28

Figure B4. f0 contours of four Mizo tones in semantic context with Chaos tone numerals.

Figure 29

Figure B5. f0 contours of four Mizo tones in sentence context with Chaos tone numerals.

Figure 30

Figure B6. Average raw f0 contours with average duration of the four Mizo tones in all contexts, plotted by gender.

Figure 31

Figure B7. Average normalized f0 contours with average duration of Mizo tones in four words in isolation.

Figure 32

Figure B8. Average normalized f0 contours with average duration of Mizo tones in four words in sentences.

Figure 33

Figure B9. Average normalized f0 contours with average duration of Mizo tones in four words in semantic context.

Figure 34

Figure B10. Mean and standard error of raw f0 contours with average duration of four Mizo tones produced in isolation by female speakers.

Figure 35

Figure B11. Mean and standard error of raw f0 contours with average duration of four Mizo tones produced in isolation by male speakers.

Figure 36

Figure B12. Reconstructed f0 contours with average duration from DCT coefficients of Mizo tones.

Figure 37

Figure B13. Tone f0 contours with average duration produced in isolation by female speakers, reconstructed using DCT coefficients.

Figure 38

Figure B14. Tone f0 contours with average duration produced in isolation by male speakers, reconstructed using DCT coefficients.