Hostname: page-component-89b8bd64d-dvtzq Total loading time: 0 Render date: 2026-05-07T05:04:10.376Z Has data issue: false hasContentIssue false

The acoustic correlates of word-level stress and focus-related prominence in Mankiyali

Published online by Cambridge University Press:  11 September 2025

Jonathan Charles Paramore*
Affiliation:
University of California, Santa Cruz, Linguistics, United States
Aurangzeb Mankiyali
Affiliation:
Independent Researcher
*
*Corresponding author: Email: jcparamo@ucsc.edu
Rights & Permissions [Opens in a new window]

Abstract

This paper investigates the acoustic correlates of word-level stress and phrase-level focus-related prominence in Mankiyali, a highly endangered Indo-Aryan language spoken in Northwest Pakistan that utilizes a weight-sensitive stress system. Of the acoustic properties measured (duration, f0, intensity, spectral tilt, and vowel quality), duration was the only feature found to robustly and consistently correlate with word-level stress across syllable types. In contrast, phrase-level focus-related prominence corresponded to an amplification of all five acoustic features measured. Given that vowel duration serves a vital role in preserving lexical contrast in Mankiyali, these findings present difficulties for a strong version of the Functional Load Hypothesis, which claims that acoustic properties bearing a high functional load in a language will not be used to mark prominence. In addition, results support an analysis of Mankiyali’s stress system as having five distinct levels of weight, a pattern which is extremely rare, if not unattested, elsewhere in the world’s languages.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of The International Phonetic Association
Figure 0

Table 1. Mankiyali vocalic inventory

Figure 1

Table 2. Tokens consist of 25 sets of near minimal pairs (five sets for each syllable type) differing in stress position. The target syllable for each token appears in bold.2

Figure 2

Figure 1. f0 tracks overlaid on the waveforms of sentence 1 (left) and sentence 3 (right) for the token [mʌzˈdɑːr] spoken in the second session by AZ1. In sentence 1, [mʌzˈdɑːr] is new information and marked with focus-related prominence, and in sentence 3, [t͡soːr] ‘four’ is marked with focus, and [mʌzˈdɑːr] is not. Breaks in f0 tracks correspond to obstruents in the signal that did not produce a reliable f0. The black arrow points to the vowel of the word-final stressed syllable in [mʌzˈdɑːr].

Figure 3

Figure 2. Representative image of vowel segmentation in Praat for /peːkiːz/ extracted from sentence 1 of the mini-monologue from the second session of speaker MS1.

Figure 4

Table 3. Number of outliers removed from each model

Figure 5

Table 4. Number of tokens for each syllable type used in the analysis of word-level duration3

Figure 6

Table 5. z-scored lmer model showing the effect of stress and syllable type on vowel duration: Duration∼stress*syllable type + (1 + stress|speaker) + (1|target.syllable)

Figure 7

Table 6. Pairwise comparisons showing the effect of stress on vowel duration for each of the five syllable types in the experiment: emmeans(lmer, ∼ stress | syllable type)

Figure 8

Figure 3. Boxplots grouped by syllable type depicting z-scored vowel duration differences between word-level stressed and unstressed syllables.

Figure 9

Table 7. z-scored lmer model showing the effect of stress and syllable type on intensity. PeakIntensity∼stress*syllable type + (1 + stress|speaker) + (1|target.syllable)

Figure 10

Table 8. Pairwise comparisons showing the effect of stress on peak vowel intensity for each of the five syllable types in the experiment: emmeans(lmer, ∼ stress | syllable type)

Figure 11

Figure 4. Boxplots grouped by syllable type depicting vowel intensity differences between word-level stressed and unstressed syllables.

Figure 12

Table 9. z-scored lmer model summary showing the effect of stress and syllable type on spectral tilt: SpectralTilt∼stress*syllable type + (1 + stress|speaker) + (1|target.syllable)

Figure 13

Table 10. Pairwise comparisons showing the effect of stress on spectral tilt for each of the five syllable types in the experiment: emmeans(lmer, ∼ stress | syllable type)

Figure 14

Figure 5. Boxplots grouped by syllable type depicting spectral tilt differences between word-level stressed and unstressed syllables.

Figure 15

Figure 6. GAM curves separated into word-level stressed (dark blue) and unstressed (yellow) conditions showing mean vowel f0 across 10 normalized timesteps for all non-focused conditions.

Figure 16

Table 11. z-scored lmer model summary showing the effect of stress and step on f0: f0∼stress*step + (1 + stress|speaker) + (1|target.syllable)

Figure 17

Table 12. z-scored lmer model summary showing the effect of stress and syllable type on Meanf0. Meanf0∼stress*syllable type + (1 + stress|speaker) + (1|target.syllable)

Figure 18

Table 13. Pairwise comparisons showing effect of stress on mean f0 for each of the five syllable types in the experiment

Figure 19

Table 14. Pairwise comparisons showing the effect of stress on Euclidean distance from a neutral schwa for each vowel quality in the study. Euclidean distance is calculated in an F1/F3–F2/F3 space, so estimates are not in Hertz but with respect to the first two formants as ratios of F3. EucDist∼stress + (1|speaker) + (1|target.syllable)

Figure 20

Figure 7. F1/F3–F2/F3 vowel space showing the mean F1/F3 and F2/F3 ratio values of all 30 speakers for stressed (dark blue) and unstressed (yellow) word-level tokens with the mean hypothetical neutral schwa (gray) across all speakers.

Figure 21

Figure 8. GAM curves separated into focused (dark blue) and non-focused (yellow) conditions showing mean vowel f0 across 10 normalized timesteps for all tokens.

Figure 22

Table 15. z-scored lmer model summary showing the effect of focus and step on f0: f0∼focus*step + (1 + focus|speaker) + (1|target.syllable)

Figure 23

Table 16. Pairwise comparisons showing the effect of focus on f0 for three timesteps, corresponding to the beginning, middle, and final third of the vowel tokens: f0∼focus + (1 + focus|speaker) + (1|target.syllable)

Figure 24

Figure 9. Boxplots depicting peak intensity differences between non-focused and focused syllables.

Figure 25

Figure 10. Boxplots depicting spectral tilt differences between non-focused and focused syllables.

Figure 26

Table 17. z-scored lmer model showing the effect of focus and syllable type on duration. Duration ∼ focus*syllable type + (1 + focus|speaker) + (1|target.syllable)

Figure 27

Table 18. Pairwise comparisons showing the effect of focus on vowel duration for each of the five syllable types in the experiment

Figure 28

Figure 11. Boxplots grouped by syllable type depicting vowel duration differences between focused and non-focused syllables.

Figure 29

Table 19. Pairwise comparisons showing the effect of focus on Euclidean distance from a neutral schwa for each vowel quality. Euclidean distance is calculated in an F1/F3–F2/F3 space, so estimates are not in Hertz but with respect to the first two formants as ratios of F3. EucDist∼focus + (1|speaker) + (1|target.syllable)

Figure 30

Figure 12. F1/F3–F2/F3 vowel space of the mean formant ratio values of all 30 speakers for stressed focused (dark blue) and non-focused (yellow) tokens compared to the mean formant ratio value of the hypothetical neutral schwa (gray) across all speakers.

Figure 31

Table 20. Results from the present study on word-level stress showing the mean duration of unstressed vowels in each of the five syllable types examined, the predicted JND based on those durations, and the actual durational change correlating with word-level stress

Figure 32

Table 21. Simple pairwise comparison lmer model comparing the effect of stress on duration between unstressed penultimate [.ˈt͡sʌɽ] and stressed penultimate [ˈpʌtʰ.re]: Duration~stress + (1|speaker)

Figure 33

Table A1. Experiment tokens with glosses

Figure 34

Table A2. lmer model showing the effect of sentence type on the mean f0 of target word vowels for speaker AZ1: token_Meanf0 ∼ sʌŋɡi_Meanf0 + Focus + (1 | target_token)

Figure 35

Table A3. z-scored pairwise comparison lmer model showing the effect focus on Peak Intensity: PeakIntensity∼focus + (1 + focus|speaker) + (1|target.syllable)

Figure 36

Table A4. z-scored pairwise comparison lmer model showing the effect focus on Spectral Tilt: SpectralTilt∼focus + (1 + focus|speaker) + (1|target.syllable)

Figure 37

Table A5. Twenty-five sets of near-minimal pairs. The target syllable for each token is in bold, and the notation used in the below plots and lmers for each set of near-minimal pairs is placed to the left of each set

Figure 38

Figure A1. Boxplots comparing stressed and unstressed CV syllables within near-minimal pairs.

Figure 39

Table A6. Pairwise comparisons showing the effect of stress on CV syllables within near-minimal pairs.

Figure 40

Figure A2. Boxplots comparing stressed and unstressed CVC syllables within near-minimal pairs.

Figure 41

Table A7. Pairwise comparisons showing the effect of stress on CVC syllables within near-minimal pairs

Figure 42

Figure A3. Boxplots comparing stressed and unstressed CVCC syllables within near-minimal pairs.

Figure 43

Table A8. Pairwise comparisons showing the effect of stress on CVCC syllables within near-minimal pairs

Figure 44

Figure A4. Boxplots comparing stressed and unstressed CVː syllables within near-minimal pairs.

Figure 45

Table A9. Pairwise comparisons showing the effect of stress on CVː syllables within near-minimal pairs

Figure 46

Figure A5. Boxplots comparing stressed and unstressed CVːC syllables within near-minimal pairs.

Figure 47

Table A10. Pairwise comparisons showing the effect of stress on CVːC syllables within near-minimal pairs