Hostname: page-component-6766d58669-6mz5d Total loading time: 0 Render date: 2026-05-23T13:50:42.102Z Has data issue: false hasContentIssue false

Acoustic properties of Seoul Korean stops in phrase-initial and word-medial positions

Published online by Cambridge University Press:  11 May 2026

Daniel Schweizer
Affiliation:
Department of Linguistics, University of Ottawa, Canada
Suzy Ahn
Affiliation:
Department of Linguistics, University of Ottawa, Canada
Harim Kwon*
Affiliation:
Department of English Language and Literature, Seoul National University, Republic of Korea
*
*Corresponding author. Email: harimkwon@snu.ac.kr
Rights & Permissions [Opens in a new window]

Abstract

Phonetic implementations of Seoul Korean stops have been examined mostly in phrase-initial positions, primarily focusing on the tonogenesis-like sound change which involves a shift in the primary acoustic property for differentiating aspirated and lax categories. Word-medial stops have been much less discussed, except in the context of inter-sonorant voicing of lax stops. To address this gap, the current study provides a comprehensive analysis of aspirated, lax, and tense stops in Seoul Korean across three prosodic positions, Intonational Phrase (IP)-initial, Accentual Phrase (AP)-initial (inter-sonorant), and word-medial (also inter-sonorant), considering various acoustic properties, namely, stop burst duration, closure duration, post-stop f0 (fundamental frequency), F1, H1*-H2*, and voicing during closure. Based on experimental and corpus data, we report that Seoul Korean stops show distinct acoustic patterns in phrase-initial versus word-medial positions. In IP- and AP-initial positions, burst duration and f0 are the primary acoustic properties that distinguish the three categories, with H1*-H2* further differentiating tense from non-tense stops. In word-medial position, however, closure duration, voicing during closure, and burst duration emerge as the main distinguishing properties, though some f0 differences between lax and aspirated stops are observed. This f0 difference is especially noticeable in non-high vowel contexts, suggesting that the tonogenesis-like change may be spreading beyond phrase-initial positions. Overall, the acoustic implementation of the Seoul Korean three-way laryngeal contrast is highly dependent on the prosodic position of the stops, with this positional variation being most noticeable for lax stops.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of The International Phonetic Association
Figure 0

Table 1. Summary of results for each acoustic property based on previous literature.

Figure 1

Table 2. Target words and phrases produced in Study 1.

Figure 2

Figure 1. F0 across laryngeal categories, prosodic positions, and speaker sex. Boxplots represent empirical distributions; triangles and error bars indicate model predictions and 95% confidence intervals.

Figure 3

Figure 2. F0 in word-medial stops across laryngeal categories, AP-initial boundary tones, and speaker sex. Boxplots represent empirical distributions; triangles and error bars indicate model predictions and 95% confidence intervals.

Figure 4

Figure 3. F1 across laryngeal categories, prosodic positions, and speaker sex. Boxplots represent empirical distributions; triangles and error bars indicate model predictions and 95% confidence intervals.

Figure 5

Figure 4. H1*-H2* across laryngeal categories, prosodic positions, and speaker sex. Boxplots represent empirical distributions; triangles and error bars indicate model predictions and 95% confidence intervals.

Figure 6

Table 3. The number of lenited lax stops in Study 1, excluded from duration analyses

Figure 7

Figure 5. Example of word-medial /k/ realized as an approximant, without closure or burst.

Figure 8

Figure 6. Example of AP-initial /k/ realized as a fricative, without closure or burst.

Figure 9

Figure 7. Example of word-medial /p/ that is fully voiced and produced without discernible burst.

Figure 10

Figure 8. Burst duration across laryngeal categories, prosodic positions, and speaker sex. Boxplots represent empirical distributions; triangles and error bars indicate model predictions and 95% confidence intervals.

Figure 11

Figure 9. Voicing during closure across laryngeal categories, prosodic positions, and speaker sex. Model predictions are in Appendix D (Figures D1–D3).

Figure 12

Table 4. Summary of the findings from Study 1 (A = aspirated, L = lax, T = tense)

Figure 13

Figure 10. F0 across laryngeal categories, prosodic positions, and vowel height. Boxplots represent empirical distributions; triangles and error bars indicate model predictions and 95% confidence intervals.

Figure 14

Figure 11. F0 in word-medial stops across laryngeal categories, AP-initial boundary tones, and vowel height. Boxplots represent empirical distributions; triangles and error bars indicate model predictions and 95% confidence intervals.

Figure 15

Figure 12. F1 across laryngeal categories, prosodic positions, and vowel quality. Boxplots represent empirical distributions; triangles and error bars indicate model predictions and 95% confidence intervals.

Figure 16

Figure 13. H1*-H2* across laryngeal categories, prosodic positions, and vowel height. Boxplots represent empirical distributions; triangles and error bars indicate model predictions and 95% confidence intervals.

Figure 17

Table 5. The number of lenited lax stops in Study 2, excluded from duration analyses

Figure 18

Figure 14. Burst duration across laryngeal categories, prosodic positions, and vowel height. Boxplots represent empirical distributions; triangles and error bars indicate model predictions and 95% confidence intervals.

Figure 19

Figure 15. Duration of closure and voicing across laryngeal categories, positions, and vowel height. Model predictions are in Appendix D (Figures D4–D6).

Figure 20

Figure 16. Duration of closure and voicing of word-medial stops across laryngeal categories and preceding segments. Model predictions are in Appendix D (Figures D7–D9).

Figure 21

Table 6. Summary of findings from Study 2. (A = aspirated, L = lax, T = tense; vowel contexts are indicated in parentheses where applicable; the results that differ from Study 1 are bolded.)

Figure 22

Table A1. Mixed linear regression on normalized f0. f0 ∼ POSITION* LARYNGEAL* SEX* PLACE + (1 + POSITION| SUBJECT)

Figure 23

Table A2. Mixed linear regression on normalized f0 for Wm aspirated and lax Laryngeals. f0 ∼ LARYNGEAL * SEX * AP–INITIALTONE + (1 + AP–INITIALTONE | SUBJECT)

Figure 24

Table A3. Mixed linear regression on normalized F1. F1 ∼ POSITION * LARYNGEAL * SEX + POSITION * PLACE + (1 + POSITION | SUBJECT)

Figure 25

Table A4. Mixed linear regression on normalized H1*–H2* H1*–H2* ∼ POSITION * LARYNGEAL + SEX * LARYNGEAL + PLACE * POSITION +(1 + POSITION | SUBJECT)

Figure 26

Table A5. Mixed linear regression on burst duration. Burst duration ∼ POSITION * LARYNGEAL * SEX * PLACE + (1 + LARYNGEAL | SUBJECT)

Figure 27

Table A6. Mixed linear regression on closure duration. Closure duration ∼ POSITION * LARYNGEAL * PLACE * SEX + (1 + LARYNGEAL | SUBJECT)

Figure 28

Table A7. Mixed linear regression on voicing duration Voicing duration ∼ POSITION * LARYNGEAL * PLACE * SEX + (1 + POSITION | SUBJECT)

Figure 29

Table A8. Mixed linear regression on proportion voiced. Proportion voiced ∼ POSITION * LARYNGEAL * PLACE * SEX + (1 + POSITION | SUBJECT)

Figure 30

Table B1. Stop tokens extracted from the Speech Corpus of Reading-Style Seoul Korean (NIKL 2005).

Figure 31

Table C1. Mixed linear regression on normalized f0. F0 ∼ POSITION * LARYNGEAL * VOWEL HEIGHT + (1 + POSITION | SUBJECT) + (1 | ITEM)

Figure 32

Table C2. Mixed linear regression on normalized f0 for Word-medial stops. f0 ∼ LARYNGEAL * VOWEL HEIGHT * AP–INITIALTONE + (1 + AP–INITIALTONE | SUBJECT) + (1 | ITEM)

Figure 33

Table C3. Model 2: Mixed linear regression on normalized F1. F1 ∼ POSITION * LARYNGEAL * VOWEL + (1 + POSITION | SUBJECT) + (1 | ITEM)

Figure 34

Table C4. Mixed linear regression on normalized H1*–H2*. H1*–H2* ∼ POSITION * LARYNGEAL * VOWEL HEIGHT + POSITION * LARYNGEAL * PLACE + POSITION * PLACE * VOWEL HEIGHT + (1 + LARYNGEAL | SUBJECT) + (1 | ITEM)

Figure 35

Table C5. Mixed linear regression on burst duration. Burst duration ∼ POSITION * LARYNGEAL * VOWEL HEIGHT + LARYNGEAL * VOWEL HEIGHT * PLACE + (1 + LARYNGEAL | SUBJECT) + (1 | ITEM)

Figure 36

Table C6. Mixed linear regression on closure duration. Closure duration ∼ POSITION * LARYNGEAL * VOWEL HEIGHT * PLACE + (1 + POSITION | SUBJECT) + (1 | ITEM)

Figure 37

Table C7. Mixed linear regression on voicing duration. Voicing duration ∼ POSITION * LARYNGEAL * VOWEL HEIGHT + (1 + LARYNGEAL | SUBJECT) + (1 | ITEM)

Figure 38

Table C8. Mixed linear regression on proportion voiced. Proportion voiced ∼ POSITION * LARYNGEAL * VOWEL HEIGHT + (1 + LARYNGEAL | SUBJECT) + (1 | ITEM)

Figure 39

Table C9. Mixed linear regression on closure duration by preceding segment. Closure duration ∼ PREVIOUS SEGMENT * LARYNGEAL * POSITION + (1 + POSITION | SUBJECT) + (1 | ITEM)

Figure 40

Table C10. Mixed linear regression on voicing duration by preceding segment. Voicing duration ∼ PREVIOUS SEGMENT * LARYNGEAL * POSITION + (1 + LARYNGEAL | SUBJECT) + (1 | ITEM)

Figure 41

Table C11. Mixed linear regression on proportion voiced duration by preceding segment. Proportion voiced ∼ PREVIOUS SEGMENT * LARYNGEAL * POSITION + (1 + POSITION | SUBJECT) + (1 | ITEM)

Figure 42

Figure D1. Predicted values from mixed effects model on closure duration for Study 1.

Figure 43

Figure D2. Predicted values from mixed effects model on voicing duration for Study 1.

Figure 44

Figure D3. Predicted values from mixed effects model on proportion voiced for Study 1.

Figure 45

Figure D4. Predicted values from mixed effects model on closure duration for Study 2.

Figure 46

Figure D5. Predicted values from mixed effects model on voicing duration for Study 2.

Figure 47

Figure D6. Predicted values from mixed effects model on proportion voiced for Study 2.

Figure 48

Figure D7. Predicted values from mixed effects model on closure duration by preceding segment for word-medial stops in Study 2.

Figure 49

Figure D8. Predicted values from mixed effects model on voicing duration by preceding segment for word-medial stops in Study 2.

Figure 50

Figure D9. Predicted values from mixed effects model on proportion voiced by preceding segment for word-medial stops in Study 2.