Hostname: page-component-89b8bd64d-nlwjb Total loading time: 0 Render date: 2026-05-07T07:15:13.783Z Has data issue: false hasContentIssue false

Probabilistic underspecification in nasal place assimilation*

Published online by Cambridge University Press:  16 January 2017

John Coleman*
Affiliation:
University of Oxford
Margaret E. L. Renwick*
Affiliation:
University of Georgia
Rosalind A. M. Temple*
Affiliation:
University of Oxford

Abstract

According to many works on English phonology, word-final alveolar consonants – and only alveolar consonants – assimilate to following word-initial consonants, e.g. ran quicklyra[ŋ] quickly. Some phonologists explain the readiness of alveolar consonants to assimilate (vs. the resistance of velar and labial articulations) by proposing that they have underspecified place of articulation (e.g. Avery & Rice 1989). Labial or dorsal nasals do not undergo assimilation because their place nodes are specified. There are reports that velar and labial consonants sometimes assimilate in English, but these are anecdotal observations, with no available audio and no statistics on their occurrence. We find evidence of assimilation of labial and velar nasals in the Audio British National Corpus, motivating a new, quantitative phonological framework: a statistical model of underspecification and variation which captures typical as well as less common but systematic patterns seen in non-coronal assimilation.

Information

Type
Articles
Copyright
Copyright © Cambridge University Press 2017 
Figure 0

Figure 1 Illustration of how larger sample sizes can yield smoother, single-peaked, distributions. (a) Histograms of F2 of /ŋ/ before /k/ or /g/ (male speakers: N=79, mean F2 (μ)=1299 Hz, standard deviation (σ)=336 Hz; female speakers: N=33, μ=1752 Hz, σ=371 Hz); (b) histograms of F2 of /m/ in from the (male speakers: N=736, μ=1214 Hz, σ=218 Hz; female speakers: N=328, μ=1554 Hz, σ=302 Hz). The vertical axis is relative incidence in the corpus sample (count/N).

Figure 1

Figure 2 Wide-band spectrograms of (a) come down, with final nasal in come pronounced [n] (cf. [n] in down); (b) seem to, with final nasal in seem pronounced [n] (cf. [n] in down); (c) some cream, with final nasal in some pronounced [ŋ] (cf. [m] in cream); (d) coming back, with final nasal in coming pronounced [m] (cf. [m] in some-); (e) something but, with final nasal in something pronounced [m] (cf. [m] in some-). The transcriptions were added manually, not automatically. Arrows indicate the F2 of nasals.

Figure 2

Table I Selection of the word-pairs analysed in this study, with the number of tokens that are well aligned and for which the speaker sex is recorded. Shaded cells contain non-assimilation control conditions, and cells with heavy borders are cases in which assimilation of /m/ or /ŋ/ to a following consonant might occur. Items discussed in detail below are italicised.

Figure 3

Table II Summary of best mixed-effects model for /m/ (N=4441, 870 speakers). The reference level for this model is the control condition, i.e. /m/ followed by another labial consonant, with Sex=female and Preceding V=[ǝ]. Random factor=(1|Speaker).

Figure 4

Table III Summary of best mixed-effects model for /n/ (N=6991, 987 speakers). The reference level for this model is the control condition, i.e. /n/ followed by another alveolar consonant, with Sex=female and Preceding V=[ˈɑ]. Random factor=(1|Speaker).

Figure 5

Table IV Summary of best mixed-effects model for /ŋ/ (N=2970, 768 speakers). The reference level for this model is the control condition, i.e. /ŋ/ followed by another velar consonant, with Sex=female and Preceding V=[ǝ]. Random factor=(1|Speaker).

Figure 6

Figure 3 (a) Histograms of F2 of nasals (female speakers). Labial: /m/ before /p, b/ (N=365); velar: /ŋ/ in all contexts (N=933); alveolar: /n/ before /t, d/ (N=135). (b) Gaussian standard normal distributions (probability density functions) fitted to those histograms. Labial: model of /m/ (a=36,088, μ=1554 Hz, σ=302 Hz); velar: model of /ŋ/ (a=86,742, μ=1694 Hz, σ=293 Hz); alveolar: model of /n/ (a=10,067, μ=1838 Hz, σ=271 Hz).

Figure 7

Figure 4 (a) Histograms of F2 of nasals (male speakers). Labial: /m/ before /p, b/ (N=364); velar: /ŋ/ in all contexts (N=1211); alveolar: /n/ before /t, d/ (N=203). (b) Gaussian standard normal distributions (probability density functions) fitted to those histograms. Labial: model of /m/ (a=39,967, μ=1214 Hz, σ=218 Hz); velar: model of /ŋ/ (a=12,231, μ=1349 Hz, σ=283 Hz); alveolar: model of /n/, a=20,367, μ=1404 Hz, σ=287 Hz).

Figure 8

Table V Means and standard deviations (in Hz) of F2 frequencies of canonical (unassimilated) /m, n, ŋ/ in non-assimilating environments across a wide range of speakers.

Figure 9

Figure 5 Histograms of F2 of word-final /n/ followed by bilabials and velars. (a) Female speakers (N=233 and 109); (b) male speakers (N=336 and 142).

Figure 10

Table VI Means and standard deviations (in Hz) of F2 frequencies of nasals. ‘A~B’ indicates ‘A tending towards B’.

Figure 11

Figure 6 F2 variation in /m/ of seem to vs. control /n/. (a) Female speakers; (b) male speakers.

Figure 12

Figure 7 Histograms of F2 frequency of /m/ in all 1670 tokens produced by speaker ‘Fred’. There were 235 tokens of /m/ before coronal consonants.

Figure 13

Table VII Means and standard deviations (in Hz) of F2 frequencies of nasals.

Figure 14

Table VIII Size and significance of mean F2 differences (δ, in Hz) for male and female speakers.

Figure 15

Figure 8 F2 variation in /m/ of come back and /ŋ/ of coming back and coming down. (a) Female speakers; (b) male speakers.

Figure 16

Figure 9 Unassimilated (left) and assimilated (right) forms of nothing and something in the speech of three male speakers: (a) speaker PS0S4; (b) speaker PS3KY; (c) speaker PS0LU. Arrows indicate the F2 of word-final nasals; tracks superimposed on the spectrograms show the lowest four formant frequencies, as estimated by Praat.

Figure 17

Figure 10 (a) Simulated articulator trajectory (solid line), using the window model of coarticulation (from Blackburn & Young 2000, after Keating 1990). The trajectory is constrained to pass through the ‘windows’ indicated by the dashed horizontal lines. (b) Simulated articulator trajectory (dashed line) using a probabilistic coarticulation model (from Blackburn & Young 2000). The midpoints of successive phonemes are indicated by the dotted vertical lines, and associated with each midpoint is a probability distribution, defining the probability that the articulator will take particular positions at the midpoints.

Figure 18

Figure 11 Gaussian models of /m/ and /n/, together with Gaussian mixture models (weighted sums of the Gaussian models of /m/ and /n/).

Figure 19

Figure 12 Modelling a combination of unassimilated and assimilated /m/ tokens, using Gaussian mixtures: (a) female speakers; (b) male speakers. Gaussian models of /m/ and /n/ are given for reference. Squares are Gaussian mixtures of /m/ and /n/ (female speakers: a1=1739, a2=7453, μm=1554 Hz, σm=302 Hz, μn=1838 Hz, σn=271 Hz; male speakers: a1=−3705, a2=18,183, μm=1214 Hz, σm=218 Hz, μn=1404 Hz, σn=287 Hz).

Figure 20

Figure 13 Modelling a combination of unassimilated and assimilated /ŋ/ tokens in coming back vs. coming down; (a) female speakers; (b) male speakers. Gaussian mixture models for /ŋ/, of the form a1 probdf(f11)+a2 probdf(f22), using the means and standard deviations of /ŋ, m, n/. Black circles are Gaussian mixtures of /ŋ/ and /m/ (female speakers: a1=3990, a2=2979, μŋ=1751 Hz, σŋ=371 Hz, μm=1554 Hz, σm=302 Hz; male speakers: a1=3028, a2=3701, μŋ=1299 Hz, σŋ=336 Hz, μm=1214 Hz, σm=218 Hz). Grey squares represent a Gaussian mixture of /ŋ/ and /n/ (female speakers: a1=−359, a2=5695, μn=1838 Hz, σn=271 Hz; male speakers: a1=−2916, a2=7791, μn=1404 Hz, σn=287 Hz). Grey circles represent simple (unmixed) Gaussian models of /ŋ/ for reference.

Figure 21

Figure 14 Image of President Obama midway through saying I'm in I'm gonna convince. (Source: PBS Newshour/YouTube; http://www.youtube.com/watch?v=s4OwubYrL2c#t=9m24s). An AVI file of the I'm gonna clip is available in the online version of the paper, and at http://www.phon.ox.ac.uk/jcoleman/Obama_I_m_gonna.avi.