Hostname: page-component-6766d58669-r8qmj Total loading time: 0 Render date: 2026-05-23T15:41:01.448Z Has data issue: false hasContentIssue false

The acoustic realization of intervocalic /v/ and /v:/ in Italian: evidence of articulatory variability

Published online by Cambridge University Press:  08 May 2026

Angelo Dian*
Affiliation:
University of Oxford, Faculty of Linguistics, Philology and Phonetics, United Kingdom
John Hajek
Affiliation:
School of Languages and Linguistics, University of Melbourne, Australia
Janet Fletcher
Affiliation:
The University of Melbourne Faculty of Arts, Australia
*
*Corresponding author. Email: angelo.dian@ling-phil.ox.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

This study examines the variation in the acoustic properties of the allophones of intervocalic /v/ and /vː/ in Italian, a typologically unusual case where voiced labiodental fricatives contrast phonemically in length. Through an acoustic analysis of read speech from 18 speakers across three regional varieties (Veneto, Roman, Calabrian), we investigate whether realization patterns are shaped by prosodic (phase position and stress placement) and indexical factors (regional variety and speaker sex). Results indicate that /v/ frequently (47%) surfaces as an approximant [ʋ]. In contrast with the principle of geminate ‘inalterability’, /vː/ also exhibits great articulatory variability, with 36% of tokens showing increased constriction including 17% of all tokens realized as a previously undocumented plosive-like labiodental [b̪ː]. Bayesian modelling reveals that approximant variants of /v/ occur more frequently phrase-finally than phrase-medially and that male speakers are more likely to produce more constricted allophones of /vː/. No consistent effects of regional variety were observed, suggesting that the variation may be widespread across Italy. Furthermore, phonemic length alone significantly predicts duration. Therefore, notwithstanding the enhanced phonetic differentiation between /v/ and /vː/, it seems most likely speakers rely primarily on duration to distinguish the pair.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of The International Phonetic Association
Figure 0

Figure 1. Airflow (top) and intraoral pressure (bottom) of [afa] measured in Aeroview using a Rothenberg mask, a MS-110 transducer electronic unit, and PT-2E and PT-25 pressure transducers, all by Glottal Enterprises, Inc. The displayed time window is 640 ms.

Figure 1

Figure 2. Airflow (top) and intraoral pressure (bottom) of [ava] measured in Aeroview using a Rothenberg mask, a MS-110 transducer electronic unit, and PT-2E and PT-25 pressure transducers, all by Glottal Enterprises, Inc. The displayed time window is 640 ms.

Figure 2

Figure 3. Annotated examples of allophonic variants of /v/ embedded in /ˈbeve/ (‘s/he drinks’) from a VI (left and middle panel) and a CI (right panel) speaker.

Figure 3

Figure 4. Annotated examples of allophonic variants of /vː/ embedded in /daˈvːero/ (‘really’), each from a different RI speaker.

Figure 4

Table 1. Overall counts and rates of occurrence (%) of allophonic variants for /v/ and /vː/ in the corpus.

Figure 5

Table 2. Distribution of allophones by strength category for /v/ and /vː/.

Figure 6

Figure 5. Raw IntDiff distributions for each allophone, overlaid with model-predicted means (black diamonds) and corresponding 95% CrIs (error bars).

Figure 7

Figure 6. Raw MaxVel distributions for each allophone, overlaid with model-predicted means (black diamonds) and corresponding 95% CrIs (error bars).

Figure 8

Table 3. Bayesian multinomial and Bernoulli logistic regression (models M2v and M2vv) results. Effects whose 95% CrI exclude zero are shown in bold.

Figure 9

Figure 7. Distribution of allophonic variants of intervocalic /v/ and /vː/ across phrase-final (PF) and phrase-medial (PM) positions, expressed as percentages. All other conditions are pooled.

Figure 10

Figure 8. Distribution of allophonic variants of intervocalic /v/ and /vː/ across female and male speakers, expressed as percentages. All other conditions are pooled.

Figure 11

Table 4. Bayesian linear mixed-effects regression (model M3) results. Effects whose 95% credible intervals exclude zero are shown in bold.

Figure 12

Figure 9. C duration distributions by consonantal length and strength. All other conditions are pooled.

Figure 13

Table A1. Pairwise comparisons of mean IntDiff values between levels for Model M1a. Statistically credible differences are indicated by 95% credible intervals that do not include zero.

Figure 14

Table A2. Pairwise comparisons of mean MaxVel values between levels for Model M1b. Statistically credible differences are indicated by 95% credible intervals that do not include zero.