The Asymmetrical Stop Inventory of Witzapan Nawat

Abstract The stop inventory of Witzapan Nawat, a critically endangered indigenous language of El Salvador, has been traditionally described as consisting only of a voiceless series /p t k kw/. In this paper, I measure the voice onset time, consonant duration, and percent voicing in stops produced by five L1 Witzapan Nawat speakers. I find that, while /p t kw/ have acoustic characteristics of voiceless stops in practically all contexts, the velar stop in this language is better analyzed as a voiced velar stop /ɡ/ rather than /k/. This results in an asymmetrical and unusual stop system that is not predicted by some theories of phonemic inventory structure. For instance, markedness-based theories propose that /ɡ/ is more marked that /b d/ and predict that, if a language has one voiced stop, it would be /b/ or /d/ rather than /ɡ/. On the other hand, feature-systemic models predict that, if a language has a stop with the [+voice] feature at a given place of articulation, it will also tend to have this feature in stops at other points of articulation to maximize feature economy. The phonemic inventory of Witzapan Nawat contradicts these predictions. I explain the asymmetrical stop inventory of this language as the result of diachronic developments involving sound change and analogy, concluding that language change does not necessarily advance towards symmetry and that phonemic inventories are the reflection of their diachrony, as proposed by Evolutionary Phonology.


Introduction
Phonologists have long observed that phonological features tend to be evenly distributed across phonemes in phonemic inventories, a tendency that is referred to as SYMMETRY (de Groot 1941;Martinet 1955).For example, in French, the feature [±voice] is spread across obstruents so that each obstruent specified as [−voice] has a [+voice] counterpart (Clements 2003).Phonemic inventories that do not display this pattern are considered ASYMMETRICAL.In what is perhaps the most familiar case of consonant asymmetry, many languages with a [±voice] distinction in bilabial and coronal stops lack a corresponding voiced velar stop /g/-a phenomenon known as the g-gap (Blevins 2004: 283).
In this paper, I present evidence for a rare asymmetry in the stop inventory of Witzapan Nawat, a critically endangered Uto-Aztecan language spoken in El Salvador.While other closely related Nahuan languages feature only a voiceless stop series /p t k k w /, I argue that the stop inventory of Witzapan Nawat is better analyzed as having voiceless stops /p t k w / C The Author(s), 2024.Published by Cambridge University Press on behalf of The International Phonetic Association.This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creative commons.org/licenses/by/4.0/),which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.Salgado and a VOICED stop /g/ at the velar place of articulation.This asymmetry is unusual because it is not predicted by the main theories of phonemic inventory structure.For instance, proposals based on markedness posit that /g/ is more marked than /b d/ due to aerodynamic factors (Gamkrelidze 1975).Under this framework, the prediction is that, if a language has a voiced stop in its inventory, it will be /b/ or /d/ rather than /g/.On the other hand, theories based on feature economy predict that, if the feature [+voice] is present in one phoneme, it will also be present in others to maximize economy (Clements 2003).This is not the case for Witzapan Nawat given that the [+voice] feature occurs only in /g/ among all obstruents.In addition, I explain this asymmetrical phonemic inventory as a result of diachronic developments involving sound change and analogy, likely aided by frequency effects.This is consistent with Evolutionary Phonology (Blevins 2004), a framework in which asymmetrical phonemic inventories are predicted because sound and language change are not teleological, that is, they operate with no regard for symmetry.
Besides its theoretical goals, this paper is also meant as a contribution to the documentation of Witzapan Nawat phonology, which has only been briefly addressed in a few impressionistic works (e.g., Campbell 1985;Lemus 1997).This contribution is critical when considered in the context of the ongoing Nawat revitalization movement that is taking place in El Salvador (Lemus 2018).Witzapan Nawat is the Nawat dialect with the highest number of speakers who learned it as a first language (L1) and has established itself as the standard that most second-language (L2) Nawat learners are acquiring all over the country.As such, a thorough understanding of Witzapan Nawat phonology/phonetics is required to effectively teach pronunciation to new L2 speakers.
This paper is structured as follows: Section 2 introduces Witzapan Nawat and the velar/labiovelar stops in Nahuan languages.The methods of this study are presented in Section 3, while the results are shown in Section 4. Section 5 deals with the discussion and Section 6 follows with the conclusions.

Witzapan Nawat and voicing of velar/labiovelar stops in Nahuan
The Nahuan languages form a distinct branch of the Uto-Aztecan language family and are divided into three subgroupings -Pochutec, Western Nahuan, and Eastern Nahuan (Canger 1988;Pharao Hansen 2014).Nawat belongs to the Eastern Nahuan group and is closely related to varieties spoken along the Isthmus of Tehuantepec in Mexico.It is the only indigenous language still spoken in El Salvador, where less than 200 elders speak it as an L1 in a few towns in the western part of the country (Campbell 1985).Despite its high level of endangerment, the last twenty years have seen the emergence of an important Nawat revitalization movement led by L2 learners who do not identify as indigenous nor live in traditionally Nawat-speaking communities (Boitel 2018;Lemus 2018).Owing to a strong social media presence, these activists have succeeded in bringing Nawat and its speakers to public awareness and, as a result, the language is now being learned as an L2 by hundreds of people all over El Salvador and even abroad.
Nawat phonology has been described briefly in impressionistic works by Campbell (1985), Lemus (1997) and King (2014) without major discrepancies as to the phonemic inventory or the main phonological features of the language.Table 1 presents the Nawat phonemes according to Campbell (1985) and King (2014) as well as their corresponding letters in the official Nawat alphabet in bold font.Campbell (1985), Lemus (1997), andKing (2014) All Nawat non-initial syllables must have an onset and, barring a few exceptions, consonant clusters occur only word-internally when the consonants belong to different syllables, as in musta [ ‹ mus.ta] 'tomorrow'.Stress is predictable in nearly all native words, falling on the penultimate syllable.The phonemic inventory of Nawat described by previous works is quite similar to that of most Nahuan languages of Mexico, such as Classical Nahuatl (Andrews 2003: 28), Mecayapan Nahuatl (Wolgemuth 2002), Tetelcingo Nahuatl (Pittman 1961), and Milpa Alta Nahuatl (Whorf 1946).Two features of the Nawat stop subsystem, also prevalent in Nahuan, are of special interest: Nawat distinguishes between a velar /k/ and a labiovelar stop /k w / and has an inventory made of a single voiceless series /p t k k w /.
However, descriptions of specific Nawat varieties document a number of voiced allophones of the velar stop /k/ in certain contexts.These are particularly frequent in Witzapan Nawat, the focus of this study (Campbell 1985;Lemus 1997).
Witzapan Nawat, spoken in the town of Witzapan -Santo Domingo de Guzmán in Spanish -is currently the Nawat variety with the highest number of L1 speakers (Campbell 1985;Lemus 2009).2This variety has established itself as the standard learned by most L2 Nawat speakers but there is very little work done on its phonology and phonetics.Nevertheless, all available descriptions of the language agree that the feature that sets Witzapan Nawat apart from all other Nawat varieties is found in the voicing and spirantization of the velar stop phoneme /k/ in different word positions and phonological contexts.
For instance, Campbell (1985: 14) reports that the velar stop /k/ in Witzapan Nawat is categorically produced as [g] in 'initial position' (without further definition of the term 'initial'), intervocalically, and after a nasal.Likewise, Lemus (1997: 16) and King et al. (2003: 18) identify three allophonic variants of the velar stop phoneme: a voiced stop [g] at the beginning of a word and after a voiced consonant, an approximant [ƒ 4 ] intervocalically, and a voiceless stop [k] in syllable-final position and after a voiceless consonant.Most recently, King (2014: 391) observes that 'g-like sounds' in Witzapan Nawat are found categorically at the beginning of a word, after 'some consonants' (which are not specified), and between Salgado vowels, adding that the velar stop is always voiceless [k] at the end of the syllable.In contrast to /k/, voiced allophones of the other stops, including the labiovelar /k w /, are quite rare and only sporadically documented in high frequency words such as the verb -ita [ ‹ i.da] 'to see something' (Campbell 1985: 56) Nawat, /k/ is voiced in the 3rd person singular object prefix ki-, between two /a/, and between any two vowels provided that the first one is long, remaining [k] in all other contexts (Campbell 1985: 27).In Izalco Nawat, the velar is voiced only in /k/-initial unstressed particles in the intervocalic position, such as ka 'that' (Schultze-Jena 1935). 3oicing of /k/ and of the labiovelar /k w / is a feature of various Nahuan languages of Mexico as well.As an illustration, in Chicontepec Nahuatl (Aguilar 2020: 33), Sierra Norte de Puebla Nahuatl (Kakadelis 2018: 206), and varieties of the Alto Balsas region (Flores Farfán 1992: 55), [g] has been reported as a sporadic allophone of the velar phoneme /k/ in intervocalic contexts.Likewise, in varieties spoken in the Sierra de Zongolica (Monzón 1990: 31), the voiced allophone [g] is found after a nasal while [g] and [ƒ] can occur intervocalically.In some Nahuan languages, it is claimed that a phonemic distinction between /k/ and /g/ has entered the stop subsystem, citing a small number of minimal pairs as evidence.For instance, in Pajapan Nahuat (García de León 1976: 57), the velars are contrastive word-initially in katka 'lark' vs. gatka 'it was' and, in varieties of the Malinche Volcano region (Hill & Hill 1986: 65) and the Sierra Zacapoaxtla (Key & Key 1953: 53), they contrast intervocalically in -maga 'to hit something' vs. -maka 'to give something to someone'.
A smaller number of Nahuan languages of Mexico also feature voiced allophones of the labiovelar /k w /.In varieties of the Sierra de Zongolica (Monzón 1990: 38), [g w ] is reported after a nasal while [ƒ w ] and [w] are found intervocalically.Moreover, a phonemic shift has taken place in some languages, where /k w / is now /b/ in all contexts, as in /bawit/ 'tree' while most varieties have /k w awit/ (García de León 1976: 41;Monzón & Roth-Seneff 1984).
Although more research is needed, it seems that no implicational hierarchy of velar stop voicing is at play -there are varieties that have voicing of /k w /, either at the allophonic or phonemic level, without voicing of /k/, as in Ixquihuacan Nahuatl (Sasaki 2014: 145).Other languages, such as Chicontepec Nahuatl (Aguilar 2020: 33), have allophonic or phonemic voicing of /k/ without voicing of /k w /, while varieties like Pajapan Nawat have voicing of both stops (García de León 1976: 41).
The voicing of the velar and/or labiovelar stops in Nahuan languages is likely not due to the influence of Spanish, a language with phonemic /g/ and /k/ that is dominant in most Nahuan-speaking communities.Evidence for this is the fact that /g/ is the least common stop phoneme in Spanish, amounting to 1% of all phoneme occurrences according to various corpora (Pérez 2003), and it would be counterintuitive that Nahuan speakers acquire voicing in velars rather than in dental or bilabial stops when they rarely hear it or use it when they speak Spanish.Similarly, voicing of the velars cannot be linked to language attrition because it is present in Nahuan varieties that enjoy relative vitality, such as Chicontepec Nahuatl.
Regardless of its presence in numerous Nahuan languages, the phonetics of voicing in velars/labiovelars has yet to be addressed in detail (see Kakadelis 2018 for an exception), and little is known about the factors that condition it or that led to its development.For instance, previous descriptions of Witzapan Nawat find that voiced allophones of /k/ occur in 'initial position'.However, it is unclear if that is the case when the word-initial /k/ is at the beginning of the utterance or following a voiceless obstruent, contexts that are crosslinguistically not conducive to voicing (Flege & Brown 1982;Westbury & Keating 1986;Wetzels & Mascaro 2001;Hayes 2004;Beckman et al. 2013).Also unresolved is whether spirantization affects word-initial /k/ across word boundaries and whether coda /k/ is subject to positional allophony the same way as onset /k/ is.By addressing these issues using original acoustic data from Witzapan Nawat speakers, this article hopes to contribute, not only to language documentation efforts, but to the literature on stop voicing and phonemic inventory structure in general.

Participants and materials
One male and four female L1 Nawat speakers from the town of Witzapan participated in this study, recruited among personal acquaintances of the researcher.At the time of the recordings, the male participant (M1) was sixty years old and the female participants were respectively sixty-five (F1), fifty-seven (F2), sixty-three (F3) and sixty-three (F4) years old.All participants are balanced bilinguals, grew up speaking the Nawat variety of Witzapan in their household, and learned Spanish in their teenage years.
High-quality recordings of the L1 Nawat speakers were made in the field in 2018 and 2019 in quiet environments in the participants' homes using a Plantronics USB head-mounted microphone connected to a laptop computer and using the Audacity (R) recording and editing software.Tokens were extracted from open-ended interviews dealing with Witzapan history as well as elicitation tasks dealing with spatial constructions, that is, goals unrelated to this project.All interviews, ranging in length from thirty to ninety minutes, were conducted in Nawat and transcribed.

Instrumental analysis
To have a comparative perspective of the phonetic properties of /k/ versus the other Witzapan Nawat stops, all the realizations of /k/ and /p t k w / found within the first thirty minutes of each recording were segmented, manually extracted, and analyzed in Praat (Boersma 2001).Three acoustic correlates associated with voicing in oral stops and obstruents were measured: voice onset time (VOT; Lisker & Abramson 1964), consonant duration (Ladefoged 2006;Johnson 2012), and percent voicing (Flege & Brown 1982).
VOT is defined as the time from the stop release to the onset of voicing (Lisker & Abramson 1964).Voicing was identified by the presence of periodic waves in the waveform and a voicing bar in the spectrogram.For voiced stops, VOT values are negative because voicing begins before the stop release.In contrast, in voiceless stops, voicing begins with the vowel following the stop release, which yields positive VOT values.Figure 1 shows the production of a word-initial onset /k/ in utterance-initial context.The presence of periodic waves in the waveform and a voicing bar in the spectrogram immediately BEFORE the stop release means that this realization has a negative VOT.In this case, the velar stop phoneme is produced as a voiced stop [g] rather than voiceless stop [k].VOT was not measured in unreleased stops and tokens produced as approximants given the absence of a visible stop release.Likewise, tokens of coda /k/ in the pre-obstruent context were not measured for VOT because there is no voicing after the release.Consonant duration, measured as the time between the onset and the ending point of a consonant, is one of the strongest correlates of voicing in obstruents, as voiced obstruents have significantly shorter durations than their voiceless counterparts (Cooper et al. 1952).In this paper, the landmarks of duration depend on whether a stop phoneme is realized as a stop or an approximant, the latter being the most frequent production of /k/ in the intervocalic context (see Section 5.1).For stop realizations, the onset of the consonant was established at either the end of formant structure of a previous vowel, approximant, or nasal, the end of frication noise of a previous fricative or affricate, or after the release of a previous stop, while its ending point was placed before the release of the stop.Unreleased stops were not measured for duration because no landmark for the ending point of the stop can be located in these cases.In the case of approximant realizations, their duration was established following the procedures in Hualde et al. (2011): the onset of the consonant was located at the moment the previous vowel showed a decrease in intensity, as assessed in the waveform and spectrogram, and its ending point was set at the increase of intensity signaling the following vowel.
Finally, percent voicing is the ratio of voicing during the production of the consonant to the total consonant duration.Voicing was identified by the presence of periodic waves in the waveform and a voicing bar in the spectrogram during the production of the consonant.
Figure 2 shows the landmarks of consonant duration for a stop production of /t/ as well as the period of voicing that is used for measuring the percent voicing.
All tokens of /p t k w / and /k/ were coded according to their place of articulation and their position within the word and syllable into four categories: word-initial onsets, wordmedial onsets, word-internal codas, and word-final codas.They were also coded for their phonological contexts: onsets were classified as utterance-initial, intervocalic, post-nasal and post-obstruent.The only stop phoneme that occurred frequently in coda position was a Only vowels can occur before a coda consonant in Witzapan Nawat, which is why the segment before the coda stop is not coded for./k/.The contexts of coda /k/ were classified as pre-nasal, pre-obstruent and utterance-final.
Examples of these categories are shown in Table 2, highlighting the relevant phoneme in bold.Notice that the intervocalic category includes tokens between vowels and glides /j w/.
Phonemes classified as obstruents in the 'post-obstruent' and 'pre-obstruent' categories are /p t k k w tp s tp S s S h/.In all, 2,024 tokens were collected from the recordings of the five Witzapan Nawat speakers, corresponding to 293 tokens of /p/, 543 of /t/, 180 of /k w / and 1,008 tokens of /k/.A total of twenty tokens were discarded for being unreleased stops or because of background noise.As seen in the following section, the descriptive statistics reveal categorical differences between /p t k w / and /k/ in their VOT, duration, and percent voicing.For this reason, it was deemed that there was no need to perform further inferential analyses.

VOT
Word-initial onset tokens of /p t k w / and /k/ in the utterance-initial context lack acoustic cues to establish their duration and percent voicing.For this reason, VOT is the only correlate of voicing measured for utterance-initial tokens.A total of forty-three utterance-initial tokens of /p/, seventy-one of /t/, fifty-nine of /k w / and 132 of /k/ were collected.Their VOT distribution is shown in Figure 3.Each box in the plot represents a stop phoneme, while the black horizontal lines within the boxes stand for the median VOT value.The height of the boxes represents the difference between the first and third quartile, known as the INTERQUARTILE RANGE.Whiskers stand for the extreme data points within 0.5 times the length of the box, large dots represent mean values, and the small black dots outside the boxes represent outliers.The horizontal dotted line at 0 ms separates positive from negative VOT values.
A marked difference between /k/ and the other stop phonemes is patent in the utteranceinitial context -while /p t k w / have a mean VOT of around 13-24 ms, /k/ has a negative mean of −37 ms, which indicates the presence of voicing before the release of the consonant.A similar trend is presented in Figure 4, showing the VOT of sixty-seven word-initial onset tokens of /p/, 126 of /t/, seventy-nine of /k w / and 188 of /k/ in the intervocalic, post-nasal and post-obstruent contexts.Word-initial /p t k w / in these contexts have a mean VOT of around 11-26 ms.In stark contrast, word-initial onset tokens of /k/ after a nasal and after an obstruent have mean VOT values of around −25 ms and −23 ms.
VOT could not be measured for word-initial intervocalic /k/ because tokens were almost categorically produced as velar approximants [ƒ 4 ], which have no stop closure nor release.
Only 10% (12/114) of tokens of word-initial, intervocalic /k/ showed other types of realization, either voiced stops or elisions.An approximant realization of word-initial, intervocalic   Following, Figure 6 presents the VOT of 183 word-medial onset tokens of /p/, 250 of /t/, 110 of /k w /, and 375 of /k/.Phonemes /p t/ display similar positive VOT between 12-13 ms in the three contexts considered.In contrast, word-medial /k w / tokens show a mean VOT In the case of codas, the only phoneme that occurs frequently in this position is /k/.
However, since most collected coda tokens are followed by obstruents or pauses, the onset of voicing cannot be established and VOT could not be measured.

Consonant duration
The duration of word-initial onset tokens is shown in Figure 7, pointing again at important differences between /k/ and /p t k w /.While the mean duration of word-initial onset /p t k w / in the considered contexts ranges between 89-103 ms, tokens of /k/ are more than half as long, showing a mean duration of 50 ms in the intervocalic context, 30 ms post-nasally and 48 ms after an obstruent.
Figure 8 presents the duration of the Witzapan Nawat stop phonemes in word-medial onset position.Again, tokens of /p t k w / follow similar patterns -they have a mean duration closer to 100 ms in the intervocalic and post-obstruent contexts.After a nasal, /p t/ and specially /k w / are considerably shorter, having a mean length of 57 ms, 59 ms and 30 ms respectively.In comparison, word-medial /k/ is considerably shorter than the other stops in the intervocalic and post-nasal contexts, reporting a mean duration of 47 ms and 29 ms.
Tokens of word-medial onset /k/ are longest after an obstruent, in which case they have a mean duration of 90 ms, comparable to those of the other stop phonemes in the same context.
A total of 109 tokens of coda /k/ in word-internal position and 203 in word-final position were collected.Their duration according to their phonological context -either pre-nasal, pre-obstruent or utterance-final -is shown in Figure 9.While the mean duration of coda

Percent voicing
The percent voicing of stops in word-initial onset position is presented in Figure 10.Once again, /p t k w / seem to pattern together, as they exhibit a mean percent voicing close to 27% or less in the considered contexts.In sharp contrast, tokens of word-initial onset /k/ in the intervocalic, post-nasal and post-obstruent contexts categorically show a mean percent voicing close to 100%, as represented by its median of 100% and lack of interquartile range.
The percent voicing of word-medial onset stops is shown in Figure 11.Stops /p t k w / have a mean similar percent voicing of around 25% in the intervocalic context and, after an obstruent, their voicing is reduced to almost 0%.In the post-nasal context, /p t/ report similar mean percent voicing of 23% and 26%, but post-nasal /k w / follows a clearly different trend in that its tokens have a mean percent voicing close to 100%.In contrast, tokens of word-medial onset /k/ are fully voiced intervocalically and after a nasal but, like the other stops, /k/ tokens show a mean percent voicing close to 0% when they occur after an obstruent.
The percent voicing of coda /k/ by position and phonological context is shown in Figure 12.In general, voicing of coda /k/ shows greater variation than onset /k/.The mean percent voicing of word-internal coda /k/ is 60% in the pre-nasal context and 45% in the pre-obstruent context.On the other hand, word-final coda /k/ shows mean percent voicing between 75% in the pre-nasal context and 45% when the following consonant is an obstruent.In the utterance-final context, coda /k/ has a considerably lower percent voicing averaging 17%.
To summarize, the descriptive statistics of VOT, consonant duration, and percent voicing evidences striking and categorical acoustic differences between /p t k w / and /k/ in Witzapan Nawat.These differences and their implications for the analysis of the phonemic inventory of Witzapan Nawat will be discussed in the next session.

Discussion
In this section, based on the analyses of the acoustic data, I propose that the stop inventory of Witzapan Nawat is asymmetrical -it consists of voiceless stops /p t k w / and the VOICED velar stop /g/.I show how this is an unusual asymmetry because it is not predicted by markedness or feature-economy theories of phonemic inventory structure.Finally, I propose a series of diachronic developments that led to the creation of this rare system in line with Evolutionary Phonology (Blevins 2004).

/g/ as the velar stop phoneme in Witzapan Nawat
In word-initial onset position, /p t k w / in the utterance-initial, intervocalic, post-nasal, and post-obstruent contexts categorically have positive VOT values.Their mean duration varies between 89-103 ms, and their mean percent voicing fluctuates between 0% in the postobstruent context and 27% in the intervocalic and post-nasal contexts.As word-medial onsets, /p t/ tokens also display positive VOT values in all contexts, mean durations of around 100 ms, and percent voicing between 0-27%.On the other hand, word-medial onset /k w / follows the pattern of /p t/ in the intervocalic and post-obstruent contexts, displaying positive VOT, a mean duration close to 100 ms, and a percent voicing between 0-27%, but shows negative VOT, a mean duration of 27 ms, and a mean percent voicing close to 100% in the post-nasal context.Thus, Witzapan Nawat /p t k w / in almost all positions and contexts have acoustic characteristics -positive VOT, duration, and relatively low percent voicing -that are comparable to voiceless stops in other languages, that is stops specified as & Abramson 1964).The only exception is /k w /, which shows characteristics of a voiced stop [g w ] word-medially in the post-nasal context, as in kitankwa [gi.‹ taN.g w a] 'it (i.e., a coyote) bites it (a rabbit)'.
In stark contrast to the other stops, word-initial onset /k/ shows mean negative VOT in the utterance-initial, post-nasal and post-obstruent contexts.In the intervocalic context, it is produced categorically as an approximant.Duration-wise, word-initial /k/ is shorter than the other stops, averaging 38-50 ms in length, and its percent voicing is close to 100% in all contexts.As a word-medial onset, /k/ has negative VOT post-nasally but positive VOT in the post-obstruent context.Its mean duration and percent voicing is 29 ms and 100% in the post-nasal context and 90 ms and close to 0%in the post-obstruent context.As was the case with word-initial onset /k/, practically all intervocalic tokens of word-medial /k/ are produced as approximants.
As for coda /k/, it has a duration of 90 ms in the utterance-final context and is longer than in pre-nasal and pre-obstruent contexts, where it averages 50-55 ms.Likewise, utterancefinal coda /k/ has an average percent voicing of 17%, which is lower than in pre-nasal and pre-obstruent contexts.On the other hand, percent voicing of coda /k/ is higher in the prenasal context, where it averages 55-75%, compared to the pre-obstruent context, where it averages 30%.Another important finding is that there is more variation in the percent voicing values of coda /k/ compared to onset /k/.
It is clear from the acoustic analyses that, unlike /p t k w /, there are three allophones of onset /k/ in Witzapan Nawat: voiceless stops, approximants and voiced stops.Notably, the only context in which onset /k/ consistently has the characteristics of a voiceless stop -positive VOT, longer duration, and relatively low percent voicing -is word-medially in the post-obstruent context.Conversely, in the intervocalic context, /k/ is produced as an approximant in word-initial and word-medial positions.In all other contexts and positions, onset /k/ shows negative VOT, shorter duration, and percent voicing close to 100%, features that are representative of voiced stops (Lisker & Abramson 1964;Beckman et al. 2013).
Coda /k/ follows different patterns from onset /k/.In the utterance-final context, coda /k/ shows duration and percent voicing comparable to voiceless stops.Nevertheless, pre-nasal and pre-obstruent coda /k/ is longer than onset /k/ in contexts where the latter is produced as a voiced stop but shorter than all other stops.Moreover, the percent voicing of coda /k/ shows more variability than onset /k/.In fact, even in the pre-nasal context, the mean percent voicing of coda /k/ is not 100%.This is evidence that pre-nasal and pre-obstruent coda /k/ shows acoustic characteristics intermediate between voiced and voiceless stops.Similar asymmetries between onset and coda segments are not uncommon and are often analyzed as instances of voice underspecification (Archangeli 1988;Inkelas 1994;Ernestus 2000;Bale et al. 2014).In this case, I suggest that coda /k/ in pre-nasal and pre-obstruent contexts is not specified for the [±voice] feature, and therefore does not display the percent voicing and duration of segments specified for [±voice].
To summarize, the allophones of /k/ and the positions and phonological contexts in which they are found are presented in Table 3, with the relevant segment highlighted in bold.Audio files of these words and phrases from the L1 Witzapan Nawat speakers are available as supplementary materials.
This allophonic distribution prompts the re-evaluation of /k/ as the velar stop phoneme in Witzapan Nawat for one main reason: if /k/ is underlying, it is difficult to account for the presence of the voiced allophone [g] in contexts that are not conducive to voicing in obstruents.For instance, if /k/ is taken as the velar stop phoneme, the voiced allophone [g] after nasals can be readily justified given the propensity of voiceless stops to gain the [+voice] feature in that context (Pater 1999).However, the acoustic data show that [g] also occurs when word-initial /k/ is in the utterance-initial and post-obstruent contexts, which crosslinguistically disfavor voicing in stops (Flege & Brown 1982;Westbury & Keating 1986;Wetzels & Mascaro 2001;Hayes 2004;Beckman et al. 2013).In order to account for the voicing of /k/ in these cases, it would be necessary to invoke 'quirky' rules or markedness constraints that give /k/ the [+voice] feature without any phonetic motivation.
Consequently, an interpretation that better accounts for the phonological facts is proposed, in which the velar stop phoneme is /g/, a phoneme specified for [+voice]. 4This way, the allophones of the velar stop in different contexts can be readily explained by common phonological processes.More precisely, the voiceless stop allophone [k] that is found in the word-medial post-obstruent context, as in wej[k]a 'far' is accounted for as a case of progressive voicing assimilation, in which /g/ acquires the [−voice] specification of the previous voiceless obstruent.As seen in Table 3, voicing assimilation to a previous obstruent is blocked when /g/ is word-initial, as in tikuyat [g]a né 'we shell it over there'.This can be explained by appealing to the status of the beginning of a word as a 4 Another possible analysis is that the underlying velar phoneme is the velar approximant /ƒ 4 /.The problem with this proposal is one of opacity.If /ƒ 4 / is underlying, the voiced stop allophone [g] found word-initially in the utterance-initial, post-obstruent, and post-nasal contexts can be explained as positional strengthening, that is, the production of a segment with increased constriction in a prominent position (Lavoie 2015).However, the question arises of how the voiceless stop allophone [k] in the word-medial post-obstruent context comes to be, as in wej[k]a 'far'.If /ƒ 4 / is underlying, it is not clear whether the feature [−voice] or [+continuant] spreads first from the previous voiceless obstruent.On the other hand, if the velar stop phoneme is taken as /g/, the voiceless stop allophone is simply accounted for by spreading of the feature [−voice] from the previous obstruent.Salgado phonologically strong position.Evidence from a variety of languages shows that segments in the word-initial position are resistant to assimilatory processes that affect the same segment elsewhere in the word, perhaps due to their psycholinguistic prominence (McCarthy & Prince 1995;Beckman 2004).Because of this, spreading of the [−voice] feature from a previous obstruent is blocked when /g/ is in word-initial position.
Similarly, the velar approximant allophone [ƒ 4 ] found in the intervocalic position, as in mutu[ƒ 4 ]a 'it is planted', is the result of a process of spirantization of the voiced velar stop /g/, which can be analyzed as the spreading of the [+continuant] feature from the adjacent vowels.Voiced stop/approximant alternations of this type are robustly documented in numerous languages (Kirchner 2004;Martínez Celdrán & Regueira 2008).Finally, in all other contexts, including utterance-initially, onset /g/ maintains its [+voice] and [−continuant] specifications and surfaces as a voiced stop.As for the velar stop in coda position, the acoustic analyses reveal that it displays characteristics that are intermediate between voiced and voiceless stops when it occurs in the pre-nasal and pre-obstruent context.For this reason, I propose that the velar stop is unspecified for [±voice] in these contexts.However, when a coda velar stop occurs in the utterance-final context, it consistently shows acoustic properties of voiceless stops, as in kuchilti[k]## 'orange (color)'.To account for these systematic differences in behavior between utterance-final velars and coda velars in pre-nasal and pre-obstruent contexts, I propose that utterance-final velars acquire the feature [−voice] because this environment is not phonetically conducive to voicing -in anticipation of the end of the utterance, vocal folds begin to spread to reach their resting position.This inhibits voicing, especially in obstruents (Hock 1991: 80;Blevins 2004: 104).
In contrast to /g/, the Witzapan Nawat labiovelar stop is specified as [−voice].The acoustic analysis revealed that this phoneme had a voiced stop allophone [g w ] word-medially in the post-nasal context, as in kitankwa [gi.‹ taN.g w a] 'it bites it'.However, in this case, it cannot be claimed that the [+voice] feature is underlying for the labiovelar phoneme.Rather, it acquires this feature via spreading from the previous nasal.Further evidence of this comes from the fact that voiced allophones of the labiovelar do not occur when /k w / is in word-initial position, even after a nasal, as in wan kwilin [ ‹ waN ‹ k w i.lin] 'and worms'.This can be analyzed as another effect of the word-initial position, as described earlier.In the case of word-initial /k w /, voicing assimilation is blocked and the [−voice] specification of the stop is retained in the post-nasal context.Conversely, the same effect results in word-initial /g/ maintaining its [+voice] specification after an obstruent.
To summarize, based on phonetic and phonological data, I propose that the stop inventory of Witzapan Nawat consists of voiceless /p t k w / and voiced /g/. 5 This inventory is asymmetrical because the feature [+voice] is present in only one of the members of the system.Moreover, it is an unusual inventory because it is not predicted by most theories of phonemic inventory structure.For instance, theories based on markedness posit that, due to their phonetic properties, some segments are more 'marked' than others (Gamkrelidze 1975).Consequently, asymmetries in phonemic inventories are caused by the absence of the most marked segments.As illustration, it is widely acknowledged that, in languages with a phonological voicing contrast in stops, the voiced velar stop /g/ is more likely to be missing than stops at other points of articulation -the so-called 'g-gap'.This is the case of Helong (Balle 2017), Setswana (Boyer & Zsiga 2013), some varieties of Galician (Martínez-Gil 2003), and Dutch (Booij 1999), among many other languages.Under the markedness approach, the g-gap is explained by the fact that /g/ is more marked than other voiced stops for aerodynamic reasons -the size of the supraglottal cavity is smaller in velar stops than in other points of articulation and this makes it more difficult to sustain the air pressure differential 5 Unlike some of the Nahuan languages mentioned in Section 2, there is no evidence of two velar stop phonemes /k/ /g/, as I have not been able to identify any minimal pairs.that is necessary for voicing in stops (Ohala 1983: 40).For this reason, this model predicts that, if a language has a voiced stop, it will be unmarked /b d/ rather than marked /g/, which is the opposite of what is observed in Witzapan Nawat.
On the other hand, feature-systemic models propose that phonemic inventories tend to maximize the number of segments that bear a feature that is already present in the system (Clements 2003).Thus, the prediction is that, if a language has a stop with the [+voice] feature at a given place of articulation, it will also tend to have this feature in stops at other points of articulation to maximize feature economy.Nevertheless, in the case of Witzapan Nawat, the [+voice] feature in obstruents is only present in /g/ and is not maximally or economically distributed across other manners and points of articulation.
A different approach to phonemic inventory structure is offered by Evolutionary Phonology (Blevins 2004).In this framework, synchronic sound patterns are understood attending to their diachronic origin.Departing from theories that see sound change as a symmetry-inducing factor in phonemic inventories, Evolutionary Phonology posits that, in fact, sound change leads to asymmetry just as often.To cite one example, the g-gap that many languages develop independently is an asymmetry that is introduced via phonetically motivated sound change.Accordingly, since sound change and language change in general do not necessarily lead to symmetry, the prediction is that asymmetrical phonemic inventories are a natural, and indeed common, consequence of diachrony.In the following subsection, I explain the Witzapan Nawat inventory as the result of a series of historical developments involving sound change and analogy in line with Evolutionary Phonology.

The origin of an asymmetrical stop subsystem
Although numerous studies point at the incompatibility of velar stops and voicing due to aerodynamic reasons (Ohala 1983;Maddieson 1999), there is also diachronic and synchronic evidence from several languages suggesting that /k/ and /k w / are more likely to become voiced than other voiceless stops, especially in the intervocalic context.As illustration, in the development of several Romance languages, voicing of Latin /k/ to /g/ was more frequent and occurred earlier than voicing of stops at other points of articulation (deGorog 1962;Recasens 2002).In fact, in modern Spanish, intervocalic /k/ is more likely to be fully voiced than /p t/ (Hualde et al. 2011), a pattern that is particularly frequent in Chilean Spanish (Bolyanatz & Rogers 2019).Preferential voicing of /k/ over other voiceless stops is also found in Basque (Hualde et al. 2019), the Papuan languages Kaeti and Wambon (Healey 1970), Honduran Lenca (King 2017), Ember Katío (Greenfield Vélez 2012), and Q'anjob'al (Lichtman et al. 2010).Moreover, as seen in Section 2, some Nahuan languages of Mexico have reported a shift of /k w />/b/ whereas all other stops remained voiceless.An identical change is reported in several Muskogean languages (Booker 1993).
Previous studies have proposed that the crosslinguistic propensity for the voicing of velars in the intervocalic position is due to their articulatory characteristics (Hualde et al. 2011: 326, f.n. 9;Recasens 2002;Shaw et al. 2020: 610).Unlike bilabial and coronal stops, voiceless velar stops and vowels share the tongue body as their active articulator.Thus, in the articulation of a Vowel+[k]+Vowel sequence, the tongue body must move quickly from the open gesture of the first vowel to the velar closure gesture of the stop and then to the open gesture of the following vowel.The opposing articulatory targets imposed to the tongue body by the velar stop and the surrounding vowels can result in a reduced duration of the closure gesture at the velum.In turn, shorter obstruent segments have been shown to have greater proportions of voicing and to be more likely to be perceived as voiced (Cole & Cooper 1975;Ohala & Riordan 1979;Summerfield 1981;Westbury & Keating 1986).
In support of this proposition, a number of languages report shorter duration of voiceless velar stops compared to other places of articulation, especially in the intervocalic context (Maddieson 1999).This is the case of /k/ in Spanish (Hualde et al. 2011: 318), American English (Umeda 1977)6 , Hungarian (Neuberger 2015), and Oaxaca Chontal (Maddieson et al. 2009).In Sierra Norte de Puebla Nahuatl, a Nahuan language described with sporadic voicing of /k/, the velar stop in the intervocalic position is shorter than other stops (Kakadelis 2018: 215).
Considering these facts, I propose the following diachronic pathway that led to the asymmetrical phonemic inventory of modern Witzapan Nawat.Originally, the stop system of this variety consisted of /p t k k w /, like most Nahuan languages.However, at some point in time, a voiced stop allophone [g] of the velar stop phoneme /k/ started to appear in the intervocalic context. 7This scenario is further supported by the other Nahuan and Nawat varieties that have sporadic [g] only in the intervocalic position, as seen in Section 2. 8 In a subsequent development, voiced allophones of /k/ spread to the post-nasal context, which is also conducive to voicing in stops (Pater 1999) It is likely that contexts in which word-initial /k/ was produced as [g] were more frequent than contexts in which it was produced as [k].Modern Witzapan Nawat offers evidence in favor of this scenario, since in the recordings of the Witzapan Nawat speakers, the most frequent contexts of the velar stop in word-initial position are those conducive to voicing -156 tokens were collected in the intervocalic and twenty in the post-nasal context.In comparison, 132 tokens of the velar stop in word-initial position were found in the utterance-initial and twelve in the post-obstruent context, which do not favor voicing.If this same distribution is reflective of past stages of the language, when there was an alternation between voiceless and voiced stop allophones of the velar stop, that would mean that productions of the word-initial velar stop as [g] were more frequent than [k].Previous studies find that, in similar situations of allophonic alternation of word-initial segments, the most frequent variant tends to be generalized over the others (Raymond & Brown 2012;Bybee 2017).
Given this scenario, I propose that, at some point in time, Witzapan Nawat speakers analogically generalized the voiced stop allophone [g] of word-initial /k/ to all contexts, even to those that were not conducive to voicing, such as after an obstruent, as in yejemet kisat [je.‹ he.met ‹ gi.sat] 'they leave', and utterance-initially, as in ##kisa [ ‹ gi.sa] 'she leaves'.In contrast, in the word-medial post-obstruent context and utterance-finally, in which there was never alternation between voiced and voiceless allophones, the velar stop remained being produced as [k], as in wejka [ ‹ weh.ka] 'far' and kuchiltik## [gu.‹ tp Sil.tik] 'color orange'.
Finally, at some later time, voiced stop allophones spirantized to velar approximants [ƒ 4 ] in the intervocalic context.This is the modern allophonic distribution of the velar stop phoneme in Witzapan Nawat, which is now better analyzed as a voiced velar stop /g/, a segment specified as [+voice], rather than /k/.
This proposed diachrony of the phonemic inventory of Witzapan Nawat goes in line with the tenets of Evolutionary Phonology, according to which synchronic sound patterns are a reflection of their diachrony.As such, the prediction is that asymmetrical phonemic inventories are not only natural, but expected, because sound change, and language change in general, is not teleological -they operate with no regard for symmetry or articulatory/perceptual ease.Moreover, Evolutionary Phonology posits that common sound patterns are the result of common, phonetically motivated sound changes.On the other hand, rare sound patterns, including rare phonemic inventories, arise from the application of non-phonetic processes, such as analogy (Blevins 2004: 192).The diachrony of phonemic /g/ that I advance is in agreement with this claim.In the development of the modern Witzapan Nawat phonemic inventory, an arguably common sound change led to the allophonic voicing of intervocalic and post-nasal /k/.However, it is only through the action of analogy, likely facilitated by frequency effects, that the shift /k/>/g/ took place, leading to the creation of an asymmetrical and rare stop inventory -/p t g k w /.

Conclusions
With this first instrumental study on the phonology/phonetics of Witzapan Nawat, I hope to contribute to the Nawat revitalization movement.This is because, in order to effectively teach the pronunciation of this language to L2 learners, a thorough understanding of its sound patterns is needed.This paper also leaves ample space for future research.For instance, it stresses the need for more research on Nahuan languages in which /k/ voicing has not advanced to the extent of Witzapan Nawat.Doing so will complete the picture of the origin of this phenomenon -whether it is common to all of Nahuan or an innovation of the Isthmus dialects that spread and whether /k/ voicing diffuses gradually through the lexicon from high-frequency words or morphemes.Moreover, it also brings out the need to assess the relationship between /k/ and /k w / voicing in Nahuan to establish whether the /k w />/b/ shift found in some Nahuan languages can be accounted for by the same diachronic mechanisms that I proposed for the /k/>/g/ change in Witzapan Nawat.Finally, this study highlights yet again the need for the documentation of highly endangered languages, not only for the sake of linguistic sciences, but for its potential contributions to revitalization initiatives.

Figure 1 .
Figure 1.Production of utterance-initial /k/ in the word kuchiltik 'color orange' showing a negative VOT value.Production by speaker F3.

Figure 2 .
Figure2.Production of word-initial intervocalic /t/ in the phrase taja tikwalani 'you get mad' (F2).The onset of the consonant is marked at the end of the formant structure of the previous vowel and its ending point is set before the stop release.This example shows a period of voicing in the consonant, used to calculate the percent voicing.

Figure 3 .
Figure 3. VOT of word-initial onset /p t k w / and /k/ in the utterance-initial context.

Figure 4 .
Figure 4. VOT of word-initial onset /p t k w / and /k/ by phonological context.

Figure 5 .
Figure 5. Production of word-initial intervocalic /k/ as an approximant in the phrase kenha keman pewa 'it is the same when it begins' (M1).
/k/ is presented in Figure 5.The waveform and spectrogram corresponding to /k/ show decreased intensity, lack of a stop closure and release, and presence of voicing and formant structure throughout the consonant.These acoustic characteristics are consistent with approximants in languages such as Iwaidja (Shaw et al. 2020), Galician (Martínez Celdrán & Regueira 2008) and Spanish (Martínez Celdrán 1991).

Figure 6 .
Figure 6.VOT of word-medial onset /p t k w / and /k/ by phonological context. of 24 ms in the intervocalic and post-obstruent contexts and a negative mean of −27 ms post-nasally.Word-medial /k/ follows a similar trend, as post-nasal tokens have a negative mean of −25 ms while post-obstruent tokens show a positive VOT mean of 25 ms.As was the case with word-initial onsets, almost all intervocalic tokens of word-medial onset /k/ were produced as approximants and were not measured for VOT.Only 10% (23/213) of tokens of word-medial /k/ in intervocalic context were produced as either voiced stops, voiceless stops, or were deleted.

Figure 7 .
Figure 7. Duration of word-initial onset /p t k w / and /k/ by phonological context.

Figure 8 .
Figure 8. Duration of word-medial onset /p t k w / and /k/ by phonological context.

Figure 10 .
Figure 10.Percent voicing of word-initial onset /p t k w / and /k/ by phonological context.

Figure 11 .
Figure 11.Percent voicing of word-medial onset /p t k w / and /k/ by phonological context.

Figure 12 .
Figure 12.Percent voicing of coda /k/ by position within word and phonological context.

Table 1 .
Nawat phonemic inventory, based on . The corresponding letters in the official Nawat alphabet are in bold font . The allophony of /k/ in Witzapan Nawat according to the previous literature is summarized as follows: [g] at the beginning of a word or after a voiced consonant: kal [ ‹

Table 2 .
Positions within the word/syllable and phonological contexts used for coding tokens of /p t k w / and /k/ Pre-obstruentkwak temu 'when it descends'Utterance-final ishtapachijtuk## 'crooked'

Table 3 .
Allophones of /k/ in Witzapan Nawat by word position and phonological context based on the analyses of correlates of voicing Type of production of /k/ piS.kat] 'we harvest it' . At this point, voicing spreading would operate regularly regardless of word position, affecting word-medial /k/, as in nikan [ ‹ ni.gaN] 'here' and anka [ ‹ aN.ga] 'maybe', but also word-initial /k/, as in ne kal [ne ‹ gal] 'the house' and ipan kisa [ ‹ ipaN ‹ gisa] 'it leaves from behind'.Since the voiced stop allophone [g] occurred only in contexts that favored /k/ voicing, the voiceless stop allophone [k] was produced in all other instances, such as in the post-obstruent context, as in wejka [ ‹ weh.ka].Consequently, sometime in the history of Witzapan Nawat, there were synchronic allophonic alternations of word-initial /k/ -on the one hand, it was produced as [k] in the post-obstruent context, as in yejemet kisat [je.‹ he.met ‹ ki.sat] 'they leave', and utterance-initially, as in ##kisa [ ‹ ki.sa] 'she leaves'.On the other hand, word-initial /k/ was produced as [g] in the intervocalic context, as in ne kisa [ne ‹ gi.sa] 'the one who leaves', and after a nasal, as in ipan kisa [ ‹ ipaN ‹ gisa] 'it leaves from behind'.