Hostname: page-component-8448b6f56d-c47g7 Total loading time: 0 Render date: 2024-04-18T06:56:21.447Z Has data issue: false hasContentIssue false

Phonetic Analysis and the Automatic Segmentation and Labeling of Speech Sounds

Published online by Cambridge University Press:  06 February 2009

Peter Roach
Affiliation:
Department of Linguistics & Phonetics, University of Leeds, LS2 9JT, UK
Helen Roach
Affiliation:
School of Computer Studies, University of Leeds, LS2 9JT, UK
Andrea Dew
Affiliation:
Department of Linguistics & Phonetics, University of Leeds, LS2 9JT, UK
Paul Rowlands
Affiliation:
Department of Linguistics & Phonetics, University of Leeds, LS2 9JT, UK

Extract

A fundamentally important practice in phonetics is the analysis of continuous speech into a sequence of discrete segments. There has been considerable debate about the theoretical validity of this practice within classical auditory/kinaesthetic phonetics and in phonology as well as in the context of the acoustic analysis of speech; in recent years the issue has become more widely important as research work in automatic speech processing has resulted in computer algorithms for segmenting speech and assigning phonetic labels to the segments. Work on the automatic segmentation and labeling of speech sounds has been carried on in our department since 1980. This paper begins by examining the theoretical issues involved in segmentation and labeling, then describes our own work.

Type
Articles
Copyright
Copyright © Journal of the International Phonetic Association 1990

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Abercrombie, D. (1965). Parameters and phonemes. In Studies in Phonetics and Linguistics. Oxford: Oxford University Press, 120124.Google Scholar
Abercrombie, D. (1967). Elements of General Phonetics. Edinburgh: Edinburgh University Press.Google Scholar
Dalby, J., Laver, J. and Hiller, S.M. (1986). Mid-class phonetic analysis for a continuous speech recognition system. Proceedings of the Institute of Acoustics, 8.7, 347354.Google Scholar
Dew, D. and Jensen, P.J. (1977). Phonetic Processing. Ohio: Merrill.Google Scholar
Diehl, R.L. (1987). On segments and segment boundaries. Journal of Phonetics, 15, 288–9.CrossRefGoogle Scholar
Elman, J. and McLelland, J. (1986). Exploiting lawful variability in the speech wave, in Perkell, J.S. and Klatt, D.H.Invariance and Variability in the Speech Wave. Hillsdale, New Jersey: Erlbaum.Google Scholar
Fant, C.G.M. (1962). Descriptive analysis of the acoustic aspects of speech. Logos, 5, 317 (reprinted in G. Fant, Speech Sounds and Features. Cambridge: M.I.T., 1973, 17–31).Google ScholarPubMed
Fowler, C.A. (1986). An event approach to the study of speech from a direct-realistic perspective. Journal of Phonetics 14, 328.Google Scholar
Green, P.D. (1985). Speech recognition - what is happening now. Computer Bulletin, September 1985, 57.Google Scholar
Hammarberg, R. (1987). Why segments might have boundaries even if these boundaries have no function. Journal of Phonetics, 15, 287–8.Google Scholar
House, A.S. and Neuberg, E.P. (1977). Toward automatic identification of the language of an utterance. Journal of the Acoustical Society of America, 62(3), 708713.CrossRefGoogle Scholar
Jakobson, R., Fant, C.G.M. and Halle, M. (1951). Preliminaries to Speech Analysis. Cambridge, Mass.: M.I.T. Press.Google Scholar
Ladefoged, P. (1982). A Course in Phonetics. 2nd Edition, New York: Harcourt Brace Jovanovich Inc.Google Scholar
Laver, J. (1970). The production of speech. In: Lyons, J. (ed.), New Horizons in Linguistics, London: Pelican, 5375.Google Scholar
Pike, K.L. (1943). Phonetics. Ann Arbor, Michigan: University of Michigan Press.CrossRefGoogle Scholar
Roach, H.N. and Roach, P.J. (1983). Automatic identification of speech sounds from different languages. Working Papers in Linguistics and Phonetics, 1, 9195.Google Scholar
Roach, P. J. (1989). Phonetic Feature Extraction by Automatic Segmentation and Labeling. Final Report to S.E.R.C. on Alvey project MMI/053, Grant GR/D/28782.Google Scholar
Roach, P.J., Dew, A.M. and Rowlands, P. (1987). Automatic assessment of machine transcriptions. Proc. XI Int. Cong. Phon. Sci., Tallinn, Estonia, 5, 217220.Google Scholar
Roach, P.J. and Roach, H.N. (1984). Automatic language recognition using broadly classified segments. Working Papers in Linguistics and Phonetics. Leeds University, 2, 7986.Google Scholar
Roach, P.J., Roach, H.N. and Dew, A.M. (1986). “Assessing accuracy in automatic identification of phonetic segments”. Proc. Int. Conf. on Speech I/O, London, IEE, 216219.Google Scholar
Roach, P.J., Rowlands, P. and Dew, A.M. (1987). Assessment of accuracy in automatic phonetic analysis. Proc. Euro. Conf. on Speech Tech., Edinburgh, 2, 158160.Google Scholar
Stevens, K.N. and Blumstein, S. (1981). The search for invariant acoustic correlates of phonetic features. In Eimas, P.D. and Miller, J.L.Perspectives on the Study of Speech. Hillsdale, New Jersey: Erlbaum.Google Scholar