Phonetic Analysis and the Automatic Segmentation and Labeling of Speech Sounds

Peter Roach; Helen Roach; Andrea Dew; Paul Rowlands

doi:10.1017/S002510030000400X

Phonetic Analysis and the Automatic Segmentation and Labeling of Speech Sounds

Published online by Cambridge University Press: 06 February 2009

Andrea Dew and

Peter Roach: Affiliation:
Department of Linguistics & Phonetics, University of Leeds, LS2 9JT, UK
Helen Roach: Affiliation:
School of Computer Studies, University of Leeds, LS2 9JT, UK
Andrea Dew: Affiliation:
Department of Linguistics & Phonetics, University of Leeds, LS2 9JT, UK
Paul Rowlands: Affiliation:
Department of Linguistics & Phonetics, University of Leeds, LS2 9JT, UK

Article contents

Extract
References

Get access

Rights & Permissions

Extract

A fundamentally important practice in phonetics is the analysis of continuous speech into a sequence of discrete segments. There has been considerable debate about the theoretical validity of this practice within classical auditory/kinaesthetic phonetics and in phonology as well as in the context of the acoustic analysis of speech; in recent years the issue has become more widely important as research work in automatic speech processing has resulted in computer algorithms for segmenting speech and assigning phonetic labels to the segments. Work on the automatic segmentation and labeling of speech sounds has been carried on in our department since 1980. This paper begins by examining the theoretical issues involved in segmentation and labeling, then describes our own work.

Type: Articles
Information: Journal of the International Phonetic Association , Volume 20 , Issue 1 , July 1990 , pp. 15 - 21

DOI: https://doi.org/10.1017/S002510030000400X [Opens in a new window]
Copyright: Copyright © Journal of the International Phonetic Association 1990

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Abercrombie, D. (1965). Parameters and phonemes. In Studies in Phonetics and Linguistics. Oxford: Oxford University Press, 120–124.Google Scholar

Abercrombie, D. (1967). Elements of General Phonetics. Edinburgh: Edinburgh University Press.Google Scholar

Dalby, J., Laver, J. and Hiller, S.M. (1986). Mid-class phonetic analysis for a continuous speech recognition system. Proceedings of the Institute of Acoustics, 8.7, 347–354.Google Scholar

Dew, D. and Jensen, P.J. (1977). Phonetic Processing. Ohio: Merrill.Google Scholar

Diehl, R.L. (1987). On segments and segment boundaries. Journal of Phonetics, 15, 288–9.CrossRef Google Scholar

Elman, J. and McLelland, J. (1986). Exploiting lawful variability in the speech wave, in Perkell, J.S. and Klatt, D.H.Invariance and Variability in the Speech Wave. Hillsdale, New Jersey: Erlbaum.Google Scholar

Fant, C.G.M. (1962). Descriptive analysis of the acoustic aspects of speech. Logos, 5, 3–17 (reprinted in G. Fant, Speech Sounds and Features. Cambridge: M.I.T., 1973, 17–31).Google Scholar PubMed

Fowler, C.A. (1986). An event approach to the study of speech from a direct-realistic perspective. Journal of Phonetics 14, 3–28.Google Scholar

Green, P.D. (1985). Speech recognition - what is happening now. Computer Bulletin, September 1985, 5–7.Google Scholar

Hammarberg, R. (1987). Why segments might have boundaries even if these boundaries have no function. Journal of Phonetics, 15, 287–8.Google Scholar

House, A.S. and Neuberg, E.P. (1977). Toward automatic identification of the language of an utterance. Journal of the Acoustical Society of America, 62(3), 708–713.CrossRef Google Scholar

Jakobson, R., Fant, C.G.M. and Halle, M. (1951). Preliminaries to Speech Analysis. Cambridge, Mass.: M.I.T. Press.Google Scholar

Ladefoged, P. (1982). A Course in Phonetics. 2nd Edition, New York: Harcourt Brace Jovanovich Inc.Google Scholar

Laver, J. (1970). The production of speech. In: Lyons, J. (ed.), New Horizons in Linguistics, London: Pelican, 53–75.Google Scholar

Pike, K.L. (1943). Phonetics. Ann Arbor, Michigan: University of Michigan Press.CrossRef Google Scholar

Roach, H.N. and Roach, P.J. (1983). Automatic identification of speech sounds from different languages. Working Papers in Linguistics and Phonetics, 1, 91–95.Google Scholar

Roach, P. J. (1989). Phonetic Feature Extraction by Automatic Segmentation and Labeling. Final Report to S.E.R.C. on Alvey project MMI/053, Grant GR/D/28782.Google Scholar

Roach, P.J., Dew, A.M. and Rowlands, P. (1987). Automatic assessment of machine transcriptions. Proc. XI Int. Cong. Phon. Sci., Tallinn, Estonia, 5, 217–220.Google Scholar

Roach, P.J. and Roach, H.N. (1984). Automatic language recognition using broadly classified segments. Working Papers in Linguistics and Phonetics. Leeds University, 2, 79–86.Google Scholar

Roach, P.J., Roach, H.N. and Dew, A.M. (1986). “Assessing accuracy in automatic identification of phonetic segments”. Proc. Int. Conf. on Speech I/O, London, IEE, 216–219.Google Scholar

Roach, P.J., Rowlands, P. and Dew, A.M. (1987). Assessment of accuracy in automatic phonetic analysis. Proc. Euro. Conf. on Speech Tech., Edinburgh, 2, 158–160.Google Scholar

Stevens, K.N. and Blumstein, S. (1981). The search for invariant acoustic correlates of phonetic features. In Eimas, P.D. and Miller, J.L.Perspectives on the Study of Speech. Hillsdale, New Jersey: Erlbaum.Google Scholar

Article contents

Phonetic Analysis and the Automatic Segmentation and Labeling of Speech Sounds

Extract

Access options

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests