|

A corpus is a collection of texts stored
on a computer. This dictionary was created by a group of editors
using sophisticated software tools to search the American
English parts of the Cambridge International Corpus (CIC) for words
and phrases and to analyze the results.
Cambridge University Press has assembled a corpus that includes
700 million words of written and spoken English texts. The
texts in the CIC come from newspapers, novels, nonfiction
books on a wide range of topics, websites, magazines, junk
mail, TV and radio programs, recordings of people's everyday
conversations, and many other sources. The American English
parts of the CIC include 175 million words of written texts,
22 million words of spoken English, 7 million words of academic
texts, and 25 million words taken from business texts.
Although idioms in general are used frequently in both speech
and writing, individual idioms may not be used frequently.
For example, the idiom give up the ghost occurs approximately
once in every 7 million words of corpus texts. (By comparison,
the word give occurs approximately 3,300 times in every
7 million words.) Because there are many more idioms in
American English than we could include in this book, we decided
to include only idioms that occurred at least once in every
10 million words of corpus texts.
The idioms included in the Cambridge Dictionary of American
Idioms have been carefully checked in the American English
parts of the CIC. Our editors have checked to make sure that
each idiom is actually used in current American English, that
it is used often enough to include in this book, and that
the information we give about each idiom describes what people
mean when they say or write it.
|