What is a corpus?
A corpus is a collection of texts stored in a database and may be quite small or very large.

The uses of a corpus
A corpus can be analyzed using software tools, much like those used to find key words on the Internet, but with greater sophistication. By evaluating the results of these searches, it is possible to see how language is really used, and to find answers to questions like these:

What are the most frequent words and phrases in English?

Which tenses do people use most often?

What prepositions follow particular verbs?

How do people use words like can, may, and might?

How many words must a learner know in order to participate in everyday conversation?

 
Materials developed with a corpus can therefore be more authentic and can illustrate language as it is really used.
 
 
Learn more about the Corpus
 

In From Corpus to Course Book, Michael McCarthy discusses the fundamentals of corpus research and the ways this research has been used to develop Touchstone.

From Corpus to Course Book

 

booklet

In Teaching Vocabulary: Lessons from the Corpus, Lessons for the Classroom, Jeanne McCarten discusses the fundamentals of corpus linguistics and how information about language derived from corpus research can be used to inform vocabulary teaching.

Pdf Teaching Vocabulary
booklet

Explorations in Corpus Linguistics, by Michael McCarthy, is a collection of articles that successively challenge the notion of "fluency," underscore the
importance of word clusters, and provide a set of criteria for a grammar of spoken English.

Pdf Explorations in Corpus Linguistics