Who We Are
- Cambridge University Press at a Glance
- The Press Syndicate
- The Press Board
- History of the Press
- The Queen's Printer's Patent
- Cambridge's Ethics
- Annual Report
- UK Gender Pay Gap 2017
- What We Do
Rights & Permissions
- Community & Environment
- Our Bookshop
- Current Jobs
- Agency Policy
- Meet the Press
- Our Culture
- Graduate Programme
- Contact Us
- Legal Notices
- Annual Report 2017
The Cambridge English Corpus is a multi-billion word collection of written and spoken English.
Our Corpus helps us to understand more about the English language, and how people use it when they speak and when they write.
Our learning materials are developed using our Corpus, making them more authentic and useful - illustrating language as it is really used.
World-leading spoken language research
Cambridge University Press and Lancaster University are running a research project to create world-leading freely-availably resource for linguistic research – the British National Corpus 2014.
About the project
We are compiling a very large collection of recordings of real-life, informal, spoken interactions between speakers of British English from across the United Kingdom.
These recordings will be transcribed and made freely available for a wide range of research purposes.
Here at Cambridge, we’ll use the collection to further improve our learning materials.
Why collect spoken language?
The last project of this scale and type was completed in the UK in the early 1990s – before Twitter, selfies, smartphones, and Facebook!
We think it is of great importance to collect new recordings, from the 2010s, in order to understand the nature of British English speech as it is today and not how it was over two decades ago.
To find out more about the Corpus, go to the link in the Extras menu.