Skip to main content Accessibility help
×
Hostname: page-component-76fb5796d-dfsvx Total loading time: 0 Render date: 2024-04-29T16:25:33.847Z Has data issue: false hasContentIssue false
Coming soon

Programming for Corpus Linguistics with Python and Dataframes

Expected online publication date:  24 June 2024

Daniel Keller
Affiliation:
Western Kentucky University

Summary

This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.
Type
Element
Information
Online ISBN: 9781108904094
Publisher: Cambridge University Press

Save element to Kindle

To save this element to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Programming for Corpus Linguistics with Python and Dataframes
  • Daniel Keller, Western Kentucky University
  • Online ISBN: 9781108904094
Available formats No formats are currently available for this content.
×

Save element to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Programming for Corpus Linguistics with Python and Dataframes
  • Daniel Keller, Western Kentucky University
  • Online ISBN: 9781108904094
Available formats No formats are currently available for this content.
×

Save element to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Programming for Corpus Linguistics with Python and Dataframes
  • Daniel Keller, Western Kentucky University
  • Online ISBN: 9781108904094
Available formats No formats are currently available for this content.
×