Hostname: page-component-89b8bd64d-7zcd7 Total loading time: 0 Render date: 2026-05-09T08:11:15.046Z Has data issue: false hasContentIssue false

Analyzing lexical emergence in Modern American English online1

Published online by Cambridge University Press:  25 May 2016

JACK GRIEVE
Affiliation:
Centre for Forensic Linguistics, Aston University, Birmingham B4 7ET, UK j.grieve1@aston.ac.uk, a.nini1@aston.ac.uk
ANDREA NINI
Affiliation:
Centre for Forensic Linguistics, Aston University, Birmingham B4 7ET, UK j.grieve1@aston.ac.uk, a.nini1@aston.ac.uk
DIANSHENG GUO
Affiliation:
Department of Geography, University of South Carolina, Columbia South Carolina 29208, USA guod@mailbox.sc.edu
Rights & Permissions [Opens in a new window]

Abstract

This article introduces a quantitative method for identifying newly emerging word forms in large time-stamped corpora of natural language and then describes an analysis of lexical emergence in American social media using this method, based on a multi-billion-word corpus of Tweets collected between October 2013 and November 2014. In total 29 emerging word forms, which represent various semantic classes, grammatical parts-of-speech and word formation processes, were identified through this analysis. These 29 forms are then examined from various perspectives in order to begin to better understand the process of lexical emergence.

Information

Type
Research Article
Copyright
Copyright © Cambridge University Press 2016 
Figure 0

Figure 1. Spearman's coefficient vs. 2013 relative frequency (67,022 forms)

Figure 1

Figure 2. Spearman's coefficient vs. 2013 relative frequency (29 forms)

Figure 2

Table 1. Emerging word forms

Figure 3

Table 2. Recency of emerging words

Figure 4

Figure 3. Emerging words time charts (part 1)

Figure 5

Figure 4. Emerging words time charts (part 2)

Figure 6

Figure 5. Emerging words time charts (part 3)

Figure 7

Figure 6. Onomasiological competition time charts

Figure 8

Table 3. Fleek meaning generalization over time

Figure 9

Table 4. Established words

Figure 10

Figure 7. Semasiological change in (on) fleek

Figure 11

Figure 8. Established words time charts