Change

Part III Change

5 Antecedents of innovation: exploring general extenders in conservative dialects

5.1 Introduction

Discourse-pragmatic change is perhaps the least well-studied type of change in quantitative approaches to language variation and change. This is undoubtedly due in part to the early debates questioning the extension of the variationist enterprise to levels of grammar above phonology (see, for example, Lavandera Reference Lavandera1978; Rickford Reference Rickford, Fasold and Shuy1975; Sankoff Reference Sankoff, Bailey and Shuy1973; see Waters [Chapter 2] for an overview). More recently, these earlier concerns have resurfaced as cautionary notes on the use of quantitative methods in the study of discourse-pragmatic variation. There is also a concomitant unease that current generalisations about the nature of change in phonology, morphology and syntax, including a constant rate of change (Kroch Reference Kroch1989), layering effects, phonetic reduction/erosion and analogical levelling (Hopper Reference Hopper, Traugott and Heine1991; Hopper and Traugott Reference Hopper and Traugott2003), will apply in the same way to discourse-pragmatic phenomena (see, for example, Pichler Reference Pichler2013: 13). Yet numerous studies over the past decade have determinedly opened up this area of grammar to quantitative investigation. Among the most prominently studied features is the general extender (GE) system (see, for example, Cheshire Reference Cheshire2007; Denis Reference Denis2011; Dines Reference Dines1980; Overstreet Reference Overstreet1999; Pichler and Levey Reference Pichler and Levey2011; Tagliamonte and Denis Reference Tagliamonte and Denis2010; Winter and Norrby Reference Winter, Norrby and Henderson2000). Given the extensive knowledge base that is developing for this area of grammar, the time is ripe to offer continued breadth to the emerging contemporary situation (see Pichler and Levey Reference Pichler and Levey2011).

GEs can be identified using a combined structural and functional approach (see also Denis and Tagliamonte Chapter 4; Pichler Chapter 3; Waters Chapter 2). Structurally, they are semi-fixed constructions that share a common schematic pattern (Pichler and Levey Reference Pichler and Levey2011: 448) which typically involves the following items: a connector (i.e., and, or), a generic (e.g., stuff, thing) or indefinite pronoun (e.g., something, everything, anything), and an optional comparative (e.g., like that), as in the examples in (1). The GE system also includes a number of expressions whose structural configuration is more fixed, as in (2a), as well as the occasional idiosyncratic variants, as in (2b).

(1)

a.
Dad always had the garden. We grew our own vegetables, potatoes and onion-sets and cabbages, sprouts or something like that, yes.
(Grace Kenway, 74, MPT)Footnote ¹
b.
See Aunty Jessie, she made wee sheets and wee pillowcases for the cot and everything.
(Esther Hamilton, 88, CMK)
c.
We were told there was no saluting or medals or anything.
(Roger Bishop, 88, MPT)
d.
You go back thirty years and look at all the old maps and all. Portavogie was only put down as a headland or something.
(Robin Mawhinney, 55, PVG)
e.
Then the harvest time you know where they cut the corn and that.
(Hugh Keane, 84, CLB)
f.
I seen there they brought some nettles over and things.
(Rob Paisley, 78, CLB)

(2)

a.
I says, well I would like some nice gifts to take back … I’ll take this, that, blah, blah, blah.
(Heather Baker, 67, CMK)
b.
Kerry would send the wee one up with his box of chocolates and what have you for Gran.
(Iain Ferguson, 72, CMK)

A typical GE will occur at the end of a phrase and will often evoke a general category of similar objects that the speaker has in mind (Dubois Reference Dubois1992: 198). For example, the use of or something like that in (1a) calls to mind different types of vegetables that would grow in a garden. The GE and everything in (1b) refers to a group of attributes relating to the things that make up a child’s bedding. The GE in (1c) refers to the paraphernalia suitable for a military parade, that in (1d) to historical materials that one might find in an archive, that in (1e) to the things that are cut at harvest time, and that in (1f) to any of a variety of herbs. Thus, speakers use a GE in order ‘to suggest the multitude of possible elements of the set’ that they are thinking or talking about (Dubois Reference Dubois1992: 182). As the research on GEs has expanded, the boundaries of what can be treated as a GE has extended to encompass additional forms and in some cases additional functions. As we shall see, however, the core remains fairly constant.

Research shows that the use of GEs is conditioned by social factors such as age, sex, education and socio-economic class (Dubois Reference Dubois1992; Stubbe and Holmes Reference Stubbe and Holmes1995). The distribution of GEs has also been found to be constrained by the semantic-syntactic characteristics of the generic/pro-form they contain (Aijmer Reference Aijmer and Togeby1985; Cheshire Reference Cheshire2007; Dines Reference Dines1980; Overstreet and Yule Reference Overstreet and Yule1997). Other research suggests GEs encode various interactional functions (Aijmer Reference Aijmer and Togeby1985), marking politeness (Overstreet and Yule Reference Overstreet and Yule1997; Winter and Norrby Reference Winter, Norrby and Henderson2000), topic shift and speaker exchange (Dubois Reference Dubois1992).

A rich aspect of the research on GEs is the implications they have for theories of grammaticalisation. Early research on GEs suggested that they were grammaticalising in some varieties of contemporary English (see, for example, Aijmer Reference Aijmer2002; Brinton Reference Brinton1996). Grammaticalisation is change that involves grammatical development. Originally, Meillet (Reference Meillet1912) defined grammaticalisation as ‘the change of an autonomous word into a grammatical element’; however, later research demonstrated that not only lexical items, but also grammatical forms and even phrases can develop along a pathway of grammaticalisation (Traugott Reference Traugott, Lehmann and Malkiel1982). Furthermore, the processes by which items develop grammatical meanings are multiplex and in some cases antithetic. For example, grammaticalisation is associated with reduction in some models (Heine et al. Reference Heine, Claudi and Hünnemeyer1991; Lehmann Reference Lehmann1982) but with expansion in more recent formulations (see, for example, Himmelmann Reference Himmelmann, Bisang, Himmelmann and Wiemer2004; Traugott and Trousdale Reference Traugott, Trousdale, Traugott and Trousdale2010). These hypotheses are particularly germane to GEs since the processes invoked to explain their development include phonetic reduction or clipping, decategorialisation and semantic-pragmatic change (see, for example, Cheshire Reference Cheshire2007), all among the suite of mechanisms that have been discussed in the grammaticalisation literature. In the analyses that follow, I will test the operation of some of these mechanisms.

The originality of the present analysis lies in the viewpoint it offers from conservative dialects. The argumentation and interpretation is built on the foundations of distributional findings presented in Tagliamonte (Reference Tagliamonte2013: 172–84),Footnote ² based on a unique archive of language materials from south-west Scotland, north-west England and Northern Ireland. In contrast, virtually all of the research on GEs has been on contemporary data, much of it from mainstream communities and contemporary youth. Further, claims in the literature that implicate the diachronic development of GEs have never been fully explored (see, however, Carroll Reference Carroll2008). Perhaps one of the most compelling claims is that GEs develop from long forms to short forms, paralleling linguistic change in other areas of the grammar. Using the apparent-time construct (Bailey et al. Reference Bailey, Wikle, Tillery and Sand1991), recent quantitative research (Denis Reference Denis2011; Pichler and Levey Reference Pichler and Levey2011; Tagliamonte and Denis Reference Tagliamonte and Denis2010) took up the challenge of testing this claim by studying the distribution of long and short GE variants in synchronic datasets. The consistent finding arising from these investigations is that there is no relationship between speaker age and the length of GEs. Indeed, it appears that short GE variants such as and stuff and long GE variants such as and stuff like that exist in relative stasis in contemporary varieties. Thus, if the short variants did indeed derive from the longer variants, they must have done so before most of the speakers interviewed for the apparent-time studies quoted above were born. Of course, the real problem is the availability of comparable time-depth to test this hypothesis. Appropriate data, i.e., spoken, interactional and vernacular, from earlier points in time does not exist. Spoken-like materials in drama, diaries and court reports provide some evidence for GEs at earlier periods (see, for example, Culpeper and Kytö Reference Culpeper and Kytö2010) but they reflect a qualitatively different register. There is no perfect solution to this problem given the impossibility of bona fide spoken historical data from earlier than the twentieth century. However, there is another worthwhile data source that may shed light on the diachronic development of GEs: conservative British dialects. Peripheral dialects are known to reflect earlier stages in the evolution of English and thus make possible the analysis of the antecedents of contemporary GE variants. If so, they can offer insight into the evolution of GEs to complement the synchronic corpora studied by Cheshire (Reference Cheshire2007), Pichler and Levey (Reference Pichler and Levey2011) and Tagliamonte and Denis (Reference Tagliamonte and Denis2010).

In what follows, I will first describe in more detail the nature of the data under investigation in this chapter. In Sections 5.3 and 5.4, I will provide a brief historical and synchronic backdrop for the study of GEs. The methods of data analysis employed will be outlined in Section 5.5. The results are then presented in terms of distributions, Section 5.6, and constraints, Section 5.7. I will end with a discussion of the results in Section 5.8 and my conclusions in Section 5.9.

5.2 The data

An informative construct that has been successful in recovering earlier stages in the development of linguistic variables is the relic area (Poplack and Tagliamonte Reference Poplack and Tagliamonte1991: 97; Tagliamonte and Smith Reference Tagliamonte, Smith and Poplack2000: 141). Such areas, because of their peripheral geographic location and/or isolated socio-political circumstances, tend to preserve older features of a language (Anttila Reference Anttila1989: 294; Hock Reference Hock1986: 442). Among the more recent criteria for establishing the status of a variety as ‘peripheral’ are geographic location, historical continuity and group identity (Wolfram and Thomas Reference Wolfram and Thomas2002: 26–36). The present study is based on the Roots Archive (Tagliamonte Reference Tagliamonte2013), a database which comprises four such peripheral communities. Their location is shown by the grey circles on the map in Figure 5.1: Maryport in north-west England, Cumnock in south-west Scotland, and Cullybackey and Portavogie in Northern Ireland. In addition, the white circles on the map in Figure 5.1 show northern England locations where GEs have been studied recently: Kingston-upon-Hull (henceforth Hull) (Cheshire Reference Cheshire2007), the city of York (Denis Reference Denis2011) and Berwick-upon-Tweed (Pichler and Levey Reference Pichler and Levey2011).

Figure 5.1 Location of targeted peripheral communities (Maryport, Cumnock, Cullybackey, Portavogie – grey circles) and locations of previous GE research (Kingston-upon-Hull, York, Berwick-upon-Tweed – white circles)

Contains OS data © Crown copyright and database right (2014)

In the early 2000s, over 100 individuals among the oldest living generation in each of the peripheral communities were interviewed using sociolinguistic fieldwork techniques (Labov Reference Labov1972c; Schilling Reference Schilling2013; Tagliamonte 2006). These audio-recorded conversations contain a wealth of cultural information, local traditions, narratives of personal experience, local gossip and informal discussions, reflecting the typical discourse found in each community. GEs are a vibrant component of the vernacular in these communities, as evident in (3):

(3)

a.
When I worked there, I worked in the stores department and one of my jobs at that particular time was clock cards and things like that. Pay documents all this, making them out. Anybody that was entitled till extra money or owt like that.
(Jack Dobson, 66, MPT)
b.
Well, she took natural sciences. It covers a wide area. I think she wants to work with people like you know like sort of these trouble shooters people and that you know.
(Lucy Fisher, 73, MPT)

Most other research on GEs in the UK is based in urban settings, for example London (Stenström and Andersen Reference Stenström, Andersen, Percy, Meyer and Lancashire1996). More generally, research on GEs has in large part been conducted on other major varieties of English, including Australian English (Norrby and Winter Reference Norrby, Winter and Allen2002) and Canadian English (Tagliamonte and Denis Reference Tagliamonte and Denis2010). One of the compelling findings arising from the literature on GEs is the extent to which varieties of English share the same variants. From England to Canada to Australia, virtually the same inventory of GE variants has been reported (at least with respect to the most frequent variants), suggesting longitudinal continuity of GEs. Yet this remains open to investigation. Pichler and Levey (Reference Pichler and Levey2011: 464) call for an extension of current variationist work on GEs to studies of real time and most especially to deeper time-depth and consideration of the earlier antecedents of the contemporary GE system. Exploration of the conservative dialects in the Roots Archive offers the possibility to gain access to a window on the past through which an earlier stage in GE variation and change may become visible, allowing us to probe the formal consistency of the GE system as well as hypotheses about the development of short GE variants from long GE variants. The question is: how do the GEs in these relic communities compare with the extant body of knowledge of this linguistic phenomenon?

5.3 Historical perspective

GEs can be traced far back in the history of English, at least to the fourteenth century (Poutsma Reference Poutsma1926: 914). The earliest documented forms comprise fixed expressions such as and such, and so forth, and so on, and what not. Carroll (Reference Carroll2008: 16) provides an example of and so forth from Chaucer, 1390. Many of these early forms are found in the Roots Archive, as shown in (4).

(4)

a.
He was doing the lambing and such.
(Elspeth Ferguson, 70, CMK)
b.
And they sold fruit and so forth.
(Charlie Baxter, 67, CMK)
c.
Saddie McCaine would sell broken biscuits, chocolates, broken chocolates and so on.
(Fergus Bell, 90, CMK)
d.
My mother went to see my cousin in Workington. They were confectioners you know sort of they had a café and what not.
(Miriam White, 81, MPT)

GEs tend to comprise a core set of generic nouns and pronouns, including thing(s), stuff, anything, something, everything and nothing. Dialectal variants of the generic pronouns also occur such as summat in (5a) and owt in (5b).

(5)

a.
She’s a black belt and first dan or summat.
(John Edwards, 70, MPT)
b.
Cos he’s not a fellow to mix with other men or owt like that or do anything or owt, isn’t Bob.
(Grace Kenway, 74, MPT)

GEs with a generic but no connector occur in the Roots Archive as well, consistent with reports from nearby Hull and Berwick-upon-Tweed (Cheshire Reference Cheshire2007; Pichler and Levey Reference Pichler and Levey2011), as in (6).

(6)

a.
That’s right, aye, you’d the place to yourself kind of thing.
(Dan James, 64, PVG)
b.
I’ve a feeling it was summat like er- one of the H’s, Hunslett, Huddersfield something like that.
(Jack Dobson, 66, MPT)

Examination of the historical record shows that GEs with and developed first and those with or arose somewhat later. The earliest GE with or, or something, is attested in the early 1800s; or whatever is attested in the early 1900s (Tagliamonte and Denis Reference Tagliamonte and Denis2010). This suggests that there is incremental historical layering in this system over time that may be corroborated by the Roots Archive.

5.4 Synchronic perspective

GEs have been studied in contemporary data from the following countries: England, Scotland and Ireland (Cheshire Reference Cheshire2007; Denis Reference Denis2011; Levey Reference Levey2012; Macaulay Reference Macaulay1991; O’Keeffe Reference O’Keeffe2004; Pichler and Levey Reference Pichler and Levey2011); the United States and Canada (Overstreet Reference Overstreet1999; Overstreet and Yule Reference Overstreet and Yule1997; Tagliamonte and Denis Reference Tagliamonte and Denis2010; Wagner et al. Chapter 9); and Australia and New Zealand (Britain and Sudbury Reference Britain, Sudbury, Jones and Esch2002; Dines Reference Dines1980; Norrby and Winter Reference Norrby, Winter and Allen2002; Stubbe and Holmes Reference Stubbe and Holmes1995). As mentioned earlier, a common set of variants are reported, including: or something (like that), and everything (like that), and things (like that), and stuff (like that). Some GE types or categoriesFootnote ³ (e.g., those containing stuff and whatever) are more frequent in large urban centres such as Toronto, Canada (Tagliamonte and Denis Reference Tagliamonte and Denis2010) or London, England (Stenström et al. Reference Stenström, Andersen and Hasund2002). On the whole, however, there is remarkable consistency in the inventory of main forms across widely divergent geographic locales, social groups and registers. Another overarching characterisation is that GEs are typical of the vernacular.

A frequent observation is that GEs are rising in frequency, with a greater usage among younger individuals and in particular females (Denis Reference Denis2011; Stubbe and Holmes Reference Stubbe and Holmes1995; Tagliamonte and Denis Reference Tagliamonte and Denis2010). Further, in part due to the wide-ranging variety of GE variants, from almost literary (and conservative) (e.g., and so forth) to highly vernacular and nonstandard (e.g., and that), different GE variants are often associated with varying levels of formality, register or style (see, however, Wagner et al. Chapter 9). Certain GE variants are also often associated with class and other socio-economic indicators. For example, GEs such as and things are associated with middle-class individuals while others such as and that or and that lot are associated with working-class individuals (Cheshire Reference Cheshire2007: 165; Stenström and Andersen Reference Stenström, Andersen, Percy, Meyer and Lancashire1996: 102).

Further, GEs are claimed to have a range of interpersonal and textual functions (see, for example, Cheshire [Reference Cheshire2007] for a recent overview). Among other things, they are claimed to mark in-group affiliation and rapport (see, for example, Youssef [Reference Youssef1993] on an ting ‘and things’ in Trinidad Creole). The in-group nature of GEs is evident in (7a) and (7b) where one must know what a ‘bran tub’ and ‘cobbles’ are to interpret the reference set of the GE.Footnote ⁴ Because cultural knowledge is required to interpret many GEs, their use is thought to signal that speakers perceive an in-group relationship to their listeners.

(7)

a.
Aye, and at Haleve time it was real smashing at Haleve time. Aye, they had the- the bran tub and all this sort of stuff at Haleve.
(Bob Cottell, 85, CLB)
b.
And a-fore they knew where they were at there was cobbles and all sorts flying at windows.
(Keith Price, 89, MPT)

The semantic-syntactic properties of the GE generic (e.g., stuff, things) have been shown to influence the choice of GE variant. Grammatically, a form such as stuff refers to mass nouns and a form such as thing(s) to count nouns, as in (8a)–(8b). However, researchers have noticed that GE generics do not always match the semantic-syntactic properties of their antecedent nouns, as in (8c)–(8e). Further, note that it is much more difficult to retrieve a set for the referent pumps in (8e) than for the parsley and leeks and kale in (8b). What does one sink with a pump? This is not part of the knowledge set of most people.

(8)

a.
When they made anything new, they started to make oh meats and stuff like that, all kinds of things in the factory, all chutneys and things.
(Helen Philips, 79, MPT)
b.
My granny would go out and get parsley and leeks and kale and things out the garden for soup.
(Esther Hamilton, 88, CMK)
c.
Oh aye, that’s the sort of things I used to give them too, you know, clothes and things.
(Nancy Taylor, 78, CMK)
d.
It’s a shame when it’s stuff like- even affects like pigeons and stuff.
(John Edwards, 70, MPT)
e.
Did they not sink pumps and stuff?
(Kate Devoy, 62, PVG)

More critically, it is sometimes the case that GEs have no clear-cut reference set (Winter and Norrby Reference Winter, Norrby and Henderson2000: 4). What should be inferred by and that in the examples in (9)? What else might be involved in being ‘sent back out’ to work in (9a) and what is involved in being ‘a big friend of Josh’ in (9b)? These examples demonstrate that some GEs are quite vague, not always functioning to generalise to a set (Aijmer Reference Aijmer and Togeby1985). If not, they may have other functions.

(9)

a.
There was nae unions or protective clothing then, they just … and I mean she had to work because her mother made sure she worked. She never missed a day. You know she was Irish. Go to work, that’s it. Cut hand or no she was sent back out the next morning and that.
(Graeme Nesbitt, 69, CMK)
b.
Pickering chemist, he was a big friend of Josh and that.
(Keith Price, 89, MPT)

Changes to the syntagmatic length of GEs may be a reflection of shifting function. For example, and things like that may evolve into and things as part of the grammaticalisation process whereby clipping or shortening occurs as the original form expands its functional domain. This is, of course, not a discrete stepwise development but part of a suite of shifts that evolve in the milieu of synchronic variation. In this case, we can conceive of GEs starting out as a generalisation to a set. At a later stage, the forms may not generalise to a set at all. In such a process, the form and that in (9) could be a development. However, researchers are discovering a multitude of forms within each GE type (e.g., and all things like that, and things like that, and things, and thing) and a broader set of contexts (i.e., extension beyond set-marking) (see, for example, Carroll Reference Carroll2008: 14; Pichler and Levey Reference Pichler and Levey2011). The question is: what are the characteristics and patterns of GE variant types and forms that obtain in the peripheral northern communities under investigation?

5.5 Methodology

Discourse-pragmatic variables are notoriously difficult to study from a quantitative perspective because they comprise many different variants with potentially diverse functions (see Waters Chapter 2). To approach the GE system scientifically, I used the combined structural and functional approach employed in early variationist studies of GEs (Dubois Reference Dubois1992) and other discourse-pragmatic variables (Vincent Reference Vincent1992; Vincent and Sankoff Reference Vincent and Sankoff1992), also consistent with Pichler’s (Reference Pichler2010, Reference Pichler2013) more recent methods for studying discourse-pragmatic variation. First, forms and structural configurations are identified. Quantitative research on GEs, which often includes the full inventory of GEs in the data under investigation, provides a precedent. These procedures are aided by the well-known GE template: (connector) (modifier) (generic noun/pro-form) (similative) (deictic) (Pichler and Levey Reference Pichler and Levey2011: 448). With known structural and functional possibilities in mind, additional forms can be identified and included. A total of 1662 GE tokens were accumulated across the four communities presented in the Roots Archive. Each token was then coded so as to identify the generic/pro-form (e.g., thing, stuff, something) and to probe two salient claims in the literature for grammaticalisation of GEs: the relative length of the construction (e.g., and things vs. and things like that) tests for the relationship between grammaticalisation and the syntagmatic length of the GE; the co-occurrence of GEs with other discourse-pragmatic variables in the same clause (e.g., ken, you know, I mean) tests whether there is a relationship between GE use and support from other discourse-pragmatic variables.

5.6 Distributional analysis

Table 5.1 provides a breakdown of the inventory of the GE types in the Roots Archive. The table documents all the types in the data by grouping the GE variants into categories according to the type of generic/pro-form they contain.Footnote ⁵ The most frequent form of each category is displayed first, followed by a combination of all the other variants in the category with that generic type. There were 88 different forms in all. As with many linguistic variables, several main forms dominate the system while many others occur only once or twice. The distribution reveals that nearly a full third of all GEs in the data are some combination of and (all) (that), representing 38.1% of all GEs. Variants with something or thing(s) make up the bulk of the remainder, followed by everything, anything and stuff variants. The large catch-all category ‘other’ at 12.3% comprises innumerable idiosyncratic forms along with a group of fixed expressions which include and whatnot, and such, and all the rest etc., each one alone well under 2% of the data.

Table 5.1. Distribution of GE types in the Roots Archive

GE type	%	N
and that	19.1	318
and all	15.8	262
and all that	3.2	53
Combined ‘and (all) (that)’	38.1	633
or something	15.0	249
other something-variants	5.1	84
and things	3.4	56
other thing(s)-variants	10.8	179
and everything	6.7	112
other everything-variants	0.7	12
or anything	3.8	63
other anything-variants	2.1	35
and stuff	1.4	24
other stuff-variants	0.6	10
Other	12.3	205
TOTAL GEs		1662

Table 5.2 provides a breakdown of the most frequent fixed GEs, which were grouped with idiosyncratic forms as ‘other’ in Table 5.1. These are constructions that do not entirely fit the structural characteristics described earlier, but perform the same range of functions and occur in the same syntactic environments as the more prototypical and less formulaic GE variants in Table 5.1 (see also Pichler and Levey Reference Pichler and Levey2011: 448).

Table 5.2. Distribution of ‘other’ GEs in the Roots Archive

	%	N
and what have you	2.2	36
or nothing	2	34
or what	1.7	28
and all the rest of it	1.3	21
and so on	.96	16
and such	.84	14
and what not	.72	12
remaining ‘other’ GEs	2.6	44

In sum, the Roots Archive is consistent with previous research on GEs in its rich variation of forms. However, the particular GE inventory in these communities contrasts from existing reports outside the northern climes of the United Kingdom, particularly that reported for Reading in the south of England where thing(s)-variants dominate, and Toronto in Canada where stuff-variants dominate. In the Roots Archive, variants with the generic stuff are rare (2%, N = 34/1662). While stuff-variants were also infrequent in Berwick-upon-Tweed (6%, N = 50/783; Pichler and Levey Reference Pichler and Levey2011), the rate in the current data is even lower, perhaps approaching the rate of working-class speech in Hull (Cheshire Reference Cheshire2007: 164, Table 4). Second, the frequency in the Roots Archive of the category of short forms, i.e., and that, and all, and all that, is relatively high. These variants have been reported in earlier research in Scotland (Macaulay Reference Macaulay and Görlach1985) and both Pichler and Levey (Reference Pichler and Levey2011) and Cheshire (Reference Cheshire2007: 164) report these to be frequent among working-class speakers in Berwick-upon-Tweed (34%, N = 268/783) and Hull. Moreover, recent work reports them to be frequent among working-class children in London (Levey Reference Levey2012). In the Roots communities, the rate of and that is only 19% overall, compared to 31% in Berwick-upon-Tweed (Pichler and Levey Reference Pichler and Levey2011). However, when and that, and all and and all that are taken together in the Roots Archive, they represent nearly 40% of the GEs in the data, i.e., their combined rate in the Roots Archive is higher than that in Pichler and Levey’s (Reference Pichler and Levey2011) data from Berwick-upon Tweed, as mentioned earlier. Given that the data under investigation were collected in relic dialect areas and from conservative speakers, the overwhelming presence of these short variants, which are thought to be a later development, is notable.

The next step is to consider whether the Roots Archive communities are differentiated with respect to this overall picture in type/variant distribution. Figure 5.2 shows the distribution of the major GE types by community. It confirms that the communities have a parallel pattern with respect to their inventory of GE types. And (all) (that) (light grey columns) is the prevailing category in nearly every community. Indeed, in Cullybackey and Maryport, it is the dominant category by a wide margin. The GE category with something (dark grey columns) is most robust in Cumnock and Portavogie whereas in Cullybackey and Maryport it is less frequent. The rate of use of forms in the ‘other’ category (nonce forms that are also reported elsewhere, e.g., and so on, and whatnot) is relatively stable throughout. None of the communities show more than a few tokens of stuff- or whatever-variants. Yet GEs with stuff and whatever are reported as frequent in North America (Tagliamonte and Denis Reference Tagliamonte and Denis2010) and to be rising in frequency in apparent time in Berwick-upon-Tweed (Pichler and Levey Reference Pichler and Levey2011: 454, Figure 2) and York (Denis Reference Denis2011). These discrepancies between the conservative dialects in the Roots Archive and the dialects from two larger centres in north-east England support the hypothesis that there is change in the composition of variants within the GE system over time but no change in terms of grammatical development. If it is simply the lexical components that are changing, this may reflect renewal rather than the evolution of lexical items into new grammatical functions.

Figure 5.2 Distribution of GE types by community

Further support for this interpretation comes from a fortuitous comparison that can be made between individuals of different ages in the corpora. As it happens, two of the fieldworkers were in their twenties, i.e., considerably younger than the individuals they were interviewing who were all over the age of sixty.Footnote ⁶ Thus, their use of GEs within the interview context offers a unique generational perspective on usage.Footnote ⁷ Figure 5.3 shows the distribution of the major GE categories but this time contrasts the elderly participants with these interviewers. The results expose some pointed apparent-time differences in GE use. GE use among the elderly community members is dominated by the and (all) (that) category, forms rarely used by the interviewers. In contrast, the interviewers exhibit dramatic use of stuff-variants, a GE type rarely used by the interviewees. These discrepancies between the conservative dialect speakers and the younger interviewers provide a suggestive reflection of the type of change the GE system may have undergone over the twentieth century, namely lexical replacement of: thing > stuff. At the same time, it is interesting to note the relatively parallel frequencies of the something/anything-types across the two groups. This shows that while some parts of the GE system are changing, other areas remain stable, at least across the (apparent-time) span of these materials. While other factors could be contributing to the differential use of GE variants among these two groups (e.g., contrasting roles in the interview context, gender asymmetry), it is striking that the very differences visible here in frequency and proportion of GE variants across elderly interviewees and young interviewers is also consistent with the patterns reported for older vs. younger individuals in urban locales elsewhere, either in North America (Tagliamonte and Denis Reference Tagliamonte and Denis2010) or in more urban settings relatively nearby in the United Kingdom (Denis Reference Denis2011; Pichler and Levey Reference Pichler and Levey2011), namely more stuff-variants among youth and more and (all) (that)-variants among elderly. These results directly impact the hypothesis stated earlier regarding the purported nature of the GE system. The hypothesis is that conservative dialects can be taken as a cautious representation of an earlier stage in the development of this system. If they are, then that system was no more grammaticalised than the GE systems reported in any other study. What contrasts is the composition of the system. These conservative dialects are dominated by forms of the type and (all) (that).

Figure 5.3 Distribution of GE types comparing elderly interviewees and younger interviewers

5.7 Constraints analysis

A central issue that emerges is whether or not GEs have been subject to grammatical change or lexical replacement (see also Denis and Tagliamonte Chapter 4). Two of the main indicators of grammaticalisation are: (i) a change from longer GEs to shorter ones, and (ii) a change from co-occurrence of GEs with other discourse features to independent usage. The key evidence for lexical replacement is marked change in the use of the generic but no shifts in the contextual grammatical constraints. In the analyses that follow, I subject the data to an analysis of these constraints in order to test these potential indicators.

5.7.1 Length of the GE

Cheshire’s (Reference Cheshire2007) research carried out in the 2000s on young people’s GE usage in three English towns found a high frequency of and that. Following Aijmer (Reference Aijmer2002: 227), Erman (Reference Erman, Moen, Simonsen and Lødrup1995: 145) and others, Cheshire (Reference Cheshire2007: 167) reasoned that this was the result of a gradual loss of the longer GE variants because of reduction via clipping. In this hypothesis, longer GEs such as and things like that would have been present at earlier stages in the development of the GE system. Then, these would have gradually evolved into shorter GEs to arrive at a situation of frequent use of and things. If this trajectory is accurate, we would expect a greater number of longer GEs than shorter GEs in the Roots Archive data which represent an earlier stage in the development of GEs than the data analysed by Cheshire (Reference Cheshire2007).

Figure 5.4 tests this possibility by comparing the frequency of short and long GEs in each of the Roots Archive communities. The results provide a striking demonstration that the shortest GEs are predominant across all communities. Interestingly, the longer GEs show the highest frequency in Maryport. This is the locale that has previously exhibited the most advanced profile among the Roots Archive communities (see, for example, Tagliamonte Reference Tagliamonte2013). If anything, this suggests that the longer GEs, rather than the short ones, are developmentally advanced.

Figure 5.4 Frequency of length of GE by community

Figure 5.4 groups all of the GEs together. However, Figures 5.5–5.8 split the data so that the cross-community distribution of long vs. short variants can be viewed separately for the highest-frequency GE categories (see Table 5.1). Figures 5.5–5.8 confirm that the long vs. short phenomenon reported in the overall perspective in Figure 5.4 accurately reflects the situation for each GE type and for each Roots community. There is no evidence for a trajectory of change from long to short. This means that the GE system in these varieties shows no evidence of an earlier stage in which longer GEs were present. Instead, just like more mainstream varieties (e.g., Toronto, York, London), they simply have more two-word variants.

Figure 5.5 Proportion of long vs. short GE variants with pro-form ‘something’

Figure 5.6 Proportion of long vs. short GE variants with generic ‘thing(s)’

Figure 5.7 Proportion of long vs. short GE variants with pro-form ‘everything’

Figure 5.8 Proportion of long vs. short GE variants with generic ‘stuff’

5.7.2 Collocation of GEs with other discourse-pragmatic features

Another proposed indicator of grammatical change in the GE system is collocation with other discourse-pragmatic-pragmatic features. Cheshire (Reference Cheshire2007: 185) argues that as GEs develop new pragmatic functions, they no longer require the support of other discourse-pragmatic features in the same clause. In this hypothesis, not only would longer GEs such as and things like that occur more often at earlier stages in the development of the GE system but they would also co-occur with discourse-pragmatic features to a greater extent than would shorter GEs. To test this hypothesis, we first tabulated the frequency and type of discourse-pragmatic features that co-occur with each of the main GE categories introduced in Table 5.2.

The most frequent discourse-pragmatic features used with GEs in the Roots Archive are you know (N = 76), aye (N = 25) and ken (N = 25). Because the frequency of these markers was so rare, I offer a basic overview of co-occurrence patterns in Table 5.3. While most GEs can occur with these features, GEs with stuff rarely occur with any discourse-pragmatic feature. Moreover, not a single one occurs with anything-variants. The ‘other’ category is distinguished from the other GE categories by the co-occurrence with like and I don’t know. Importantly, and (all) (that) co-occurs with the most wide-ranging discourse-pragmatic features in these materials.

Table 5.3. Co-occurrence of GEs and other discourse-pragmatic features

	Co-occurring discourse-pragmatic features
GE type	you know	aye	ken	I mean	eh	just	anyhow	like	kind of	I don’t know
and (all) (that)	√	√	√	√	√			√
something-variants	√	√	√	√			√
thing(s)-variants	√	√	√		√
everything-variants	√	√		√		√
stuff-variants			√
anything-variants
Other		√			√			√		√

The results in Table 5.3 cannot be feasibly split between short and long GEs, which would be the true test of the hypothesis, because the co-occurring discourse-pragmatic features themselves are too sparse. However, the overall results demonstrate that and (all) (that) – an inherently short GE type – co-occurs with the broadest range of discourse-pragmatic features while others have a comparatively narrow range. This trend goes in the opposite direction of prediction, i.e., more co-occurring discourse-pragmatic features with short GEs.

5.7.3 Statistical modelling

Statistical confirmation of the major trends and patterns among the main GE types in the Roots Archive is provided in Table 5.4. The model tests the effect on variant category choice of length and the presence of a co-occurring discourse-pragmatic feature where this is possible.Footnote ⁸ The analysis is a fixed-effects logistic model using Goldvarb X (Sankoff et al. Reference Sankoff, Tagliamonte and Smith2005). The results reveal that the only factor that is significant to the variation among the major types of GEs in these data is community. Each community favours different GE types. Further, certain varieties privilege one or more types and in some cases certain types rarely occur. For example, something-variants are disfavoured in Maryport while thing(s)-variants are disfavoured in Cumnock. More importantly, however, observe that neither the length of the GE nor the presence of a co-occurring discourse-pragmatic feature is significant for any of the main GE types (where testing this effect is possible). Only in the catch-all group of ‘other’ GE variants is there an effect of length and co-occurrence with a discourse-pragmatic feature. The reason for this is the wide range of forms, mostly long (e.g., and all the rest of it), that comprise this part of the system.

Table 5.4. Logistic regression analysis of main GE types (sorted by generic/pro-form)

	something	anything	thing(s)	stuff	other
	.20	.05	.11	.02	.11
Community
Cumnock	.59	.59	.39	.62	not
Maryport	.33	.62	.51	.59	significant
Cullybackey	.48	.42	.69	.29
Portavogie	.67	.62	.51	.64
Length of GE
Long	not	not	not	not	.74
Short	significant	significant	significant	significant	.43
Discourse-pragmatic feature
Yes	not	not	not	not	.52
No	significant	significant	significant	significant	.35

5.7.4 What of ‘and that’?

Given the notable differences across communities based on the type of GE, it will be instructive to determine how the major GE type in these data, and (all) (that), distributes across communities when the individual variants are viewed independently. The form and that is the pivotal form proffered as evidence that the GE system is changing over time. The argument is that it is the most advanced in terms of grammaticalisation based on tests for frequency, length and co-occurrence with discourse-pragmatic features (see, for example, Cheshire [Reference Cheshire2007] based on earlier research by Aijmer [Reference Aijmer2002]; Brinton [Reference Brinton1996]). Figure 5.9 plots the frequency of each of the variants of this type, i.e., and all, and that and and all that. The results reveal that the distribution of these forms is highly differentiated from one community to the next. Maryport favours the and that-variant, Portavogie favours and all that and Cumnock and Cullybackey favour and all. The results for Maryport, in north-west England, show remarkable correspondence with working-class individuals in the communities of Hull (Cheshire Reference Cheshire2007) and Berwick-upon-Tweed (Pichler and Levey Reference Pichler and Levey2011), also both located in northern England. The fact that, in addition, these data comprise a majority of working-class speakers corroborates the hypothesis that and that is a working-class variant (Cheshire Reference Cheshire2007: 164). Further, correspondence across these northern communities suggests that and that may well be a more general northern working-class feature.

Figure 5.9 Distribution of and all, and that, and all that by community

A final question to consider is whether and that, as in (10), is old or new. In working-class speech in the early 2000s in Reading, Milton Keynes and Hull in the United Kingdom, and that is reported to be ‘firmly entrenched’ (Cheshire Reference Cheshire2007: 165). Quantitative investigation in Berwick-upon-Tweed (Pichler and Levey Reference Pichler and Levey2011) allows a consistent comparison with the Roots Archive data, as in Figure 5.10. This demonstrates that the oldest living generation in the small north-east England town of Maryport also use and that to a relatively high degree, patterning along with Berwick-upon-Tweed. This distributional pattern suggests that and that is not an advancing GE in England but rather the retention of a conservative northern English feature generally, as in (10a)–(10b). In sum, the short GE and that is not a development, i.e., a reflex of ongoing grammaticalisation, but it is a retention.

(10)

a.
We had greenhouses like and green tomatoes and that you know.
(Phil Stevenson, 84, MPT)
b.
And then there were maybe a wee dance and that.
(Angus Milroy, 66, CMK)

Figure 5.10 Proportion of and that in the Roots Archive compared to Berwick-upon-Tweed (as reported in Pichler and Levey Reference Pichler and Levey2011)

5.8 Discussion

The study of GEs in the Roots Archive has offered a new perspective on the GE system in relatively conservative dialects among elderly speakers from the general vantage point of northern England, south-west Scotland and Northern Ireland. GE use in these conservative dialect data is an indication into a possible earlier stage in the GE system. Given the competing hypotheses regarding the status of the GE system in contemporary varieties in England (and elsewhere), this is an important addition to the current body of knowledge. Both frequency and patterns of use combine to show the inner workings of the system across dialects. While there is very little use of stuff-variants, a type that is reported to be accelerating in North America as well as in the northern English city of York (Denis Reference Denis2011), other short forms are prevalent, especially and that, and all and and all that.

Comparing these conservative northern English dialects with earlier analyses of Hull (Cheshire Reference Cheshire2007), Berwick-upon-Tweed (Pichler and Levey Reference Pichler and Levey2011) and in some cases York (Denis Reference Denis2011) reveals that both short and long forms are part of the GE inventory, regardless of place or nature of the locality or period represented by the data. Thus, while some previous research (see, for example, Aijmer Reference Aijmer2002; Brinton Reference Brinton1996) has built a picture of the GE system as one that involves increasing grammatical development of forms (e.g., and things like that > and things) as they grammaticalise over time, the results presented here continue to corroborate the alternative perspective (Pichler and Levey Reference Pichler and Levey2011; Tagliamonte and Denis Reference Tagliamonte and Denis2010), namely that there is no evidence for ongoing grammaticalisation.

Indeed, it seems likely that the English GE system has had a varied inventory of forms over an extended period of time. Perusal of the Oxford English Dictionary shows that most of the currently available variants have been attested for centuries. At the same time, it is becoming apparent that the generic component of this system is undergoing shift (i.e., that > stuff). Conservative dialect speakers in the four isolated communities in the Roots Archive as well as working-class speakers in Berwick-upon-Tweed and Hull retain a healthy proportion of variants with and (all) (that) and show scant incursion of stuff-variants.

Another attribute of the GE system is its marked dialectal dimension. The cross-dialect comparison has exposed the local aspects of this area of grammar. While a core inventory of forms is stable both across these conservative dialects and across studies (e.g., or something, and things, and everything), the individual communities in the Roots Archive each privilege one of the other forms in their inventory: and all, and all that, and that (Figure 5.9). These results suggest that further research on different varieties will offer additional new perspectives on this grammatical system. Which parts of the system are universal (in the sense that they occur consistently across varieties) and which are local (in the sense that they are unique to particular communities), and what is the linguistic mechanism underlying the difference between these types across locales?

5.9 Conclusion

The use of relic dialects as a proxy for diachronic data in the study of discourse-pragmatic change introduces a novel application of research practice that extends the time-depth for discourse-pragmatic change research. As far as methods themselves are concerned, none of the analyses I have implemented here are particularly new. They follow in line with a long-standing tradition in variationist sociolinguistics to approach linguistic phenomena systematically, employing methods that are both accountable and replicable (Labov Reference Labov1972b). The extension of quantitative methods to features beyond phonology has been going on since the early 1970s, beginning with Sankoff’s (Reference Sankoff, Bailey and Shuy1973) research on tense-aspect phenomena, indefinites and complementisers, then extending into broader areas of more quintessential discourse-pragmatic phenomena, including punctors (Vincent and Sankoff Reference Vincent and Sankoff1992), exemplification (Vincent Reference Vincent1992) and, of course, GEs (Dubois Reference Dubois1992). Within the same time-frame, many developments were concurrently underway in the investigation of the pragmatic and interactional functions of these features (see, for example, Aijmer Reference Aijmer and Togeby1985; Schiffrin Reference Schiffrin1987; Stenström Reference Stenström, Jucker and Ziv1998). What is new in more recent approaches to discourse-pragmatics is to follow through on: (i) Lavandera’s (Reference Lavandera1978: 181) suggestion to ‘relax the condition that the referential meaning must be the same for all the alternants and substitute for it a condition of functional comparability’; and (ii) Sankoff and Brown’s (Reference Sankoff and Brown1976: 650) method to include the grammatical function of a construction as well as the ‘the uses to which it is put’. In other words, variationist methods are employed but are guided by structural, functional and interactional information (see, for example, Cheshire Reference Cheshire2007; Pichler Reference Pichler2013, Chapter 3; Tagliamonte and Denis Reference Tagliamonte and Denis2010, Chapter 4; Wagner et al. Chapter 9). In essence, this practice leads to a synthesis of information that can then be used for delimiting the grammatical dimensions of the variable set of which the discourse-pragmatic variants are a part. These developments tap the best of both worlds (quantitative and qualitative) and together serve to advance the host of discourse-pragmatic features available for study. The next step will be to ensure that a consistent methodological practice is applied rigorously across datasets so that cross-variety comparisons and diachronic pathways can be meaningfully explored.

With regard to more theoretical issues, these findings offer an illuminating window into discourse-pragmatic change. First, the building evidence suggests that the nature of changes in the GE system is not a singular or linear grammaticalisation pathway as often documented in the literature. There is no apparent influence of the Constant Rate Effect (Kroch Reference Kroch1989), the well-attested pattern of parallel development of constraint effects over time. While the GE inventory seems to exhibit a cohesive set of forms and patterns, their compositional inventory varies and local (idiosyncratic) forms appear. Nor is analogical levelling, the foundational process underlying grammatical change (Kurylowicz Reference Kurylowicz1949), visible in the evolution of GEs either singularly or as a group, at least not insofar as I have been able to test for this here.Footnote ⁹ At the same time, the GE system has evidently changed dramatically over the past century as new components have emerged (stuff) and taken a prominent position within the system, at least in some locales and populations. As research on discourse-pragmatic features continues, a more comprehensive inventory of variables, time points and types of data will enrich this building picture of the nature of change in features such as GEs and their like.

6 Quotatives across time: West Australian English then and now

6.1 Introduction

Labov’s (Reference Labov1972c: 98) ‘cumulative principle’ which states that ‘the more that is known about a language, the more we can find out about it’ crystallises in a plethora of studies on direct speech- and thought-encoding across Englishes, exemplified in (1)–(4).Footnote ¹

(1)
I walked down the hill and as I was going down the hill I thought, ‘By Jove, I might be able to get into the Flying Corps. Wouldn’t it be lovely?’
(HB-M-1889)

(2)
Dad had no time for religion. He reckons it was just a big, bloody business robbing all the poor people! And he says, ‘Well, that’s her thing. She can go for her life on that so long as it doesn’t interfere with me!’
(FB-M-1922)

(3)
So you’re thinking positive and the next thing you know, they find out you’ve got a record or something. Ø, ‘Tough luck Charlie. You don’t get the job.’
(TM-M-1970)

(4)
She’s like, ‘Oh!’ and then goes off into yelling, you know, and you can’t have a decent argument any more.
(AM-M-1970)

Quotation plays a key role in how narratives are constructed and researchers have investigated the use of quotative verbs in some depth. A wealth of mostly synchronic research into the quotative system of predominantly inner-circle Englishes has noted the emergence of innovative forms such as be like in (4) to introduce speech, sound, thought and gesture (see, inter alia, Baird Reference Baird2001; Barbieri Reference Barbieri2007; Blyth et al. Reference Blyth, Recktenwald and Wang1990; Buchstaller Reference Buchstaller2006a; Buchstaller and D’Arcy Reference Buchstaller and D’Arcy2009; Cukor-Avila Reference Cukor-Avila2002; D’Arcy Reference D’Arcy2004, Reference D’Arcy2010; Dailey-O’Cain Reference Dailey-O’Cain2000; Ferrara and Bell Reference Ferrara and Bell1995; Johnstone Reference Johnstone1987; Macaulay Reference Macaulay2001; Romaine and Lange Reference Romaine and Lange1991; Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2004; Tagliamonte and Hudson Reference Tagliamonte and Hudson1999; Terraschke Reference Terraschke, de Beuzeville and Peters2010a; Winter Reference Winter2002). Crucially, quotative be like is not a fleeting innovation but is instead strongly entrenched in the quotative system of many contemporary varieties of English (Buchstaller Reference Buchstaller2006a; Cukor-Avila Reference Cukor-Avila2002; Levey Chapter 7; Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2007). In my recent study of Australian English (AusE) (Rodríguez Louro Reference Rodríguez Louro2013), I have reported an adolescent peak in the use of be like, an indication of a change in progress (Labov Reference Labov2001b: 455). If be like usage in AusE is a recent innovation (as my earlier findings suggest), two questions remain: (i) what was the AusE quotative system like before the incursion of quotative be like?; and (ii) given that self-revelation has emerged across varieties of English as a discourse mode where speakers imbue their narratives with reports of their inner thoughts, attitudes and feelings (Buchstaller and D’Arcy Reference Buchstaller and D’Arcy2009; D’Arcy Reference D’Arcy2012; Ferrara and Bell Reference Ferrara and Bell1995; Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2007), is the incursion of be like motivated by an increase in self-revelation in earlier AusE narratives?

The evolution of the English quotative system has received less attention than its synchronic variation. Notable exceptions are listed in Table 6.1. Buchstaller (Reference Buchstaller2011) and D’Arcy (Reference D’Arcy2012) performed variationist analyses on the diachrony of quotation in north-east England and New Zealand, respectively, in order to examine the quotative system of these varieties before the rise of be like. They document little variation in earlier quotative systems, with say uncontested well into the twentieth century and think emerging as an encoder of thought and attitude before the ingress of be like in the second half of the twentieth century. Crucially, both studies show marked restructuring of the quotative system over time. As D’Arcy (Reference D’Arcy2012: 360) notes, the contemporary quotative system of English results from historical evolution and is not the ‘reflex of recent lexical innovations and incursions’, a finding also underscored in the present study and only made possible through the examination of longitudinal data. Moreover, the rise of self-revelation in narrative mentioned earlier is viewed as one of the main reasons motivating the expansion of the English quotative system. Ferrara and Bell (Reference Ferrara and Bell1995: 283) observe that internal state reporting in their American English data is to be expected given Carbaugh’s (Reference Carbaugh1988) socio-cultural observation that American narrative tends towards, as they put it, the ‘lionisation of self-revelation as a preferred cultural mode’. The increase of self-revelation in narrative – and the concomitant rise in the use of quoted material – has been documented beyond American English to include Canadian, British and New Zealand English (NZE) (Buchstaller and D’Arcy Reference Buchstaller and D’Arcy2009; D’Arcy Reference D’Arcy2012; Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2007). The question remains as to whether these changes are also attested in earlier AusE.

Table 6.1. Details of existing longitudinal studies on English quotatives

Study	Variety	Corpora	Data collection	Speakers’ years of birth
Buchstaller (Reference Buchstaller2011)	British English	Diachronic Electronic Corpus of Tyneside English (Beal and Corrigan Reference Beal, Corrigan, Mallinson, Childs and van Herk2013)	1960s/1970s 1990s 2007–2009	1895–1990
D’Arcy (Reference D’Arcy2012)	New Zealand English	Origins of New Zealand English Archive (Gordon et al. Reference Gordon, Hay, Maclagan, Beal, Corrigan and Moisl2007)	1946 & 1948 1990 & 1996 2006	1851–1935
Höhn (Reference Höhn, Hundt and Gut2012)	Irish English	International Corpus of English-Ireland (Kallen and Kirk Reference Kallen, Kirk, Beal, Corrigan and Moisl2007)	1990–1994 2002–2005	Unspecified

Drawing on spontaneous narratives of personal experience collected in Perth, Western Australia, between 1963 and 2011, this chapter offers the first longitudinal study of quotatives in AusE, a variety severely underdocumented in the variationist literature (Schneider Reference Schneider and Mesthrie2012: 349).Footnote ² A singular real-time depth of forty-eight years and a sample of speakers aged thirty-five to ninety-three show that the Australian quotative system has undergone dramatic changes from the nineteenth to the twenty-first century and that – in line with D’Arcy’s (Reference D’Arcy2012) findings for NZE – the quotative system was in flux well before the arrival of be like. These findings contribute an original perspective to current knowledge about the expansion of the quotative system in AusE. They suggest that the functional niche of internal thought- and attitude-encoding was already in place by the time be like entered the system in the late twentieth century, which, in turn, may explain why be like’s expansion was so rapid. The analysis also reveals variability in the use of quotative frames across time: a highly invariable system dominated by say giving way to patterned variability in how quotative introducers, grammatical person and tense are deployed in narratives. Moreover, variation in AusE quotatives across time is socially constrained. These results are important in highlighting the complex nature of discourse-pragmatic variation and change.

The chapter is organised as follows. The data and methods used in this study are presented in Section 6.2, followed by the presentation of the longitudinal results in Section 6.3. The chapter ends with a discussion and conclusion in Sections 6.4 and 6.5.

6.2 Data and methods

6.2.1 Data: the University of Western Australia Corpus of English in Australia

The present study is based on an analysis of spontaneous narratives of personal experience extracted from two corpora of West Australian vernacular English: (i) the State Library of Western Australia Oral History Collection collected between 1963 and 2007 and including speakers born between 1870 and 1980; and (ii) the University of Western Australia (UWA) Sociolinguistic Interview Corpus collected in 2011 and including speakers born between 1950 and 1960. Both data sources are components of the UWA Corpus of English in Australia (Rodríguez Louro under construction); they are comparable in that the data were collected in the Australian city of Perth by an interviewer asking questions about the interviewees’ lives, including danger of death scenarios as well as childhood experiences and reminiscences (Labov Reference Labov, Baugh and Sherzer1984: 34). The combined dataset of 822,144 words includes forty-four interviews with twenty-three female and twenty-one male Anglo-Australians of professional (N = 29) and non-professional (N = 15) backgrounds.Footnote ³ A breakdown of the speaker sample is given in Table 6.2. For the present analysis, it is organised into four periods by speakers’ year of birth: 1870–1890, 1920–1930, 1950–1960 and 1970–1980. Data for each of these periods were collected at different points in time during the period spanning 1963 and 2011; speaker ages range from thirty-five to ninety-three across time periods. Importantly, all speakers included in the sample were aged thirty-five and above at the time of the interview, i.e., they were all interviewed well after the stabilisation of their vernaculars (Labov Reference Labov2001b: 447; Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2009: 66).

Table 6.2. West Australian English speaker sample

Speaker year of birth	Period of data collection	Speaker age at the time of data collection	N of speakers
Speaker year of birth	Period of data collection	Speaker age at the time of data collection	Female	Male
1870–1890	1963–1980	88–93	4	4
1920–1930	1993–2005	68–83	6	5
1950–1960	1990–2011	37–54	10	8
1970–1980	2007	35–37	3	4
Total N of speakers			23	21

6.2.2 Methods of data analysis

6.2.2.1 Variable context

Research into linguistic variables ‘above and beyond phonology’ (Sankoff Reference Sankoff, Bailey and Shuy1973) must grapple with the type of ‘form-function polyvalence’ (Sankoff Reference Sankoff and Newmeyer1988: 157) characteristic of natural discourse. In the tradition of variationist research into discourse-pragmatic variables (Dines Reference Dines1980: 15) and in line with previous research on quotatives (see, inter alia, Macaulay Reference Macaulay2001; Tagliamonte and Hudson Reference Tagliamonte and Hudson1999), the variable context for the present analysis was functionally defined to include ‘all strategies used to introduce reported speech, sounds, gesture and thought by self or other’ (Buchstaller Reference Buchstaller2006a: 5). I thus included in the analysis the range of quotative frames, including variants with an overt quotative introducer, as in (5)–(12), and the zero variant represented by Ø in (13).

(5)
A mate said, ‘Look do you want to meet a good girl, beautiful girl? She’s staying with my sister.’
(RKT-M-1925)

(6)
Because I thought, ‘The government has no right to take that money.’
(JT-M-1886)

(7)
I put my arms out and went, ‘Nooo!’
(SIBUR-F-1955)

(8)
We were listening to it and as it goes on it gets worse and I’m like, ‘Oh my God!’ But yeah, you should listen to it.
(HGCD-M-1950)

(9)
And it’s like, ‘Calm down, it’s me that’s going, not her.’
(AM-M-1970)Footnote ⁴

(10)
Cause every job you go for it’s, ‘What experience have you had? What experience have you had?’
(VC-F-1960)

(11)
Her boyfriend was the one that got her pregnant, or supposedly pregnant, and after he got her pregnant, he just told her, ‘Piss off. I was only using you.’
(TM-M-1970)

(12)
Garry yelled out, ‘Mum, what’s that man ‘itting them ‘orses for?’ in his worst South Fremantle accent.
(GB-M-1926)

(13)
And his high beam didn’t work, so they pulled him over Ø, ‘Flick your high beam on for us please.’
(HC-F-1970)

The data were analysed drawing on the available verbatim transcripts and their corresponding digitised audio files. Checking each transcription against the recorded material proved crucial to providing accurate results. Quotative frames and the quote/inner thought they introduce had at times been omitted from the original transcription. This may be due to the fact that ‘discourse-pragmatic features […] are referentially and syntactically optional elements of discourse that can be omitted without necessarily altering the propositional meaning or syntactic structure of an utterance’ (Pichler Reference Pichler2010: 588). However, making sure they are extracted from the available materials is paramount to exhaustive data mining. An example is shown in (14), where the bolded and italicised text consists of material missing from the original transcription which was later recovered by double-checking the transcript.

(14)
Me and most people get tired in the afternoon when it’s about half past two, three o’clock, because you’re standing on your feet eight hours a day and your legs just can’t take it anymore, but then they get used to it but you still feel tired at the end of the day. You say, ‘Oh I can’t wait till half past four and you’re at the door.’
(TMA-F-1971)

6.2.2.2 Exclusions

Incomplete and ambiguous tokens were excluded from the analysis. Instances such as (15) were viewed as incomplete as the quoted content was missing. In (16), the connector and makes utterance interpretation ambiguous: while in (16a) call is analysed as the quotative verb, in (16b) the quotative is zero.

(15)
And he just had it going cheap so we just thought …
(AM-M-1970)

(16)

a.
And the tradesmen would call, ‘And Good morning Mrs Thompson,’ it was getting embarrassing so it wasn’t long and we thought we would get married.
(JT-M-1886)
b.
And the tradesmen would call and Ø, ‘Good morning Mrs Thompson,’ it was getting embarrassing so it wasn’t long and we thought we would get married.
(JT-M-1886)

Canonically, English quotative frames are positioned immediately before the quoted content (D’Arcy Reference D’Arcy2012; Romaine and Lange Reference Romaine and Lange1991; Schiffrin Reference Schiffrin1981). However, among those born in the earlier periods, say is differentially positioned before or after the quoted material, as shown in (17). Such tokens were included in the distributional analysis in Section 6.3.1. However, given the fact that post-posed syntactic position is restricted to the variant say, post-posed say tokens were excluded from the multivariate analysis presented in Section 6.3.2 which focuses on quotative variation in pre-quote position.

(17)
‘No blokes,’ they said.
(FB-M-1922)

6.2.2.3 Sociolinguistic variables

The adoption in this chapter of variationist methods parallel to those in Buchstaller (Reference Buchstaller2011) and D’Arcy (Reference D’Arcy2012) – including circumscription of the variable context and coding procedures for independent variables – allows for comparability of results across longitudinal studies of the quotative system in order to ‘determine where intra- and inter-dialectal discourse variation occurs’ (Pichler Reference Pichler2010: 601).

The linguistic factors involved in synchronic variation in the English quotative system – usually described in relation to be like but also applicable to other quotative frames (D’Arcy Reference D’Arcy2012) – include the ‘classic’ variables of grammatical person (of the matrix subject) and content of the quote (Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2007: 203), with the latter including the factors: speech, as in (18); inner thought, as in (19); and non-lexicalised sounds and gestures, as in (20). Previous research has shown that say and go tend to co-occur with third-person speech and that go tends to collocate with non-lexicalised sound (Blyth et al. Reference Blyth, Recktenwald and Wang1990; Buchstaller and D’Arcy Reference Buchstaller and D’Arcy2009; Romaine and Lange Reference Romaine and Lange1991; Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2007).

(18)
He said, ‘I never got a photo of it because we didn’t have a damn camera.’
(FB-M-1922)

(19)
I thought, ‘This looks like a really lovely little school.’
(RKG-F-1957)

(20)
And she goes, {sound of intake of breath} ‘did you hear that Peter … he’s going with a girl?’
(AM-M-1970)

The effect of tense/time reference on quotative variant choice has been shown to vary across locales (see, for example, Buchstaller and D’Arcy Reference Buchstaller and D’Arcy2009), with the most consistent finding suggesting that in narration be like tends to collocate with the historical present, i.e., with verbs that are morphologically present but have past tense reference (Schiffrin Reference Schiffrin1981). To test the effect of tense on quotative variant choice in AusE, I coded all tokens of the variable in my data for one of the following factors: present tense, as in (21); past tense, as in (22); and historical present, as in (23).

(21)
So, you know, he goes to the shop and says, ‘Oh, did any kids bring any crates in here today?’ and they go, ‘Yeah.’ And if you just happen to be hanging around he goes, ‘Oh, that kid there, there and there,’ and he looks around and he gets the cops and you hear this walling of sirens.
(TM-M-1970)

(22)
I remember he was an American sailor and he was just solitary and walking round and we picked him up and said, ‘Come in,’ and he sat down at our grand piano and played the piano for a couple of hours.
(MR-F-1928)

(23)
And of course when the contract was finished they wanted us to move and my father says, ‘No.’
(SH-F-1889)

The social factors of sex and occupation are included in the quantitative analysis to tease out the degree to which these impact potential changes in the choice of quotative variants across time. Information about speakers’ occupational status is available across corpora. Professional occupations include those for which speakers have completed one or more university-level courses (e.g., medical doctor, lawyer, university professor). Non-professional speakers are engaged in occupations for which an apprenticeship or trade certificate is needed (e.g., factory worker, welder, plumber).

6.3 Results

6.3.1 Overall distribution of variants

To examine the distribution of quotative variants before the onset of be like and to establish whether the AusE quotative system has changed across time, let us first consider in Figure 6.1 the overall usage frequency of quotative frames in the West AusE speaker sample introduced in Table 6.2.Footnote ⁵ Figure 6.1 shows that, across time periods and in line with previous research (Blyth et al. Reference Blyth, Recktenwald and Wang1990; Buchstaller Reference Buchstaller2011; Buchstaller and D’Arcy Reference Buchstaller and D’Arcy2009; Cukor-Avila Reference Cukor-Avila2002; D’Arcy Reference D’Arcy2004, Reference D’Arcy2012; Ferrara and Bell Reference Ferrara and Bell1995; Tagliamonte and Hudson Reference Tagliamonte and Hudson1999; Tannen Reference Tannen and Coulmas1986), say is the main quotative variant used. These results are noticeably different from those reported in recent comparable research into the contemporary AusE quotative system. Rodríguez Louro (Reference Rodríguez Louro2013) reports that be like is the most widely used quotative variant among Australian speakers aged eleven to twenty-six (born between 1985 and 2000), accounting for 80% of all quotatives used by speakers in this age cohort. The results presented in Figure 6.1 illustrate a vastly dissimilar system for AusE speakers born between 1870 and 1980, with quotative say dominating across cohorts and time periods.

Figure 6.1 Distribution of quotative variants used by adult Anglo-Australians born between 1870 and 1980

Example (24) illustrates typical uses of quotative say in late nineteenth century Anglo-AusE. A usage unattested in the later AusE data is that of say after the quotation (see also (17) in Section 6.2.2.2). This usage is most frequent in the speech of those born between 1870 and 1890 where it accounts for 11% (N = 42/392) of all quotatives used (see D’Arcy [Reference D’Arcy2012] for similar findings in NZE). As noted in Section 6.2.2.2, because post-posed syntactic position is restricted to say, these tokens were not included in the multivariate analysis presented in Section 6.3.2. However, a brief note on their usage is in order. With time, and in line with D’Arcy’s (Reference D’Arcy2012) findings for NZE, quotative post-posing has become obsolete and is absent from contemporary conversational AusE. We can witness a steady decrease in its use from the late nineteenth to the mid-twentieth century: 11% (1870–1890) > 4% (1920–1930) > 0% (1950–1960). This development suggests an important change in how direct speech is framed in AusE narratives.

(24)
On this particular day he said, ‘We are going to have company drill.’ He said, ‘I’m going to make a few purposeful mistakes and I want you to tell me when it’s over what mistakes I’ve made.’ Well, he made some palpable errors and I said, ‘Oh, there’s another mistake, Captain Thompson.’ He said, ‘And what was that?’ ‘Well,’ I said, ‘The company was in such and such a position and you gave a certain order which was quite incorrect.’ ‘Oh,’ he said, ‘Was it?’ So I thought, ‘I put my foot in it!’
(JT-M-1886)

6.3.2 Multivariate analysis

This section focuses on uncovering the sociolinguistic constraints on the use of say, the most frequent quotative variant in the dataset across the time periods represented in the current data (see Figure 6.1). What do these constraints tell us about the evolution of the AusE quotative system?

Multivariate analysis with Goldvarb X (Sankoff et al. Reference Sankoff, Tagliamonte and Smith2005) is used to uncover these constraints. Some notes on the analysis and the results reported in Tables 6.3–6.4 and 6.6–6.7 are in order. Firstly, across multivariate analyses, non-application values feature the most frequent quotative variants other than say, including zero, think, go and (where applicable) be like. Quotatives in the ‘other’ cohort are not included in the multivariate analysis. Secondly, the token count across factor groups is not always identical due to exclusions. For example, zero quotatives had to be excluded from consideration in the grammatical person and tense factor groups. Thirdly, the only factors considered in the analysis are those representing no more than 95% of the data in the respective factor group (Guy Reference Guy, Ferrara, Brown, Walters and Baugh1988: 131). Where tense variation is modest, all tokens with non-past tenses (i.e., simple present, historical present, modal constructions) are collapsed. Fourthly, content of the quote is not included as a factor group in the multivariate analyses of say because say categorically introduces direct speech (rather than thought or non-lexicalised sound/gesture). The relative frequency of speech- vs. thought-introduction is addressed in the discussion of the results in Section 6.4. Finally, across multivariate analyses, favouring factor weights are shaded while non-significant factor weights are given in square brackets.

Table 6.3. Contribution of linguistic and social factors to the use of say among speakers born between 1870 and 1890

Input	.92
Total N	495
	FW	%	N
Tense
Past	.51	93.1	407
Non-past	.25	81.2	16
Range	26
Occupation
Professional	.50	86.5	267
Non-professional	.49	70.6	228
Range	1
Sex
Female	[.69]	91.3	23
Male	[.49]	78.6	472

Table 6.4. Contribution of linguistic and social factors to the use of say among speakers born between 1920 and 1930

Input	.93
Total N	376
	FW	%	N
Occupation
Professional	.54	83.1	260
Non-professional	.40	50.0	116
Range	14
Tense
Past	.52	94.8	191
Non-past	.45	92.9	99
Range	7
Sex
Female	[.52]	83.0	153
Male	[.46]	65.9	223

Qualitative observations across time periods are also offered to illustrate how numerically marginal quotative variants have extended their pragmatic function across time.

6.3.2.1 The late nineteenth century (speakers born between 1870 and 1890)

The quotative system of those born between 1870 and 1890 is remarkably invariable, both in terms of the quotative variant used and in terms of the function of quotation. As shown in Figure 6.1, say represents 76% of the quotatives employed by speakers born in this period. Zero quotatives are well represented (14%); think is used minimally (5.4%); and (third-person-marked) quotative go is infrequent (0.6%) and pragmatically restricted to introducing either lexicalised sound, as in (25), or verbatim content, as in (26).

(25)

a.
And I went, ‘Whoof, whoof, whoof.’
(HB-M-1889)
b.
The machine was flying about a hundred thousand feet and all of a sudden it went, ‘Whoosh’ and nearly turned over.
(HB-M-1889)

(26)
The first verse went, ‘Let books for a while have a rest on their shelves Viva la compagnie. While we’re singing the praise of our excellent selves. Viva la compagnie.’
(BH-F-1889)

Table 6.3 displays the results of a multivariate analysis of the sociolinguistic factors constraining the use of say among speakers born between 1870 and 1890.Footnote ⁶ The results show that tense exerts the strongest influence on say usage, with say more likely to occur with the past tense in this cohort. Regarding social factors, occupation plays a significant role, with professionals slightly favouring say over non-professionals. A closer look at the use of say vs. other quotative variants across occupational groups shows that while professionals in this period use quotatives other than say (including zero, think and go) at a rate of 13.5%, their non-professional counterparts employ quotative variants other than say at a rate of 29.4%. These differences suggest that change in the quotative system of speakers born in the late nineteenth century was propelled by those from a non-professional background. Cross-tabulations of occupation and sex (not reproduced here) show that it is non-professional males who frequently use quotatives other than say, while non-professional females exclusively use say.

6.3.2.2 The early twentieth century (speakers born between 1920 and 1930)

As shown in Figure 6.1, the quotative system of those born between 1920 and 1930 is also relatively invariable, with say dominating at 68%. Zero quotatives show a sizeable increase in comparison with the previous cohort (21%), and think continues to feature minimally (2%). Quotative go (marked for third person) evinces a minimal increase in frequency across the two cohorts (from 0.6% to 2%), and the pragmatic contexts in which it appears are in line with its long-documented usage, i.e., the encoding of lexicalised sound and gesture (first attested in the Oxford English Dictionary in 1791, D’Arcy Reference D’Arcy2012: 353). Importantly, we find two variants in this cohort (subsumed in Figure 6.1 in the ‘other’ category) that were not attested in the previous cohort: the go like that {gesture}-construction, illustrated in (27) and the something like Ø-construction, shown in (28). Use of the latter variant is in line with Haddican and Zweig’s (Reference Haddican and Zweig2012: 12) proposal that ‘be like quotatives involve a null something indefinite’ and a null deictic ‘that’ usually unexpressed in inner-circle Englishes. The uses in (28) may thus be viewed as precursors to quotative be like, undocumented in the dataset used here until the mid-twentieth century (see Section 6.3.2.3).

(27)
A little girl at the till there was going like that {gesture} and she marked it up.
(RKT-M-1925)

(28)

a.
Something like Ø, ‘State School kids are stupid kids’ and something else, I don’t remember.
(FB-M-1922)
b.
What’s that rhyme they did? Something like Ø, ‘Pigs in and out the water.’ {laughs}
(MMS-F-1924)

The sociolinguistic conditioning of say in this cohort is considered in Table 6.4. The results are similar to those yielded for the predecessor cohort: say overwhelmingly occurs with third-person subjects (and was therefore not included in the multivariate run), and it is associated with past tense contexts. However, the strength of tense has diminished in relation to the previous period; this factor group no longer ranks first in the multivariate analysis. Instead, in this cohort, occupation exerts the most important effect on the variation between say and other quotative variants, with professionals favouring say over non-professionals. When occupation and sex are considered together, males in both occupational groups are more likely than their female counterparts to use quotatives other than say (i.e., zero, think, go): 34% (N = 76/223) vs. 17% (N = 26/153). Specifically, it is non-professional males who take the lead: their use of non-traditional quotatives, including think, zero and go, equals their say usage.

Particularly noticeable is non-professional males’ preference for the zero variant, exemplified in (29), which accounts for almost half of all quotatives used by this group (47%, N = 54/116). The rise of the zero variant goes hand-in-hand with an increase in the introduction of thought and attitude in narratives, and is in line with the change in narrative style mentioned at the outset of the chapter and explored in more detail in Section 6.5

(29)
Yes we had a few Italians at our school, due to them coming in from the vineyards and the Swan. Ha! Garlic all over you! You could smell who was sitting behind you. Ø, ‘Oh, stop your yawning. Blow your stinking garlic in some other direction, not down our neck.’
(FB-M-1922)

Previously dominated by say and direct speech encoding, the system of those born in the early twentieth century shows increasing variability in quotative variants, and the rise of the null form heralds the introduction of reported thought and non-lexicalised content, an upward trend that, as shown next, continues well into the mid-twentieth century.

6.3.2.3 The mid-twentieth century (speakers born between 1950 and 1960)

Although still dominated by say (58%), the quotative system of AusE speakers born between 1950 and 1960 nevertheless displays richer variation than it did previously. As shown in Figure 6.1, the frequency of think has risen considerably (16%), that of zero has declined somewhat (16%) and that of go is negligible (1%). While still incipient (1%), be like is first attested among those born in the mid-twentieth century. The increasing usage rates of think and the appearance in this cohort of be like reflect a rise in the encoding of inner thought and attitude (see, for example, (8) where be like introduces internal thought). Previously almost invariably used with the third person, the occurrence of say is no longer restricted to third-person subjects, and hence its inclusion in the statistical analysis reported in Table 6.5.

Table 6.5. Contribution of linguistic and social factors to the use of say among speakers born between 1950 and 1960

Input	.82
Total N	158
	FW	%	N
Person
Third	.74	92.5	67
First	.14	46.3	41
Range	60
Tense
Past	.59	77.9	68
Non-past	.39	74.6	59
Range	20
Sex
Female	[.52]	64.9	77
Male	[.47]	60.5	81
Occupation
Professional	[.50]	62.3	138
Non-professional	[.50]	65.0	20

Table 6.5 shows the results of a multivariate analysis of the sociolinguistic factors constraining the use of say by speakers born between 1950 and 1960. Say is significantly constrained by grammatical person and tense: it is favoured with third-person subjects and in the past tense. The rise of think among those born in the mid-twentieth century further contributes to the strong effect of third-person subjects on say usage. The hallmark of think is its co-occurrence with first-person subjects, and the effect of this trend impacts the entire quotative system. Say usage is not significantly constrained by social factors in this period. In line with previous cohorts, however, females continue to use say slightly more frequently than males. Crucially, while occupation significantly constrains the use of quotative variants in previous periods (with professionals consistently favouring say over non-professionals), this constraint is no longer operative among those born between 1950 and 1960: professionals and non-professionals now utilise the combination of non-say quotatives at comparable rates.

6.3.2.4 The late twentieth century (speakers born between 1970 and 1980)

As shown in Figure 6.1, the quotative system of AusE speakers born between 1970 and 1980 is the most variable in the corpus. Say is still the majority variant (55%), but other quotative frames also feature with different degrees of prominence: zero (14%), go (12%), think (9%) and be like (2%). The frequency of think has declined noticeably in relation to the 1950–1960 cohort (16% > 9%). In contrast to previous cohorts, go is the third most frequent quotative in late twentieth century AusE. Not only has quotative go increased in overall usage frequency, it has crucially expanded its pragmatic function and is now a firm encoder of ‘vivid’ third-person speech, in line with Winter (Reference Winter2002: 11). Starting in the go like that [gesture] construction used by participants born in the early twentieth century and as an introducer of lexicalised sound for speakers born in the late nineteenth century, quotative go has expanded to introduce a variety of quote content types, along the following cline: lexicalised sound/gesture > direct speech (see also Butters Reference Butters1980, Reference Butters1982; Vandelanotte Reference Vandelanotte, Buchstaller and Van Alphen2012). The final stage in this extension is evidenced in go’s preference for third-person subjects (see also Blyth et al. Reference Blyth, Recktenwald and Wang1990; Buchstaller and D’Arcy Reference Buchstaller and D’Arcy2009; Romaine and Lange Reference Romaine and Lange1991) and its co-occurrence with the historical present (in line with Winter Reference Winter2002), a fundamental feature of English narrative complicating action clauses (Wolfson Reference Wolfson1982). These trends are clearly documented among those born between 1970 and 1980: go collocates with third-person subjects at an overwhelming 94% (N = 34/36), and features with the historical present at 72% (N = 26/36). These trends for go are related to how say is used in narrative, as explained next.

Table 6.6 shows the results of a multivariate analysis of the sociolinguistic factors constraining the use of say among speakers born between 1970 and 1980. Sex and occupation do not make a significant contribution to the occurrence of say in this cohort. Say now appears across the factors constituting the factor groups person and tense, which for the first time allows us to include all factors (individually) in the multivariate analysis. In line with previous studies (Blyth et al. Reference Blyth, Recktenwald and Wang1990; Buchstaller and D’Arcy Reference Buchstaller and D’Arcy2009; Romaine and Lange Reference Romaine and Lange1991; Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2007), say in this cohort is significantly correlated with past tense usage and – to a lesser extent – the present tense. Contrary to the data from those born in previous decades, say is suddenly favoured here with first-person subjects. The change in the association of say from third to first person stems from the aforementioned encroachment of go as a narrative device and from the pivotal role of grammatical person in discourse organisation (see Section 6.4). Among those born between 1970 and 1980, say introduces first-person, past tense-marked speech while go is used to encode speech, thought, non-lexicalised sounds and gestures by third parties in the historical present. An example of speech encoding through third-person, historical present-marked go vs. first-person, past tense-marked say is given in (30).

(30)
And he goes, ‘Why aren’t you there?’ I said, ‘Because I was up here. I was going to get myself something to drink.’ And he goes, ‘If I see you around here again tonight I’m going to take you in and bust you for loitering.’
(TM-M-1970)

Table 6.6. Contribution of linguistic and social factors to the use of say among speakers born between 1970 and 1980

Input	.60
Total N	287
	FW	%	N
Tense
Past	.74	78.3	23
Present	.59	67.6	74
Historical present	.21	29.3	41
Range	53
Person
First	.77	89.7	29
Third	.40	51.7	89
Range	37
Sex
Male	[.53]	62.4	218
Female	[.39]	52.2	69
Occupation
Professional	[.62]	61.9	21
Non-professional	[.49]	59.8	266

6.3.3 Summary

The longitudinal development of the linguistic and social constraints on say use in AusE is summarised in Table 6.7.

Table 6.7. Longitudinal development of sociolinguistic constraints on say in AusE

Sociolinguistic factor	Trajectory of change
Grammatical person	Overwhelmingly third person; first-person constraint emerges among those born between 1970 and 1980
Tense	Overwhelmingly past tense; use in present simple and historical present emerges among those born between 1970 and 1980
Occupation	Slightly favoured by professionals, though effect loses significance among those born between 1950 and 1960
Sex	Not significant across cohorts; preferred by females except among those born between 1970 and 1980

6.4 Discussion

In line with previous research across varieties of English, the development of the AusE quotative system is such that say as the canonical introducer of direct speech by third parties is slowly but steadily replaced by a system featuring a wealth of quotative frames encompassing zero, think, go and – to a lesser extent – be like. The systems of those born between 1870 and 1980 reflect a trajectory of change which speaks of overarching shifts in how the variable grammar operates. Studying the quotative system as a whole allows for a nuanced understanding not only of why say shifts its sociolinguistic constraints with the passing of time but also of how these shifts ultimately influence the rise and fall of other quotative frames, notably the incomer be like, and their sociolinguistic conditioning. What are some of the forces that triggered these changes?

Close inspection of various developments in the data examined in Section 6.3 suggests a threefold answer. Firstly, the ingress of quotatives other than say is linked to a steady increase in the expression of internal thought. Secondly, not only does storytelling increasingly feature more internal thought expression, the way quotatives are deployed is crucially intertwined with the organisation of quoted content in narrative discourse. Thirdly, changes in the AusE quotative system –in terms of both the number of quotative variants used and the pragmatic contexts of their usage – are socially embedded. Each of these hypotheses is explored in detail here.

Regarding the first hypothesis, the data analysed in this chapter suggest that the decrease of say and gradual incursion of other variants, notably be like, in contemporary AusE (Rodríguez Louro Reference Rodríguez Louro2013) is motivated by the increase in self-revelation in narratives referred to in Section 6.1. Figure 6.2 depicts the relative frequency of quotative frame-introduced direct speech vs. quotative frame-introduced internal thought among Anglo-Australians born between 1870 and 1980. The results reveal that the changes in quotative variant choice and preference were accompanied by changes in the type of content introduced by quotative frames, namely a rise in the encoding of internal thought across time. Once negligible, thought-reporting represents a sizeable portion of direct quotation among those born post-1950 (in line with D’Arcy Reference D’Arcy2012: 367, fn. 18). The finding that internal thought-reporting was already in place when be like entered the AusE system is important given that be like is believed to have entered the English quotative system as an introducer of internal thought (Buchstaller Reference Buchstaller2014; Butters Reference Butters1982; see also D’Arcy Reference D’Arcy2012: 360). As such, the contemporary AusE quotative system is the result of rich historical evolution.

Figure 6.2 Relative frequency of quotative frame-introduced direct speech and internal thought among adult Anglo-Australians born between 1870 and 1980

Regarding the second hypothesis, the construction of narrative discourse is important to the development of the (AusE) quotative system. Two linguistic variables are intrinsically related to the organisation of complicating narrative clauses (in the sense of Labov and Waletzky Reference Labov, Waletzky and Helm1967): grammatical person and tense (Levey Reference Levey2006b). Examination of their effect on say usage in Section 6.3 revealed key differences between the speakers born before and after the mid-twentieth century, especially those born after 1970. Once overwhelmingly associated with third-person grammatical subjects, say is favoured with first-person subjects among those born after 1970; once restricted to the past tense, the grammars of speakers born after 1970 show a more complex tense-marking system, and say occurs in non-past tense contexts. The implications of these findings are best understood by focusing on how information is organised in narrative discourse. Rodríguez Louro and Ritz (Reference Rodríguez Louro and Ritz2014) analysed tense variation in the complicating clauses of personal narratives (the ‘then, what happened?’ portion of narratives, Labov Reference Labov1972a: 359) and established the following pattern: in contemporary AusE narrative discourse, the first person and the past tense introduce information pertaining to the narrator while the third person and the historical present encode situations relating to individuals other than the narrator. This trend is also evident in the longitudinal dataset examined in this study. Examples (31) and (32) contain vastly differently constructed narratives by two Anglo-Australian males: (31) was produced by a speaker born in 1922 where past tense-marked first- and third-person say is used to organise the narrative; (32) was produced by a 1970-born speaker where first person-marked say encodes quoted speech by the narrator, and go in the historical present introduces quoted speech by a third party. In this sense, AusE say does not only decrease in overall usage frequency across time periods (see Section 6.3) but, importantly, it narrows its pragmatic scope to encode first person-marked speech in the narratives of those born post-1970. These changes affect the usage and frequency of say’s competitor variants.

(31)
Well, I used to help a chap named John Edmondson, and he used to go up to the house to get his shovel and axe. He said, ‘I’ll be down shortly,’ and probably about quarter of an hour or half an hour later he’d come down, and I’d be digging away there. But I just didn’t like it, especially when I got down towards the box. I was frightened I’d go through the box or something. So I said to Will Smith, he was the superintendent at the time … He said to me one morning that I’d be going on the digging. I said, ‘Oh no, Will, I don’t like the digging. I don’t want to go on.’ So I went off and he came out and saw me later on and said to me, ‘Why wouldn’t you go on the digging?’ I said, ‘Well, I don’t like it, Will.’ And he said, ‘Oh well, you’re no good to me.’ And I said, ‘Oh well, I don’t care, Will.’ So he went off. He didn’t seem to be too concerned really, and I didn’t have to go digging, except if they were busy I went and helped them to fill in the graves, because everybody, everybody, had to go and fill in the graves.
(ES-M-1925)

(32)
And I went to the boss and I said, ‘I’d like to see the boss, please.’ I said, ‘I just want to see him. Now.’ And she goes, ‘Okay, hang on.’ I’m in there, and he goes, ‘What can I do for you?’ I said, ‘I resign.’ He goes, ‘What?’ And I said, ‘I resign.’ And he goes, ‘Why?’ And I said, ‘Because that bitch down there is giving me the shits, going off my face.’ And he goes, ‘Would you like to reconsider your decision here?’ and I said, ‘No, I wouldn’t.’
(TM-M-1970)

Given these findings, the importance of discourse context in the development of the English quotative system cannot be underestimated. One of the most salient syntactic contrasts attested in narrative discourse includes the use of direct speech. The more dramatic the narrative, the more likely the storyteller is to rely on direct speech reporting where ‘the reporter-speaker plays the role of the reported/original speaker’ (Li Reference Li and Coulmas1986: 38). The results of the present and previous longitudinal studies of English quotation (Buchstaller Reference Buchstaller2011; D’Arcy Reference D’Arcy2012) show important shifts in the type of quoted content introduced by speakers (e.g., speech vs. thought). The findings presented here offer a further contribution by considering discourse organisation more generally, addressing the important question of how narrative structure itself constrains quotative usage (see Rodríguez Louro Reference Rodríguez Louro2015).

The third and final hypothesis explored here concerns the social embedding of linguistic change in the English quotative system. As shown in Table 6.7, professional speakers born between 1870 and 1950 show a slight preference for say over other quotative variants. Women in this study also tend to prefer say over other quotatives, a (non-significant) trend operative until 1970. These results are in line with D’Arcy’s (Reference D’Arcy2012: 354) findings for NZE where women born between 1890 and 1935 significantly favour say over other quotative verbs. They support the view that innovation in the English quotative system was brought about by males, with females ‘catching up’ later on. However, given the non-significant findings across time periods, further research is needed. Sex and occupation interact in important ways. Firstly, although non-significant, Anglo-Australian women born before 1960 prefer say, the unmarked or ‘standard’ quotative variant (see D’Arcy Reference D’Arcy2012: 367, fn. 13) (see Table 6.7). Coincidentally, the occupation constraint whereby professionals (significantly) favour say disappears among those born before 1950. These findings indicate a shift in the social evaluation of quotative variants in early Anglo-AusE: once a prestigious form preferred by professionals and women, in the middle to late twentieth century say is no longer the sole favoured quotative frame in the system. Secondly, as noted in Section 6.3.2, non-professional males born in the early twentieth century contribute to variability in the AusE quotative system through their use of the zero variant. This, too, is partly in line with D’Arcy’s (Reference D’Arcy2012: 354) findings for NZE where men born between 1890 and 1935 (occupation unspecified) are responsible for an increase in the use of the zero variant. These trends suggest that what starts off as a virtually invariant system with say encoding direct speech gradually expands to include further quotative introducers (e.g., zero), and males play a central role in this expansion. The role of the non-professional male in the evolution of AusE should not be underestimated. Kiesling (Reference Kiesling and Hickey2004: 422) notes that the language of the predominantly working-class, male population of early colonial Australia constituted a source of covert prestige, a trend likely to have continued well into the late nineteenth century and, arguably, beyond. These findings are especially noteworthy when compared to Buchstaller’s (Reference Buchstaller2014: 169) claims that middle-class Tyneside English speakers in her 1960s/1970s data were responsible for the expansion of the quotative system, i.e., the use of quotative variants other than say. In this study, occupation proved useful in the identification of change in progress, especially in analysing quotation as a primordial feature of narrative.

6.5 Conclusion

Drawing on a previously unexplored corpus of archival and recently collected naturalistic interaction, this longitudinal study has established that quotation in Anglo-AusE has undergone significant changes in a period of some one hundred years. The results are in line with what has been shown for NZE (D’Arcy Reference D’Arcy2012), offering support for the claim that ‘there is a strong linguistic connection between New Zealand and Australia [and that] varieties with similar structural features may be linked by history even if separated by distance’ (Kiesling Reference Kiesling and Hickey2004: 432). The parallels arising from this research are noteworthy in that they indicate similarities in how discourse-pragmatic change operates across geographically distant speech communities.

The longitudinal nature of the original data investigated here, spanning forty-eight years of data collection and including speakers born between 1870 and 1980, has allowed us to trace the trajectory of change in the Australian quotative system, identifying shifting forms and revealing new insights into the AusE quotative system before the encroachment of be like late in the twentieth century. This longitudinal perspective is important to understanding whether what may be seen as recent innovations (e.g., the upsurge of be like) are rooted in the historical evolution of the system as a whole. Such historical evolution is best illustrated here in the rise of internal thought-encoding across time. The findings reported in this chapter underscore the importance of discourse context in understanding language change.

7 The role of children in the propagation of discourse-pragmatic change: insights from the acquisition of quotative variation

7.1 Introduction

Much of the foundational research on children’s acquisition of sociolinguistic patterns of variation and change has paid specific attention to phonological variables (see, for example, Foulkes et al. Reference Foulkes, Docherty and Watt2005; Roberts Reference Roberts1996, Reference Roberts1997), with an expanding number of studies addressing morpho-syntactic variables (see, for example, Kovac and Adamson Reference Kovac, Adamson, Sankoff and Cedergren1981; Miller and Schmitt Reference Miller and Schmitt2010; Smith et al. Reference Smith, Durham and Fortune2007; Youssef Reference Youssef1999). While cumulative advances in developmental sociolinguistics are refining our understanding of how children acquire variables situated in different grammatical domains (see, for example, Smith et al. Reference Smith, Durham and Richards2013), this line of inquiry remains hindered by the dearth of accountable quantitative studies of the acquisition of discourse-pragmatic variation.

An understanding of how children acquire, replicate and employ community-based patterns of discourse-pragmatic variation poses a number of methodological challenges. A first requirement is the availability of a large compendium of naturally occurring child language as well as a suitable adult control variety with which to compare child and adult usage patterns and to track children’s acquisition of community-based norms. A second requirement calls for access to an analytical approach capable of teasing apart the multiple factors that condition the acquisition process. Among the myriad factors to be considered are the age of the child (Kerswill Reference Kerswill1996); the type of variable being acquired (Smith et al. Reference Smith, Durham and Richards2013); its relative stability or engagement in change (Cameron Reference Cameron2005); the complexity of its sociolinguistic conditioning (Meyerhoff and Schleef Reference Meyerhoff, Schleef and Lawson2013; Roberts Reference Roberts, Chambers and Schilling2013); as well as the possible existence of dialect-specific constraints on variation (Rutter Reference Rutter2013). In view of the multiple factors that constrain variable use, many of which differ from one variable to another, it seems reasonable to hypothesise that the acquisition of discourse-pragmatic variation will not necessarily parallel that of phonological or morpho-syntactic variation. The investigation of this hypothesis constitutes a major impetus for the research described in this chapter. An additional motivation for the work reported here comes from the limited number of studies addressing the acquisition of changes in progress, as opposed to stable variation (Rutter Reference Rutter2013).

Focusing on the quotative system, I examine maturational constraints on children’s acquisition of discourse-pragmatic variation and situate children’s usage patterns with respect to baseline community norms. The quotative system is an ideal site to test the hypothesis described earlier because it typically accommodates multiple variants, and it is widely acknowledged to be the locus of rapid change and reorganisation (see, for example, Buchstaller Reference Buchstaller2004, Reference Buchstaller2011; D’Arcy Reference D’Arcy2012; Ferrara and Bell Reference Ferrara and Bell1995; Rodríguez Louro Chapter 6; Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2007). In the course of the past three decades, many varieties of English have witnessed the exponential rise of quotative be like, reported to be one of the most aggressive changes in the recent history of the language (Labov, p.c., 2000, cited in Cukor-Avila Reference Cukor-Avila2002: 21–2). A cursory glance at the strategies used by Canadian children (1)–(6) and adults (7)–(12) to report speech and thought confirms that both speaker groups make use of this innovative quotative alongside a number of competing variants, including the null or zero variant represented by Ø in (5):

Children’s quotative usage

(1)
she’s like, ‘Oh my god, ok, I’m coming home, I’m gonna go catch the bus, it comes in two minutes’
(OCLC:009/F/11/19.04)Footnote ¹
(2)
it’s like, ‘Are you talking about soccer?’
(OCLC:015/M/11/05.35)
(3)
if I- say I’m making some noise in my room she’ll be, ‘Jake be quiet I’m trying to do my homework’
(OCLC:015/M/11/01.58)
(4)
he’ll go, ‘Don’t say that’
(OCLC:031/F/11/22.23)
(5)
throughout the four periods, maybe two times Ø ‘I don’t feel well, I don’t feel well’
(OCLC:048/M/8/34.38)
(6)
they told them, ‘Oh you’re in first class too’
(OCLC:051/M/8/16.51)

Adults’ quotative usage

(7)
people keep coming in and being like, ‘Man, like your friend’s like really, really, really pale, like should we call an ambulance?’
(OEC:012/M/Y/41:31)
(8)
I would never lord myself over my staff like, ‘I’m your boss’
(OEC:006/M/Y/23:04)
(9)
I said, ‘Thank you, have a nice day’
(OEC:006/M/Y/22:38)
(10)
they always complain, ‘There’s not enough art, there’s not enough culture’
(OEC:003/F/Y/22:05)
(11)
and I thought, ‘Hmmm, I’m sort of on the spot here’
(OEC:009/F/O/52:01)
(12)
nowadays, they’re talking about, ‘Oh, I’ve got a huge class of twenty-five!’
(OEC:020/F/O/24:41)

The spread of quotative be like throughout mainstream urban varieties of Canadian English is reported to exhibit distinctive age-correlated patterns of use: older adolescents are spearheading this change, with preadolescents using the incoming form less frequently (Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2009: 82–3).Footnote ² Preadolescents’ lower rate of use is generally explained by appealing to the fact that children typically have had less time to accrue changes in progress. As children mature and their vernacular stabilises in late adolescence, they are expected to contribute to the ‘logistic incrementation’ of linguistic change by advancing the use of innovative features (Labov Reference Labov2007). The question of how children acquire and propagate ongoing changes merits further research, not least because this issue remains ‘a problem that lies at the forefront of […] efforts to understand the mechanism of linguistic change’ (Labov Reference Labov2012: 276).

Linguistic change can be tracked developmentally by comparing the stages that children pass through during their acquisition of a particular feature (Romaine Reference Romaine, Breivik and Jahr1989: 61). As I will show, it is particularly instructive to pursue such comparisons using narrowly defined age ranges capable of revealing differences which may be concealed by more broadly defined age cohorts (Llamas Reference Llamas, Llamas, Mullany and Stockwell2007: 75). Combining a developmental perspective with apparent- and real-time components to assess what has changed, I focus on: (i) children’s acquisition of social and linguistic constraints on quotative variation; (ii) the extent to which these constraints parallel those operating on the same variable in the corresponding adult community; and (iii) the role of children in sustaining and propagating change in the quotative system. In order to address these objectives, I make use of a number of complementary data sources. These include synchronic corpora of child and adult speech supplemented by real-time data, affording a diachronic perspective on the rise of be like and its progressive infiltration of the quotative system.

In the following sections, I first review in Section 7.2 the literature addressing children’s acquisition of variation and the emergence in childhood of strategies for reporting speech and thought. In Section 7.3, I supply information about the data on which this study is based before describing the theoretical framework as well as the methodological procedures underpinning the analysis. After presenting the results of the study in Section 7.4, I examine in Section 7.5 their contribution to addressing the research objectives before offering my conclusions in Section 7.6.

7.2 Acquiring sociolinguistic competence

7.2.1 The acquisition of multiple constraints on variation and change

An expanding number of studies indicate that sensitivity to socially motivated patterns of variation and change emerges well before the onset of adolescence, beginning as early as the third year of infancy (see, for example, Roberts Reference Roberts1996, Reference Roberts1997). Given that the input to which young children are exposed is inherently variable and socially situated, it is unsurprising that the acquisition of sociolinguistic variation should be ‘an integral part of the acquisition process itself’ (Roberts Reference Roberts and Ball2005: 153–4). Indeed, Chambers (Reference Chambers2009: 170) goes so far as to claim that there are ‘no studies indicating a time gap between the acquisition of grammatical competence and the development of sociolinguistic competence’.

A central preoccupation in the developmental sociolinguistic literature concerns the sequence in which multiple constraints on variable usage are acquired (Labov Reference Labov1989; Smith et al. Reference Smith, Durham and Fortune2007). Investigation of the acquisition of variable patterns of (t/d)-deletion in word-final consonant clusters revealed that children as young as three had mastered the phonological constraints and had partly acquired the grammatical constraints, but had not internalised the social constraints associated with this variable (Roberts Reference Roberts1996). Conversely, Labov’s (Reference Labov1989) examination of children’s use of (ing) indicated that stylistic constraints on variable use were acquired before grammatical constraints. In yet other studies, social and linguistic constraints are reported to be acquired contemporaneously (see, for example, Foulkes et al. Reference Foulkes, Docherty and Watt2005; Smith et al. Reference Smith, Durham and Fortune2007; Youssef Reference Youssef1991).

Another major research focus concerns the ability of young children to engage in ongoing change. Active participation in change has been documented in the case of very young children. Research undertaken by Roberts and Labov (Reference Roberts and Labov1995) on the tensing and raising of short /a/ in Philadelphia demonstrates that children between the ages of three and four are acquiring community norms associated with this change. In a similar vein, Rutter (Reference Rutter2013) reports that children as young as five in Oklahoma City are acquiring newly emerging variability associated with the palatalisation of /s/ in the onset cluster /str-/.

In brief, a number of studies, based primarily on the investigation of phonological variables, confirm that young children are capable of acquiring both social and linguistic constraints on variation but not necessarily in lockstep. Furthermore, not only do young children have the capacity to engage in ongoing change, they are even able to exceed adult rates of change by extending the environments in which innovative forms are used (Roberts Reference Roberts1997). The present study considers whether similar findings obtain when the investigation is extended to the acquisition of discourse-pragmatic variation.

7.2.2 The acquisition of strategies for reporting speech and thought

Explicitly framed direct speech is reported to emerge initially around twenty-eight to thirty months of age (Nordqvist Reference Nordqvist2001: 61). When young children (three to five years) start to use overt quotative frames productively, they almost categorically employ them to report speech but rarely thought (Nordqvist Reference Nordqvist2001: 263). Between the ages of four and ten, there is evidence of a significant expansion in children’s metalinguistic ability to report their own speech as well as that of others (Ely and McCabe Reference Ely and McCabe1993: 690; Hickmann Reference Hickmann and Lucy1993: 83). With maturation, there is also increasing diversification in the range of markers used to frame quotations (Goodell and Sachs Reference Goodell and Sachs1992: 411), although the extent to which this process follows a developmental sequence warrants further investigation. A preliminary study of quotatives (N = 136) in the spontaneous speech of twenty-two Canadian four- to six-year-olds (mean age = 4;9) revealed that these children preferentially introduce reported speech with say (38%), followed by the zero variant (22%), go/go like (18%) and be like (8%),Footnote ³ confirming that change in the quotative system used by the wider speech community is percolating down the age spectrum (Levey and McIntyre Reference Levey and McIntyreMS). Drawing on natural language data collected in London, Cheshire et al. (Reference Cheshire, Kerswill, Fox and Torgersen2011: 174) likewise note that four- to five-year-olds rely predominantly on say to report speech whereas children aged eight and above use a more extensive repertoire of variants, including go and be like. Increasing use of vernacular variants in preadolescence, concomitant with the ascendancy of peer group influence on linguistic behaviour during this life stage, suggests that expansion in variant usage may be (at least partly) motivated by children’s shifting social relationships and their progressive convergence on peer norms (Kerswill Reference Kerswill1996: 191–6).

Another factor implicated in the acquisition of strategies for reporting speech and thought includes speaker sex. Although sex differentiation has been documented in the speech of young children (see, for example, Romaine Reference Romaine1984a), its effects are claimed to become more salient in later childhood. Robust indications of sex-differentiated variation are believed to surface around the age of ten (Eckert Reference Eckert2000). To date, it is only in the discourse of older children and teenagers that statistically significant sex-based differences in quotative use have been reported. In the speech of ten- to fourteen-year-old Canadians, for example, females were found to favour be like (Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2004: 506) but, crucially, the relative importance of this effect was found to be weaker than those contributed by the linguistic factors associated with grammatical person and content of the quote. Moreover, when the strength of the sex effect in this age cohort was compared with an equivalent effect in the speech of older, seventeen- to nineteen-year-old adolescents, its relative strength was found to increase with age. These findings suggest that internal constraints on be like are internalised before the relevant social constraints, which may not be fully acquired until later adolescence.

Conspicuously absent from earlier studies of quotative variation is detailed quantitative information on speakers younger than ten. Little is known about the development of the sociolinguistic patterns of quotative variation in the years leading up to adolescence. Factoring younger speakers into the investigation is a prerequisite to understanding when children first begin to acquire such patterns, and how these are transformed over time into adult strategies (Eckert Reference Eckert2000: 11). These issues are pursued in the quantitative analyses detailed in the following sections.

7.3 Data and method

7.3.1 Nature of the data

The linguistic material on which this study is based was collected in Ottawa, Ontario. The primary source of material, compiled between 2011 and 2012, comes from the Ottawa Child Language Corpus (OCLC), a corpus of digitised recordings of children aged between eight and twelve. These children are grouped into two age divisions, eight- to nine-year-olds vs. eleven- to twelve-year-olds, with each age division stratified by speaker sex. No other stratifying measure was imposed during corpus construction. A total of forty-six children were recorded using a sociolinguistic interview methodology (Labov Reference Labov, Baugh and Sherzer1984). All the children participating in the study had English as their primary language and had resided in Ottawa since infancy. The distribution of sample members by age and sex is shown in Table 7.1.

Table 7.1. Sample constitution: Ottawa Child Language Corpus (OCLC)

Age range	Male	Female	Total
8–9	8	12	20
11–12	12	14	26
Total	20	26	46

The children are grouped according to significant life stages identified by Labov (Reference Labov2001b: 101): eight to nine representing alignment to the preadolescent peer group and eleven to twelve representing the age at which membership in the preadolescent peer group is typically achieved. The upper and lower boundaries of this age range provide a window on vernacular reorganisation, the process by which children begin to show strong linguistic influence from their peers and the speech of the community (Labov Reference Labov2001b: 416).

In order to situate children’s usage in relation to community-based patterns of variation, the Ottawa English Corpus (OEC) serves as a reference variety with which to assess children’s approximation to adult norms. As shown in Table 7.2, this dataset of recordings from nineteen adults incorporates an apparent-time component: speech from twenty- to thirty-year-olds and speech from over forty-year-olds. These recordings were collected in 2010.

Table 7.2. Sample constitution: Ottawa English Corpus (OEC)

Age range	Male	Female	Total
20–30	7	6	13
40+	3	3	6
Total	10	9	19

Because the apparent-time component incorporated into the synchronic dataset of the OEC cannot uncritically be assumed to reflect diachronic change (Cukor-Avila and Bailey Reference Cukor-Avila, Bailey, Chambers and Schilling2013: 241), there is an additional need for a real-time benchmark to determine what has changed in the quotative system. To satisfy this requirement, the synchronic dataset is compared with the Ottawa-Hull Spoken Language Archives (OHSLA), a diachronic corpus comprising thirteen speakers recorded in Ottawa in 1982. These speakers were born between the end of the nineteenth century and the 1970s. This corpus (see Table 7.3) is based on spontaneous speech obtained from adults aged thirty and above as well as children aged between eight and twelve.

Table 7.3. Sample constitution: Ottawa-Hull Spoken Language Archives 1982 (OHSLA)

Age range	Male	Female	Total
8–12	2	6	8
30+	3	2	5
Total	5	8	13

Potentially detracting from the utility of the OHSLA dataset as a diachronic reference point is its restricted size (circa 100,000 words) and the limited number of speakers drawn from disparate age groups. But for the purposes of this study, its great merit lies in the inclusion of preadolescents matching the age range of the contemporary child dataset. This asset qualifies the 1982 corpus as a viable, albeit limited, diachronic benchmark with which to investigate a precursor stage of the quotative system at a time when the use of be like is widely believed to have been incipient (see, for example, Buchstaller and D’Arcy Reference Buchstaller and D’Arcy2009: 292).

7.3.2 Theoretical framework

This study draws on the theoretical framework of comparative variationist sociolinguistics (Labov Reference Labov1972b; Poplack and Tagliamonte Reference Poplack and Tagliamonte2001). A key tenet of this approach is that language is inherently variable, and this variability is neither random nor indiscriminate but characterised by ‘structured heterogeneity’ (Weinreich et al. Reference Weinreich, Labov, Herzog, Lehmann and Malkiel1968: 99–100). Structured heterogeneity is typically subject to multiple social and linguistic constraints that are probabilistically correlated with the distribution of variant choices. The nature of the ‘system’ or ‘grammar’ underlying variable surface manifestations can be discerned by carefully inspecting the distribution and conditioning of competing variants in discourse (Poplack and Tagliamonte Reference Poplack and Tagliamonte2001: 6). Once characterised, the underlying grammar can be systematically compared across different speaker groups (i.e., children and adults in this study) in order to pinpoint any structural similarities or disparities between groups in the detailed sociolinguistic conditioning of variant selection. Where different speaker groups exhibit structural parallels in the conditioning of variant choice, a relationship between their underlying grammars can be inferred. Conversely, where such correspondences are limited, or absent, evidence of an affinity between the underlying grammars of the respective groups is vitiated.

7.3.3 Method

The variable context is defined in broad, functional terms and includes ‘all strategies used to introduce reported speech, sounds […] and thought by self or other’ (Buchstaller Reference Buchstaller2006a: 5; see also Rodríguez Louro Chapter 6).Footnote ⁴ In accordance with the ‘principle of accountability’ (Labov Reference Labov1972b: 72), eligible strategies include not only frames containing an overt quotative marker but also those in which a null or zero variant is used, as in (5).

In line with previous investigations of quotative variation, accurate delimitation of the variable context necessitates a number of exclusions (see, for example, Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2004: 500, 512). These include quotatives introducing indirect speech (13), excerpts of written language (14), as well as incomplete utterances (15).

(13)
Mom says maybe that Dad lied to Rachel
(OCLC:006/M/8/256.31)

(14)
it just said on a note, ‘Gone to get groceries, be back in an hour’
(OCLC:021/M/11/16.08)

(15)
it was all dark and my friends were like, ‘I’m scared to g-’
(OCLC:027/F/11/10.38)

Each token retained for analysis was coded for speaker age and sex as well as a number of linguistic factors hypothesised to affect variant choice. Most prominent among these are grammatical person and content of the quote (Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2007: 202). With regard to grammatical person, be like is reported to be favoured with first-person subjects whereas say and go tend to occur with third-person subjects (Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2004, Reference Tagliamonte and D’Arcy2007; Tagliamonte and Hudson Reference Tagliamonte and Hudson1999). Accordingly, first-person contexts (16) were differentiated from third-person ones (17):

(16)
and I’m like, ‘See what he did?’
(OCLC:004/F/8/24.40)

(17)
she goes, ‘I’m your mom, I have the right’
(OCLC:018/F/11/15.59)

In terms of content of the quote, an emblematic feature of be like is its use to represent internal dialogue (Buchstaller and D’Arcy Reference Buchstaller and D’Arcy2009). By contrast, the prototypical speech verb, say, is overwhelmingly associated with externally realised speech. To detect any such associations in the current data, each token was distinguished according to whether it introduced reported speech (18) or internal dialogue/inner states (19) (Ferrara and Bell Reference Ferrara and Bell1995: 283):

(18)
I said, ‘The coast is clear’
(OCLC:052/M/8/29.55)

(19)
in my head I’m like, ‘Screw you, I can do what I want, like it’s me playing’
(OCLC:024/M/11/20.03)

Though less extensively researched, mimesis is another factor strongly implicated in variant selection, particularly in the case of quotatives derived from non-speech verb elements such as go and be like (Güldemann Reference Güldemann2008: 373). In this study, mimesis is understood to include sound symbolism and prosodic variation (e.g., loudness, pitch, syllable length) to construct dramatised dialogue. To capture these effects in the data, quotations that have mimetic content (20) were differentiated from those that contain none (21).

(20)
once the teacher leaves everybody’s like, ‘AAHHH no teacher!’
(OCLC:043/F/8/01.33)

(21)
I said, ‘Where is my mother?’
(OCLC:020/M/11/23.06)

The temporal reference of the quotative is cited as one of the most powerful determinants of variant choice although it does not operate uniformly across varieties of English (Buchstaller and D’Arcy Reference Buchstaller and D’Arcy2009: 308; Cukor-Avila Reference Cukor-Avila2012: 627). In Canadian English, a specific correlation between be like and the historical present has been noted for speakers under thirty (Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2007: 209). To assess the combined effects of verb morphology and temporal reference on variant selection, each quotative marker containing a tensed verb was distinguished according to whether it was encoded with non-past-tense morphology used with past temporal reference (i.e., the historical present) (22); non-past-tense morphology used with present temporal reference (23); and past-tense morphology used with past temporal reference (24).

(22)
then he showed it to me and I’m like, ‘Kerry you’re disgusting’
(OCLC:015/M/11/26.52)

(23)
sometimes they’re just like, ‘No this is just for our fort’
(OCLC:004/F/8/04.42)

(24)
and then I said, ‘Open your mouth’
(OCLC:042/M/8/36.22)

7.4 Results

7.4.1 Distributional analysis

To determine the extent to which children have acquired adult-like rates of variant choice, it is first necessary to establish what these rates are in the corresponding adult corpus. Table 7.4 shows the distribution of variants by age and sex in the adult corpus (OEC).Footnote ⁵ Combining the data from males and females, a first important finding is that be like is the lead variant for speakers between twenty and thirty years of age, accounting for over half (56%) of their quotatives. The status of be like as an under-forties phenomenon is confirmed by the fact that it is not used once by any of the adults aged above forty, whose quotative system is largely restricted to the canonical variant say. In terms of the trajectory of change, the preponderance of be like in the young adult data offers a snapshot of the ‘upswing of the classic S-curve of linguistic change’ (Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2007: 200). Closer inspection of the distribution of be like in the twenty-to-thirty adult cohort reveals a familiar asymmetrical sex effect, with females using this variant almost twice as often as males (see Ferrara and Bell Reference Ferrara and Bell1995; Singler Reference Singler2001; Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2004, Reference Tagliamonte and D’Arcy2007). Say maintains a firm foothold in the speech of young males, with all remaining variants occurring at relatively low rates. Young females’ speech exhibits even greater uniformity, as no minority variant exceeds 8% of quotative use.

Table 7.4. Distribution of quotative variants by age and sex in the Ottawa English Corpus (OEC)

	20–30 years				40+ years
	Males		Females		Males		Females
Quotatives	%	N	%	N	%	N	%	N
*Be Like*	35	79	68	267	0	0	0	0
*Say*	28	63	8	33	65	54	54	87
*Zero*	8	17	6	22	6	5	20	32
*It’s like*	5	12	3	13	0	0	0	0
Go	5	11	3	10	7	6	1	1
*Disc. like*	4	9	6	23	1	1	0	0
*Think*	3	6	1	2	11	9	8	12
Be	1	2	2	6	1	1	3	5
Other	11	24	4	17	8	7	15	24
Total		223		393		83		161

Turning to the OCLC child data, Table 7.5 shows the distribution of quotatives stratified by age and speaker sex. In both the eight-to-nine and eleven-to-twelve age groups (amalgamating the data for males and females), be like is the primary exponent of the quotative system.Footnote ⁶ Restricted numbers of males and females in each age division as well as small token counts in the case of minority variants necessitate a cautious interpretation of sex-based differences in variant distribution. Unlike eight- to nine-year-olds and young adults, eleven- to twelve-year-old males and females use be like at comparable rates, and its overall frequency (54%) in this age group matches its corresponding rate of occurrence (56%) in the young adult data. In the eight-to-nine age group, males use be like far less often than females, but their rate of use is almost identical to that of the twenty- to thirty-year-old males in Table 7.4. Thus, not only have the youngest children appropriated quotative be like, the pattern of sex differentiation exhibited by the eight- to nine-year-olds resembles, at least superficially, the split in the corresponding adult community where females are clearly ahead of males in their use of this variant. Sex-based asymmetries are also apparent in the use of other variants. The zero quotative is a case in point. Although an infrequent option, it exhibits a male bias in both age groups. A similar pattern can be observed in males’ use of it’s like. Other minority variants exhibit less consistent patterning: go is used at appreciably the same rate by males and females in the eight- to nine-year-old cohort but, contrary to the male bias reported for North American populations (Singler Reference Singler2001: 272; Tagliamonte and Hudson Reference Tagliamonte and Hudson1999: 160), it is used more by females in the eleven-to-twelve age group.

Table 7.5. Distribution of quotative variants by age and sex in the Ottawa Child Language Corpus (OCLC)

	8–9 years				11–12 years
	Males		Females		Males		Females
Quotatives	%	N	%	N	%	N	%	N
*Be like*	33	45	50	71	52	121	56	183
*Say*	33	45	26	37	7	16	21	69
*Zero*	10	14	6	9	13	30	4	14
*It’s like*	7	9	1	2	13	31	2	6
Go	5	7	4	5	2	4	6	19
*Disc. like*	2	3	4	6	7	17	2	7
*Think*	1	1	0	0	0	0	1	4
Be	0	0	0	0	0.4	1	0.3	1
Other	9	12	8	11	6	13	7	24
Total		136		141		233		327

The children’s inventory of quotative markers and associated rates of occurrence offer compelling indications that they are registering linguistic influence from the wider speech community and are attuned to shifting norms in quotative use. An important question raised by the prevalence of be like in the child corpus concerns the potential source of its transmission to younger speakers. As no data from the children’s primary caregivers were collected, it is not possible to offer any detailed evaluation of the role that they may have played in transmitting be like (and its associated conditioning) to their children. The prevailing consensus in the literature is that older adults, the presumed locus of parental linguistic influence, play no determinative role in transmitting be like to younger speakers (Sankoff and Blondeau Reference Sankoff and Blondeau2007: 583; Tagliamonte and Denis Reference Tagliamonte and Denis2014: 121). Evidence in favour of this view can be found in Table 7.4, where the ‘temporal isogloss’ distinguishing young adult be like users from their older, i.e., above forty, non-be like counterparts vitiates the possibility that children inherit this change from older adults via the conventional route of parent-to-child transmission. This leaves the peer group, teenagers and young adults as potential agents of the transmission of change.Footnote ⁷ The primacy of the peer group in reinforcing vernacular norms is reported to play a key role in the transmission of linguistic features from older to younger children, especially in cases where the features in question are not readily available in caregiver speech (Kerswill Reference Kerswill1996: 190). Increased overall rates of be like use, as emerges from a comparison of the older children with their younger counterparts, suggest that convergence on the focused norms of the peer group helps to diffuse be like to preadolescent speakers.

Comparison with the OHSLA diachronic data, depicted in Table 7.6, foregrounds the spectacular rate of change. Although the data are restricted, an important finding to emerge from the diachronic control is the almost categorical absence of be like. There are only two instances of this variant, both produced by the same eleven-year-old speaker. In contrast with what is found some thirty years later, the quotative system in 1982 is circumscribed to say, go and, to a lesser extent, zero in the child data, and just say and zero in the corresponding adult data. If there is any candidate for change in Table 7.6, then quotative go, rather than be like, would be the frontrunner.

Table 7.6. Distribution of major quotative variants by age in the 1982 Ottawa-Hull Spoken Language Archives (OHSLA)

	8–12 years		30+ years
Quotatives	%	N	%	N
*Say*	42	42	71	115
Go	32	32	1	1
*Zero*	11	11	17	27
*Be like*	2	2	0	0
*Discourse like*	2	2	1	2
Be	2	2	1	1
*Think*	0	0	4	6
Other	9	9	5	9
Total		100		161

To summarise, these results corroborate the rapid rise of be like in a compressed time-frame and confirm that the eight- to twelve-year-old children in the synchronic datasets have clearly acquired this incoming variant, as evidenced by its elevated rate of occurrence in both preadolescent age groups. But consideration of rates alone is inadequate for establishing the extent to which children have acquired the underlying conditioning, or structure, of the variability associated with be like in the adult speech community. In order to address this question, the following section details the statistical procedures that were implemented to compare the underlying structure of the quotative system across speaker cohorts.

7.4.2 Multivariate analysis

To test which of the independent social and linguistic variables introduced in Section 7.3.3 exert a significant effect on variant choice, the stepwise multiple regression procedure in Goldvarb X (Sankoff et al. Reference Sankoff, Tagliamonte and Smith2005) is utilised. This procedure yields three major lines of evidence that are critical to the interpretation of the results (Poplack and Tagliamonte Reference Poplack and Tagliamonte2001: 93–4): (i) the statistical significance of independent variables at the 0.05 level; (ii) the relative strength of significant effects, as indicated by the range value; and (iii) the constraint hierarchy or ordering of probabilities (‘factor weights’) within factor groups. The constraint hierarchy associated with each independent variable represents a portion of the underlying grammar. By holding the same independent variables constant across speaker groups, constraint hierarchies can be systematically compared in order to assess the extent to which different groups share the social and linguistic conditioning of variant choice. This technique enables subtle adjustments in the underlying grammar to be detected and fine-grained differences between comparison groups to be identified. The information that emerges from such a comparison can be used to situate children’s usage patterns with respect to adult baseline community norms.

Another analytical advantage afforded by the constraint hierarchy resides in its capacity to determine the extent to which a form has grammaticalised. Grammaticalisation, the process whereby ‘lexical items and constructions come […] to serve grammatical functions or […] grammatical items develop new grammatical functions’ (Hopper and Traugott Reference Hopper and Traugott2003: 1), typically involves a set of gradient processes that can be tracked quantitatively. One such process involves the extended, or generalised, use of a construction in new contexts (Heine Reference Heine, Joseph and Janda2003: 579). To trace the progress of a grammaticalising form such as be like along the pathway of change, a number of diagnostic measures can be exploited. Key metrics exploited in earlier research and operationalised in this study include the expansion of be like into contexts associated with third-person subjects, reported speech and non-mimetic discourse as well as a stronger correlation with the historical present (Ferrara and Bell Reference Ferrara and Bell1995: 286; Levey et al. Reference Levey, Groulx and Roy2013: 12; Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2007: 209). Evidence of these developments in synchronic data can be inferred from permutations in the underlying grammar (e.g., reweighting of constraints or their neutralisation) when constraint hierarchies associated with be like are compared across different speaker groups. The resultant information can then be used to situate these groups with respect to one another in terms of their advancement of change in specific linguistic environments.

Table 7.7 presents the results of nine independent multivariate analyses of the social and linguistic constraints that contribute to the selection of be like, say and zero in the speech of eight- to nine-year-olds, eleven- to twelve-year-olds and twenty- to thirty-year-old adults, respectively.Footnote ⁸ Because Goldvarb only performs binomial multivariate analysis generating a choice of two results (i.e., application or non-application value), each time that an independent analysis is performed, one variant of the dependent variable must be selected as the application value and the remaining variants must be treated as the non-application. Thus, for the three comparison groups, each of be like, say and zero was treated in turn as the application value, and the remaining variants were selected as the non-application, yielding three independent multivariate analyses for each comparison group. In terms of the numerical values associated with each analysis, the probabilities, or factor weights, are to be interpreted as follows: those above 0.50 (shaded in Table 7.7) favour variant selection in the contexts listed on the left-hand side of the table whereas those below 0.50 have a disfavouring effect. Factors selected as non-significant are enclosed in square brackets.

Table 7.7. Nine independent multivariate analyses of the factors contributing to the probability that be like, say and zero will be selected in the speech of eight- to nine-year-olds, eleven- to twelve-year-olds (OCLC) and twenty- to thirty-year-olds (OEC) (N/A = not applicable)

	8- to 9-year-olds			11- to 12-year-olds			20- to 30-year-olds
	be like	say	zero	be like	say	zero	be like	say	zero
Total N	116	82	23	304	85	44	346	96	39
Corrected mean	.42	.29	.08	.54	.14	.07	.57	.14	.06
	FW	FW	FW	FW	FW	FW	FW	FW	FW
Sex
Female	.59	[.46]	[.44]	[.52]	.63	.38	.62	.37	[.47]
Male	.41	[.54]	[.57]	[.48]	.32	.67	.30	.72	[.55]
Range	18				31	29	32	35
Temporal reference
Historical present	.88	.13	N/A	.66	.17	N/A	.70	.20	N/A
Present	.35	.62	N/A	.35	.67	N/A	.34	.59	N/A
Past	.25	.77	N/A	.48	.67	N/A	.39	.75	N/A
Range	63	64		31	50		36	55
Content of the quote
Internal dialogue	.95	.07		.59			[.50]	.12	[.59]
Direct speech	.44	.55	100%	.49	100%	100%	[.50]	.64	[.47]
Range	51	48		10				52
Mimesis
Voicing	.64	.39	[.49]	[.51]	.38	[.54]	.58	.41	[.49]
No-voicing	.32	.65	[.51]	[.49]	.62	[.46]	.44	.57	[.51]
Range	32	26			24		14	16
Grammatical person
First person	.52	[.47]	N/A	.55	.48	N/A	.54	[.47]	N/A
Third person	.49	[.52]	N/A	.47	.51	N/A	.46	[.53]	N/A
Range	3			8	3		8

Looking first at social constraints, speaker sex is selected as significant for eight- to nine-year-olds’ and adults’ use of be like, although the relative strength of this effect (as indicated by the range value) is greater in the adult data where it is the second-ranked constraint as opposed to the fourth-ranked one in the eight-to-nine age group. By contrast, this effect is neutralised for the eleven- to twelve-year-olds. Inspection of the constraint hierarchies for competing variants in the system reveals further social differences. In the case of say, the eight- to nine-year-olds match the direction of the sex effect (but not its statistical significance) found in the corresponding adult data whereas the same effect reverses direction completely in the eleven- to twelve-year-olds. As a marginal variant, zero is not selected as significant for any factor group, with the exception of speaker sex in the eleven-to-twelve age group where it is favoured by males. In summary, inconsistencies in the influence of speaker sex on variant selection, as revealed by comparison across speaker groups of statistical significance, direction and relative strength of effect suggest that the effect of this constraint is labile in the preadolescent groups studied here.

Turning to the effect of linguistic factors, the constraint hierarchies for say are broadly parallel in the child and adult data. As for the infrequent zero marker, the most noticeable difference relates to children’s categorical use of this variant to introduce direct speech, contrasting with its additional (albeit weak) association with internal dialogue in adult speech. As far as be like is concerned, temporal reference is the overriding determinant of its selection in the child and adult data (as assessed by the range values). Moreover, the strong association of be like with the historical present, shared by all speaker groups, indicates that preadolescents and adults alike are participating in a development reported to be characteristic of ongoing grammaticalisation. This development is schematised in the literature in terms of a three-stage model of change: in Stage 1, be like is initially encoded with non-past-tense morphology to express present and past temporal reference; in Stage 2, the specific association of be like with the historical present is favoured while its association with present temporal reference weakens; and in Stage 3, the association of be like with the historical present becomes particularly strong and its use with present temporal reference is disfavoured (Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2007: 208–9). With regard to the comparison groups’ engagement in this trajectory of change, all three can be located at Stage 3, judging from the respective constraint hierarchies for temporal reference. Another key piece of evidence aligning the children with the adults concerns the diminished contribution of grammatical person. Qualifying as the weakest significant constraint on be like in the child and adult data, its modest effect points to the attenuation of the robust association of be like with first-person subjects documented in earlier research targeting the quotative system (Tagliamonte and Hudson Reference Tagliamonte and Hudson1999). The reduced impact of this constraint can be interpreted as yet another indication of advanced grammaticalisation shared by children and adults. In sum, the results for temporal reference and grammatical person converge in demonstrating that all three comparison groups are participating in advancing change in particular linguistic environments.

By contrast, other factor groups reveal major disparities when constraint hierarchies for be like are compared across speaker cohorts. Content of the quote is a case in point. In the eight-to-nine age group, this makes a strong contribution (only moderately smaller than that of temporal reference) to the selection of be like. Conversely, in the eleven-to-twelve age group, its influence, though significant, is reduced, as indicated by the range value (=10) for this factor group which is three times smaller than the corresponding value (=31) for temporal reference. The modest effect of content of the quote in the eleven-to-twelve age group is taken one step further in the adult data, where the distinction between internal dialogue and direct speech is neutralised. Similar discrepancies in the conditioning of be like emerge from a comparison of the constraint hierarchies for mimesis. This factor group makes a moderately strong contribution to the selection of be like in the speech of the eight- to nine-year-olds, but exerts a neutral effect in the case of the eleven- to twelve-year-olds. In the adult data, its effect is significant but relatively minor. Comparison of the constraint hierarchies associated with content of the quote and mimesis suggests that the older children and the adults are more closely aligned with one another in relation to the relative progress of be like along the cline of grammaticalisation. Conversely, the linguistic behaviour of the younger children indicates that they are participating less fully in ongoing linguistic developments as far as these factor groups are concerned. Thus, in spite of compelling parallels between all three speaker groups with respect to temporal reference and grammatical person, nuanced differences between the two preadolescent age groups in relation to the remaining linguistic factor groups suggest that the acquisition of adult community norms is not complete before the age of eleven.

How are these differences to be explained? Inspection of the corrected means, or overall tendency of variant occurrence, sheds some light on this question. Note that the corrected mean for say is far higher for the eight- to nine-year-olds than the equivalent values for the older children or adults (0.29 vs. 0.14 vs. 0.14), indicating that in the youngest age cohort this variant still accounts for a moderate proportion (30%) of the variable context. For the youngest children, say (and, to a much lesser extent, the zero variant) functions as the introducer of externally realised speech whereas be like remains firmly entrenched with internal dialogue. By contrast, for the older children, these contextual effects are adjusted as be like diffuses further into the system and consolidates its status as the default quotative marker. To be sure, say retains its association with direct speech in the eleven-to-twelve age group, but this variant is now effectively consigned to the margins of the system, as confirmed by its substantially lower corrected mean. As say wanes in the system used by the older children, it stands to reason that its territory will be susceptible to appropriation by the incoming variant be like. The effects of older children’s higher rate of be like and the extent of its diffusion throughout their quotative system are visible in the reweighting of the probability values for internal dialogue and direct speech. This reweighting of contextual effects can be interpreted as evidence of specialisation (Hopper and Traugott Reference Hopper and Traugott2003: 116): as be like penetrates further into the quotative system, relegating competing variants to peripheral status, it assumes a more general quotative function, and its emblematic association with internal dialogue weakens. That this development is entirely in line with community-based norms is confirmed by the neutralisation of the distinction between internal dialogue and direct speech in the adult baseline variety.

In summary, adjustments in environmental constraints on be like, as emerges from the comparison of the youngest and oldest children with the adult control variety, provide a graphic illustration of the malleability of children’s grammars in response to ongoing discourse-pragmatic change in the ambient speech community.

7.5 Discussion

Innovative in the research reported here is the application of the variationist framework to tracking discourse-pragmatic change in later childhood, illuminating the transition between different stages of the grammar within a narrowly defined age range. A key component of this investigation involves the judicious use of a number of complementary data sources to track change in real and apparent time, and to situate children’s usage in relation to a commensurate adult control variety. A lynchpin of the methodological framework underpinning this investigation is the importance attached to variable structure, rather than rates of variant occurrence, as the primary heuristic for comparing the underlying grammar across different speaker groups. This approach is additionally informed by the recognition that an adequate understanding of how change is acquired can only be achieved when the incoming variant, be like, is situated in relation to the larger system of which it is part (Poplack and Malvar Reference Poplack and Malvar2007).

A major finding to emerge from the comparative component of this research is that although the primary exponents of the quotative system are essentially the same in the preadolescent and adult data, their rates of occurrence and associated conditioning differ in a number of important respects. An especially intriguing pattern concerns the existence of a sex effect associated with be like in the youngest age group, superficially mirroring a similar (though stronger) effect in the young adult data. At first sight, it is tempting to construe this effect in the eight- to nine-year-olds as an instantiation of the classic pattern of male retreat from female-led changes (Labov Reference Labov2001b), but the neutralisation of the same effect in the eleven-to-twelve age group militates against this interpretation. A more likely explanation is developmental: the eight- to nine-year-old males lag behind the females in acquiring be like but males subsequently increase their use of this variant in later childhood (see Table 7.5). What is striking in these data is the lability of sex effects: it is not just the contrast between the eight- to nine-year-old males and females in their rates of be like that is remarkable but also the rate increase that emerges when the males in the youngest age group are compared with males in the eleven-to-twelve cohort. Furthermore, differences between these two age groups in overall rates of be like are at odds with the claim that ‘the more diffused be like is, the more likely it is to differentiate male and female speech’ (Tagliamonte and Hudson Reference Tagliamonte and Hudson1999: 167). These discrepancies indicate that there is no constant relationship between the frequency of be like and sex-differentiated patterns of variation, at least when child and adult populations are compared, nor is there any simple correlation between the emergence of a sex effect in preadolescence and an equivalent effect in the corresponding adult community.

Among the major assets of the methodological approach adopted in this study are the refined measures it contributes to elucidating the extent to which preadolescents engage in and propagate discourse-pragmatic change. Comparison of constraint hierarchies across the preadolescent age cohorts provides key information in this regard. It is evident that the youngest children examined here have acquired more than just the form be like, as they have also internalised a complex set of linguistic constraints governing its use. Furthermore, not only are the eight- to nine-year-olds contributing to the incrementation of change, they are also extending its scope and specificity (Labov Reference Labov2007: 346). This information can be inferred from measures of grammaticalisation operationalised in this study. These reveal that the youngest children, along with their older peers, are advancing the use of be like in specific environments (i.e., temporal reference and grammatical person) associated with the progression of this variant along the cline of grammaticalisation (Tagliamonte and D’Arcy Reference Tagliamonte and D’Arcy2004, Reference Tagliamonte and D’Arcy2007). Such findings are not altogether unexpected considering that by the age of eight, restructuring of the vernacular in favour of the linguistic norms of the ascendant peer-based social order is liable to be well underway.

However, on other key measures, i.e., content of the quote and mimesis, the linguistic behaviour of the youngest children sets them apart from the older children who are more closely aligned with the adults, as indicated by correspondences in variant rates as well as the linguistic conditioning of variant choice. An important inference to be drawn from differences in quantitative patterning between the preadolescent age groups is that the grammar associated with the quotative system has not stabilised by the age of eight and remains malleable beyond this age, as demonstrated by the results of this investigation. This inference bolsters the general conclusion that the acquisition of adult-like patterns of quotative variation (and change) is a developmentally protracted process. The prolonged nature of this process points to fundamental differences between the acquisition of discourse-pragmatic variation and phonological variation. Whereas children are claimed to acquire the bulk of the phonology of their local variety by the age of six or seven (Kerswill Reference Kerswill1996: 190), the acquisition schedule for discourse-pragmatic variation extends well into later childhood. Among the possible explanations for this difference is the complexity of the syntax-discourse interface, reported to be the locus of learnability issues in language acquisition (Sorace Reference Sorace2004). Because successful use of quotative markers draws on different areas of linguistic competence transecting discourse and syntax (Meyerhoff and Schleef Reference Meyerhoff, Schleef and Lawson2013), integration of this knowledge may not be fully internalised before adolescence. Whatever the precise explanation, the hypothesis enunciated at the outset of this study, namely, that the acquisition of discourse-pragmatic variation will not necessarily resemble that of other variable components of the grammar, is sustained rather than refuted by the results reported here.

7.6 Conclusion

The findings uncovered in this study generate a number of additional questions for future research. An outstanding issue concerns the origins of the complex set of conditions on be like instantiated by the children in the present investigation. If, in the absence of a parental model, be like is acquired from the peer group and older children, when is this process initiated? At what age is be like first acquired and how is it conditioned? Resolution of these issues can only be achieved by extending the focus of the investigation to early childhood in order to probe the developmental roots of quotative variation (Levey and McIntyre Reference Levey and McIntyreMS). Another key, though understudied, question concerns the impact of inter-dialectal differences on the acquisition of change. In other communities where be like has made less dramatic inroads into the quotative system, earlier research found that neither the form nor its associated structure had been acquired by preadolescents to any appreciable extent (Levey Reference Levey2007). This raises the critical question of how quantitatively robust (and socially salient) a change needs to be in the speech community before children can begin to acquire it. It is not entirely clear to what extent constraints on incipient changes, as opposed to those operating on mid-range and advanced ones, are accessible to young children (see Cameron Reference Cameron2005). Engagement with these issues is a prerequisite for gaining new insights into children’s participation in and propagation of community-based change, with a view to acquiring a deeper understanding of how children ‘identify the newer pattern in the community system that they are learning, adopt that pattern and then move further in that direction’ (Labov Reference Labov2012: 276). The empirically accountable framework adopted in this chapter is intended as a contribution to addressing these issues.

Book contents

Part III - Change

Summary

Information