This chapter provides an in-depth analysis of one particular manifestation of macrogrammar, the linearization of forms without hierarchization, which involves expressions such as discourse markers, vocatives, highly conventionalized syntactic fragments, general extenders, or tag questions. As discussed in Chapter 3, the distribution of macrogrammatical expressions is not determined by hierarchical relationships, but by corresponding to particular temporal phases in the real-time production of a unit of talk. This idea will be further elaborated in this chapter, which argues that the occurrence of macrogrammatical expressions at particular moments in the step-by-step configuration of a structural unit is based on the habitualization of the use of linguistic expressions in particular phases in incremental speech production as they serve as local solutions to communicative tasks that are relevant in the respective phase. We can conceive of such temporal phases as fields, that is, communicative spaces in which the production of a particular set of expressions becomes expectable and relevant, but not obligatory.
4.1 Linearization in Macrogrammar: Cognitive Phases and Fields
Fields represent the recurrent temporal sequentiality of particular communicative routine tasks and their means of expression in the linear flow of time, which are identifiable by means of detailed empirical observation. Fields emerge in the step-by-step production of a unit of talk during which different kinds of information need to be aligned consecutively in the progression of time and where some kinds of information need to be conveyed earlier or later than others. Responding to a prior utterance and thus expressing e.g. alignment or disalignment, for instance, is usually relevant at the beginning of a unit of talk (e.g. initial yeah, right), whereas mitigation of illocutionary force tends to occur at later points in time in the construction of a unit of talk, often at its (preliminary) end (e.g. You should call her (.) definitely), and it is the analyst’s task to explain such regularities.
Macrogrammatical fields represent the local re-actualization of a “sedimented” (Hopper Reference Hopper and Tomasello1998, Reference Hopper2004) structure that involves a fixed set of expressions that are part of the speakers’ knowledge store. Following the idea that language is a collection of routines and patterns based on which the co-participants accomplish actions (Hopper Reference Hopper1987, Reference Hopper and Tomasello1998; Linell Reference Linell2009: 186; see also Chapter 2, this volume), sedimentation is the result of the routinization of communicative tasks; the stronger the routines, the more the linguistic devices used to deal with them and the places in which they are produced become regularized or “sedimented.” In the case of fields, sedimentation refers to the availability of conversational spaces at particular points in time in the incremental production of a unit of talk in which speakers deal with particular recurrent communicative tasks in a routinized way. Under this view, fields are no “absolute positions,” but emerge through practice as a by-product of the frequent usage of particular expressions for specific recurrent purposes in particular time slots. The correlation of fields with specific communicative tasks implies that the number of possible expressions available for use in a particular field is not indefinitely large, but finite; that is, fields are associated with a finite set of options. Fields are thus available only for a restricted set of macrogrammatical expressions that serve the tasks that are relevant at the respective moment in time.
Note, again, that speakers and listeners are forced into the temporal emergence of linear structure, being situated at the leading end of an emergent unit of talk that is continually expanded until a potential point of completion has been reached. The emergent character of speech is responsible for an important principle of the construction of units of talk formulated by Schegloff (Reference Schegloff, Ochs, Schegloff and Thompson1996: 82), which is that the starting and end points of such units are not symmetrical and thus not surrounding some kind of grammatical “midpoint.” This is precisely what one might be tempted to infer from fixed-code approaches and the use of terms like “left” and “right,” which suggest the existence of a mirror construction at both ends of a structural unit. Rather, structural units are directional in the sense that they are oriented forward in real time at each moment in ongoing speech. This implies that the tasks that are relevant at the beginning of an emergent unit of talk are not the same as those at their (potential) end since speakers develop a unit of talk over time rather than falling back to the point of departure at their possible end. In utterance production, speakers continuously create new tasks they need to deal with as they progress in time. Moreover, with each communicative act speakers change the conditions under which next utterances and actions are relevant or necessary – both for the current speaker and for next speakers. The macrogrammatical expressions in (23) (highlighted in gray), for instance, document the different tasks speaker B is dealing with predominantly at the beginning and end of each new unit of talk. Speaker B is disabled and in a wheelchair; he had become a member of a dancing group for disabled people some months prior to the conversation and talks about the first performance they were going to have at some time in the future.
(23)
157 B: and then hopefully (.) we will get some more pieces together (.) and uhm then see where we can perform (.) 158 uhm (.) personally I I would like to (.) to do that very much (.) 159 uhm (1.3) I always keep saying I wish it: had start- 160 I wish I’d got involved (.) ten years earlier because you know (.) I’m getting old. 161 <<laugh>> andh uhm (.) you know I mean, (.) 162 there you go. 163 you see I am talking about the stereotype dancer’s (.) life arent’t I? (.) life span o::f of dancing. 164 but:h uhm ↑I don’t know, [ICE-GB S1A-003]
Speaker B first deals with the continuation of expressing his ideas (e.g. and then, and) and speech planning (consider the occurrence of planning markers and pauses), but from line 161 on B appears to deal more with addressee-oriented activities, e.g. indicating shared understanding (you know, you see) and facilitating addressee-response, which are dealt with in particular time slots referred to a fields here.
Fields are one manifestation of the dualistic organization of grammar: speakers do not only deal with the assembling of linguistic forms into hierarchically organized units expressing propositional information (=microgrammar), but also with different tasks relating to speech planning, text organization (e.g. organizing sequences of units of talk into a coherent whole), and the organization of the relationship between the speaker and the addressee, which belong to macrogrammar. Based on these considerations, macrogrammatical fields can be defined as follows:
Macrogrammatical fields
Macrogrammatical fields are optional communicative spaces available at different moments in time in the production of a unit of talk for the production of linguistic expressions that are outside microgrammatical relations. These spaces derive from and are associated with a specific set of communicative tasks for which macrogrammatical expressions serve as solutions.
It has become clear at several places in the present study that the beginnings and ends of units of talk are major points in time at which speakers deal with the organization of a unit of talk and of conversational interaction. From an interactional point of view, both fields form the transition zones between two turns or two units of talk, and are therefore the locus of dense interactional activities related to turn-taking, of “getting in” and “out” of a unit of talk or a turn, and of cognitive activities related to the transition from one unit of talk to another in terms of orderliness, duration, and import. The main focus of this chapter is therefore on the analysis of the initial and the final field, and thus of points in time in the real-time production of a unit of talk at which speakers deal with various organizational tasks related to the initiation of a new unit of talk and its closing. However, I will also discuss the occurrence of fields in the continuing phase and the corresponding communicative tasks.
4.2 Field Models in Linguistic Research
Since fields are outside the relations holding between forms linked by linearization+ hierarchization, the concept should not to be confused with the notion of “field” as used in other, basically sentence-based or product-based approaches to grammar, as, for example, in the well-established topological field model in German syntactic analysis (Höhle Reference Höhle, Weiss, Wiegand and Reis1986; Zifonun, Hoffmann & Strecker Reference Zifonun, Hoffmann and Strecker1997: 1502; Eisenberg Reference Eisenberg2006: 384; Duden-Grammatik Reference Duden-Grammatik2009: 861–887), where the term field is based on the segmentation of sentences into smaller domains, each of which hosts particular kinds of constituent(s).Footnote 1 Auer (Reference Auer1996a) expanded the model by postulating a pre-front field, which hosts elements that are not integrated into the morphosyntactic structure of a sentence and that serve the indication of “metapragmatic” information, including e.g. isolated conditional clauses (wenn ich ehrlich bin – ich weiß es wirklich nicht [“to be honest – I really don’t know”]) or adverbials (natürlich – das verlangt viel Geduld [“of course – that requires a lot of patience”]), which frame the upcoming syntactic unit pragmatically and do not project a particular syntactic pattern. Thus, speakers need not have completed syntactic planning upon the production of a pre-front field expression. However, such expressions may be used to foreshadow a particular kind of conversational action.
Many of the more recent discussions of the functions and uses of “extra-clausal” elements have been concerned with the so-called periphery of a clause. Syntactic field models that combine syntax, information structure, and the use of “extra-clausal” constituents have been proposed for the French sentence by e.g. Danon-Boileau et al. (Reference Danon-Boileau, Meunier, Morel and Tournandre1991) and Morel (Reference Morel2007). In contrast to the model proposed here, they are basically designed for sentence-grammatical analyses, that is, for categorical units in which all constituents are integrated into morphosyntactic dependency relations. The sentence models proposed by Danon-Boileau et al. (Reference Danon-Boileau, Meunier, Morel and Tournandre1991) and Morel (Reference Morel2007) are similar in that they propose a segmentation of the French sentence into a core clause, the rheme, which is an obligatory element and specifies the event structure, and two peripheral fields called preamble and post-script, which surround a core clause and provide optional slots for the indication of different kinds of metatextual or circumstantial information, as shown in (24a–b), (24a) being an example discussed in Detges & Waltereit (Reference Detges, Waltereit, Beeching and Detges2014).
(24)
Preamble Rheme Post-script binder | viewpoint | frame | frame | lexical topic a.
tu vois moi hier en classe y avait un mec qui me fait rire quoi b.
alors toi tes amis ils sont ridicules alors
The field preceding the rheme, the so-called preamble, can be subdivided into smaller optional slots, each of which hosts elements with a particular function, such as “binding” elements with a phatic function (tu vois [“you see”] in [24a]), expressions of “viewpoint” (moi [“I”]), “frame elements” locating the information expressed in the rheme in time and space (e.g. hier [“yesterday”], en classe [“in school”]), and “lexical topic” expressions, i.e. expressions or structural units referring to what the respective rheme will be about. Thus, the preamble hosts various kinds of what is often referred to as “pragmatic markers,” but also lexical items or phrasal units such as adverbials, the latter of which clearly qualify as sentential constituents with a dependency relationship to the verb, all of them serving the function of anchoring the upcoming rheme in discourse. The rheme may be followed by the post-script, which hosts pragmatic markers or adjuncts and which often has a prosodic contour of its own. The model allows for the segmentation of a sentence into a core, the rheme, with a relatively fixed syntactic structure, and two optional fields surrounding it, where sequentiality appears to be determined by information structure or pragmatic principles.
In contrast to such field models, the present model does not posit a syntactic frame for the analysis of clauses or sentences, but is to be understood as way of accounting for temporal slots that become available in the real-time production of units of talk of any syntactic format for the expression of macrogrammatical units. The focus of linguistic description is thus not a sentence where all forms are embedded in morphosyntactic and semantic relationships and that may be “surrounded” by “extra-clausal” units, as it surfaces in the field models sketched above, but an emergent unit of talk whose format is not defined prior to the act of speaking.
4.3 The Initial Field
The initial field is a conversational space that occurs in the initial phase of the production of a unit of talk. Expressions in this time slot are not integrated into hierarchical relationships with other forms. Whether an expression forms the initial field of a unit of talk, or a unit of talk of its own depends on how the speaker continues after its production and can thus be determined only after the speaker has produced more talk. For instance, a response token such as yeah, right, or no may occur as a unit of its own without further talk, or be followed by more talk, and it is only in the latter case that it makes sense to speak of an initial field. Some expressions are usually not able to represent an independent utterance as they open a cognitive projection that requires continuation, e.g. well, now or actually. Such expressions project an upcoming structural unit without, however, requiring the speaker to have a syntactic plan at hand, i.e. they “project syntactically into the following space without defining one particular syntactic gestalt” (Auer Reference Auer1996a: 300) at a time at which the speaker may be still in the planning stage.
Typical expressions in the initial field are discourse markers, interjections, or parentheticals/comment clauses (e.g. I think), which are not part of hierarchical relationships and thus “outside” microgrammar, and many of which are pragmatically incomplete in that they lack an illocutionary force of their own. They show no sign of syntactic embedding, since the units following them have a syntactic projection of their own and since they are not assigned their morphosyntactic or semantic properties by another form. Prosodic integration into the following unit is gradient from full integration into the intonation contour of the upcoming unit to separation from it and production as an intonation unit of its own (for initial discourse markers, for instance, see Auer [Reference Auer1996a: 308] or Aijmer [Reference Aijmer2002: 34]). Prosodic gradience is discussed at some length from a theoretical point of view in Fischer (Reference Fischer2006: 8–12) and tested with prosodic analysis in Dehé (Reference Dehé2009, Reference Dehé2014) and Dehé & Wichmann (Reference Dehé and Wichmann2010a, Reference Dehé and Wichmann2010b).
As with all macrogrammatical expressions, those available for use in the initial field can be conventionalized monomorphemic elements (so, well, actually, anyway), but may also represent reusable fragments or chunks (e.g. you know, the thing is), that is, fixed syntactic configurations with little or no internal compositionality that have been assigned a particular communicative function. Examples for expressions in the initial field are given in (25).
(25)
a. 87 B: so what I mean what is it, 88 what’s it for? [ICE-GB S1A-069] b. 366 F: well actually the thing is it’s not even particularly fa:r. [ICE-GB S1A-019] c. 11 B: I mean I I think (2.5) in a very simp- in a simple way yes (..) but [ICE-GB S1A-004] it’s more than that. d. 138 C: well you know I sent my I sent my son [ICE-GB S1A-012] off to bording school. INITIAL FIELD
A superficial glance at the kinds of expressions occurring in the initial field suffices to see that this field is associated with predominantly projecting functions, serving the task of linking upcoming talk back to prior talk, e.g. by discourse markers like actually, which indicates a shift toward information that is potentially unexpected for the addressee, or so, which indicates a shift toward a concluding part. It is also an important point in time for dealing with turn-taking issues and response activities.
The set of expressions that are mainly used in the initial field in spoken English, based on the inspection of transcripts of spontaneous speech in the ICE-GB, is given in Table 4.1. The expressions are ordered according to their complexity, ranging from single lexical units to units with a more complex internal structure. Note that it is not easy to decide whether a categorization based on formal or on functional aspects is appropriate. The problem of using formal criteria is that almost all of the expressions in the different fields violate many of the defining features that characterize the lexical or syntactic category they seem to correspond to according to their form, i.e. they “look like x,” e.g. like a clause or an adverb, but strictly speaking they are not x. Therefore, classifying units such as those in the initial field (as well as those in other fields) into formal categories deriving from static, product- and clause-based approaches to grammar is often misleading. Units like the thing is, for instance, are formally affiliated with and thus “look like” main clauses as they seem to exhibit a subject–predicate structure and require a complement clause typically introduced by the subordinator that. However, the analysis as a main clause is misleading as such units are not interpretable as genuine clauses followed by a subordinated clause: they do not represent a free combination of individual constituents, but relatively specific, highly conventionalized constructions with their own pragmatic, rhetoric and discursive properties; they are not expressing propositional content; they are neither grammatically nor semantically representing a superordinate unit expressing a core proposition; they are often used without a complementizer (that) and thus part of a merely linear structure, and they may be followed not only by a clausal unit, but by a structural unit of any syntactic format and length, or by various units. As projecting constructions (Günthner Reference Günthner2008, Reference Günthner, Auer and Pfänder2011) they have functional, text-organizing uses, anticipating an upcoming focal message and thus serving the internal organization of a unit of talk.
Table 4.1 Macrogrammatical expressions in the initial field
The problem of applying the classical structuralist categorical and conceptual apparatus (parts of speech, syntactic categories) to macrogrammatical expressions is thus that they often do not fit into the established categories of grammatical analysis, neither formally nor functionally, which explains the continuous renewal of labels given to such expressions. In order to avoid terminological disintegration by postulating entirely new classes of linguistic expressions, the present study is based on a compromise solution that consists in using formal labels, which are important for identifying the different unit types among analysts, in combination with the specifying adjective initial (e.g. initial parenthetical), which is supposed to indicate the time slot in which they are produced.
Since all of the expressions discussed in this section are typically produced at the beginning of a unit of talk, we can assume that they are cognitively activated at a point in time at which the speaker is about to produce the onset of a unit of talk.
Expressions in the initial field represent sedimented patterns of initiating a unit of talk without making a particular syntactic format expectable. As elements of macrogrammar, these expressions (i) are based on macrogrammatical linearization principles, which are based on the temporal consecutiveness of communicative tasks, (ii) form more or less conventionalized expressions or chunks (pieces of syntax forming an integrated form–meaning pair), (iii) are organized paradigmatically, at least to certain degree, and (iv) are integrated into the illocutionary force of the unit they accompany, as will be discussed below. Moreover, when two or more of these units co-occur their order is not random, which suggests that the initial field has a syntax of its own. Note that some of the expressions that can occur in the initial field can also be produced at other points in time in the production of a unit of talk, e.g. in the final field, such as adverb-like units (e.g. really), some discourse markers (e.g. actually, anyway), or address terms/vocatives; other expressions, by contrast, can occur only in the initial field, e.g. discourse markers like well, interjections, or some clausal fragments. Since the tasks to be fulfilled by a speaker in the initial phase of the production of a unit of talk differ from those at its possible end, expressions that are available for use in different fields serve different communicative functions and are thus not functionally equivalent across different uses. I will now briefly discuss the functions of each of the expressions under (i)–(vi) in order to identify the set of communicative tasks that are relevant in the initial phase of the production of a unit of talk.
4.3.1 Interjections
Interjections are conventionalized verbal forms that may constitute an utterance of their own, but may also represent an “entry” into a unit of talk and thus be followed by more talk. They do not enter into formal relationships with other lexemes or syntactic units, and do not tend to have lexical content (Clark & Fox Tree Reference Clark and Fox Tree2002: 76). The production of interjections is motivated by the general communicative context as they represent an instantaneous and often unmonitored reaction to a verbal or nonverbal stimulus from the environment. They are produced in a quasi-automatic way as formally autonomous units.
Grammatically, interjections are at the boundary between lexical and nonlexical, merely phonological expressions of states of minds or response cries in terms of Goffman (Reference Goffman1978). While some authors do not regard interjections as part of the language system, others do attribute lexeme- or word-status to them. Quirk et al. (Reference Quirk, Greenbaum, Leech and Svartvik1985: 853), for instance, describe interjections as “purely emotive words which do not enter into syntactic relations.” Ameka (Reference Ameka1992), who argues that interjections have a semantic structure, distinguishes two classes of interjections: primary interjections, which have a lexical counterpart (e.g. hell, Jesus, shit), and secondary interjections, which have no lexical counterparts and cannot be used in any other sense than as an interjection (e.g. oh, oops, sh, wow). He further distinguishes two classes of interjections: those indicating a cognitive state or a change in knowledge state (e.g. oh, ah), and those expressing emotive stance such as delight, excitement, disgust or feeling of pain (e.g. wow, gosh, ouch). Wierzbicka (Reference Wierzbicka1992) adds a third type, namely interjections expressing desire or volition (e.g. shh, shoo, ahem) serving, for example, as requests for attention. Depending on how interjections are conceptualized, more routine functions can be distinguished, e.g. expressing greetings and farewell (e.g. hello, hi, bye), degrees of accepting a message (weak: hm, uh-uh, stronger: yo, yep), or confirming the validity of an assertion (bingo), and many interjections have parallel uses. While those authors that argue from a conceptualist point of view consider interjections as forms with semantic content (e.g. Ameka Reference Ameka1992; Wierzbicka Reference Wierzbicka1992; Wilkins Reference Wilkins1992: 119), others argue that the meaning of interjections cannot be defined in semantic terms; they can only be defined by the conventional practices they are used for (Clark & Fox Tree Reference Clark and Fox Tree2002: 77).
Some authors include uh and uhm (also er), which for others are merely “filled pauses,” in the category of interjections, e.g. Clark & Fox Tree (Reference Clark and Fox Tree2002). This is certainly plausible given that they are, like other interjections, produced as units of their own and provide insight into the current cognitive state of the speaker, documenting an ongoing mental planning activity. Basically, the function of uh and uhm is to project a delay in ongoing speech and to point forward to upcoming speech. There is some debate on whether or not the two interjections are freely interchangeable: based on evidence from larger corpora of spontaneous speech, Clark & Fox Tree (Reference Clark and Fox Tree2002) propose that uh and uhm signal different kinds of delays: while uh is used “to announce the initiation [ … ] of what is expected to be a minor delay,” uhm is used “to announce the initiation [ … ] of what is expected to be a major delay in speaking” (Clark & Fox Tree Reference Clark and Fox Tree2002: 79). These findings led the authors to the hypothesis that uh and uhm are planned like other words (Clark & Fox Tree Reference Clark and Fox Tree2002: 75). However, in her corpus-based study of hesitation phenomena within and at the boundaries of prefabricated units or “chunks,” Schneider (Reference Schneider2014: 60–62) provides counterevidence against this assumption: in her data, there are no significant differences in the mean length of pauses after uh and uhm, which in both cases are between 0.49 and 0.5 seconds on average. This would mean that their meanings are unlikely to differ.
In any case, both studies make clear that uh and uhm are expressions of speech planning activities, signaling shorter or longer hesitations, which shows that speakers are able to monitor the location of an imminent delay. To simply somewhat, the delays indicated by uh and uhm are usually caused by the need for stalling for time in order to reflect on the continuation of talk. The use is also motivated by marking discourse boundaries and signaling readiness to adopt or keep the conversational floor. Since speech planning can be assumed to be most difficult at the beginning of a unit of talk, where speakers have to plan the content of a message and select a structural pattern from a variety of possible syntactic formats, it would make sense to find a much larger proportion of uh and uhm in the initial field than at other points in time in the production of a unit of talk. This is indeed what Clark & Fox Tree (Reference Clark and Fox Tree2002: 94) observed, based on an empirical corpus-based study on the frequency of use of these interjections in different positions within and at the “boundaries” of an intonation unit (defined as a stretch of speech produced under a single intonation contour): the usage rate was highest at the initial boundary of an intonation unit, where also most of the prosodically prolonged realizations of these interjections were produced (roughly 50 percent of all prolonged uses, the rest occurring after the first word or later within a unit of talk), which indicates that delays tend to be longer at the beginning than at any other point in time in the production of a unit of talk. The use of uh and uhm attests that speakers keep track of how an upcoming unit is delivered (relating to performance, e.g. production with a delay), which is a task that is clearly within the confines of macrogrammar.
Clark & Fox Tree’s (Reference Clark and Fox Tree2002) observations on unit-of-talk-initial uses of uhm are congruent with my own observations on the use of uh and uhm in the ICE-GB: uhm is by far more frequent than uh in the initial field, and uhm it is more frequent in the initial field than in any other time slot in the production of a unit of talk. Based on a study of the occurrence of uh and uhm in the first ten conversations of the ICE-GB (S1A-001–S1A-010), I conducted a small-scale empirical study of the uses of these interjections in four contexts: in the initial field, in the final field, and at two different points in time at which uh/uhm occurred within the continuing phase, one being at a juncture from one syntactic segment to another, the other one at unspecific points within the production of a microgrammatical unit, e.g. after a subject or an auxiliary, in which uh/uhm clearly interrupt a segment-internal hierarchical relation. (26a) illustrates the use of uhm in the initial and final field within the same unit of talk. In (26b), uhm occurs in the transition zone from one (microgrammatical) syntactic segment to another: speaker B first produces a statement before he shifts to a major point deriving from it, which is introduced by so.
a.
129 A: wh- what do you personally get ou’ of (.) ↓this particular dance (.) 130 B: u:hm (3.1) it- (2.6) I find that I: have to (. ) be more honest about the way that I work. (.) uhm (.) 131 [ … ] [ICE-GB S1A-002] b.
136 A: do you want to continue into the performance. 137 B: yeah I will= 138 =I’m graduating in JUNE uhm ( . ) so (1.8 ) it’s ( . ) given me some direction alREAdy. [ICE-GB S1A-002]
Note that in the initial field, uh/uhm may occur together with other macrogrammatical expressions, e.g. so uhm or well uhm. A context in which uhm is produced within an emergent microgrammatical unit is illustrated in (27).
(27)
015 A: NO: because I plan to be out of phonetics as quickly as possible. (.) [°really°. ] 016 B: [yes, ] but it might well help if you’ve uhm actually done some. [ICE-GB S1A-008]
In (27), uhm occurs after the subject–auxiliary sequence you’ve, which makes a lexical verb and further arguments expectable and thus occurs during the production of forms that are involved in hierarchical relations. Such uses are least specifiable in terms of regularity.
The figures in Table 4.2 indicate the occurrence of uh and uhm in different time slots during the production of units of talk. Instances in which uh or uhm occurred two or more times in a row in the same time slot within a single unit of talk (e.g. uhm (.) uhm) were counted as one token. Based on this procedure, the ten conversations (=20,978 words) included into the empirical analysis yielded an overall number of 115 occurrences of uh and 315 occurrences of uhm. In the sample conversations uh was thus by far less frequent than uhm.
Table 4.2 The occurrence of uh and uhm within a unit of talk in the ICE-GB (S1A-001–S1A-010)
| uh (N=115) | uhm (N=315) | ||
|---|---|---|---|
| Initial field | 48 (42%) | 177 (56%) | |
| Continuing phase | a) at a juncture | 15 (13%) | 34 (11%) |
| b) anywhere | 46 (40%) | 63 (20%) | |
| Final field | 6 (5%) | 41 (13%) | |
The data show that uh and uhm occur preferably in the initial field, and thus serve as means to prepare or announce the production of an upcoming unit of talk. The functions range from getting the addressee’s attention, indicating turn uptake and preparing the ground for an upcoming turn, and delaying the production of a message. Delays have different motivations, such as avoiding overlap with a prior or another self-selected speaker or stalling for time in order to carry out conceptual and structural planning of the upcoming unit. The tendency toward uses in the initial field is clearer with uhm than with uh. One explanation for this finding could be that in the initial phase speakers require, or automatically produce, phonologically more salient indicators of turn-uptake or continuation than at other places in an emergent utterance, given that in this phase speakers are subjected to turn-taking mechanisms and thus need to compete for turn-uptake or continuation. Since uhm is phonologically more salient than uh the first serves the task of indicating the speaker’s intention to adopt or keep the role of the speaker better than the second. Another explanation is that uhm allows for a somewhat longer planning time of upcoming talk, given that it provides more phonological material for prosodic lengthening (both the vocalic nucleus and the bilabial nasal lend themselves for lengthening). The beginning of a unit of talk represents a phase of comparatively high cognitive stress since speakers need to assemble linguistic forms into a proposition and choose a particular syntactic format. Thus, it can be expected that it is particularly in the initial phase that speakers, once they have self-selected for the upcoming turn but not yet completed conceptual and syntactic planning, are somewhat more often caught in a situation that requires a more or less extended time span for carrying out planning activities. This assumption would be congruent with Clark & Fox Tree’s (Reference Clark and Fox Tree2002) observation that uhm signals a more extended delay than uh even though, as discussed above, this finding is not undisputed. Note that uh/uhm are also frequent in unspecified time slots (“anywhere”), in which their use can be related to ongoing planning activities in emergent talk caused by, for example, the temporal unavailability of a lexical expression in the speaker’s mind, careful selection of an adequate expression, or a brief suspension of continuation due to an idea or activity that competes with the further development of a current thought or activity. The lowest frequency of occurrence of uh and uhm is in the final field, which is plausible if we assume that at the potential end of a unit of talk speakers need no longer select from a broad range of possible continuations as they have (almost) completed the expression of a focus of consciousness. Monitoring possible continuation would then become relevant only at the beginning of a new unit of talk, and thus in the initial field of a subsequent unit, unless speakers decide to expand an emergent unit of talk beyond a potential point of completion, or to foreshadow continuation, in which case uh/uhm may occur as planning markers in the final field.
To conclude, interjections in the initial field serve different functions. On the cognitive level, they may indicate an ongoing planning activity, as with uh/uhm. On the discourse-structural level, they continue prior talk since they are per se produced as a reaction to some prior stimulus, e.g. surprise (oh) or disgust (ugh). In terms of the organization of speaker–addressee relationship, they may express an emotional (e.g. wow) or knowledge-based reaction (e.g. ah) to what was said before, serve the expression of a directive speech act (sh, shoo), or indicate that the speaker is taking over the conversational floor or continuing a turn. Since the indication of cognitive or emotive stance makes an account expectable, interjections raise expectations on further talk in which the speaker explicates his/her stance. In this sense, interjections serve the organization of conversational interaction in that they coordinate the speaker’s roles between the participants by legitimizing turn uptake.
4.3.2 Initial Adverbs
Some expressions in the initial field “look like” adverbs, from which they typically derive. However, strictly speaking they must be distinguished from “genuine” adverbs, that is, from forms that are integrated into hierarchical relationships with other forms. The group includes lexemes like of course, right, really, or honestly, which are classified in various ways in the literature, e.g. as disjuncts (Quirk et al. Reference Quirk, Greenbaum, Leech and Svartvik1985: 613, 648), stance adverbials (Biber et al. Reference Biber, Johansson, Leech, Conrad and Finegan1999: 133), sentence adverbs (Brinton & Traugott Reference Brinton and Traugott2005: 139), or pragmatic markers (Fraser Reference Fraser1996; Traugott [Reference Traugott, Brems, Ghesquière and Van de Velde2012] for surely and no doubt). While many authors tend to regard these lexemes as adverbs and thus as clausal constituents, in the present framework they are classified as macrogrammatical units for the following reasons. First, they are syntactically unintegrated in the sense that in the initial field they lack integration into a hierarchical relation with another form. They do not restrict or modify the meaning of a form or unit within a microgrammatical unit, e.g. that of the predicate, and they are not assigned their morphosyntactic or semantic properties from any other constituent. Moreover, in the initial field they lack important features that characterize prototypical adverbs, e.g. positional flexibility or possible premodification or intensification by other adverbs. An analysis as clause adverbs is inappropriate as they need not be followed by a clausal unit, but can introduce a unit of any syntactic format and length. Secondly, they differ from adverbs in the sentence-grammatical sense in their semantico–pragmatic scope potential: while adverbs have scope over a constituent of a phrase or a clause, adverb-like expressions in the initial field typically have a wider scope, which goes beyond the meaning of the upcoming unit as it extends to the communicative situation, encompassing, for example, illocutionary force, speaker attitude, or speaker–addressee relation.
Consider, for instance, the problem of so-called illocutionary adverbials such as unit-of-talk-initial (also -final) frankly or seriously, which encode a lexical concept but are, strictly speaking, not part of the proposition expressed in a microgrammatical unit, non-truth conditional, and not involved in a “basic-level explicature” in terms of Sperber & Wilson (Reference Sperber and Wilson1995). Yet, they are associated with a particular conceptual meaning, providing a “higher-level explicature,” that is a conceptual representation in which the proposition is embedded under a propositional attitude and illocutionary force (Sperber & Wilson Reference Sperber and Wilson1995: 182; Carston Reference Carston, Davis and Gillon2004). On a gradient from microgrammar to macrogrammar, they are clearly closer to the macrogrammatical end.
The use of really in the initial field is perhaps the clearest example for a microgrammatical element that has developed parallel uses within the domain of macrogrammar. As a form involved in microgrammatical relationships, really typically intensifies the meaning of an adjective (e.g. really nice) or a verb (e.g. It really brings you closer to the truth), and is thus part of a hierarchical relationship with the form in its scope, which is structurally higher-ranked. As an expression in the initial field, however, it is not included in a binary morphosyntactic and semantic relationship with another form. Consider the use of really in (28), where A and B talk about possible ways to refurnish B’s bedroom. A is advising B on the color of the curtains over B’s bed; later (line 37) B expands the topic to ask for B’s opinion on a wallpaper color she has chosen.
(28)
31 A: no but it’ll be it’ll be if you decide to have blue it’ll be the blue that you see. 32 B: yes. 33 A: see what I mean. 34 u::h let me get you a photograph. (.) 35 it will just be the outside little bit. 36 → really you wouldn’t see much at all. (.) uhm 37 B: so WHAT if I choose ↓this wallpaper, (.) uhm what shall we do with this? (.) [ICE-GB S1A-086]
In the initial field, really forms a unit of its own that is not part of the network of hierarchical relations within the following microgrammatical (clausal) unit you wouldn’t see much at all. It serves as a text-organizing device in the sense that the speaker foreshadows a reinforcement of her main argument, which is that the blue curtains would not let through much light from the outside. The meaning is not anchored in the structure of the following unit, but refers to the situation of discourse: really is used in a sequential slot in which A has already expressed her opinion (see what I mean, line 33), and continues fleshing it out. Really connects two subsequent units of discourse (the units of talk in lines 31–35 and the unit produced in line 36) and thus serves as a means to signal a coherent continuation of talk in that it marks the upcoming unit as arising out of the immediately preceding context and that the preceding talk is the interpretive anchor for its comprehension. Moreover, it underlines the speaker’s strong commitment toward what she is about to say and is thus highly subjectified. The communicative effect is that it focuses the attention of the addressee at the upcoming unit in that it projects an idea that the speaker fully endorses, and thus serves as a persuasive strategy on the interpersonal level.
Adverb-like elements in the initial field express meanings that relate to text organization, speaker–addressee interaction, and the cognitive processing of a message, and thus to meanings outside those expressed in the unit they precede. Some of them express, for instance, subjective meanings like attitude towards the co-participant (e.g. honestly or frankly, which indicate, at least rhetorically, a certain degree of intimacy between the co-participants), projecting the illocutionary act and the kind of action they are about to perform (e.g. asserting something confidential), or expressing stance or degree of commitment towards what was said before, such as full endorsement (e.g. of course). Note that those adverb-like lexemes that may occur in the initial field have parallel uses as constituents within a microgrammatical unit, where they are integrated into a hierarchical relationship with another form.
4.3.3 Discourse Markers
Discourse markers (DMs) are a rather elusive category as there is no generally accepted definition of the elements subsumed under this category and as the set of formal and functional features characterizing them is rather diffuse. Due to the lack of a clearly definable boundary between DMs and expressions without DM status, there is also no generally accepted list of DMs in English. Part of the problem is that the term discourse marker has been conceptualized in different ways. Basically, one can distinguish two different “schools,” one of which represents a more “narrow,” the other one a “broader” view on what qualifies a lexeme to be classified as a DM (see Diewald Reference Diewald, Degand, Cornillie and Pietrandrea2013). Under a more narrow view, discourse markers are elements that are usually (and for some authors even necessarily) syntactically integrated into a sentential matrix and thus syntactically dependent on a host syntagma. Their function is to signal a propositional relationship between two neighboring textual units or “discourse segments” (S), i.e. they relate aspects of the explicit message of a second unit to aspects associated with a message of a first unit under a schema “S1–DM+S2” (Fraser Reference Fraser1999: 938, 950). According to this definition, DMs are e.g. conjunctions (e.g. but, because, or, so) (Cuenca Reference Cuenca, Degand, Pietrandrea and Cornillie2013), adverbials (e.g. of course) or prepositional phrases (e.g. as a result of), provided that they serve the introduction of a separate message with a propositional content of its own, i.e. one that is not embedded in the message of the prior unit. Two examples (but and so) are provided in (29).
a. A: She didn’t even invite him. But that’s another story.
b. C: I’m not in a hurry so how about having a talk over coffee?
The defining function of such elements is discourse connectivity: DMs have a text-connecting function, linking two adjacent units that range in size from clauses to larger textual units and indicating the relationship between them. In terms of Schiffrin (Reference Schiffrin1987: 31), they “bracket units of talk,” marking a boundary between a prior and an upcoming discourse unit. The scope of a DM is over two adjacent textual units, each expressing a separate message.
Under a broader conceptualization, DMs are elements that are syntactically non-integrated lexemes with no fixed position and without constituent status; they are syntactically as well as semantically and often prosodically independent. Functionally, they do not only link two textual units, but refer to dialogic interaction in general, encompassing nonpropositional, metadiscursive aspects, such as relationships between different conversational actions or between the speaker’s and the addressee’s views. In this sense, the basic functional distinction between DMs in the two theoretical views is that in contrast to the first view, in the second, broader view the type of connectivity they establish is not restricted to propositional relations between adjacent utterances or discourse units, but also pertaining to relations between nonlinguistic, nonpropositional aspects of discourse (Diewald Reference Diewald and Fischer2006: 408). DMs serve the management of conversation in a broader sense and have scope over utterances, illocutionary types, and actions accompanying an interaction in general. Text connectivity is thus only one among several aspects of their functional spectrum: they refer to general aspects of dialogic interaction, such as thematic structure, the turn-taking system, strategies of argumentation, and cognitive aspects. The DM actually, for instance, does not only link two units of discourse, but refers to the expectations and thus the to cognitive state of the addressee, for whom the upcoming information will be or might be unexpected (Taglicht Reference Taglicht2001), as in (30); it may also signal topic change and thus organize discourse into a coherent whole despite potential disruption caused by a change of the topical direction.
(30)
133 B: uhm sh- she’s got quite a small house a small garden a small 134 A: uhm whereabouts 135 whereabouts 136 sh- she’s got a small GARden <unclear word> °the fire the bonfire and, 137 → B: actually it’s not a small garden. 138 no it’s not small it’s quite big. [ICE-GB S1A-025]
In (30), B marks her upcoming utterance as expressing a divergence from A’s belief or knowledge state by means of actually: while A assumes that she’s got a small garden, B seems to be certain that it’s quite big. As a DM, actually introduces a new unit of talk whose propositional content is seen as unexpected for the addressee by the speaker and dissonant to the addressee’s views or assumptions.
Considering examples such as (30), it appears more appropriate to adopt a broader view on DMs: DMs are syntactically unintegrated expressions that do not only establish links between neighboring discourse units on the propositional level, but serve a broader set of indexical functions on the interactional level, relating also to communicative, nonpropositional aspects of talk. In spite of these different conceptualizations of DMs, there seems to be agreement on some basic features that characterize them: DMs do not affect the truth conditions of the unit they accompany, they do not add or change propositional content, they tend to occur initially, though some DMs are positionally flexible, and they indicate how the following unit is to be understood or interpreted within a given communicative context, thus having procedural rather than referential, lexical meanings (Blakemore Reference Blakemore1987, Reference Blakemore2002).
Individual DMs typically exhibit a broad set of functions (see Fischer Reference Fischer2000). Jucker (Reference Jucker1993), for instance, identifies four different functions for the DM well: it serves as (i) a marker of insufficiency, indicating some problem on the content level relative to the preceding utterance, e.g. when speakers cannot give the information which the questioner has requested; (ii) a face-threat mitigator, mitigating some sort of confrontation, e.g. when an assessment is followed by disagreement or when an invitation is declined; (iii) a device used for indicating a topic change or introducing direct reported speech; or (iv) a delaying device. A similar diversity of functions has been observed for actually, which expresses functions like mild contradiction, surprise, change in discourse topic, or contrast to what was said before (Lenk Reference Lenk1998; Taglicht Reference Taglicht2001; Aijmer Reference Aijmer2002). As these examples show, DMs are multifunctional and adaptable to the sequential environment in which they are used. Descriptions of the multifunctionality of DMs differ as to whether the respective authors adopt a meaning-minimalist stance, according to which the diversity of functions can be reduced to one functional core, or a meaning-maximalist one, under which it is assumed that the different functions that can be identified for single DMs are unrelated and not plausibly derivable from a core meaning. I cannot do justice to the vast amount of studies dealing with DMs here; the reader is referred to studies that figure most prominently in this field, such as Schourup (Reference Schourup1985), Schiffrin (Reference Schiffrin1987), Jucker & Ziv (Reference Jucker and Ziv1998), Lenk (Reference Lenk1998), Fraser (Reference Fraser1999), Aijmer (Reference Aijmer2002), Fischer (Reference Fischer2006), and most recently Heine (Reference Heine2012) or Degand, Cornillie & Pietrandrea (Reference Degand, Cornillie and Pietrandrea2013).
From a macrogrammatical perspective, DMs are not syntactically integrated into the structural unit they project: they are not embedded in a binary hierarchical relationship with another form (and therefore do not classify as adverbs), but represent formally autonomous units in the initial field of a unit of talk. They establish structural relations outside single units of talk, thus serving conversation-organizing functions, and have an important cognitive function in that they project a particular kind of upcoming action, allowing the listener to process a message not as merely contributing propositional content, but as a message of a particular kind, e.g. one expressing mild contradiction (well) or potential violation of the addressee’s expectations (actually). They also serve a variety of functions on the interpersonal level, such as mitigating face-threats inherent in disagreements or topic shifts. As cues for discourse processing, they contribute to the development of a coherent mental model of ongoing discourse by integrating single units of discourse into a coherent whole.
4.3.4 Address Terms/Vocatives
Address terms in the initial field are primarily used to summon the attention of the addressee, especially in a communicative situation in which the co-participants need to achieve joint orientation to an upcoming message. This involves activities such as attracting and focusing the addressee’s attention and the arrangement of a joint interactional space including bodily co-orientation of the participants, e.g. in situations where mutual visibility and fixing gaze need to be arranged to provide the necessary ground for the incipient unit of talk (Mondada Reference Mondada2009; Deppermann Reference Deppermann2013). This is the case when speakers move in space and/or deal with different activities, or when they are located in different rooms. In such contexts, speakers need to secure the availability of the addressee for the message to be produced, dealing with mutual availability, attention, readiness for interaction, recognition of speaker roles, and coordination of gaze. These are the prerequisites that need to be achieved in order to establish joint orientation to the upcoming message and for which vocatives or address terms play a central role. This is illustrated in (31), where the co-participants are moving to a table for coffee. They seem to be situated in the visual field of speaker D, given that all co-participants are in the same room, but physically not co-oriented in the way necessary to recognize who D is addressing with his utterance, at least not right from the beginning, as they are busied with choosing a seat. D addresses his guests with their first name.
(31)
52 D: anybody else like a piece of anything that they can see on here that I haven’t given them? 53 A: <<unclear words>> 54 B: <<unclear words>> 55 → D: Peter where are you sitting? 56 you’re there. (.) 57 → TIM? (..) are you going were you going to sit here. [(.) ] 59 C: [here.] 58 D: would you like a piece of that? 60 C: yes please. [ICE-GB S1A-012]
From the design of D’s turn we may conclude that the addressees shift bodily positions in succession to select a seat at the table and have multiple, changing visual foci. Thus, D needs to coordinate the attention of the different potential addressees and to update the referential addressee as he changes interactional partners. By means of using the first name of the addressees as a beginning of the units of talk in lines 55 and 57, D establishes the prerequisites for the reception of his requests as he is not able to build on already established arrangements of the participants’ coordination, such as when conversing while sitting at a table, or in dyadic interactions. These prerequisites include securing the availability of the addressees for the request, signaling that one is willing to take the speaker role, picking out the intended addressees distinctively (in multiparty situations), and attracting the addressee’s attention for the upcoming request (which is the prerequisite for a response). In short, address terms serve as warrants for the addressee’s attention to the upcoming message.
Address terms are not part of microgrammatical relations as defined in Chapter 3, and not part of the propositional content expressed in a unit of talk as their omission would not alter the propositional content or truth-value of the unit they precede. However, they are not outside a unit of talk as they make some kind of continuation relevant: they mark the onset of a new unit of talk and are used as a device to achieve the necessary conditions for a successful uptake of what follows. Address terms have macrogrammatical functions in that they organize joint cognitive orientation to an upcoming unit of talk and situate this unit within the social coordinates. As explicit means to pick out the addressee distinctively, they not only serve an attention-getting function, but also convey subjective meanings such as positive affection since address terms make the interactional relation to the addressee and thus the speaker’s orientation toward him/her as a distinctive individual explicit. Address terms are also used to mark a unit of talk and the ensuing action as set off from prior talk, and thus have a discourse-organizing function. Clayman (Reference Clayman2012), for instance, notes that address terms used in responsive actions serve to mark the response as independent of the initiating action, e.g. a question, detaching it from its presuppositions and separating the upcoming utterance from the motivational entanglement with the preceding question. In this sense, they serve as cues on how an upcoming unit is to be understood in a given discourse context.
4.3.5 Parentheticals/Comment Clauses
The initial field may host a fixed set of configurations that, from a sentence-based grammatical perspective, look like main or matrix clauses consisting of a subject (usually a first person singular pronoun) and a mental verb that requires a complement clause as an object, such as I think, I believe, I guess, I mean, also you know or it seems. Such units, as well as forms like to-infinitive clauses, such as to be honest, or finite clauses introduced by as (e.g. as I see it), which usually function as adverbial clauses, are analyzed under different labels, such as parenthetical disjuncts by Quirk et al. (Reference Quirk, Greenbaum, Leech and Svartvik1985: 1112), comment clauses (Brinton Reference Brinton2008; Kaltenböck Reference Kaltenböck2008, Reference Kaltenböck, Aarts, Close, Leech and Wallis2013), or parentheticals (Dehé & Kavalova Reference Dehé and Kavalova2006; Dehé Reference Dehé2009, Reference Dehé2014). The term parenthetical suggests nonclausal and morphosyntactically “unintegrated” status of the expressions to which it refers. For practical reasons, I will restrict the category of parenthetical clauses to subject-cum-mental verb combinations.
The syntactic status of parentheticals allows for two alternative interpretations, at least in English: utterance-initial I think or I believe lacking a complementizer (that) are ambiguous between a main clause taking the rest of the sentence as object, and a parenthetical or discourse marker, which is morphosyntactically and semantically separated from the structural unit following them. Note that in medial and final positions an interpretation as “main clause+complement clause without complementizer” cannot be obtained since they do not introduce a complement clause, but even this is not undisputed since it may be argued that such an analysis is possible under the assumption that such clauses are the result of a syntactic movement operation (most recently Newmeyer Reference Newmeyer, Schneider, Glikman and Mathieu Avanzi2015).Footnote 2 In initial position, however, a main clause interpretation is, for many authors, not excluded. Dehé & Wichmann (Reference Dehé and Wichmann2010a, Reference Dehé and Wichmann2010b), for instance, have shown that the meaning of I think varies along different prosodic realizations, depending on whether the main stress falls on the subject, the verb, or whether I think is prosodically unmarked. In the last case, a discourse marker reading without propositional meanings is most likely, given that prosodic reduction correlates with reduced semantic content (see Pierrehumbert & Hirschberg Reference Pierrehumbert, Hirschberg, Cohen, Morgan and Pollack1990). Moreover, research by Dehé (Reference Dehé2009, Reference Dehé2014) has shown that a variety of intonational phrasing patterns is possible with parenthetical clauses such as I think, allowing both for prosodic separation such that the parenthetical clause is phrased in its own intonational phrase (IP), and prosodic integration, such that the clause is phrased in one IP with surrounding material. Parenthetical clauses whose primary function is one of mitigation and which have only very reduced semantic content are unstressed and integrated into the contour of the unit that follows. Additionally, they are prosodically characterized by increased speech and lowered volume. Prosody may thus disambiguate between various possible functions of parentheticals.
Since the present approach is based on the linear, emergent nature of speech, it will be argued – based on the findings referred to above – that initial parentheticals that lack prosodic prominence no longer function as matrix clauses with an embedded subsequent subordinate clause. Rather, they form the initial field of an upcoming unit and thus precede a structural unit without having a microgrammatical relation with it; that is, their use is not related to hierarchization and embedding. An important indicator for the lack of embeddedness is, next to the absence of a formal marker of a binary, hierarchically organized relationship, that parentheticals may be followed by units of any structural type, ranging from a single lexeme to a longer textual segment. Thus, from a macrogrammatical perspective the syntactic structure involving a parenthetical expression can be analyzed as consisting of an initial parenthetical followed by an independent (non-subordinate) assertion in any syntactic format. The parenthetical itself represents a chunk (pragmatic tag [Weinert Reference Weinert2012: 237] or epistemic parenthetical [Thompson & Mulac Reference Thompson and Mulac1991]) that is integrated into the illocutionary force of the unit it accompanies, but that does not establish or take part in an embedded structure in sentence-grammatical terms. The informational import does not lie on the parenthetical, but on the assertion, which also typically contains the topical subject, which is why “conversationalists are normally oriented to the material in the complements and independent clauses and not to the CTP-phrases” (CTP=complement-taking predicate; Thompson Reference Thompson2002: 154).
The case of parenthetical expressions is congruent with the observation that speakers favor the aggregation of segments that are not complements syntactically, but combinations of evidential or epistemic “phrases” and an assertion linked by loose discourse-pragmatic relations (Thompson Reference Thompson2002). In the initial field, the mental verbs involved in parenthetical expressions project an upcoming message whose syntactic format is not specifiable on a priori grounds (it may range from a single lexeme to a larger unit of discourse), and express an epistemic–evidential evaluation relating to the upcoming message. A semantic reading of these verbs cannot be obtained. Initial I think, for instance, does not indicate the speaker’s mental processing activity, but marks an upcoming message as an opinion or an assumption to which the speaker is principally committed, but whose commitment is mitigated for some reason, e.g. due to a lack of epistemic certainty, as illustrated in (32).
(32)
45 A: d’you d’you have d’you have that book °on him°. 46 do- 47 B: I couldn’t find it. 48 I don’t know what. 49 I think I must have lent it to somebody. [ICE-GB S1A-045]
In (32), prosodically unmarked I think marks the assertion following it as a vague idea that seems most plausible to the speaker, but whose validity cannot be scrutinized at the moment of speech. This is also reflected in the choice of modality (must have lent), where must has no deontic, but epistemic meaning indicating that the assertion is not a fact, but an inference based on some piece of evidence that is, however, not specified (see Traugott & Dasher Reference Traugott and Dasher2002: 113). I think is thus an interpretive cue for the addressee to take the validity of upcoming utterance as preliminary. I think also functions as a downtoner of illocutionary force with opinions, e.g. in contexts in which a speaker expresses a suggestion or an assessment (e.g. I think he’s very good actually, S1A-049). As a mitigator of illocutionary force, I think frequently co-occurs with other mitigators, e.g. probably (e.g. I think that’s probably what I’ll do, S1A-053).
A text-structuring – rather than modifying – function can be observed for I mean, which is used to signal that the upcoming utterance does not express new information, but a reformulation or specification of some prior piece of talk that is supposed to represent a clearer or more condensed way of expressing e.g. an idea or, as in (33), the main point of an information-seeking request.
(33)
95 B: it’s very good work but I think (.) that art and dance actually include (.) the idea of therapy within them. (.) 96 A: when you say recovering the whole person it suggests that there is something lost. 97 → I mean you know is there something incomplete? 98 → I mean [(.) ] uh what’s w- what’s what’s incomplete? (.) 99 B: [yeah. ] [ICE-GB S1A-004]
The parenthetical I mean is, in both cases, not analyzable as being embedded in the following unit: it could not be followed by a complementizer (that), and as a main clause it would be incompatible with the clause types of the following units (interrogative clauses). Rather, A marks the upcoming requests for information as being inferentially related to her initial assertion (line 96), which expresses an inference that she has drawn from B’s prior talk. I mean thus indicates that the requests following it will make an aspect of a prior utterance (in this case the question deriving from the inference that A has drawn) more explicit and thus more clearly identifiable for the addressee. A literal reading of I mean as referring to the expression of a personal opinion or meaning cannot be obtained; that is, I mean is not propositional, but rather providing a processing cue on how an upcoming piece of talk is related to prior talk.
4.3.6 Clause-Like Fragments
Clause-like fragments are chunks that derive from microgrammatical units that underwent a progressive loss of internal syntax, thus acquiring a more formulaic character by the loss of originally compositional semantics and of the ability to govern other structural elements or units, and the adoption of functional values that are not derivable from the component parts. A frequently occurring unit in the initial field is the “the+N+is” chunk, e.g. the thing/point/problem/trouble … is, which was originally composed of a definite noun phrase headed by an abstract noun and the third person singular form of the copula verb to be, which syntactically requires a complement. The format occurs also in other languages, e.g. in German (das Ding/die Sache ist (dass), see Günthner [Reference Günthner2008, Reference Günthner, Auer and Pfänder2011]) or Spanish (la cosa es (que)). Based on how such fragments are used in spontaneous speech, an analysis in terms of microgrammatical units in the format of a main clause followed by a complement clause can, however, not be supported as they often occur without a complementizer, are usually not followed by a clause, but by a larger discourse segment, and they do not express propositional content to which the content of the subsequent unit is subordinated. On the contrary, it is the information following the fragment that expresses the main idea. Moreover, the projected unit could occur without the initial fragment that frames it without being “ungrammatical,” i.e. the unit introduced by the “the+N+is” chunk is structurally not subordinated, but autonomous. In this sense, the fragment has a formal and functional affinity to other fragmented or formulaic expressions that project an upcoming message, such as pseudoclefts with lexical specification (e.g. what I mean (is)), or initial complement-taking predicates/parentheticals (e.g. I mean/think/believe), which exhibit the same features. Due to their function of projecting a piece of focal information (Günthner Reference Günthner2008, Reference Günthner, Auer and Pfänder2011; Miller & Weinert Reference Miller and Weinert2009 [1998]: 243), clause-like fragments cannot represent a unit of talk of their own but require an upcoming structural unit that expresses the main information. The main task they serve is that of focusing the addressee’s attention to what the speaker considers an important piece of information; the second part or segment is thus framed as focal information or a core argument or idea that remains relevant for ensuing talk. The second part can take various forms ranging from a clausal structure to a longer stretch of discourse that expands over several units of talk, and it may express a single proposition or, as in (34), a series of propositions. In (34), B explains how he (a person called Uts) has long been playing with a girl by never making explicit whether or not he had any genuine interest in her.
(34)
80 A: and she is she s- she spent the entire evening convincing her that Us UTS was desparately passionately in love with her (.) and it was: uhm, 81 B: fatal <<laugh>> 82 A: ↑well I mean it’s= 83 → =the thing is he was supposed to be coming over for her birthday °coming back for her birthday (.) uh because he’s over there now°. 84 a:nd uh she hadn’t seen him for nine months or something. 85 her birthday’s in May. (.) 86 and he phoned and yes he was going to get a f- a a standby flight (.) and he never turned ↑up. [ICE-GB S1A-093]
The chunk the thing is serves two functions: the speaker projects an upcoming piece of information that may have any syntactic format and length and that includes information that is focal, expressing a main argument or crucial information that is required to understand a particular state of affairs. Additionally, this construction allows speakers to justify the adoption of the turn or, as in (34), to keep it over a longer stretch of time. The speaker thus segments a unit of talk into two parts, an initial field and a core message. The expression in the initial field allows speakers to deal with (i) projections by opening up a conversational space that makes further talk expectable and necessary, and framing upcoming talk as expressing focal information, (ii) the organization of interaction (acquiring the right to adopt/continue a turn, focusing the addressee’s attention), and (iii) the cognitive planning of upcoming talk, which need not have been completed upon the production of the fragment. (ii) and (iii) are particularly important when the speaker is about to produce a longer discourse unit, as in (34), for which s/he requires the temporary suspension of transition-relevant places in order to complete the message. It is therefore not surprising that the fragment is typically used to introduce a more complex argument or description that spans over several syntactic segments or units of talk. Prosodically, clause-like fragments in the initial field and the following stretch of talk are usually not separated by a pause (though this is not excluded); the ensuing talk is typically “latched” to the initial fragment.
Clause-like fragments are characterized by minimal semantic content and belong to the domain of macrogrammar: the fragments are not part of a morphosyntactic relationship to another unit and thus not embedded, but serve the cognitive organization of a unit of talk in that they grant the speaker some planning time, focusing the listener’s attention and creating expectations in the listener on upcoming talk by delaying the production of the main information.
4.3.7 Expressions in the Initial Field as Macrogrammatical Elements
The discussion of the different kinds of expressions available for use in the initial field of a unit of talk has shown that all of them evade categorization in terms of established “parts of speech” or “word classes” and do not qualify as “constituents” in the traditional sense of sentence grammar: they are not morphosyntactically integrated into binary relations with other lexical or phrasal constituents and do not establish such relations. The expressions include lexical units and formulaic fragments or chunks and are thus conventionalized units that tend to form a relatively closed class. Expressions in the initial field exhibit the following features, which characterize them as macrogrammatical:
(i) they are used to construct and organize spoken text rather than to create microgrammatical relationships,
(ii) they tend to be morphosyntactically autonomous,
(iii) their internal structure is not or only weakly compositional,
(iv) they do not contribute to, change or restrict the propositional content of the unit they project; their semantic content is minimal,
(v) they serve as addressee-oriented processing or attentional cues, framing an upcoming message and indicating how it is related to prior talk and/or how it is supposed to be interpreted.
4.3.8 Functions of the Initial Field
Expressions in the initial field allow speakers to deal with a set of recurrent communicative tasks that are relevant in the initial phase of the production of a unit of talk. First, speakers need to deal with turn-taking issues, which include activities such as foreshadowing a longer contribution and legitimizing the adoption or continuation of the speaker role. The initial field is thus an important locus for the organization of conversational interaction, where speaker rights and the amount of conversational space that a next speaker is granted need to be negotiated. Second, spontaneous speech requires speakers to bridge the time gap between turn adoption and syntactic planning, the latter of which might not have been progressed when the speaker assumes the right to speak. The initial field thus correlates with intense speech planning activities.
Elements in the initial field are also understood by the co-participants as representing a particular kind of reaction to just-prior talk. This default assumption derives from the relationship of adjacency inherent in talk. As Schegloff (Reference Schegloff2007: 14–15) states, “[n]ext turns are understood by co-participants to display their speaker’s understanding of the just-prior turn and to embody an action responsive to the just-prior turn,” unless the speaker marks the turn as not addressing some aspect of the just-prior turn, e.g. through displacement markers such as by the way. Thus, listeners understand each new turn as responsive to the preceding one since prior turns typically form the starting point of subsequent turns. Speakers are naturally oriented to prior talk while designing a next contribution in order to provide a coherent (and intelligible) continuation of talk. The topic and the action established in a prior turn constrain listeners within “presuppositional matrices” (Heritage Reference Heritage2013: 333) in that next utterances need to be type-conform to immediately preceding ones. Typical reactions to prior turns encompass the expression of stance or the divergence from the normative constraints that a prior turn bears on the next turn. The principle of continuity also holds for multi-unit turns: next units are interpreted against prior units given that, in the default case, speakers continue their own talk in a coherent way.
Next to this backwards-oriented, responsive function, speakers use the initial field to provide an early interpretive cue as to the type of action that they are about to perform, and thus to project into the discursive future. Early projection facilitates cognitive processing in that addressees can start the interpretation of upcoming talk right from the point at which a unit of talk is initiated, which reduces the time gap between production and processing and allows for maximal synchronization between what the speaker is doing and the listener’s understanding of it (Auer Reference Auer2009: 3).
All of these tasks, which involve macrogrammatical expressions, have been predicted by the temporal logics of communicative tasks that speakers need to deal with in the initial phase of unit of talk production discussed in Chapter 3. The tasks are summarized in Table 4.3.
Table 4.3 Communicative tasks relevant in the initial phase of the production of a unit of talk
| Initial phase | |
|---|---|
| Cognition |
|
| Text organization |
|
| Speaker–addressee relationship |
To conclude, the initial field is a point in time at which speakers instruct their addressees to recognize the connection between a prior and an upcoming unit of talk in terms of continuity of topic and action. It is an important resource for setting up expectations in the listener on what there is to follow, focusing and guiding the listener’s interpretation from the earliest point of utterance interpretation possible. It is also used to reset expectations that derive from prior talk and to deal with various aspects of speaker–addressee interaction, such as turn-taking and attention management.
4.4 The Medial Fields
4.4.1 The Post-Initial Phase
The temporal phase immediately following the initial field, which can be regarded as forming the post-initial phase in which the linguistic forms produced still project more to come, is characterized by a high frequency of self-repairs, cut-offs and restarts. These tend to follow expressions in the initial field, as shown in (35a), which exemplifies two sudden changes of a syntactic trajectory that had already been underway (syntactic repairs), and (35b), which illustrates the restart of a syntactic fragment.
a.
72 A: have you still got that damn machine on. 73 B: uhm yes. (.) 74 → but you can’t- you see you’re not meant to notice it. 75 → it’s got to be: the tape’s got to be °on for° 76 ↑they disregard the first fifteen minutes anyway(..) because they reckon that it won’t be natural. [ICE-GB S1A-047] b.
138 → C: well you know I sent my- I my sent my son off to boarding school, 139 but I thought that as you were here I might as well send my daughter here. (.) <<unclear words>> [ICE-GB S1A-012]
The observation that the (post-)initial phase is highly susceptible to dysfluencies such as hesitations and repair phenomena has already been made by other authors (e.g. Beattie Reference Beattie1979; Barr Reference Barr, Cavé, Guaïtella and Santi2001). The reasons can be found in the greater cognitive demands on planning processes at this point in utterance production, that is, the kinds of cognitive activities that become relevant after backlinking and responsive activities: the speaker undergoes a phase in which a number of syntactic alternatives are “in play,” that is, available for use, which need to be simultaneously tracked, and in which several syntactic interpretations of the first word or words that has/have already been produced are possible. In this phase, the speaker needs to “build up syntactic projections while assembling syntactic patterns in a phase of high cognitive stress” (Auer Reference Auer2009: 4). In other words, the cognitive task of the speaker is to select a suitable structural format from a broad range of possibilities for the kind of message s/he wants to convey and the kind of action s/he intends to perform, to distribute thematic roles, and to make preliminary decisions on the information status and the relative size of an emergent pattern. Thus, in the initial and post-initial phase speakers deal with the assembling of linguistic signs in order to open a syntactic projection that creates expectations in the listener’s mind on the further development of a structural unit, and to eliminate possible alternative interpretations of the kind of structure and action they are about to produce. Early predictability of a structural unit-in-progress is the precondition for synchronization, which allows for fluent dialogic coordination between the actions performed by the speaker and the co-participant.
Due to this relatively high cognitive stress at utterance beginnings, it is not surprising to find macrogrammatical elements quite early after the initial field: if speakers are still in the planning phase, they are likely to encounter planning difficulties at the conceptual or syntactic level in the early phase of structure building, in which they are required to deal with the problem of committing themselves to a particular structure that builds around one or more words already produced. They are thus most likely to pause and signal various kinds of planning activities after the production of what could be heard as the first constituent of a microgrammatical unit, and thus before or immediately after an initial commitment is made, but less so when the unit is already in full progress (see Clark & Wasow Reference Clark and Wasow1998: 205) and thus when fewer syntactic alternatives are still “in play.”
This view implies that cognitive load predicts the occurrence of dysfluencies and the use of macrogrammatical expressions that document planning activities and delays. Some evidence for this assumption has been provided e.g. by Oviatt (Reference Oviatt1995) and Shriberg (Reference Shriberg1996), who have shown that dysfluencies occur more often before longer than before shorter utterances, and thus when speakers carry out more extensive planning that spans over a larger structural unit. Clark & Wasow (Reference Clark and Wasow1998) provide empirical evidence that speakers tend to disrupt an utterance-in-progress and produce more repetitions and dysfluencies in proportion to the relative complexity of an upcoming constituent: for instance, determiners and pronouns are repeated much more often at the beginning of longer units with a larger number of syntactic nodes in the constituent (a phrase or a clause) than before shorter constituents with lower syntactic complexity. The complexity hypothesis, which predicts that the more complex a constituent the more likely speakers are to suspend speaking after an initial commitment to it, is thus a claim about processing limitations.
These findings refer to dysfluencies in the post-initial phase, which refers to a point in time at which the first constituent of an upcoming structural unit has been produced and at which speakers make a first commitment to a particular structural pattern including that constituent. The commitment is, however, cancelable and thus preliminary, given that many commitments are premature: as argued by Maclay & Osgood (Reference Maclay and Osgood1959: 42), “since structural choices typically involve fewer alternatives than lexical choices the speaker will often initiate a constituent before he has completed his lexical decisions” and, one may add, his/her syntactic decisions. Premature commitments are perhaps driven by time pressure in ongoing talk, where time does not only belong to the speaker, but also to the addressee (Goffman Reference Goffman and Goffman1981: 198), and where speakers that pause too long, especially at the beginning of a unit of talk, might be heard as opting out, being distracted, or as being unsure of what to say. By making a preliminary commitment at the start of a new unit of talk speakers may forestall such misattributions, with “speakers already expecting, at some level of processing, to suspend speaking immediately afterward” (Clark & Wasow Reference Clark and Wasow1998: 208).
Thus, there is evidence that dysfluencies typically occur in contexts where speakers have a high number of syntactic options to choose from. Given that the choices are more numerous at the beginning of a structural unit than at later points in the production process, the cognitive load deriving from dealing with various syntactic (as well as lexical) formats at unit of talk beginnings is an explanatory factor for the phenomenon that structures are often abandoned and replaced by a different one (which means that a new commitment is made) after the production of the first element(s), or are taken up again and continued after a brief moment of hesitation when the commitment to a particular syntactic pattern is strong enough.
What is relevant for the discussion of the microgrammar–macrogrammar distinction is that macrogrammatical expressions occur frequently at such caesuras in the post-initial phase. These expressions serve different functions, such as smoothing out dysfluencies and indicating different kinds of ongoing speech planning activity, such as reformulation (I mean), searches for expressive means (uh, uhm), or references to shared knowledge or assumptions (e.g. you see or you know, as in (36a–b) below). The latter typically signal that the speaker is about to express a state of affairs that is accessible to or highly plausible for the addressee, which might release the speaker from spending too much effort in explicit formulation and preciseness of expression.
a.
125 B: uh I was (.) I mean I wasn’t (.) shocked that the that the the students had not met people with disabilities or people in wheelchairs. (.) uhm (..) [ICE-GB S1A-001] b.
127 A: and the physical contact [(.) ] out there is the is the 128 B: [uhm] 127 A: only <<unclear words>> problem of being isolated in your (.) °wheelchair°. 129 B: uh uh in a in a chair? 130 A: mhm. 131 B: I think so. 132 yes. 133 → I mean I think uhm (.) space (1.5) is you know I mean (.) you know just the obstacles that you have in a room (.) [ICE-GB S1A-003]
In (36a) speaker B replaces a syntactic unit introduced by I was by a negative construction (I wasn’t) after a micropause and the reformulation marker I mean; in (36b) the speaker appears to have more serious formulation difficulties, first starting a microgrammatical unit consisting of a “subject+copular verb” combination (space is), which projects a subject complement, but then abandons the structure switching to macrogrammar, shifting between addressee-involvement (you know) and an attempt at a reformulation (I mean), before initiating a new grammatical format that is, however, relatively fragmented as it represents an isolated noun phrase with internal postmodification and thus a referent that is not integrated into a state or an event schema involving a predicate. However, the unit expresses the speaker’s current focus of consciousness (obstacles that you have in a room), which can be plausibly linked to the meanings associated with the initial, abandoned referent (space) and the topic of the prior talk (being isolated in a wheelchair): the topic of feeling isolated in a wheelchair and the reference to space allow for an integration of the noun phrase into a topical frame centering around the problem that obstacles in a room that provides only little space for someone in a wheelchair to move around contribute to the feeling of isolation. The interpretation of the unit, however, requires the ability to construct a coherent model of discourse by the listener by integrating different segments of talk into a coherent whole. In both cases, macrogrammatical expressions occur right after the beginning of a microgrammatical unit.
A further regular phenomenon at unit of talk-beginnings that is relevant for explaining the deployment of macrogrammatical expressions shortly after the initial field relates to Schegloff’s (Reference Schegloff, Ochs, Schegloff and Thompson1996: 78) observation that units of talk often start with a candidate beginning that is cut off and followed either by a new element (e.g. a referent) or by the reemployment of the original beginning, the latter of which is illustrated in (37a–b).
a.
250 C: does an artist have to live with an artist? 251 B: yes, 252 what happens? 253 C: I don’t see why. 254 A: uh do do they have to or can they? 255 → B: Brow- well uh BROWning and uhm and Emily you see lived together. [ICE-GB S1A-020] b.
30 D: have you decided what to do with exercise number six. 31 C: when did you do that? 32 how many times did you do that? 33 D: a couple of times. 34 → A: u:hm the only time=I think the only time was at the bottom of what’s called a hundred and thirty. 35 C: yes. [ICE-GB S1A-026]
In (37a–b), the reemployment of the beginning is linked to the original beginning by the insertion of a macrogrammatical element, which allows the restart to be recognized as leading into a unit of talk of a particular kind. In (37a), the production of the proper name Browning is self-interrupted after the first syllable and followed by well, which serves as a delayed signal indicating that the response given by B will diverge from the premises set up by A’s information-seeking request: B will not respond to A’s question following the options offered by A (do they have to or can they?), which refer to whether an artist can only have a partnership with someone who is also an artist or whether this is just an option and more or less a coincidence. Rather, B provides an example in which both partners happened to be artists, from which A needs to infer the kind of relation to his request. B thus marks his utterance as not directly contingent to the premises of A’s request after he had already started producing the first constituent of an upcoming (microgrammatical) unit. Well seems to document B’s orientation to the apparent divergence from the kind of response made relevant by A, marking the unit of talk as disjunctive at the earliest point possible. In (37b), speaker A interrupts the beginning of a microgrammatical unit in order to insert the macrogrammatical expression I think, which serves as a delayed device to mark the content of the upcoming unit as epistemically weak, lacking facticity, and thus as potentially challengeable. In both cases, the speakers interrupt a candidate beginning at an early point in order to indicate information relevant for the processing of the upcoming unit of talk and to improve its integration into the sequential context or to indicate the way it is to be interpreted, before the abandoned beginning is redone. The insertions have a clear macrogrammatical character since they organize a unit of talk by linking smaller fragments into an integrated unit of talk. Additionally, they mark the relationship of the unit in which they are produced to prior talk or to the speaker’s mental world in a maximally economic way without making it the central concern of the speaker, who continues the structure already initiated after the reemployment of the beginning.
To sum up, in terms of dysfluencies we find two different ways in which the development of a unit of talk may be carried out in the initial phase of its production (“beginning” refers to the production of the initial constituent(s) of a microgrammatical unit):
(i) Beginning + cut-off (+ macrogrammatical insertion) + restart with a new element,
(ii) Beginning + cut-off (+ macrogrammatical insertion) + reemployment of beginning.
Macrogrammatical expressions are not always used in such contexts, as is indicated by round brackets above, but a frequent phenomenon. They document what the speaker is about to do or which tasks s/he is currently dealing with, e.g. indicating formulation difficulties, representing causalities of a high cognitive load in the planning phase, or marking epistemic uncertainty.
Theoretically, there should be a third possibility, which is that speakers begin a unit of talk and continue the structural unit without reemployment of the beginning (= “beginning [+ macrogrammatical insertion] + continuation”). However, this pattern occurs very rarely in the data and is by far outnumbered by either a repair of the initial element (usually a form that is part of an emergent microgrammatical structure), or its repetition. One reason for the preference of restarts may be the speaker’s intention to redo a microgrammatical form in its ideal delivery and thus to facilitate parsing for the listener (Clark Reference Clark2002): speakers might be guided by the intuition that forms that are part of hierarchical relations are easier to parse and understand when they are integrated into a coherent sequence with the forms to which they are hierarchically related than when they are disrupt. Thus, their reproduction restores continuity after the disruption caused by the suspension. A further possible reason refers to processing limitations on the side of the speaker: it might be easier for speakers to formulate a structural unit from the beginning rather than from the middle, assuming that producing a structural unit from the beginning helps speakers keeping track of where they are. Both reasons have been discussed in the context of word repetitions at utterance-beginnings by Clark & Wasow (Reference Clark and Wasow1998: 207), who point out that, in principle, both of them apply as they represent two sides of the same coin.
A brief note on pattern (ii) is in place here. The repetition of an initial element or segment of an emergent structure is a highly frequent pattern among dysfluency phenomena in the initial or post-initial phase of the production of a unit of talk. In many cases it is realized without macrogrammatical elements and delays, as shown in (38).
(38)
306 C: you’d better be nice to her. 307 A: yeah. 308 C: yeah. 309 → B: she could come she could come down on Tuesday and say I want fifty pounds (.) <<laughter>> 310 she could say or else I’m going to snitch or I’ve already taken down out down the West End and go and see <<unclear syllables>> something fantastic. [ICE-GB S1A-030]
Such repetitions at unit of talk-beginnings have various reasons. First, speakers need to set up the conditions for the successful uptake of their talk and focus the attention of the addressee(s) to their talk. As the initial phase is overlap-vulnerable, especially after turn-transition, e.g. through laughter caused by prior talk or overlap of two competing speakers that self-select for the next turn, speakers may reemploy the beginning of a structural unit in order to avoid it from being unattended by the addressee. In this sense, repetitions are everything else than functionless performance “errors.” A second reason is related to cognitive pressure deriving from real-time speech production: given that speakers act under time-pressure, they try to avoid temporal gaps (as these may be associated with unwanted implications, see above) and rather make preliminary commitments to a particular constituent. This would imply that speakers are more or less aware of the preliminariness of first constituents and count on the possibility of suspending speaking right after their production. The idea of a “premature” production of a constituent before decisions on the design of the structure and lexical choices have been completed is at the heart of Clark & Wasow’s (Reference Clark and Wasow1998) commitment hypothesis, which gave rise to several testable predictions on when and how speakers produce disruptions and restarts.
Dysfluencies become less frequent in the final phase of the production of a unit of talk. The reason is that during the course of the further production of a unit of talk, once a microgrammatical unit has been initiated, the variety of alternative developments becomes smaller until toward the closure of projections the final components are increasingly predictable so that speakers are more and more unlikely to revise an emergent structure that has become identifiable as a pattern of a particular kind. For the listener, the various alternatives that had been “in play” at the beginning of a structural trajectory are reduced over time until the ultimate structural format can be identified with reasonable certainty, which allows for the anticipation of a possible point of completion at which a transition of speaker roles is legitimate. The predictability allows not only for smooth turn transition to the next speaker with minimal time gaps, but also for early starts (Jefferson Reference Jefferson1973, Reference Jefferson, D’Urso and Leonardi1984), that is, overlapping talk at the potential end of a structure-in-progress. In such cases, a first speaker brings a structural unit to completion by producing highly predictable elements, while a second speaker already sets in at the point at which the production of further elements is almost entirely predictable and initiates a new unit of talk.
4.4.2 Medial Fields at Later Points in Time
It has become clear at several places in this study that understanding the internal structure of a unit of talk in grammatical terms begins with an analysis of what is in it, that is, by identifying and locating subunits, which may belong to the domain of microgrammar or macrogrammar. In other words, one needs to lay open the internal organization of units of talk and associate the different segments one may encounter with the different kinds of things the speaker is doing with the production of these segments.
The assumption that units of talk are internally organized in smaller segments allows the analyst to deal with even seemingly chaotic units, such as the ones in lines 20–22 in (39), where B is talking about how she told her grandmother that she had just been left by her boyfriend.
(39)
16 B: and she was just saying= 17 =oh yes well you know (.) you must get over it as best as you can= 18 =it’s not the end of the world °and stuff°. 19 ↑and I said I know= 20 → =but uh you know when I was talking to my cousin Emma uhm it was it was <<unclear word>>, 21 → so I was you know I said it was well Monday when I went to work I said it was a week two weeks ago yeah °two weeks ago last Monday°. 22 → but (.) ↑I kept on feeling wobbly all day you know and I kept on feeling really miserable and thinking but ↓you know. 23 I thought= 24 =God this is- if I’m feeling like this because this is the first person that’s really, 25 you know I meant something to him I was getting quite fond of him. [ICE-GB S1A-049]
The macrogrammatical elements (highlighted in gray) reflect the different kinds of actions speaker B is doing while incrementally building up a structural unit in the linear flow of time, separating one action from another. The unit of talk in line 20 begins after a series of macrogrammatical elements in the initial field: the first marks a shift in focus, namely from how she was talking to her grandmother toward the talk she had with her cousin, which is indicated by but, the second one is a planning marker indicating a cognitive resting point required to plan the new sequence (uh), the third one an addressee-oriented marker (you know) that expresses that the addressee will be able to grasp the implications of what was said in the preceding talk and/or of what will follow. What follows is the projection of a biclausal microgrammatical unit with a first part (when … ) that strongly projects a second part (then … ), the first one expressing an event that became the starting point for a projected second event. However, at the transition from the completed first part to the anticipated second part the speaker hesitates, producing a macrogrammatical expression (uhm) in order to deal with planning activities. The second part begins with a repetition (it was it was) and ends with an inaudible part. The medial field that hosts the cognitive planning marker uhm thus represents a major juncture in the production of the unit of talk, linking an already produced segment to an anticipated second segment that is required to resolve the initial projection of a biclausal structure.
In the next unit of talk (line 21), the speaker signals continuation with a conclusion by producing the macrogrammatical expression so in the initial field, which indicates that what follows is directly contingent to what was said before. This is followed by the beginning of a microgrammatical unit (I was), which makes a subject complement expectable and is likely to express the speaker’s emotional state. However, B changes her plans midway, dropping the further development of the unit and rather alluding to shared understanding (you know) before continuing with a different kind of information. A further, major juncture in the production of a microgrammatical unit occurs at the point at which the speaker is about to express a state (I said it was), but leaves the expectable subject complement unexpressed and shifts to background information (Monday when I went to work), the shift being marked by well. The next juncture in what is to become a microgrammatical unit is caused by a self-correction (it was a week two weeks ago), which goes almost unmarked (no cut-offs, hesitation or a filled pause) and reflects the real-time planning process in which B has to change the deictic perspective from last Monday to the moment of speaking. The repair is followed by a macrogrammatical expression of self-confirmation and deferred realization (yeah), which epistemically upgrades the new piece of information two weeks and strengthens the repair.
The unit of talk in line 22 in (39) is marked as focusing on a new aspect by means of initial but, which signals that something disjunct is being done: B is now focusing on her emotional and physical state, which is described as feeling wobbly. The expression of this state is followed by you know, which forms a final field as it occurs at a potential completion point, but is followed by a continuation that is interrupted at a point at which B is about to refer to her thoughts ( … and thinking). At this point, a shift of focus is indicated (but), followed by a marker that refers to shared understanding of the implications (you know), both of which are ambiguous between forming a medial field or a final field indicating potential completion from a process-based perspective, as can be concluded from the terminal pitch.
While, from an analytic perspective, the interpretation of macrogrammatical units is usually retro-constructed from the final speech product, a speaker- and listener-based perspective understands these units as documenting the kind of cognitive tasks a speaker is dealing with at particular points in time in the incremental construction of talk, and the kinds of projections s/he is creating when upcoming talk is yet unknown. The few, rather coincidental observations based on (39) above illustrate the relevance of macrogrammar for the organization of ongoing communicative tasks in terms of what has preceded and what will follow. Most of these tasks are recurrent generic jobs (Schegloff Reference Schegloff, Ochs, Schegloff and Thompson1996: 81), which are made relevant by prior talk or derive from how talk-in-progress proceeds. Some of them are highly relevant in the course of the production of an utterance and thus in medial fields, such as referring to shared understanding, highlighting the implications of what the speaker is about to say, or indicating a shift to a new piece of information midway. What follows from an analysis of this kind is that understanding the internal structure of a complex unit of talk is impossible without understanding the sequential organization of a speaker’s minor and major actions. The macrogrammatical expressions occurring at several places within a unit of talk were shown to mark major junctures in the continuation of a structure-in-progress, and are thus everything else than randomly occurring “fillers.”
Speakers need not only establish internal coherence within a single unit of talk, but also need to link their utterance in various ways to the prior speaker’s talk and thus to the sequential context in which it is produced. Backlinking may occur within (rather than at the beginning of) an emergent unit of talk when speakers need to deal with a concurring task beforehand. An example is (40), where A and B talk about one of A’s favorite books.
(40)
39 A: [I ] think it’s a really good book, 41 B: [uhm] 42 and do you read the book over and over again? 43 → A: WELL (.) after a certain length of time [yeah ] I would imagine. 44 B: [yes ] 43 A: because it was really (.) one that always sticks in my mind. 45 B: °right°. [ICE-GB S1A-016]
In line 43, speaker A first deals with a restriction of the propositional content of B’s information-seeking request before directly answering it, stating that he reads the book again after a certain time has passed. This prepositional phrase is followed by a marker of affirmation (yeah), which serves as a direct response to B’s request at a point at which the modification of the content of the question has been completed. A then continues with a clausal unit that expresses a somewhat mitigated confirmation of the content of B’s request (I would imagine) and with an explanation of the state of affairs. As a macrogrammatical device, yeah in the medial field serves the task of providing a straightforward response to the co-participant’s request for information and, as a side effect, also serves the integration of the unit of talk produced thus far into the sequential context, marking it as directly contingent to B’s utterance. Moreover, it marks a boundary in the utterance-in-progress as it represents the transition from specifying information that is relevant for responding to B’s request to a follow-up that partly mitigates the confirmation and weakens its epistemic value (I would imagine).
The ability to integrate different segments into one coherent unit of talk is particularly important in cases where syntactic segments that are morphosyntactically independent are aligned consecutively, as illustrated in (39) above, and in (41). A advises B, who is planning to teach English at an institute, to consider important aspects such as the age group B would be comfortable with. In line 189, speaker A starts producing a microgrammatical unit that projects a biclausal structure expressing two alternatives in which the first part involving a subject–operator inversion (would I like to work) strongly projects completion by a second part (or not). However, the production of the second part is delayed as the speaker engages in a specification of the first part.
(41)
188 A: but I susPECT primary might be °more likely°. 189 → but then you’d have to think would I like to work with that age range= 190 → =you might well you might say yes that’s such a you know the the the five six till (.) ten eleven sort of age a::nd or no:th (.) 191 that’s another thought. [ICE-GB S1A-033]
The divergence from the syntactically bipartite frame opened up with the production of a subject–operator inverted syntagma would I like to work with that age range is marked by well, which occurs somewhat delayed after the beginning of an intervening segment (you might well you might say, line 190). The next juncture occurs after A initiates what can be interpreted as the beginning of an evaluative construction (that’s such a … ), which projects an evaluative expression in the format of a lexical noun phrase that is, however, not delivered. The suspension of the continuation is marked by you know, which includes the addressee into the discourse as it alludes to shared understanding. The segment that follows (the the the five six … ) does not syntactically fit to the prior segment in the sense that a definite noun phrase cannot be morphosyntactically embedded in a such a construction. Rather, it forms a structurally autonomous unit that fits only loosely to the prior part and, in terms of its internal structure, represents an ad hoc formation analyzable as a phrasal compound (the five six till ten eleven sort of age). After the completion of the noun phrase – a microgrammatical unit with internal hierarchical relationships – the speaker projects a continuation with and that is followed by the production of the second part of the bipartite structure that was initially opened up (or not), which closes the unit of talk as a whole.
The example in (41) illustrates how syntactic junctures arise from ad hoc shifts in ongoing speech activity, which do not only occur after the completion of microgrammatical units, but may also temporally suspend the production of such a unit in order to attend the production of other segments. Such temporal suspensions of a segment-in-progress typically arise from competing communicative tasks that the speaker considers necessary to perform, e.g. the expression of two competing thoughts. This is also illustrated in (42), where speaker A talks about a flat he intends to buy. While producing his utterance, A deals with two concurring tasks, the expression of a “problem” related to the flat (=the lack of availability of public transport) vs. the indication of continued interest in it despite the problem, provided that the price will be the one that he offered.
(42)
207 → A: the only problem with that Estelle Road place. (.) >I mean I’d still be happy to have it at the price that I offered.< (.) uhm (.) is the other thing that we’ll need (..) about all those places= 208 =and that’s the: uhm, (..) actual public transport convenience. (..) [ICE-GB S1A-023]
In line 207 the speaker shifts from one communicative task (introducing the problem) to another (expressing his continued interest: I’d be happy to have it). Since both could be interpreted as going against each other (the problem might be interpreted as a possible counterargument to buying the flat) the speaker needs to attend the task of making explicit that he is nevertheless interested in it rather early before misunderstandings arise. This dual-task situation is iconically reflected in the structure of the unit talk in line 207, which begins with the introduction of the “problem” in the format of a projecting phrase that anticipates focal information, but whose completion with the projected predicate is temporarily suspended in favor of the expression of A’s general attitude toward the flat (I’d still be happy to have it … ). This digression, which is delivered with increased tempo, creates a juncture at which the speaker makes use of the macrogrammatical expression I mean, which marks the upcoming information as providing specifying information. The micropause and the planning marker uhm following the complex noun phrase the price that I offered indicate the end of the speaker’s current focus on the intervening task, upon which he resumes the initial syntactic thread by providing the copular verb is that is projected by the initial noun phrase the only problem … At this point, the speaker rephrases the original idea, which is now referred to as a the other thing that we’ll need and interpretable as a structural resonance in that is reflects the syntactic format of the initial projecting noun phrase. The structure is completed with the projected predicate only in the next unit of talk (line 208), which includes a resumptive pronoun (that in that’s the … ) that reactivates the prior syntactic subjects (the only problem/the other thing that we’ll need) and is followed by the statement of the problem (that’s the actual public transport convenience).
As shown above, major junctures occur when speakers carry out ad hoc shifts of a syntactic or communicative plan. They may also result from repairs, as in (43a). Moreover, they generally arise from the incremental character of spontaneous speech, where speakers move from a syntactic segment that may represent a completed unit of talk of its own to a next one. Junctures thus occur e.g. when increments are added to a potentially completed unit of talk, as in (43b), or generally when speakers move from one idea to the next, as is illustrated by the repeated use of you know in (43c).
a.
89 A: and I think that’s- you know I think people need (.) uhm skills for doing that and uh (.) I think Adam certainly has those skills for for our group (.) uhm [ICE-GB S1A-003] b.
94 B: because they were doing it on their own you see. 95 A: mm. 96 → could they get the right sort of materials (.) you know to make it tone in? [ICE-GB S1A-009] c.
258 B: and uh whenever you went and stayed in hotels abroad and things you know he’d be worrying about whether the tissue box was on the right side of the bed and whether you know why the flowers weren’t the right color and all that nonsense you know and sort of (.) busy dispatching people. [ICE-GB S1A-010]
It is precisely at such junctures that medial fields and thus macrogrammatical expressions available for use in such fields occur. These expressions establish textual cohesion within an emergent unit of talk in the continuing phase of utterance production. In order to find empirical support for this observation, a corpus-based study of two expressions that appear to occur regularly at the juncture between segments within a unit of talk or between two subsequent units of talk produced by the same speaker has been conducted, namely I mean and you know. Each of the expressions indicates a different kind of link between the neighboring segments: I mean typically indicates continuation with a reformulation, a specification, a conclusion, an assessment, an explanation or additional details (Swan Reference Swan1997: 151), that is, the expansion of an idea (Schiffrin Reference Schiffrin1987: 296), sometimes also in terms of a concession, e.g. after “extreme-case formulations” (Couper-Kuhlen & Thompson Reference Couper-Kuhlen, Thompson, Hakulinen and Selting2005); you know usually indicates that the addressee is able to grasp the implications derivable from what has just been said or that what follows is assumed to be plausible for the addressee or shared knowledge (Schiffrin Reference Schiffrin1987: 268–270). The empirical study distinguished five different contexts of use, which are defined and exemplified in Table 4.4 below. The contexts under (ii)–(iv) belong to the concept of the medial field.
Table 4.4 Contexts of use distinguished for a quantitative study of I mean and you know
The figures in Table 4.5 indicate the total frequency of the two macrogrammatical expressions in the five different contexts and the frequency relative to their overall uses in percentages for the first 300 tokens of each expression in the spoken sections S1A of the ICE-GB.Footnote 3
Table 4.5 Frequency of occurrence of I mean and you know in five different time slots in real-time utterance production in the ICE-GB (S1A) (N=300)
| Initial field | At major junctures | After repair | Planning pause | Final field | |
|---|---|---|---|---|---|
| I mean | 236 (79%) | 22 (7%) | 13 (4%) | 17 (6%) | 12 (4%) |
| you know | 67 (22%) | 84 (28%) | 25 (8%) | 70 (23%) | 54 (18%) |
The figures allow for the following conclusions. First, I mean and you know occur frequently in the initial field where they indicate continuation and a transition from a prior to an upcoming action or bit of information within a turn produced by a single speaker or when a speaker continues a prior action after a brief contribution by the co-participant, that is, at a juncture created by a co-participant’s intervening speech activity, as illustrated in (44).
(44)
44 B: >are you going to go to all< uhm the day on- of the phonology lectures. 45 A: °I think I ought to [do that.]° 46 B: [yes, ] 47 I think you had. 48 ye:ah. 49 → A: I mean [I don’t know how much I’ll take in.] 50 B: [I think I’ll go to most of them. ] [ICE-GB S1A-005]
By means of I mean in the initial field in line 49 speaker A indicates that she is continuing to talk about the question of whether she will attend the phonology lectures, thus bridging the upcoming utterance to the piece of talk (line 45) she had produced before B intervened, and marking it as an elaboration of what she has said before.
The second and perhaps most important conclusion deriving from the data in Table 4.5 is that the occurrence of I mean and you know in a unit of talk is not random. There seems to be a division of labor between both expressions, I mean predominantly bridging two subsequent units of talk in the initial field, you know smoothing out junctures within units of talk and gaps occurring through planning pauses. This division can be explained with the function of these expressions: while I mean projects a continuation with a specification or reformulation, which is a task that speakers typically deal with in the initial phase of the production of a unit of talk, you know indicates that the addressee is assumed to be able to grasp further implicatures or the relevance of what is said, which is a task that becomes relevant either at the beginning of a new unit of talk that continues a speaker’s turn (referring to prior talk), or while a unit of talk is in progress as a means to monitor the addressee’s interpretation of emergent talk. Given that you know indicates the speaker’s assumption that there is no significant discrepancy between the mental world of the speaker and that of the addressee (Schourup Reference Schourup1985: 102), the conversational behavior of the addressee may influence the further course of talk, e.g. making further explanations unnecessary.
To conclude, both expressions are predominantly used to bridge transitions in ongoing talk from one segment to another and creating cohesion by smoothing out the potential disintegration of syntactic segments that results from the linearization of various pieces of talk into an emergent structure, or from turn-taking and thus between prior talk and an upcoming continuation of a turn. Repair activities are among the least systematic time slots in which these expressions are used, but considering their frequency of occurrence in all other contexts they represent a minority.
Generally, the points at which syntactic junctures occur are, of course, not predictable. From the speaker’s perspective, such gaps occur in ad hoc ways, which explains the instantaneous character of medial fields. While starting points lend themselves to a relatively context-free description, given that these are absolute points in the production of a structural unit, what happens when a unit of talk is in progress is relatively difficult to conceptualize in an abstract way as the degree of segmentation depends on a larger set of idiosyncratic factors. Some of these are e.g. the schematicity of a format chosen by the speaker (certain formats facilitate the mental segmentation of a message, and thus real-time planning, such as temporal sequentiality, e.g. while x–y, when x–y, or cause–effect relations), or the frequency of talking about a certain topic. It has been observed in experiments that speakers produce more dysfluencies when describing an abstract scenario they had not described before than when describing one they had previously described (Arnold, Fagnano & Tanenhaus Reference Arnold, Fagnano and Tanenhaus2003; Barr Reference Barr, Cavé, Guaïtella and Santi2001) and that speakers have more difficulties introducing new items into the discourse than talking about old ones (Almor Reference Almor1999). Mental schemas are thus a powerful resource affecting the structural design of talk.
What we can conclude from the examples above is that medial macrogrammatical fields are likely to occur at junctures or gaps within an ongoing unit of talk. Major junctures and gaps originate from:
(i) syntactic planning that did not go far enough, or a loss of the syntactic “thread,”
(ii) word-finding problems and formulation difficulties,
(iii) transitions from one microgrammatical segment to another,
(iv) dealing with concurring tasks, such as forwarding a message while providing background information required for the listener to be able to understand,
(v) repair activities, which need to be carried out in situ, that is, immediately adjacent to the repairable item (Schegloff Reference Schegloff1997; Sidnell Reference Sidnell2010: 111).
(i)–(v) are phenomena that speakers need to deal with instantaneously and that arise in ad hoc ways. Note that many cut-offs of an emergent microgrammatical unit (e.g. in the case of reformulations) are smoothed out by macrogrammatical expressions, thus providing important orientation for real-time processing by the listener, as in (45). With such cut-offs, the co-participants need to mentally overwrite the pieces of microgrammatical configurations that are abandoned underway and not continued at a later point in time (highlighted in gray), i.e. they need to stop the further processing of a syntactic segment at some point in order to restart processing at the beginning of a new segment. In (45), speaker A and B talk about how people usually describe other people, particularly those who they cannot stand, and how people can manipulate other people’s impression by how they talk about an individual.
(45)
268 B: you think these are these are people that other people are describing as warm loving (.) friendly, 269 → you know=I mean it didn’t seem, 270 → uh here I am talking to all the- 271 → because I’m asking people their own point of vie:w (3.6) I’m obviously (1.9) I mean in a sense it’s a nice position to be in because I’m asking everybody to express what they think (.) so they like me because they think oh here’s kind of subconsciously they think here’s someone who’s prepared to listen to what I think. (1.3) so:, (4.3) 272 anyway. (1.6) 273 there’s too much of me °talking°. [ICE-GB S1A-037]
Such examples provide evidence for relatively small-scale planning distances that speakers carry out in real-time speech production and for the observation that macrogrammatical expressions occur at caesuras in emergent talk (note e.g. the uses of you know, I mean, uh, because in lines 269–271). An interesting question is in which ways the abandoned segments contribute to the speaker’s message and thus implicitly guide the listener’s understanding, e.g. in the sense that they foreshadow topical elements that turn up at a later point in time in a new microgrammatical format. Thus, even though they are no more than grammatical “junk” from a retrospective (analytic) point of view, they may provide important processing cues supporting the comprehension process and document the speaker’s cognitive activity of “trying out” different formats before committing to a particular structure. For instance, the abandoned segment I’m obviously in line 271 anticipates the expression of a self-evaluation or reflection of the speaker, which is provided later in a different format (they think here’s someone who’s … ) and thus already foreshadows a particular kind of information. This idea surfaces in Ferreira & Bailey’s (Reference Ferreira and Bailey2004) concept of Overlay, which refers to a cognitive operation that allows one structure to be put on top of another, creating a layered structural representation. The top layer is the one that supposedly determines the interpretation, but since the reparandum portion is “underneath,” it can continue to exert some influence (see also Ferreira Reference Ferreira2005: 375).
To sum up, one of the major tasks in the continuing phase of the production of a unit of talk is to integrate smaller segments into a coherent whole in the sense that all segments can be understood as contributing to the illocutionary force and thus to the communicative action the speaker is about to perform. This task is highly relevant given that cognitive constraints (limited capacity of the WM) work against far-reaching syntactic plans and thus often force the speaker into an incremental, segment-chaining style. Since syntactic planning has usually not been completed when the first units of an emergent structure are expressed, speakers need not only be able to create units with internal hierarchical relationships and embedding of all forms (=microgrammar), but also need to deploy strategies to achieve cohesion between smaller syntactic segments.
The different kinds of tasks in the continuing phase discussed in this section are summarized in Table 4.6. The tasks are all associated with macrogrammar.
Table 4.6 Communicative tasks relevant in the medial phase of the production of a unit of talk
| Continuing phase | |
|---|---|
| Cognition | |
| Text organization |
|
| Speaker–addressee relationship |
|
4.5 The Final Field
The final field is a communicative space that follows a structural unit that has reached a potential point of completion in real-time utterance production and that is available for the production of macrogrammatical expressions. The final field is a crucial moment in the production of a unit of talk in so far as it allows speakers to deal with a variety of communicative tasks before potential turn transition or continuation with a new idea.
The starting point for the postulation of a final field is the observation that in spoken English a limited set of linguistic expressions is used almost exclusively at the (preliminary) end of an emergent unit of talk, such as final linking adverbials/final particles (e.g. then, though, actually, Biber et al. Reference Biber, Johansson, Leech, Conrad and Finegan1999: 791; Haselow Reference Haselow2012a, Reference Haselow2013; Hancil, Haselow & Post Reference Hancil, Haselow and Post2015), tag questions (e.g. can you?, Tottie & Hoffmann Reference Tottie and Hoffmann2009), or general extenders (e.g. and all, or something, Cheshire Reference Cheshire2007; Tagliamonte & Denis Reference Tagliamonte and Denis2010; Pichler & Levey Reference Pichler and Levey2011), as exemplified in (46a–c).
(46)
a. 150 A: the mother looked like 151 B: Cher. 152 A: yea:h. 153 B: I can’t stand >that kind 154 of woman °though°<. [ICE-GB S1A-041] b. 241 A: but you can’t just pick them up off the counter °can you¿° 242 B: yeah you can. [ICE-GB S1A-079] c. 009 B: I’ve never understoo:d and this is what’s caused a lot of problems °n all°. °h [ICE-GB S1A-050] FINAL FIELD
These expressions represent sedimented patterns of expanding a unit of talk beyond a potential completion point and are not integrated into microgrammatical relationships, as they do not receive their morphosyntactic and semantic properties from any of the forms within the unit they accompany. However, they are not autonomous either as they are not able to represent a unit of talk of their own, but require a structural unit to which they are linked in various ways, e.g. in terms of pragmatic scope, illocutionary force, and prosody. Since it is not easy to define the end of a unit of talk from the speaker’s perspective, for whom a structural unit-in-progress is, in principle, always expandable, some comments need to be made on how the final field can be conceptualized best.
4.5.1 Conceptualization of the Final Field
The term final field refers to a point in time at which the speaker has brought a unit of talk-in-progress to potential completion and expands it by the production of further expressions. What speakers and listeners consider “complete” is a context-sensitive issue and cannot be predicted out of a concrete speech context or the co-text (Schegloff Reference Schegloff, Ochs, Schegloff and Thompson1996: 87–90; Auer Reference Auer, Couper- Kuhlen and Selting1996b: 61–62) as it does not merely depend on syntactic “well-formedness.” A relatively context-free definition of the potential end-point of a unit of talk can only be given on the basis of one major cognitive operation that guides language production and processing, namely the speaker’s and listener’s ability to draw mental projections, that is, to precalculate possible continuations on various dimensions of the communicative system (Auer Reference Auer2005, Reference Auer2009; Levy Reference Levy2011). Projection is a cognitive mechanism by which one (verbal or nonverbal) action or a part of it prefigures the next, thus creating expectations on the side of the participants on the further trajectory of a (syntactic, prosodic) configuration or a communicative action underway.Footnote 4 Projections are based on gestalts, which are schematic patterns created by speakers and identified by listeners that are perceived as an organized whole with the production of more or less well predictable elements in a gestalt-conforming way. Gestalt types in language can be found on the syntactic, prosodic, semantic, and pragmatic level.
Speakers are able to create projections and listeners are able to process projections based on their experience and intuitive knowledge of how gestalts are typically structured or sequentially organized. Knowledge of gestalts allows listeners to identify the points at which these may have reached potential completion on various dimensions. Syntactic projections require knowledge on the serialization of linguistic forms, which includes adjacent serialization (when the form following the one processed so far can be predicted with more or less certainty, e.g. adjacency of determiner and noun in English) and nonadjacent serialization (when the occurrence of a next form can be predicted, but not its exact position, e.g. a segment expressing the apodosis of an initial conditional if-segment) (Auer Reference Auer, Deppermann and Günthner2015). The ability to build up prosodic projections is based on segmental properties, involving knowledge about the possible prosodic patterns of intonation units, which are bounded by a terminal contour and contain at least one nuclear accent and one prominent pitch movement (Couper-Kuhlen & Selting Reference Couper-Kuhlen, Selting, Couper-Kuhlen and Selting1996: 14; Auer Reference Auer, Couper- Kuhlen and Selting1996b). It requires intuitive knowledge of global patterns of pitch movement to and from a syllable with nuclear pitch accent (a segment receiving highest pitch) and on the shape of terminal boundaries in a given language.Footnote 5 Determining when the highest or lowest pitch accent has occurred in an emergent utterance requires the co-participants to have some feeling for the pitch range in a concrete speaker’s speech in unmarked contexts (excluding e.g. intonation displaying surprise, anger or contrasts) since, in principle, listeners can make predictions only on the basis of their intuition regarding the limits of a speaker’s pitch contour (Auer Reference Auer, Couper- Kuhlen and Selting1996b: 68–69). Predicting possible completion points would then coincide with the point at which a speaker has reached a pitch limit, i.e. the upper or lower end of his/her intonational range, the first of which foreshadows an upcoming terminal intonation pattern, the latter usually signaling potential completion. Changes in rhythm, volume, and tempo are further resources for prosodic projections. However, the predictability of the number of units to follow a point of maximal or minimal pitch and thus of potential completeness is relatively small. Moreover, global intonation contours are only an ideal case since more often than not speakers assemble smaller accent units in a linear sequence, and it is impossible to predict how many and what kind of accent units or non-accented syllables will follow a final pitch accent. In other words, speakers do not tend to create far-reaching, but rather small-scale prosodic projections, which often do not exceed the limits of a single accent unit. Prosody with its local flexibility to adjust additional units into an emergent intonation contour is thus less suitable for building up far-reaching projections than syntax (Selting Reference Selting1996; Couper-Kuhlen & Selting Reference Couper-Kuhlen, Selting, Couper-Kuhlen and Selting1996: 14–24). Semantic projections relate to knowledge on what it takes for a unit to represent a complete proposition or an “idea,” definable as the point at which a piece of talk can be subjected to a test of their truth value and at which other speakers may express confirmation or negation. Pragmatic projections are based on the speakers’ knowledge on how particular illocutionary acts are usually carried out, and how conversational actions are grouped together and related to one another so as to form sequences. An individual action may be part of or foreshadow another one, as with adjacency pairs (Schegloff & Sacks Reference Schegloff and Sacks1973; Sacks Reference Sacks, Jefferson and Schegloff1995: 521–541), that is, with paired actions in which one action makes another one relevant, e.g. question–answer or offer–acceptance/decline pairs, or preparatory activities (“pres” and “pre-pres”), e.g. story prefaces (Did you hear that about Pete’s car?), pre-invitations (Are you busy tonight?), or pre-requests (Can I ask you a favor?), which project a specific action and are supposed to check on the appropriateness of its production and to prevent the occurrence of dispreferred second actions (e.g. rejections) or the blocking of an action underway (Schegloff Reference Schegloff1988, Reference Schegloff2007). Judgments on what it takes for a conversational action to be interpretable as complete is heavily determined by the sequential context in which it is performed: as Schegloff (Reference Schegloff, Ochs, Schegloff and Thompson1996: 88–89) notes, a simple no can in one context be taken as a completed action (e.g. when it serves as an agreement token with a preceding negative assertion), whereas in other contexts it cannot (e.g. when the speaker is not in the position to agree or disagree with the prior speaker). In such cases, no serves only the initiation of a turn that is expected to express an account.
It is important to note that projection does not determine what follows, but makes expectable a particular kind of continuation. Syntactic, prosodic, semantic, and pragmatic parameters thus serve as orientation cues in the real-time production of an utterance and the synchronization of the listener’s processing with the speaker’s activity. The different kinds of projections interact in complex ways, which cannot be discussed here in full detail. A detailed discussion of this issue has been provided by Auer (Reference Auer, Couper- Kuhlen and Selting1996b, Reference Auer2005) and Ford & Thompson (Reference Ford, Thompson, Ochs, Schegloff and Thompson1996).
Due to the participant’s ability to project continuations, the chances to predict the further trajectory of a structural unit underway continually increase during the production of a unit of talk, until projections on relevant dimensions are complete. This means that in the course of the production of a structural unit the processing effort continually decreases for both the speaker and the listener: while during the initial part of a trajectory participants go through a “phase of maximal planning,” where speakers need to select among various syntactic alternatives and listeners need to simultaneously track several possible trajectories of an emergent structure on various dimensions (e.g. syntax, pragmatics), the number of alternative trajectories decreases over time until in the later phase only a small number of alternatives is available for bringing a unit of talk to completion, many of which are less probable than others so that the remaining parts can be predicted (Auer Reference Auer2005: 9). As I have shown in Haselow (Reference Haselow2016b), a final field arises when the speaker has reached a point in time in the production of a structural unit at which projections relevant for the interpretation of a unit of talk are completed and at which the unit-thus-far is potentially independent of any following unit (t1). Expressions forming the final field are thus outside the projections holding within the unit to which they are attached. The final field expands a unit of talk and creates a new point of (potential) completion (t2 in Figure 4.1), without adding new propositional content or opening up new structural projections.

Figure 4.1 Schematic representation of the final field
The sequence produced up to the point in time t1 in Figure 4.1 is unit-of-talk-constitutive; that is, it can potentially represent a unit of talk of its own. The point of potential completion at t1 is not to be understood in categorical terms, like the end of a sentence. It is a decision made by the participants, based on whether or not listeners are able to identify the illocutionary point of a unit of talk underway, and thus the kind of action a speaker is performing, and whether or not speakers need to deal with the resolution of open syntactic, prosodic, semantic, and pragmatic projections.
However, it is not necessary that projections on all levels are completed in order to understand a structural unit produced until t1 as complete: participants tolerate the incompletion of projections on different dimensions to a certain extent if the context and the co-text as well as the unit produced thus far provide sufficient interpretive cues. Syntactic completion, for instance, is often secondary; that is, speakers often do not complete projections on the microgrammatical level when the unit-in-progress is advanced to such an extent that it allows for the identification of the speaker’s communicative goal. The reason is that a syntactically “incomplete” structure can usually be interpreted against the context, co-text, or knowledge on how a particular action is usually performed verbally, and thus allows for the identification of the kind of action a speaker intends to perform without syntactic completion, as illustrated in (47).
(47)
65 B: I understood that it’s not out on VIdeo. 66 A: no. 67 B: I just wondered if you’d seen it. 68 A: yes= 69 =I have. 70 → B: and you- >OH you were the one who-,< 71 A: I was the one who’s seen it. [ICE-GB S1A-006]
B’s sudden realization of the fact that of all people it was A who had already seen the film is syntactically “incomplete” in the sense that it lacks the continuation of the relative clause that is projected by the relative pronoun who. Yet, the unit-thus-far is interpretable for A as the expression of an instantaneous change of B’s knowledge state. Since there are not very many syntactic alternatives in play after the production of the relative pronoun who (a finite verb complex and an object, e.g. has seen it, are the most likely candidate items to follow) the continuation is highly predictable and thus not necessarily required in order to identify the action carried out by B. Speaker A is thus able to confirm B’s deferred cognitive realization by reproducing and expanding the syntactic format initiated by B. Note that the unit in line 70 also leaves projections on other levels open: it is, for instance, prosodically incomplete as it lacks a terminal contour, ending with a cut-off and mid-level pitch, the latter of which projects more to come.
There are units which, if considered out of their co-text and context, appear “incomplete” in the sense that they project more to come, but which are not heard as leaving projections on any level unresolved within their individual context. An example is provided in (48), where actually, which forms the final field of the unit of talk in line 265, is attached to a bare noun phrase.
(48)
262 D: uhm <<piano in the background>> (7.5) could you just play it through so we know what it’s (.) exactly cos >I’m not confident that I know what it’s supposed to sound like.< 263 A: it’s pretty. 264 it’s nice. 265 → a good chord actually. (.) [ICE-GB S1A-026]
If produced and heard in isolation, the noun phrase a good chord could be heard as projecting more to come: units beginning with a noun phrase are most likely to be followed by a verb in English since initial noun phrases are typically subjects, or they represent a “left-dislocated”/“fronted” noun phrase that is typically followed by a co-referential pronoun (e.g. a good chord – that’s what we need). However, given the co-text the unit is syntactically not leaving a projection unresolved as it follows a sequence with two copular constructions (it’s pretty, it’s nice), which can be considered as cognitively activated when the speaker produces the unit of talk in line 265. The noun phrase a good chord fits to the copular pattern produced in the preceding units of talk as it may also serve as a subject complement of it’s, parallel to the preceding adjectival complements. Semantically, the NP is non-projecting, as it fits to the list of positive assessments given by A before, providing a nominal reference to the assessment. Pragmatically, the unit of talk is also clearly interpretable as contributing to the sequence of units that express the speaker’s attitude, that is, to an action that can be accounted for, paraphrased, or questioned, and thus not leaving any projections unresolved. The final field represented by the FP actually is thus a point in time at which the unit-thus-far has reached a potential point of completion in the given sequential context. Such examples show that units of talk are usually not uttered as autonomous acts, but interact with prior aspects of talk and are deeply intertwined with the co-textual and contextual environment (see Linell Reference Linell2009: 296).
Identifying potential completion points is thus a complex issue, requiring the simultaneous tracking of several kinds of projections and the sequential context, and not tied to the rules of syntax alone. The role of syntax is important in so far as it is one important dimension on which projections are build up and completed, thus guiding the speaker’s planning activities and the listener’s anticipation of the further trajectory of a unit of talk-in-progress. It seems intuitively plausible that pragmatic projection is the crucial factor in deciding about the potential completeness of a unit of talk in the sense that unresolved projections on other levels can be tolerated as long as speakers have provided sufficient cues as to make their communicative goal identifiable. The relative weight of different kinds of projections in utterance interpretation requires further investigation.
4.5.2 Macrogrammatical Expressions in the Final Field
The final field does not only host single-word units such as final linking adverbs/final particles (e.g. then, though), but also more complex ones, e.g. independent if-clauses (e.g. if you don’t mind me saying), general extenders (e.g. and stuff, and things like that), or parentheticals (e.g. I think, I believe). Such units do not obey preconceived ideas of syntactic units based on lexical compositionality and exhibit a reduced semantics where only in some cases residual components of the original semantic content of the individual parts are identifiable. They form conventionalized form–meaning pairings or chunks with little or no internal compositionality and functional values that are not derivable from the meanings of the component parts.
Note that, as mentioned above for expressions in the initial field, using formal criteria based on sentence-grammar for classification is problematic since almost all of the expressions in the final field violate many of the defining features that characterize the lexical or syntactic category they seem to correspond to according to their form. Units like if you want or if you don’t mind me saying, for instance, are formally affiliated with and thus “look like” if-clauses as they seem to exhibit a subject–predicate structure introduced by the subordinator if. However, the classification as a clause is misleading since such forms are not interpretable as genuine clauses introduced by a subordinator: they do not represent a free combination of individual constituents, but relatively specific constructions with their own pragmatic, rhetoric, and discursive properties (see Brinton Reference Brinton2008: 163–166); semantically, they do not express a condition under which a state of affairs expressed in another clause is valid, and they are neither grammatically, nor semantically representing a subordinate clause (no embedding into a matrix clause, no semantic dependence on the proposition of the prior clause).Footnote 6 Another example are units known under the label “general extenders,” such as and stuff or and things like that, which appear to be interpretable as coordinated noun phrases from a sentence-based perspective but, again, represent formulaic units whose meaning cannot be generated by rules of compositionality. Their structural isolation is reflected in the frequent mismatches between the morphosyntactic and semantic properties of the antecedent unit and those of the generic noun that is part of the extender, e.g. you have to speak clearly and stuff (Pichler & Levey Reference Pichler and Levey2011: 444–445).
As with expressions in the initial field, the terminology used here is largely based on formal categories, which are important for identifying the different unit types among analysts, in combination with the specifying adjective final (e.g. final independent if-clause, final comment clause), which is supposed to indicate the syntactic distribution and the functional values that the respective expressions have in common. An overview of the different expressions in the final field, based on the ICE-GB, is provided in Table 4.7.Footnote 7 The order of the expressions reflects increasing internal complexity, ranging from low complexity (a single lexeme) at the top of the table to higher degrees of complexity (a more complex, “clause-like” construction) at the bottom. The functional values given in Table 4.7 are discussed below.
Table 4.7 Macrogrammatical expressions in the final field
Expressions in the final field are only interpretable with reference to the unit to which they are added and for which they typically provide an interpretive cue: they have no illocutionary force of their own and they do not represent a new conversational action, but are part of the action performed with the prior unit. They often improve the perceptibility of transition-relevant places (TRPs) as they are conventionalized in this position and thus create “marked” turn endings (Sidnell Reference Sidnell2010: 154) or endings of units of talk. Such elements therefore play a crucial role in the organization of sequentiality and conversational interaction. The different expressions in the final field form a temporary subsystem of grammar (Hopper Reference Hopper and Tomasello1998: 158) that is continuously reorganized and thus in flux, as the development and variation in the use of tags (e.g. tendencies toward invariant innit?), general extenders (see Pichler & Levey Reference Pichler and Levey2011), or final particles (Kim & Jahnke Reference Kim and Jahnke2011; Haselow Reference Haselow2013; Traugott Reference Traugott2016) shows.
The following analysis of the expressions in Table 4.7 in their individual contexts of use seeks to identify the core functions of the final field, which are indicative of the communicative tasks that speakers typically deal with at the end of an emergent unit of talk.
4.5.2.1 Final Adverbs
This category includes units that “look like” adverbs, such as final uses of really or of course, but that violate many class-defining features of adverbs: they do not depend on or modify another category (nouns, adjectives, verbs), and even postulating clausal or sentential scope (as terms like sentence adverb suggest) is misleading in so far as they are not only attached to clausal or sentential units, but also to lexical, subclausal or larger textual units. It is important to note that adverbs in the final field are not the kind of adverb we find in the “end-position” of a sentence (Quirk et al. Reference Quirk, Greenbaum, Leech and Svartvik1985: 490; Huddleston & Pullum Reference Huddleston and Pullum2002: 575; Biber et al. Reference Biber, Johansson, Leech, Conrad and Finegan1999: 772), which is defined as the part of a sentence following all “obligatory” elements and which usually involves time, place, and circumstance adverbs. The adverb-like expressions referred to here are not integrated in hierarchical relationships with other forms and peripheral to the meaning of the unit to which they are attached in the sense that their deletion would not result in a loss of semantic content. Many of these expressions can also be produced within a microgrammatical unit, where they are involved in hierarchical relationships with another form and express lexical meanings.
Adverb-like expressions in the final field are predominantly used to fine-tune epistemic stance, indicating either certainty (e.g. of course, yeah) or uncertainty/impreciseness (e.g. usually, probably, possibly), the latter of which may also mitigate an authoritative attitude, whereas others strengthen assertive force (really, absolutely). In this sense, they have procedural rather than propositional meaning in the sense of Blakemore (Reference Blakemore1987, Reference Blakemore2002); that is, they provide an interpretive cue to the addressee and guide the processing of a unit of talk rather than expressing lexical content.
Final really is an example for a macrogrammatical expression that retrospectively strengthens the assertive force of a proposition. In the final field, it has a broad scope over the unit it is attached to.
(49)
174 B: but (.) I did some volunteer work over the summer just one day a week [(.) ] at a ((coughs)) are they called Minds or Drop In? 179 A: [yeah.] 174 B: [(.)] Day Centre? 180 A: [mhm] yeah. 175 → B: and that was quite successful really. (.) [ICE-GB S1A-035]
In (49), B first mitigates her assessment of her volunteer work in a day center (quite successful), probably in order to avoid self-praising, but eventually strengthens the core of the propositional unit (“it was successful”) by using final really. This way, B manages to follow the modesty maxim, on the one hand, and to emphasize the success of her work, on the other hand. As with all expressions in the final field, really has a broad scope over the entire unit to which it is attached. This way, the emphatic meaning refers to the state of affairs expressed in a unit of talk as a whole.
A strengthening effect on the illocutionary force of a unit of talk can also be reached by what I call echoing reinforcers, that is, items that usually serve as response adverbs expressing affirmation (yes, yeah) or negation (no) and that can occur as utterances of their own in this function, but that are also often used in the final field. They are usually attached to a responsive turn in which they reinforce an affirmation or negation and reassure information provided in the preceding discourse unit (e.g. that’s right yeah, ICE-GB S1A-016, 199-D; no he is not no, ICE-GB S1A-015, 150-A). Thus, they indicate that a proposition is in accordance with an expectation or opinion of the addressee or with an assumption that had already been expressed or that had been “hanging in the air,” that is, a proposition that is expectable and thus plausible to the co-participants. In this use, they often accompany a unit of talk that merely repeats or “echoes” what the prior speaker had just said and reinforce its validity (e.g. B: it looks swollen that foot – A: it is yeah, ICE-GB S1A-047, 262–263). In such reiterative uses, final yeah/no indicate intersubjective understanding, a shared point of view, opinion or observation, and thus emphasize agreement. At the same time, the speaker retrospectively confirms his/her own utterance, signaling that the information can be added to the common ground with no further discussion. In some cases, speakers add final yeah to an utterance that expresses an inference, retrospectively indicating plausibility and confirmation of what the prior speaker has left implicit, as illustrated in (50).
(50)
242 A: yeah (.) I mean there’s quite a sort of overhead machine overhead on Windows isn’t there but I mean you you’ve got to have a big powerful machine to run it or plenty of RAM. 243 → C: at the same time it’s not going to have Windows then yeah. [ICE-GB S1A-029]
Again, final yeah is a retrospectively oriented device for reinforcing the validity of the propositional content of the unit of talk it accompanies, indicating that the content is in line with the prior speaker’s point or argument. It also marks self-assurance on the side of the speaker that the content of the utterance is true and that it was worth saying. Final yeah is thus a linking device relating a unit of talk to the co-participant’s view with which it coincides.
A similar function on the interpersonal level is fulfilled by final of course, which alludes to shared background knowledge or shared assumptions between speaker and addressee on the basis of which the propositional content of the unit it is attached to appears plausible and expectable (see also Simon-Vandenbergen & Aijmer Reference Simon-Vandenbergen and Aijmer2002). Only remnants of the original meaning of marking a state of affairs as the natural consequence of something (Lewis Reference Lewis, van der Wouden, Foolen and van de Craen2002) can be found, namely in those cases in which of course implies that, based on what the speaker and the addressee know about the world and about each other, the proposition is (almost) self-evident and conforming to expectations, as in (51).
(51)
97 B: I heard you speaking Welsh yesterday= 98 =it was really nice. 100 A: oh right 101 B: I was listening an’ fascinated. 102 A: you were listening, 103 were you understanding it? 104 → B: no not a ↑WORD of [course ]. 105 A: [no]. ((laughs)) [ICE-GB S1A-069]
In (51), an evidential interpretation “as a natural consequence of me not knowing Welsh” is certainly not excluded. However, since B cannot assume A to know that she does not know Welsh, it seems more plausible to interpret the use of of course as marking the proposition as intersubjectively evidential, based on world knowledge (Welsh is a language that only few people are able to understand, let alone to speak) and degree of likelihood (B is unlikely to be among these people). The function of of course is thus metatextual in that it serves as a device to mark the content as plausible and expectable in a given sequential context (“you will understand that p”) and to background the news value of the unit it accompanies.
Many adverbs pose serious problems for grammatical categorization, due to the variety of positional options, variable scope, and varying degrees of syntactic and prosodic integration. Of course, for instance, is classified as a subjunct, conjunct, and disjunct in Quirk et al. (Reference Quirk, Greenbaum, Leech and Svartvik1985: §9.5), as a pragmatic particle by Holmes (Reference Holmes1988), and as a pragmatic marker by Simon-Vandenbergen & Aijmer (Reference Aijmer2002).Footnote 8 By classifying the kinds of lexemes discussed above as adverbs, the important distinction between conceptual (lexical) and procedural meaning is lost: in contrast to adverbs as defined in standard grammars, these lexemes have no effect on the semantic content of the preceding unit. Rather, they serve as a last-minute device to modify the illocutionary (usually assertive) force of a unit of talk, to indicate the epistemic status of the information expressed (e.g. probability, certainty), and to express addressee-oriented meanings (e.g. shared knowledge, plausibility) before turn transition or topical moves.
Prosodically, adverb-like units in the final field show different degrees of integration into the intonation contour of the unit they accompany, ranging from minimal separation by a micropause and new pitch onset to full integration. They are typically produced with emphatic prosody when they strengthen illocutionary force (see e.g. [51]), but otherwise tend to be prosodically unmarked, which is consistent with the observation that a loss of semantic weight and lexical meaning correlates with a loss of prosodic prominence (Pierrehumbert & Hirschberg Reference Pierrehumbert, Hirschberg, Cohen, Morgan and Pollack1990).
4.5.2.2 Linking Adverbs/Final Particles
The elements in this category comprise the lexemes actually, anyway, but, even, so, then, and though used at the end of a unit of talk. They have been variously categorized as conjuncts (Quirk et al. Reference Quirk, Greenbaum, Leech and Svartvik1985: 631–647), linking adverbials (Biber et al. Reference Biber, Johansson, Leech, Conrad and Finegan1999: 889–892), connective adjuncts (Huddleston & Pullum Reference Huddleston and Pullum2002: 779), pragmatic markers (Traugott Reference Traugott2016), or final particles (Mulder & Thompson Reference Mulder, Thompson and Laury2008; Haselow Reference Haselow2012a, Reference Haselow2013; Izutsu & Izutsu Reference Izutsu, Izutsu, Hancil and König2014; Hancil, Haselow & Post Reference Hancil, Haselow and Post2015). The preference for the term final particle (FP) is based on the observation that they do not form part of a microgrammatical unit and therefore do not conform to the class of adverbs/adverbials in the categorical system developed within sentence-grammatical approaches in the strictest sense, as they are not related to another constituent with which they maintain a morphosyntactic and semantic relationship and whose properties they modify. Moreover, in final uses these lexemes have no propositional content and no effect on the semantic content and the truth-value of the preceding unit as they are deletable without a reduction in meaning and of the validity of the unit they accompany. They have a peripheral relationship with the unit they accompany and they can connect units of different size, from lexical units and clauses to larger chunks of text. Analyzing FPs as linking elements or connectors stresses their function of establishing a relation between two units of discourse. However, FPs also interact with the illocutionary force of an utterance, e.g. as mitigators of other-repair (final though, Couper-Kuhlen & Thompson Reference Couper-Kuhlen, Thompson, Hakulinen and Selting2005) or intensifiers of information-seeking requests (e.g. anyway, Lenk Reference Lenk1998: Ch. 4), which justifies the label pragmatic marker (Traugott Reference Traugott2016). It is, however, debatable if these elements operate in the pragmatic domain only and, if so, where to draw the boundary between grammar and pragmatics with respect to spoken discourse (see Ariel Reference Ariel2008; Haselow Reference Haselow2013).
All FPs have a lexical counterpart in non-final use. However, in final use they share a set of functions that justify a classification of the respective lexemes as a category of their own. Outside the final field, these lexemes are typically conjunctions (but, so), adverbs (e.g. then, actually), or scalar focus particles (even) (see Haselow Reference Haselow2013 for an overview).
The categorical value of FPs in spoken English is that they mark a unit of talk as reactive to a preceding one, which has usually been produced by the interlocutor. However, the two adjacent units they link may also be produced incrementally by the same speaker, e.g. when speakers correct inferences that could mistakenly be derived from what they have just said (e.g. I know her. I can’t remember her name though.) or with utterances expressing afterthoughts (e.g. I’m glad she’s gone. I didn’t like her anyway.). In any case, FPs indicate a particular kind of sequential relationship between two subsequent units of talk, namely the one they accompany and the one preceding it, as illustrated in (52).
(52)
a. 320 A: ZARA’s a Marks and Spencer’s person °aren’t you° (.) 321 B: well it’s easy to buy clothes and (.) [rubbish °like that° . ] (..) 322 → C: [they’re all right ↓then.] 323 B: I only buy my (.) from there. <<laughs>> (ICE-GB S1A-017) b. 212 B: d’you want me to put anything like cheese on toast on. 213 °or anything you’ve had lunch already.° 214 → A: I wouldn’t mind some coffee actually. 215 B: coffee. (ICE-GB S1A-045)
In both cases, the FP retrospectively relates the unit of talk it accompanies to a preceding one. In (52a), the unit of talk they’re all right is conditionally related to A and B’s utterance, from which C derives a particular conclusion. More precisely, C’s utterance can be analyzed as pragmatically expressing an inference that awaits confirmation and that is drawn from the content of A’s utterance and the illocutionary goal of B’s utterance, which are pragmatically rank-shifted to conditional protases whose premises are not hypothetical, but given (premise conditionals). Thus, final then triggers a reconfiguration of C’s utterance as the apodosis: “if, as is the case, U1A (=‘Zara is a Marks & Spencer’s person’) and U1B (=B provides another argument for buying at this store): U2 (=‘the clothes are all right’).” This way, C justifies the production of her utterance by marking it as the second part of an implicit conditional construction. In (52b), A’s response diverges from the premises of the request produced by B: while B refers to food for lunch in the first way, A is asking for a beverage, marking the divergence from an expected response by means of final actually. Note that in both cases the syntactic features of the FPs are not determined by any structural constraints holding within the unit to which they are attached. In this sense, FPs are clearly outside microgrammar, representing a final field. They do not participate in the internal organization of the structural unit to which they are attached, but create a two-place relation in discourse, relating the unit to which they are attached to prior talk.
FPs are strictly backwards-oriented and can be conceived of as components of a schematic pattern of the form “U1–U2+FP”: the pattern consists of a unit of talk (U1) that is followed by a second, reactive unit of talk (U2) that includes an FP, which establishes a particular kind of relationship between U1 and U2 (Haselow Reference Haselow2013: 384). U1 and U2 are typically adjacent and form a pair in which the content of U2 has been motivated by and is conceptually dependent on the content of U1. The retrospective linking function of FPs is represented in the abstract discourse schema in Figure 4.2.

Figure 4.2 Abstract discourse schema for final particles
FPs are paradigmatically organized and thus exhibit internal subdifferentiation, as shown in the example in Figure 4.2: replacing one FP by another one attached to a unit like he is right changes the type of rhetorical relation holding between two subsequent discourse segments. Final actually indicates that the content of the second unit of talk U2 (he is right), which expresses a confirmation of the hearsay knowledge expressed in the preceding unit U1, is potentially unexpected for the addressee; final anyway (if stressed) marks U2 as valid irrespective of U1, that is, as a kind of general truth that he is right no matter what has been said or implied before; final but indicates that U2 is dissonant to another, implied proposition that weighs heavier and thus assigns it a lower communicative value; even marks the assertion he is right as ranking low on an implied scale of expectability and signals that, on top of what he said (that she is a bad teacher), the fact that he is right is especially remarkable and noteworthy; so points to an implied conclusion related to U2; then indicates that he is right is a conclusion inferable from U1 and conditionally related to it; though signals that U2 is true in spite of an implied opposite view. This kind of paradigmaticity is one of the core features of grammatical categories (Diewald Reference Diewald2011) and an important indicator for the fact that FPs form a category of their own with clearly definable core functions and formal properties that determine category-membership. FPs have a framing function that spans two adjacent units of talk, U2 expressing information that is contingent to the information provided in the preceding unit of talk U1. They play an important role in the creation of coherent discourse produced in real time where syntactic disconnectedness, caused by the successive production of structurally isolated units as a result of incremental, real-time speech production and speaker change, is ubiquitous. The unavailability of structural embedding with speaker changes or the successive planning and production of smaller syntactic segments leaves linear-temporal sequentiality and the use of conventionalized linking devices as the only possibility for structural linkage. In this sense, FPs, like other macrogrammatical expressions, repair much of the structurally unintegrated character of spontaneous spoken discourse.
An important cognitive aspect that motivates the use of FPs in real-time speech production is thus that they allow a speaker to establish a relationship between two subsequent discourse units without prior planning of this relationship. In same-speaker turns, speakers are not required to have two ideas in mind before they produce an utterance, and to syntactically embed one of the ideas into the other, but may integrate later information into information expressed before in a post-hoc way. This view is also implied in Biber et al.’s (Reference Biber, Johansson, Leech, Conrad and Finegan1999: 851) explanation for the much higher occurrence of concessive final though in spoken texts compared to all written text types:
[a] possible factor at work is that considerable forward planning is required to construct a sentence with a concessive adverbial clause. The speaker has to have two propositions in mind, together with a realization that one runs counter to the other, before starting to speak. This is not easy to manage in the online production of speech.
Limitations in speech planning thus seem to be responsible for the use of final though in spoken language, as its use allows speakers to produce two propositions, both expressed as structurally independent units, subsequently in a linear sequence, and to mark the dissonant relationship between both propositions after they have been produced. This strategy is, for instance, relevant when speakers need to modify a statement already made, e.g. restricting its validity or canceling a possible inference derivable from it. This is the case in (53), where speaker A tells C that he wants to change the format of his CV, which currently extends over two pages, to have it on one page. C offers A to help him changing the format by converting the document on his computer.
(53)
178 A: I would like it done on Wednesday if possible. (.) 179 C: mm. 180 A: please. 181 C: .h well=if you give me it tomorrow I might be able to do some tomorrow [morning before I go to rugby. ] 183 A: [that would be excellent. ] 182 → I can only do an hour though. [ICE-GB S1A-038]
In (53), speaker C produces an assertion (he might be able to do some [converting] tomorrow morning), which might lead A to derive a particular inference or conclusion from it, namely that, in case C is dealing with it tomorrow morning, he will have no particular time limit for doing it, apart from the fact that he will do it before he goes to rugby. The example illustrates that speakers may become aware of possible, unwanted inferences during or after the completion of an assertion: in the next unit of talk (line 182), C limits the time frame available for dealing with A’s CV to an hour and links the unit to the preceding one by means of final though, thus canceling the possible inference that he can deal with the CV the entire morning.
FPs exemplify the importance of a dualistic approach to grammar as they are not integrated into the binary, hierarchical relationships holding with the structural unit they accompany. As shown above, both their functional description and their assignment to a particular grammatical category appear difficult on the basis of sentence-based grammars. Given the principles of macrogrammar discussed in Section 3.3 – all of which apply to the use of FPs – they can be treated as a category of their own in a grammatical domain that deals with the organization of language outside binary morphosyntactic and semantic relationships.Footnote 9
4.5.2.3 Address Terms/Vocatives
The category of vocatives or “address terms” refers to expressions by which speakers establish direct reference to the addressee. It encompasses proper names, that is, the name of the addressee, but also forms of endearment (e.g. dear, darling, honey) and conventionalized forms (e.g. mate), the latter of which are more common in the final than in the initial field. An example is (54), where speaker A uses a conventionalized form of address in direct speech. The use of the last mentioned type is less common in English than in other languages, e.g. Spanish (e.g. tío, tía, hombre; see Stenström & Jørgensen Reference Stenström and Jørgensen2008: 4) or German (e.g. Alter, Digger), and more typical for adolescent speakers than for speakers of other age groups. In (54), mate seems to mark emphasis, underlining the validity of the assertion in reference to the addressee.
(54)
280 B: he was STANding outside in front of the National Gallery going- isn’t it (.) beau:tiful we just don’t have anything like this in America? (.) [ … ] 284 → A: because you haven’t got a history MATE °you [should’ve said] to 285 B: [yeah. ] 284 A: him°. [ICE-GB S1A-006]
Address terms have no binary morphosyntactic relationship to any other form in the unit they accompany, but the status of isolated noun phrases in traditional syntactic analysis. From a macrogrammatical perspective, they are analyzed as being integrated into the illocutionary force of a unit of talk and contributing to it: they often indicate emphasis and underline the (alleged) communicative importance of a unit of talk. On the communicative level, they situate a unit of talk into the concrete interpersonal context: they express an attitude toward the addressee, e.g. positive affection with endearments, and create a higher degree of intimacy between the co-participants since they mark an utterance as specifically tailored to an individual addressee, as illustrated in (55).
(55)
01 A: THANK you (..) for giving up your time Paul. 02 B: well I don’t see it as giving up the time <<unclear words>> 03 A: well for for giving me the time. [ICE-GB S1A-072]
The expressive speech act is rendered more personal by speaker A through direct reference to the addressee Paul. It is therefore not merely the expression of gratitude or acknowledgement, but an utterance that appears to be specifically designed for an individual addressee. In other contexts, particularly with conventionalized forms such as mate, address terms serve the control or maintenance of the contact to the addressee (Briz Reference Briz2001: 225), e.g. in order to monitor the attention and involvement of the addressee into the conversation. Proper names are particularly common with illocutionary types appealing the attention of the addressee, above all directives (e.g. warning, advice), in which they express insistence or strengthen the expectation of a response to or compliance with a directive, e.g. with requests (e.g. Can you help him, Bill?). Moreover, they serve as a strategy to support an assertion or a point, which may be factual or evaluative, e.g. that’s not what I would do Emma (Clayman Reference Clayman2012: 1863). Prosodically, address terms in the final field are part of the intonation contour of the unit to which they are attached and are produced with the terminal rising or falling intonation associated with transition relevance and turn transfer (see also Clayman Reference Clayman2012: 1858).
4.5.2.4 General Extenders
General extenders (GEs) are expressions that typically occur final to a unit of talk, such as and stuff like that, and that, or something, or or whatever. They are also common in other languages, e.g. in German (und so weiter, und so’n Zeug, bla bla), Spanish (y tal (cosas), y tal y cual) or Russian (e.g. и такие вещи [i takiye veshchi] “and such things”). The term general extender refers to the fact that these expressions are nonspecific (“general”) and that they extend an otherwise complete “grammatical” unit (Overstreet Reference Overstreet1999: 3). Pichler & Levey (Reference Pichler and Levey2011: 448–449) use the following schema to describe the prototypical structural configuration of GEs in English:
| (connector) | (modifier) | (pro-form/generic noun) | (similative) | (deictic), | |
| e.g. | and | all, | |||
| and | things | like | that, | ||
| or | whatever |
GEs are semi-fixed, complex, and unbound chunks – that is, form–meaning pairs that occur in a specific or schematic format and whose meaning is not compositional in the sense that it cannot be generated by rules of linkage of semantics and syntax. The combination of individual elements is variable, as indicated by round brackets in the schema. The pro-form includes generic referents such as anything, stuff, everything, something, or things, and is often combined with a connector and/or an optional similative-cum-deictic combination. The connectors and/or provide the basis for a structural subdivision between two main types of GEs, adjunctive variants (e.g. and stuff, and things) and disjunctive variants (e.g. or something like that) (Overstreet Reference Overstreet1999: 3–4). The presence of a connector is common, but absent with more fixed structural configurations, as with things like that, stuff like that, or type of thing. GEs differ greatly in syntagmatic length, and sometimes shorter variants derive from longer ones, having undergone lexical reduction, e.g. or something < or something like that (Cheshire Reference Cheshire2007; Pichler & Levey Reference Pichler and Levey2011). Moreover, some variants seem to be more advanced in terms of phonological reduction (e.g. or something > o’ summin’) than others (Cheshire Reference Cheshire2007; Tagliamonte & Denis Reference Tagliamonte and Denis2010).
Variationist research on GEs in Canadian English (Tagliamonte & Denis Reference Tagliamonte and Denis2010) and British English (Pichler & Levey Reference Pichler and Levey2011) has shown that there is considerable synchronic variability in terms of the formal, morphosyntactic, and functional properties of GEs, and instability in the social conditioning of variant choice of GEs. The system is highly fragmented with a large number of relatively infrequent variants and a small number of highly productive (e.g. and that, or something) ones. The overall system of GEs is continuously in flux, with ongoing renewal of individual variants and synchronic variability in the use of variants (e.g. and things like that coexisting with and things) and no clearly discernible preferences for or tendencies toward the use of longer or shorter variants. No stable system has as yet evolved (Pichler & Levey Reference Pichler and Levey2011: 462).Footnote 10 Only some individual variants have become synchronically fossilized in terms of their form, function, contextual uses, and social contexts of use, such as and that, which typically follows nominal referents and is currently becoming the default variant among younger speakers in Northern England (Pichler & Levey Reference Pichler and Levey2011: 462).
A strong indicator for the microgrammatically unintegrated, macrogrammatical character of GEs is the typical mismatch between the morphosyntactic and semantic properties of the generic noun (e.g. thing, stuff) and those of the antecedent phrase to which GEs are attached, as mentioned above and illustrated in (56a–b).
(56)
a. 44 A: this person over in the Survey of English Usage is subscribe- transcribing it. 45 B: oh right. 46 → A: but only but the names are changed and things like that so, (.) then they won’t know. [ICE-GB S1A-068] b. 277→ B: they sent one to my mother after she di:ed or something. [ICE-GB S1A-007]
In (56a) there is a mismatch between the morphosyntactic and semantic properties of the antecedent unit and the properties of the generic noun things in the GE: the generic noun is not attached to a noun phrase with one or more countable common nouns, but to a clausal unit (the names are changed), and it does not semantically refer to entities, but to an activity (“changing names”). The GE and things like that leaves it to B to think of further activities related to that of transcribing talk. In (56b), the indefinite pronoun something in generic use is not appended to a noun phrase with a countable, singular, inanimate noun, but has indeterminate scope, which is either over the prepositional phrase (after she died) or over the utterance as a whole, either of which is retrospectively marked as a vague approximation to how or when exactly the event occurred in reality. Or something indicates that there are semantic alternatives to the proposition over which it has scope.
As macrogrammatical units, GEs are not analyzable as coordinated noun phrases or adjuncts, and thus not involved in the morphosyntactic and semantic relationships holding within the prior unit. They form the final field of a unit of talk, contributing to the way in which the unit to which they are attached is to be interpreted. Four functions can be distinguished: (i) referential, (ii) turn-taking, (iii) interpersonal, and (iv) cognitive functions. In referential uses they “cue the listener to interpret the preceding element as an illustrative example of a more general case” (Dines Reference Dines1980: 22); that is, the preceding referent is marked as a member of a more general set of things or of a particular conceptual category, or it represents a part of a larger, unspecified context. The turn-taking function consists of marking a TRP, thus legitimating turn transition. GEs often mark the end of a list of items, a single-instance exemplification, or of a quotation that could be expanded, but is suspended for some reason, indicating the speaker’s desire to close a unit of talk and probably the topic altogether and to “punctuate” or bracket units of discourse (Cheshire Reference Cheshire2007). The interpersonal function of GEs is to express informality and solidarity (Cheshire Reference Cheshire2007) which, one should add, is a pragmatic effect deriving from the implicational character of GEs: marking a unit of talk as potentially expandable, lacking precision or epistemic certainty presupposes that the speaker can rely on the addressee’s ability and, above all, his/her willingness to accept the conversational vagueness and the indication of mutual understanding implied in the use of GEs. In cognitive terms, GEs appeal to shared knowledge and common ground based on which an exhaustive set or a detailed expression is not required for mutual understanding and one item is sufficient to evoke a mental frame of utterance interpretation. GEs serve as a retrospective hedge indicating vagueness and potential expandability of a conversational activity and, in a more global sense, refer to how something is expressed by the speaker and should be processed by the addressee, namely as an approximation and as potentially expandable, leaving open how exactly the speaker would continue. This implies that the speaker refers to shared background knowledge and understanding on how what was said thus far can be continued or expanded in a given context. GEs indicate or allude to intersubjective understanding similar to other elements in the final field (e.g. you know, I mean, of course) and signal that a higher degree of precision is not required at a particular point in discourse. In this sense, they further the progression of talk when further examples or more precise expressions are cognitively unavailable at the moment of speech or unnecessary, or when accuracy of information would unnecessarily threaten the smooth flow of discourse. GEs give the addressee “a rough but sufficiently exact idea about a certain state of affairs for the general purpose of the conversation” (Erman Reference Erman, Moen, Simonsen and Lødrup1995: 144).
4.5.2.5 Parentheticals/Comment Clauses
The final field may host a fixed set of configurations that “look like” matrix clauses with a subject (usually a first person singular pronoun) and a mental verb that requires a complement clause as an object, such as I think, I believe, I guess, I mean, also you know or it seems. These expressions are also used in the initial field (see Section 4.3.5). As above, I will restrict the category of parenthetical clauses to subject-cum-mental verb combinations.
It has been stated above that the syntactic status of parentheticals/comment clauses is often open to interpretation, which holds particularly for initial parenthetical clauses such as I think or I believe, which may be ambiguous between a main clause taking the rest of the sentence as object, and a comment clause or discourse marker. In the final field, however, such expressions are unlikely to function as main clauses with an embedded, preposed subordinate clause, but rather represent a syntactic chunk of their own: syntactically, they are not required to integrate or embed a preceding, allegedly subordinate unit into a larger (sentential) structure, which is also reflected in the absence of an overt marker of clausal linkage, and semantically they do not express propositional information into which the content of the prior unit is integrated, but rather indicate how the prior content is to be interpreted. It is therefore more appropriate to assume that parentheticals form the final field of the unit they accompany and thus follow a structural unit without establishing a microgrammatical relation with it.
As argued above, a parenthetical unit is analyzed here as part of an independent (non-subordinate) assertion (in any syntactic format) that contains the topical subject. In the final field, parentheticals typically express a post-hoc epistemic–evidential evaluation relating to the speaker’s certainty concerning the validity of the prior message, rather than adding semantic content. Final I believe, for instance, does not indicate the speaker’s belief in the truth of an assertion, but expresses a retrospective epistemic evaluation or an opinion to which the speaker is committed, but which is mitigated for some reason, e.g. due to the lack of epistemic certainty, as in (57).
(57)
68 A: yeah he had some sort of disease. 69 → undiagnosed °I believe°. 70 so he had these big big cracks in his face. [ICE-GB S1A-015]
The use of I believe in (57) lends the adjectival “increment” (Schegloff Reference Schegloff, Ochs, Schegloff and Thompson1996: 59; Couper-Kuhlen & Ono Reference Couper-Kuhlen and Ono2007) undiagnosed the character of an assumption rather than a fact, based on some piece of evidence that the speaker might have (e.g. from what “he” has told him), an inference, or on a piece of information in memory that cannot be fully recovered at the moment of speech.
An epistemic meaning is, however, often absent with final parentheticals. Final I think, for instance, is predominantly used as a mere opinion marker, as in (58).
(58)
150 B: you can’t communicate with people (1.0) solely (1.0) through uh <unclear word> you need-, (.) 151 → you need physical contact too (..) in everyday things not (1.0) uhm (2.5) not just once in a while. (2.2) I think. [ICE-GB S1A-003]
The use of I think in (58) is not epistemic in the sense that it expresses the speaker’s degree of certainty concerning the validity of the content of unit of talk to which it is attached since the speaker, who is a dancing teacher working with disabled people, is not talking about a fact for which one needs objective evidence. Rather, B expresses his subjective experience with people at his job, from which he learned that an individual needs more than “words” in everyday interaction with other people, namely physical contact. B’s assertion is thus an opinion that is not supposed to be epistemically challengeable. It can be challenged only by divergent subjective experience or by a different opinion. I think is also often used as a hedging and face-saving device on the interpersonal level, weakening or mitigating the force of a claim or an immediate reaction in such a way that it “leaves room for intervention by the interaction partner” (Nuyts Reference Nuyts2001: 165). This interpretation of I think is applicable to (58): the long pause before the production of the parenthetical offers room for intervention by the interaction partner who, however, does not adopt the speaker role. B therefore reissues the invitation for speaker change by marking his utterance as an opinion to which A may position him/herself.
Another, functionally different kind of chunked expression in the final field is I mean. In uses other than in the final field, I mean indicates that the speaker modifies his/her own talk (Schiffrin Reference Schiffrin1987: 299) by expanding or reformulating an idea or opinion just expressed. In the final field it is not projecting, but marking the preceding utterance or discourse segment as potentially expandable by a reformulation or an explication of an inference that is, however, left implicit as it is potentially interpretable based on mutual understanding (“you know what I mean”), leaving the projected continuation or an inferred meaning pragmatically “hanging,” as in (59).
(59)
32 B: how detailed do you want me to get really because I could: (.) go off (.) I could talk quite a long TIME about my father on a question like this and in what kind of way (3.0) do you want him [described ] as a as a person? (..) 33 A: [well, ] 34 → well uh- bearing in mind that I know nothing about him at all=°I mean°, 35 B: all right. 36 uhm (..) well I’m bound to say he’s he’s an actor <unclear word> [ICE-GB S1A-076]
In (59), A answers B’s request for information as to how detailed he should get in talking about his father with an assertion that only indirectly answers the question, i.e. from which B needs to infer that he should offer a detailed account. The fact that B displays understanding indicates that he is able to interpret the inference and understands I mean as an offer for turn-transition.
In contrast to parentheticals/comment clauses in the initial and medial fields, where a variety of intonational phrasing patterns is possible (Dehé Reference Dehé2009, Reference Dehé2016; Dehé & Wichmann Reference Dehé and Wichmann2010a, Reference Dehé and Wichmann2010b), they are prosodically unmarked and integrated into the intonation contour of the prior unit in the final field. The relatively uniform realization of final parentheticals provides evidence that they exhibit no longer semantic variability and do not convey propositional content.
4.5.2.6 Tag Questions
Quirk et al. (Reference Quirk, Greenbaum, Leech and Svartvik1985: 810) define tag questions as a type of yes–no question that conveys a positive or negative orientation and that is appended to a sentence expressing a statement. If the statement is positive, the tag is generally negative (60a), and vice versa (60b).Footnote 11
(60)
a. 372 A: and he thinks he’s SO: cool ↓°doesn’t he°. (.) 373 oh he’s a WINDOW CLEANER. [ICE-GB S1A-041] b. 305 A: you haven’t seen her=have you? 306 B: no- no- no- I have not made the effort. [ICE-GB S1A-098]
Formally, tag questions consist of an operator and a subject (in that order), the operator usually being the same as the operator of the preceding structural unit or, if the latter has no operator, the auxiliary do, as in (60a). The subject of the tag is usually a pronoun that is co-referent with the subject in the preceding unit and agreeing with it in person, number and semantic gender. Note that many varieties of English have invariant (“non concord”) tags, e.g. innit, which is relatively frequent in British English where its use correlates with the sociolinguistic variables age, gender, ethnicity and geographical origin (Pichler Reference Pichler2013: Ch. 6).Footnote 12 The syntactic status of tags is unclear: Quirk et al. (Reference Quirk, Greenbaum, Leech and Svartvik1985) do not use a specific terminology, but treat tags as units that are “appended” to a sentence expressing a statement and that are in a paratactic relation to this sentence (Quirk et al. Reference Quirk, Greenbaum, Leech and Svartvik1985: 919) which, according to the authors’ definition of parataxis, means that they are constituents at the same level of constituent structure. As “appendices,” they are “outside” the clause or sentence they are appended to, but unable to occur on their own, that is, without a syntactic “host.”
Tags are functionally variant: they can express a genuine information- or confirmation-seeking request eliciting a response by the addressee, but they also serve other interactional functions. As genuine requests for information/confirmation, tags are used to invite the addressee’s response to the statement to which they are appended. In such uses, they are typically produced with rising intonation if the addressee is about to decide the truth of the proposition. The statement to which the tag is attached thus acquires the character of an assumption (for which the speaker may nevertheless have strong evidence). In (60b), for instance, the speaker elicits confirmation of a statement to which he is not fully committed and attributes epistemic power to the addressee.
However, tag questions are not only used to express a request for information or confirmation and thus to elicit a response, but often already imply the addressee’s consent, i.e. they mark a statement as already agreeing with the addressee’s view, based on assumed shared knowledge. In such facilitative uses (Holmes Reference Holmes1983; Tottie & Hoffmann Reference Tottie and Hoffmann2009: 146), the speaker is committed to the truth of the proposition and the tag merely implies a particular expectation on the side of the speaker, namely that the addressee will agree, as in (60a), where a response by addressee is not required for negotiating the validity of a proposition. Tags in this function facilitate interaction in that they offer the addressee to engage in ongoing interaction by eliciting a minimal affirmative response (e.g. hmm, yeah); that is, they invite the addressee to contribute to the discourse. Other tags are “attitudinal” (Tottie & Hoffmann Reference Tottie and Hoffmann2009: 145) in that they indicate the speaker’s attitude, such as disapproval or solidarity, and they may be pragmatically challenging, softening, hortatory, or emphatic (e.g. I’ve told you not to touch it haven’t I?).
Some tags are ambiguous between a genuine question, a mere invitation for confirmation, and a rhetorical device with the same speaker continuing his/her turn. Tags thus align differently on a scale from representing a TRP in a maximal way (genuine question, rising tone, pause) and a form of address inviting for a minimal contribution to ongoing talk where the opportunity for the addressee to respond is restricted to minimal tokens (e.g. a head-nod or continuers like mhm) overlapping with the current speaker’s ongoing turn.
4.5.2.7 Independent If-Clauses (“Chunks”)
This type of macrogrammatical expression in the final field is represented by a fixed set of routinized expressions that formally “look like” if-clauses but that are neither subordinated to another clause, nor necessarily combined with another clausal unit. They are conventionalized expressions without external syntactic dependencies and represent lexically specified chunks in the sense that their meaning, their functional value, and their pragmatic uses cannot be derived from the constituent parts. Some types that are common in the ICE-GB are given in (61a–c).
(61)
a. 83 B: I’ve got the other- (.) the other one to that one Vicky and Tom Pitts has got his profile. 84 A: oh have you? 85 that’s it. 86 that’s the one. 87 yeah. (..) 88 B: profile here. 89 → very short skirt on °if you don’t mind me saying°. 90 A: <<laughing>> with all my hair on one side of my head. [ICE-GB S1A-040] b. 156 B: uhm (0.5) so I wouldn’t say that I actually looked on religion as a BAD thing ↓if you see what I mean. [ICE-GB S1A-076] c. 345 B: I told you I only read- (.) text books= 346 =that’s all. (0.5) uh 347 → reference books if you li:ke. [ICE-GB S1A-016]
Unlike “genuine” conditional clauses, if-chunks in the final field show no signs of syntactic embedding: they are not integrated hypotactically into a complex sentence, but loosely appended to a potentially complete unit, which can be a lexical, phrasal, or clausal unit. Semantically, these if-chunks do not express a conditional protasis that needs to be accompanied by an apodosis. Thus, unlike most “genuine” conditionals, they do not set up a mental space (a condition) within which the state of affairs expressed in the another clause is valid (Dancygier & Sweetser Reference Dancygier and Sweetser1997); that is, the unit introduced by if no longer evokes a hypothetical frame in the strictest sense, but expresses metaconditional meanings that require a speech-act related interpretation, relating to different aspects of an utterance. Some of these chunks refer to the adequacy of expression (if you like, if you want), others serve as a rhetorical device to mitigate the illocutionary force of speech acts that represent a potential face-threat to the addressee or the speaker him/herself (if you don’t mind me saying), or facilitate addressee involvement (if you see what I mean).
The if-units discussed here represent chunks that were formed on the basis of microgrammatical principles and that have been transferred to macrogrammar. Final if you like, for instance (see [61c]), is to be analyzed as a chunk since its meaning cannot be broken down into separate semantic, syntactic, and pragmatic modules: it does not express the literal meaning of a condition of the kind “in case you find sth. agreeable/enjoyable, q” and does not require embedding into a main clause, but forms a symbolic unit in the sense of Langacker (Reference Langacker1987), integrating various conventionalized aspects relating to its use and function into a higher- order meaning. Moreover, if is no longer analyzable as marker of subordination and conditionality as it has extended its scope from syntax to connected discourse and from conditionality to pragmatic felicity. The conditional element surfaces only indirectly in that the felicity of the speech act is, at least rhetorically, expressed as depending on a pragmatic condition expressed in the if-clause, e.g. the condition that “you don’t mind” or “see what I mean,” which is the addressee’s consent or understanding. It thus expresses addressee-orientation and serves the involvement of the addressee in the dialogic activity (see also Brinton Reference Brinton2008: 163–166).
Prosodically, final if-chunks may represent an intonation contour of their own and be separated from the host unit by a pause, but typically they combine with the latter into one coherent intonation contour. In any case, such units are produced without pitch movement; that is, the pitch remains on the same low level it has reached with the last syllable of the preceding unit, and they are prosodically unmarked.
4.5.3 Expressions in the Final Field as Macrogrammatical Elements
The different kinds of expressions in the final field do not qualify as constituents in the traditional sense of sentence grammar: they are not morphosyntactically integrated into hierarchical relations with other lexical, phrasal or clausal units, they do not receive their morphosyntactic and semantic properties from any other form or assign them to another form, and they do not relate clausal constituents to one another. Expressions in the final field tend to form a relatively closed class and are characterized by the following features, which coincide with those of expressions in the initial field and which justify their assignment to the macrogrammatical domain:
(i) they are used to organize spoken text rather than creating microgrammatical relationships,
(ii) they are morphosyntactically autonomous,
(iii) their internal structure is not or only weakly compositional,
(iv) they do not contribute to, change, or restrict the propositional content of the unit they accompany; their semantic content is minimal;
(v) they serve as processing cues, framing or fine-tuning a preceding message and retrospectively integrating it into a sequential context.
Note that expressions in the final field can occur in combination. A tag question can, for instance, be followed by a general extender, as in (62a), or an FP by another FP, as in (62b).
(62)
a. 112 B: ↑well he has this stupid girl he falls in love with ↓°doesn’t he or something°. 113 A: NO:: [ICE-GB S1A-006] b. 175 B: we should=maybe just leave a message here saying head over. (.) 176 A: she won’t bother coming then though. [ICE-GB S1A-039]
The combinability of expressions in the final field is not unconstrained: some combinations seem to be excluded or at least do not occur in the ICE-GB and intuitively appear unnatural, such as an independent if-chunk followed or preceded by a tag question. Moreover, the serialization of units in the final field is often constrained: an FP can, for instance, be followed by a parenthetical, but it is usually not preceded by it (e.g. *He’s not invited I think then). This indicates that the final field has a syntax of its own that requires further investigation.
4.5.4 Functions of the Final Field
The functions of the different kinds of expressions in the final field are very similar, which suggests that speakers deal with recurrent communicative tasks at the potential end of a unit of talk in the linear flow of time. These tasks are related to the fine-tuning of epistemic or pragmatic aspects of a message or a “focus of consciousness” just expressed in order to facilitate the interpretation of a unit of talk, the organization of the further course of interaction (e.g. by prompting the addressee to engage in ongoing interaction), and the integration of a unit of talk just produced into the discourse context. They provide a last procedural cue guiding the addressee’s interpretation of a message before (potential) turn transition or continuation with a next unit of talk. The tasks performed by speakers in the final phase of the production of a unit of talk are summarized in Table 4.8.
The final field is a resource that is used when the interactive past has not already contributed to a solution of the tasks listed above or when the speaker considers the conditions required for the successful processing of a unit of talk-thus-far as not fulfilled. For instance, speakers may notice unwanted effects on different dimensions (e.g. on the content or the interpersonal level) that they wish to remove. From a dynamic, temporal perspective on utterance production the final field is the last opportunity to make whatever adjustments are necessary according to the speaker’s intuition or contextual demands before possible turn transition or continuation with a new idea.
4.6 Beyond the Final Field
There seem to be cases where expressions forming the final field appear to occur utterance-medially from an analytic perspective, that is, when considering the finished speech product. However, a closer look at such cases shows that they tend to occur at a point in the temporal emergence of a structural unit at which the speaker has come to a preliminary end of a projection and is uncertain as to the way in which talk is continued. Consider, for instance, the use of the general extender and the parenthetical in (63a), and that of the tag question in (63b).
(63)
a. B: when we had the talk on clinical psychology and stuff (.) you know you need a lot of experience not just with: people with problems and stuff. [ICE-GB S1A-035, 047] b. A: I mean in in that piece we’ve just heard from The Revenger’s tragedy it’s a mixture isn’t it of original instruments. (.) and kind of- what sound to me like modern [ (.) ] trumpets. B: [ye:s. ] [ICE-GB S1B-023, 140]
In (63a), the speaker projects a complex turn that consists of an initial temporal segment (here in the format of a clause) introduced by the subordinator when which, upon its production, raises expectations on a bipartite structure in which the speaker expresses a temporal relation between two co-occurring or successive events. The general extender and stuff in (63a) is produced after the first unit can be heard as complete, that is, after the syntactic projections of the first segment have been completed. At this point the speaker begins to hesitate, adding the addressee-oriented parenthetical you know, before he continues. However, the continuation is semantically odd as it does not complete the cognitive projection initiated by when in the sense that it does not match the semantic template offered by syntactic when … (then) … structures. Rather, the speaker is creating a continuation on the fly, improvising as the unit of talk unfolds. The when-segment merely provides a general topical or event frame within which the following unit is anchored, thus forming a structural unit of its own. A similar strategy can be observed in (63b), where A adds the question tag isn’t it to an intermediate completion point of a unit-so-far expressing a core idea (it’s a mixture), alluding to the addressee’s agreement. The idea is specified by a postmodifying prepositional phrase after this potential end point, based on the speaker’s impression that the nominal reference mixture might be semantically too unspecific to be identifiable for the addressee. This suggests that the final field is, from an analytic perspective, not always final to a unit of talk, but sometimes arising at the end of a syntactic segment that might stand for itself from the speaker’s perspective in real-time speech production, but is then followed by an ad hoc continuation.
The ad hoc character of continuations of a unit of talk-so-far is also reflected in the addition of increments to a potentially completed unit, which follow expressions that form the final field, as in (64). These increments are, like expressions in the final field, nonprojected units that merely expand a structural unit beyond a point of possible completion (Couper-Kuhlen & Ono Reference Couper-Kuhlen and Ono2007).
(64)
B: so she’s got about four thousand books now °I think°. (.) or three thousand. [ICE-GB S1A-025, 319]
Continuations beyond expressions in the final field are thus based on the speaker’s ad hoc decision to expand a unit of talk for various reasons, e.g. in order to express post-hoc ideas, associations, or repairs that need to be integrated into an ongoing unit, or to smooth out gaps resulting from a lack of turn uptake and responsive activities by the addressee, which offers the current speaker the opportunity to continue.
From a product- rather than process-based perspective, cases like (64) look like utterance-medial uses of expressions that otherwise occur in the final field. This interpretation, however, misses an important point: speakers may re-analyze their utterances in the moment-by-moment unfolding of structure since “the leading edge of an utterance is always open to renegotiation through increments which [ … ] may work to recreate an original grammatical project and (seen retrospectively) transform it into a new construction” (Hopper Reference Hopper, Auer and Pfänder2011: 32). It is at such crucial points in the real-time emergence of structure – intermediate between the completion of a (sub)unit and different alternatives as to how to continue – that expressions forming the final field are produced.
4.7 Units of Talk without Fields
Thus far, we have treated units of talk as regularly including a communicative space that can be interpreted as the beginning and one that is recognizable as the end. However, as has already been pointed out by Schegloff (Reference Schegloff, Ochs, Schegloff and Thompson1996: 75), there are units or “spates of talk” that start with something that is audibly not a beginning, and that end with something that is not a recognizable possible completion. Such units are typically completions of or additions to a prior speaker’s unit of talk, or units that are produced in overlap with a prior speaker’s talk. In both cases, they are structurally oriented to another unit to which they are related. An example is given in (65).
(65)
60 B: whereabouts in the Black Sea. 61 <<unclear word>> 62 C: I can’t remember to be honest. 63 it was like a sort of- 64 because it was four day visa you sort of spent most of three days travelling [(.) ] then one night there at this 67 → B: [getting there.] 64 C: incredibly crowded camp site which seemed to be like (.) 65 >I don’t know<. 66 it seemed to be where everybody went for the holidays. [ICE-GB S1A-014]
The unit at the arrow (getting there) is grammatically built as a recompletion of speaker C’s possibly completed unit of talk at travelling. It is not built as a structural unit of its own, but “parasitic” to a unit already produced, given that it is not intended as a starting point for a new unit or activity, as we may conclude from the terminal pitch and the fact that it resonates the grammatical form of the last item (a participle form of the verb). By producing it, B displays attention to C’s talk and understanding. The grammatical format of this activity is designed in such a way that the stretch of talk can be directly linked to C’s talk, rather than being hearable as an autonomous unit of talk initiating a new sequential step within the interaction.
Many units of talk are designed as follow-ups to a prior unit and thus structurally oriented to the prior speaker’s talk. As follow-ups, they initiate a continuation of a prior unit beyond the completion point it has already reached, and thus lack an initial field. An example is (66), where A tells B, C and D that he recently met a girl on the train. B and C use this as an opportunity to mock at A, insinuating that this “meeting” had been intentional rather than coincidental and that A actually “picked her up.”
(66)
12 A: I I met her and <<unclear words>> 13 B: you PIcked her up and DROPPed her °did you°. 14 A: picked her up and dropped her yeah. 15 → B: only to pick her up again= 16 =I hope. (.) 17 D: oh well if you’ve got her telephone number you picked her up. [ICE-GB S1A-020]
B’s unit of talk in line 15 is built as a continuation of A’s prior talk, the latter of which is thus treated by B as not having come to possible completion, but as leaving an important message unsaid. The unit is produced in the format of a collaborative completion (Lerner Reference Lerner, Ochs, Schegloff and Thompson1996), by which B pretends to spell out what the prior speaker (A) appears to have left unsaid, thus expanding A’s utterance. B’s utterance has no apparent beginning since it is supposed to be heard as continuing a structural unit initiated by A.
A similar orientation to prior talk can be observed with follow-up questions, which continue a prior structure and include an in-situ wh-pronoun. In (67), for instance, speaker B formulates his request for specification by using a structural format that expands the unit of talk produced by B.
(67)
179 B: and then he changed to Surrey University where all the <unclear word> people went. <<laugh>> 180 → A: like who? 181 B: you know, 182 they thought they were the best <<unclear word>> and they should all stick together= 183 =and two of them got dysentery <<unclear words>>. <<laughs>> [ICE-GB S1A-014]
Similar to (66), the utterance produced by speaker A in line 180 is designed as to fit the grammatical format of the prior speaker’s unit of talk. A requests an exemplification of the relatively unspecific reference all the [ … ] people. The unit of talk in line 180 does not start with a recognizable beginning as it is constructed as a continuation of or follow-up to the prior unit of talk. It is grammatically designed in such a way that it is maximally coherent to the grammatical format of the prior unit and as requesting an immediate repair of the referential underspecification in this unit. To put it more generally: it is designed for the sequential turn juncture at which it is produced.
What distinguishes such units from other units of talk is their lack of clearly recognizable beginnings as they are formed as a continuation of a structural unit that has already been produced, or adding to it; that is, they derive from the speaker’s cognitive orientation to the kind of structure produced by the prior speaker. Thus, units of talk can be grammatically designed to lack a beginning or ending in terms of macrogrammar (lack of macrogrammatical fields) in order to be maximally coherent to a prior unit. Such units are built on the microgrammatical structure of a prior unit and are themselves microgrammatical if, however, often in reduced (“elliptical”) form (see Section 5.3). Their interpretation requires access to the linear development of discourse since they need to be related to a prior unit and thus rest upon the co-participants’ ability to construct a coherent model of discourse in which separate segments are mentally integrated into a coherent whole.
4.8 Conclusion
This chapter provided an analysis of important time slots in the real-time production of units of talk, so-called fields, at which speakers deal with routine tasks and which may host a specific set of macrogrammatical expressions. From a macrogrammatical point of view, fields emerge in the step-by-step creation of structure in real time and do not represent fixed “positions” available to a priori syntactic planning. They are optional spaces in which speakers may give expression to specific communicative tasks they are dealing with in particular phases of utterance production. The expressions available for use in the different fields cannot occur on their own as they are integrated into the illocutionary force of a unit of talk and provide a processing cue for its interpretation.
The study of macrogrammatical fields has shown that macrogrammatical expressions are everything else than positionally mobile, “floating” units that can be produced at any time in the production of a unit of talk. Based on the communicative tasks they serve and the points in time at which these tasks become relevant in the construction of a unit of talk it is possible to correlate different time slots with the set of expressions that are preferably produced in these slots. The analytic approach to the study of such expressions is strictly deductive, empirically based, and can never be deterministic and based on a priori rules. Macrogrammar is thus a probabilistic cognitive grammar describing the use of macrogrammatical expressions not in terms of (quasi-)obligatoriness, but in terms of abstract schemas of sequentially organized communicative tasks accompanying the creation of linguistic structure in the linear flow of time. Thus, temporal (or serial) regularities determine distributional probabilities.
Under this “emergentist view,” language structure is not only based on predetermined ways of configuring linguistic forms into a structural sequence, based on fixed positions, but “subject to the exigencies of communication” (Hopper Reference Hopper and Tomasello1998: 157). Speakers create structure as they go along in the moment-by-moment configuration of talk whereas listeners, for whom the occurrence of fields is not projectable, need to process structure as “‘mobile observers’, travelling along with the stream of speech and on the spot producing hypotheses of understanding which change and vary with the point the utterance has reached” (Franck Reference Franck and Dascal1985: 238).

