5 On the free choice potential of epistemic and deontic modals
1. Introduction
A number of constructions in various languages display a different behavior in the scope of epistemic and deontic modals. For example, the German indefinite determiner irgendein gives rise to different inferences under the two kinds of modals (Aloni and PortReference Aloni, Port, Fainleib, LaCara and Park2010; Kratzer and ShimoyamaReference Kratzer, Shimoyama and Otsu2002). Furthermore, while the Romanian determiner vreun is licensed under epistemic modals, but not under deontic modals (Fălăuş Reference Fălăuş2009), concessive scalar particles like Slovenian magari are licensed under deontic modals but not under epistemic ones (e.g. Crnič¸, Reference Crnič, Fainleib, LaCara and Park2011b). Although seemingly a diverse set of observations, there is a curious commonality in the proposed explanations for the relevant data: all of the three case studies independently assume that deontic and epistemic modals have different free choice (FC) potential, in the sense that these modals differ in the way that they license so-called FC-inferences. We consider FC-inferences to be inferences of the form in (1b) and (2b) associated with either existentials or disjunctions under modals, as in (1a) and (2a) respectively:
a. ☐∃x : ϕ(x)
b. ◇∃x : ϕ(x)
c. ∀x: ◇ϕ(x)
a. ☐(p1 ∨ p2 ∨ . . . pn )
b. ◇(p1 ∨ p2 ∨ . . . pn )
c. ∀i ∈ {1, 2, . . ., n } : ◇ pi
While epistemic FC-inferences, i.e., FC-inferences associated with epistemic modals, mostly seem to be well-behaved pragmatic inferences outside of compositional semantics, deontic FC-inferences seem able to penetrate into the computation of semantic values much more freely. It is this Modal Variability Hypothesis, as we will call it, that is the main object of scrutiny in this chapter.
We argue that the hypothesized differences in FC-potential are not due to a sharp semantic contrast between modalities, but propose a pragmatic explanation for the Modal Variability Hypothesis. We acknowledge the possibility, explored in great depth by Chierchia (Reference Chierchia2006, to appear a) and others, of (quasi-)pragmatic inferences taking effect “locally” during the composition of semantic meaning, but we still argue for a genuine pragmatic perspective on the distribution of such local pragmatic effects. We suggest that whether a pragmatic inference reliably enters semantics proper is a matter of pragmatic fossilization: since deontic FC-inferences are of a different type of information than epistemic FC-inferences, the former are more readily adaptable to the computation of semantic values.
Our main argument for the pragmatic difference between deontic and epistemic information hinges on the contextual variability of universal FC-inferences, i.e., FC-inferences that apparently occur in the scope of universal quantifiers, as is the case when an utterance of (3a) conveys the strengthened meaning expressed in (3b).
a. All of the boys may go to the beach or to the cinema.
b. ⇒ All of the boys may go to the beach and all of the boys may go to the cinema.
We argue that the availability of universal FC-inferences depends, among other things, on the contextual relevance of the inference in question. Contrasting situations in which deontic and epistemic universal FC-inferences do or do not arise, we argue that deontic FC-inferences are more clearly relevant for practical purposes than their epistemic counterparts. This suggests that deontic information is more readily perceived as “base-level” information about what the actual state of affairs is than epistemic information. In turn, this makes plausible why, from a diachronic perspective, deontic FC would fossilize sooner and/or more readily than epistemic FC.
The proposed intuitive difference in relevance for “base-level” information can be made tangible in dynamic semantics. In standard dynamic accounts of epistemic modality (VeltmanReference Veltman1996; GilliesReference Gillies2004; YalcinReference Yalcin2007), epistemic modals are not eliminative updates, but tests on whether the currently accumulated information supports or is compatible with some piece of further information. In contrast, deontic modal statements that provide directly useful information about practically relevant permissions and obligations are most plausibly treated as base-level informative, on a par with propositional information. We show that the difference between deontic and epistemic FC-inferences clearly shows in a dynamic semantics in the way that the former are persistent, while the latter are not only not persistent but even anti-persistent. Persistent information survives information growth; anti-persistent information survives information loss.
The chapter is structured as follows. After reviewing the relevant data and theories on Romanian vreun, Slovenian magari (and related constructions), and German irgendein in Section 2, we formulate the Modal Variability Hypothesis in Section 2.4. Section 3 discusses but dismisses an obvious semantic explanation of modal variability implemented in an alternative-based framework (AloniReference Aloni2007b; Chierchia to appear a). Building on data concerning universal FC-inferences in Section 4, Section 5 states our pragmatic explanation of modal variability. Section 6, finally, tries to corroborate the informal picture by tracing the underlying intuitions in dynamic semantics. Section 7 concludes with some reflections on the conceptual consequences of our proposal.
2. Data in support of modal variability
In this section, we briefly review the main properties of Romanian vreun (Fălăuş Reference Fălăuş2009, Reference Fălăuş, Li and Lutz2011, Reference Fălăuş2012), Slovenian magari (and related constructions) (CrničReference Crnič, Fainleib, LaCara and Park2011b), and German irgendein (Aloni and PortReference Aloni, Port, Fainleib, LaCara and Park2010; Kratzer and ShimoyamaReference Kratzer, Shimoyama and Otsu2002), focusing on their behavior in modal contexts. We also sketch three of the most recent accounts of these data sets. Interestingly, all of these analyses assume a different FC-potential for deontic and epistemic modals. We therefore formulate the Modal Variability Hypothesis in Section 2.4.
2.1 Romanian vreun
Epistemic indefinites are existential constructions that signal ignorance or indifference on the part of the speaker. Romanian vreun is an example of an epistemic indefinite with a very limited distribution (Fălăuş Reference Fălăuş2009, Reference Fălăuş, Li and Lutz2011, Reference Fălăuş2012): its occurrences are restricted to negative polarity and a subset of modal contexts. We will focus here on the behavior of vreun in epistemic and deontic contexts. As Fălăuş reports, vreun is licensed under epistemic modals, as illustrated in (4), while it is ungrammatical under deontic modals, as illustrated in (5).
(4)
| Trebuie/Poate | să | fie | vreun angajat | care lucrează | până târziu. |
| must/may | subj | be.3sg | vreun employee | who work.3sg | until late |
(5)
| # Trebuie/Pot | să | citesc | vreo | carte până | mâine. |
| must/can | subj | read.1sg | vreun | book by | tomorrow |
An interesting question concerns the type of modal inference vreun sustains in examples like (4). Following Alonso-Ovalle and Menéndez-Benito (Reference Alonso-Ovalle and Menéndez-Benito2010), Fălăus¸ distinguishes between standard FC-inference, which, adopting terminology from Chierchia (to appear a), she labels as total variation, and a weaker partial variation inference:
Using examples like (7), Fălăuş shows that vreun in epistemic contexts merely conveys partial variation effects (Fălăuş Reference Fălăuş, Li and Lutz2011: 418):
(7)
| E | posibil | ca | Irina | să | se | fi | întâlnit | cu | vreun | prieten, | dar | nu | poate |
| be.3sg | possible | that | Irina | refl | subj | be | met | with | vreun | friend, | but | neg | can |
| fi | Luca, | tocmai | l-am | văzut. |
| be | Luca, | just | cl.3sg.masc-have.1sg | seen |
But in addition to this, vreun seems actually to be incompatible with situations in which total variation would hold, as illustrated by the following example (Fălăuş¸ Reference Fălăuş2012: 38):
(8) Shell Game Scenario: The shell game requires three shells or boxes and a small ball. The ball is placed under one of the shells and then the operator quickly shuffles the shells around. In order to win, the player has to correctly identify the shell containing the ball.
(9)
| Mingea | trebuie | să | fie | în | vreo | cutie. |
| ball_the | must | subj | be.3sg | in | vreun | box |
According to Fălăuş (Reference Fălăuş2012), example (9) cannot be used in the scenario described in (8) because the context makes it clear that the ball could be in any of the three shells under consideration. Vreun then seems to express, beside partial variation, also an anti-total variation inference:
(10) Anti-Total variation: ¬∀x◇ϕ
not all alternatives in the relevant domain qualify as a possible option
To explain these facts Fălăuş proposes an analysis couched in the alternative-based approach developed in Chierchia (Reference Chierchia2006, to appear a). Chierchia’s proposal has two main ingredients: (i) all polarity items (including negative polarity items, FC-items, and epistemic indefinites) activate alternatives; (ii) active alternatives require the application of an independently established mechanism of exhaustification (e.g. Fox Reference Fox, Sauerland and Stateva2007a), which otherwise would be optional. Differences between different items are accounted for in terms of variation in (i) the type of alternatives they may activate (e.g. scalar, domain, singleton domain alternatives), and (ii) the mode of exhaustification they employ (see Chierchia to appear a; Fălăuş Reference Fălăuş2009, this volume for details). In this approach, different types of alternatives give rise to different modal inferences: recursive exhaustification with respect to domain alternatives derives total variation effects; recursive exhaustification with respect to singleton domain alternatives derives partial variation effects (Fălăuş Reference Fălăuş2009, Reference Fălăuş, Li and Lutz2011, Reference Fălăuş2012; Alonso-Ovalle and Menéndez-BenitoReference Alonso-Ovalle and Menéndez-Benito2010).
On Fălăuş’ account, vreun activates singleton domain alternatives. Through recursive application of an exhaustivity operator in the style of Chierchia (Reference Chierchia2006) and Fox (Reference Fox, Sauerland and Stateva2007a), this derives partial variation effects under epistemic modals; see (11a). But in addition to this, as we saw, vreun should also encode a ban on total variation, represented in (11b). In Fălăuş¸ (Reference Fălăuş2012), inference (11b) is derived by exhaustification with respect to ‘complex’ alternatives brought about by the competition of vreun with the Romanian epistemic ‘total variation’ indefinite un oarecare (see Fălăuş Reference Fălăuş2012, for details). This ban on total variation explains example (9), but it would also explain why vreun is excluded from deontic sentences under the assumption that existentials under deontic modals give rise to total variation (FC) inferences for independent reasons, as illustrated in (12):1
(11) Epistemic: Trebuie/Poate să fie vreun angajat care lucrează până târziu.
b. Anti-total variation: ¬∀x ◇e ϕ
(12) Deontic: #Trebuie/Pot să citesc vreo carte până mâine.
a. Partial variation: ∃x∃y (◇d ϕ(x) ∧ ◇d ϕ(y) ∧ ¬x=y)
b. Anti-total variation: ¬∀x◇d ϕ
Crucial to this explanation is that total variation (FC) inferences are independently generated under deontic modals, but not under epistemic modals. It is important to stress that a different method from Chierchia/Fox’s recursive exhaustification should be employed here to derive (12c). Indeed, Chierchia/Fox’s exhaustivity operator is blind towards the difference between epistemic and deontic modals. If total variation is derivable via this operator under deontic modals, it will also be derivable under epistemic modals, unless we assume that vreun activates domain alternatives under deontic modals, and singleton domain alternatives under epistemic ones. But why would different sets of alternatives be selected by vreun under different types of modals?
To summarize, under the assumption that vreun activates singleton domain alternatives and ‘complex’ alternatives, Fălăuş¸ derives partial variation and anti-total variation inferences via (recursive) application of Chierchia/Fox’s exhaustification. To derive the systematic exclusion of vreun from deontic sentences one needs the additional assumption that total variation (FC) inferences are independently generated for vreun under deontic modals by a method which is different from Chierchia’s or Fox’s exhaustification. In Section 3 we will propose an explicit implementation of the strategy illustrated in (12) which derives (12c) as an entailment, employing the analysis of deontic modals developed in Aloni (Reference Aloni2007b).
2.2 Concessive scalar particles
Crnič (Reference Crnič, Fainleib, LaCara and Park2011b) uses *magari as a blanket term for concessive scalar particles including Greek esto ke (GiannakidouReference Giannakidou2007), Spanish aunque sea and siquiera (Alonso-OvalleReference Alonso-Ovalle, Cormany, Ito and Lutz2009; LahiriReference Lahiri2010), and Slovenian magari/makr. There are two main kinds of environments in which *magari can occur: in downward entailing contexts, where it is glossed with even, and under priority modals. Priority modals include imperatives, as (13), deontic, bouletic and teleological modals. Crucially, *magari is reported not to be licensed under epistemic modals (Crnič Reference Crnič, Fainleib, LaCara and Park2011b: 4).
(13)
| Preberi | magari | Sintaktične | Structure! |
| read.imp | magari | Syntactic | Structures |
According to Crnič, *magari spells out two operators that he calls even and at least,2 defined as follows (≤c stands for at most as likely as, and Heim and Kratzer’s (Reference Heim and Kratzer1998) notation for presupposition is assumed):
(14) [[even]]g,c = λC . λ p : ∀q ∈ C [¬( p = q) → p <c q]. λw . p(w)
(15) [[at least]]g,c = λC . λ p . λw . ∃q ∈ C [q ≤c p ∧ ( p(w) ∨ q(w))]
For example, assuming that C consists of the propositions that Peter won bronze, that he won silver and that he won gold,, i.e., the focus-semantic value of the prejacent:
(16) [[[even C] Peter won goldF ]]g,,c (w) is defined only if it is least likely that Peter won gold. If defined, it is true iff Peter won gold in w.
(17) [[[at least C] Peter won bronzeF ]]g,,c (w) is true iff Peter won bronze or silver or gold in w.
The distribution of *magari is regulated by the inferences that these two operators generate. As illustrated in (18), in positive episodic environments, *magari is predicted to be out because it produces contradictory presuppositions: in (18f), (bronze ∨ silver ∨ gold) cannot be less likely than the other alternatives in C0, because the latter asymmetrically entail the former.
a. # Peter won *magari a bronzeF medal.
b. # [ZP [even C’ [XP [at least C] Peter won a bronzeF medal]]
c. [[C]]g,c = {bronze, silver, gold}
d. [[XP]]g,c = (bronze ∨ silver ∨ gold)
e. [[C’]]g,c = {bronze ∨ silver ∨ gold, silver ∨ gold, gold}
f. [[ZP]]g,c (w) is defined only if
(bronze ∨ silver ∨ gold) <c (silver ∨ gold), gold.
If defined, it is true iff Peter won bronze or silver or gold in w.
In deontic modal sentences like (19), the modal can crucially intervene between even and at least. Assuming that deontic modals when applying to an existential sentence return a strengthened FC-meaning, see (19d), the presupposition of even, fleshed out in (19f) can now be satisfied: since no entailment relation obtains between the relata, the presupposition is consistent and, as Crnič observes, it is also plausible: that you are allowed to win an unremarkable bronze medal (and silver and gold) can be less likely than that you are required to win some shinier medal (silver or gold, just gold).
a. You must win *magari a bronzeF medal.
b. [ZP [even C′][ XP ☐d [[at least C] you win bronzeF]]]
c. [[ C]]g,c = {bronze, silver, gold}
d. [[XP]]g,c = ☐d (bronze ∨ silver ∨ gold) ∧ (◇d bronze ∧◇d silver ∧◇d gold)
e. [[ C′]]g,c = {☐d (bronze ∨ silver ∨ gold) ∧ (◇d bronze ∧◇d silver ∧◇d gold)},
☐d (silver ∨ gold) ∧ (◇d silver ∧◇d gold), (☐d gold∧◇d gold)]
f. [[ZP]]g,c (w) is defined only if
☐d (bronze ∨ silver ∨ gold) ∧ (◇d bronze ∧◇d silver∧◇d gold) <c
☐d (silver ∨ gold) ∧ (◇d silver ∧◇d gold), ☐d gold ∧ ◇d gold
If defined, it is true iff you must win bronze or silver or gold in w.
To derive FC-effects with priority modals, Crnič (Reference Crnič, Fainleib, LaCara and Park2011b) adopts the mechanism developed in Aloni (Reference Aloni2007b), but he observes that any mechanism which generates FC-effects in the grammar could be employed as well.3 Crucial to this explanation of the distribution of *magari is again the assumption that deontic modals and epistemic modals have a different FC-potential.4 We have to derive the tendency that *magari occurs solely under priority modals. If FC-effects were generated in the grammar also with epistemic modals, we would miss this generalization.
To summarize, on Crnič’s (Reference Crnič2011) account *magari is correctly predicted to be excluded from positive episodic sentences because it produces a contradictory presupposition. In a deontic sentence like (19), if embeddable FC-inferences are generated, no contradictory presupposition is produced and *magari is predicted to be grammatical. To explain the systematic exclusion of *magari from epistemic contexts, we have to assume that potentially rescuing FC-inferences are not generated by the grammar under epistemic modals (unless we want to rely on at least dubious alternative explanations in terms of implausible presupposition).
2.3 German irgendein
The German epistemic determiner irgendein is felicitous in positive episodic contexts where it gives rise to an ignorance effect (epistemic partial variation), but it can also be used in downward-entailing contexts where it expresses a plain narrow scope existential meaning (Aloni and PortReference Aloni, Port, Fainleib, LaCara and Park2010; Kratzer and ShimoyamaReference Kratzer, Shimoyama and Otsu2002):
Quite puzzling is the different behavior irgendein displays under epistemic and deontic modals:
(20)
| Irgendein | Student | hat | angerufen, | (#nämlich Peter). |
| irgend_one | student | has | called, | (#namely Peter) |
(21)
| Niemand | hat | irgendeine | Frage | beantwortet. |
| nobody | has | irgend_one | question | answered |
Quite puzzling is the different behavior irgendein displays under epistemic and deontic modals:
a. Epistemic: ☐e (. . . irgend . . . ) ⇒ partial variation
Under epistemic modals, irgendein gives rise to a partial variation inference: Aloni and Port (Reference Aloni, Port, Fainleib, LaCara and Park2010) observed that example (24) can be used in the HideAnd Seek Scenario described in (23), where a statement like (25), employing a genuine FC-item which induces total variation, would be inappropriate (see also LauerReference Lauer2010 for similar observations).
(23) Hide and Seek Scenario: María, Juan, and Pedro are playing hide-and-seek in their country house. Juan is hiding. María, and Pedro haven’t started looking for Juan yet. Pedro believes that Juan is not hiding in the garden or in the barn: he is sure that Juan is inside the house. Furthermore, Pedro is sure that Juan is not in the bathroom or in the kitchen. As far as he knows, Juan could be in any of the other rooms in the house. (Alonso-Ovalle and Menéndez-BenitoReference Alonso-Ovalle and Menéndez-Benito2010: 6)
(24)
| Juan muss in | irgendeinem | Zimmer | im | Haus | sein. |
| Juan must in | irgend_one | room | in_the | house | be |
(25) Juan might be in any room of the house.
Under deontic modals, instead, irgendein can give rise to a total variation inference, as illustrated by example (26) from Kratzer and Shimoyama (Reference Kratzer, Shimoyama and Otsu2002). The example is ambiguous between a wide scope epistemic interpretation for the indefinite represented in (26a) and a lower scope FC-interpretation represented in (26b).5
(26)
| Mary musste | irgendeinen | Mann | heiraten. |
| Mary had_to | irgend_one | man | marry |
a. ‘There was some man Mary had to marry. The speaker doesn’t know who it was.’
b. ‘Mary had to marry a man; any man was a permitted marriage option for her.’
Aloni and Port (Reference Aloni, Port, Fainleib, LaCara and Park2010) analyze epistemic indefinites as existentials with two additional characteristics: (i) they induce an obligatory domain shift; and (ii) they are licensed only if such a shift is for a reason. Differences between different epistemic indefinites can be captured in terms of the different kinds of domain shift they can induce. German irgendein is assumed to be able to shift the domain of quantification in two different ways: it can either shift method of identification (conceptual-cover shift, henceforth, CC-shift), or it can widen the domain (domain widening, henceforth DW). CC-shifts are justified only if otherwise, i.e., with respect to the old (default) method of identification, the speaker would not have been able to identify the witness of the existential claim. DW is justified only if it does not create a weaker statement.
The operation of DW is well-known since Kadmon and Landman (Reference Kadmon and Landman1993). The intuition behind the notion of a CC-shift is best illustrated by an example. Assume you know that Macky Sall is the new president of Senegal, but you have never seen the man and therefore you would not be able to point him out. Consider now the sentence “You know who the president of Senegal is” used in the following two contexts: (a) during an exam on African politics; (b) at a party with many African leaders where you need to find Macky Sall for an interview. Intuitively, in context (a) the sentence would be judged true: you know that Macky Sall is the president of Senegal, so you know who the president of Senegal is. In context (b), instead, the sentence would be judged false: as far as you know, this person could be the president of Senegal or that person over there, so you don’t know who the president of Senegal is. Individuals, like Macky Sall, can be identified in various ways: by name, by ostension, or by description. Our evaluation of knowledge attributions seems then to depend on what identification method is at play in the context of use. In context (a) where identification by name is sufficient, the sentence is true; in context (b), where identification by ostension is required, the sentence is false. Aloni (Reference Aloni2001) accounted for these facts by formalizing identification methods in terms of conceptual covers (sets of individual concepts satisfying certain constraints) and assuming that (i) quantificational expressions quantify over conceptual covers, rather than over sets of individuals simpliciter, and (ii) different covers can be selected as domain in different contexts.
The main intuition behind Aloni and Port (Reference Aloni, Port, Fainleib, LaCara and Park2010) is that referents of epistemic indefinites like irgend-indefinites are typically identified via a method different from the one required for knowledge. The notion of a CC-shift is the technical counterpart of this intuition. Suppose m is the cover representing the identification method contextually required for knowledge. Then irgend-indefinites, at least in their specific uses, signal an obligatory shift to a cover n different from m, i.e., they existentially quantify over a cover which represents a method of identification which is not the one at play in the relevant context. For example, in context (b) above, a specific use of an irgend-indefinite would signal a shift to a method of identification, say n, different from ostension, and a sentence of the form ϕ(irgend-indefinite) such as (27) in its specific reading, analyzed as an existential sentence in a dynamic semantics, would say that there is an individual d, identified by method n, which satisfies ϕ.
(27)
| Ich | muss | irgendeinen | (bestimmten) | Mann | interviewen. |
| I | have_to | irgend_one | (certain) | man | interview |
If the witness of the existential claim were also identifiable by ostension, such a CC-shift would be vacuous in this context. By assuming that only non-vacuous CC-shifts are justified, Aloni and Port (Reference Aloni, Port, Fainleib, LaCara and Park2010) derive cover-dependent partial variation effects for specific uses of irgendein, namely that the witness of the existential claim cannot be identified by the identification method at play, e.g., in context (b), ostension. This accounts for examples (20) and (26a). Partial variation effects of irgendein under epistemic modals, as in (24), are explained by CC-shift in a similar fashion (see Aloni and PortReference Aloni, Port, Fainleib, LaCara and Park2010, for details). Occurrences in downward-entailing environments, like (21), are explained by DW.
Deontic FC-uses of irgendein, as in (26b), constitute a potential problem for the approach. Under a standard analysis of deontic modals, neither CC-shift nor DW is justified; the latter fact is shown in (28a). So Aloni and Port’s analysis wrongly predicts that irgendein is infelicitous under deontic modals. Assume now a grammar that generates (total variation) FC-effects for existentials under deontic modals. Then, extending the domain of an existential under a modal no longer leads to a weaker statement if we incorporate its universal FC-inference as in (28b):
| ☐∃xφ |= ☐∃xDW φ | without FC-inference |
| ☐∃xφ ∧ ∀x ◇φ ⊭☐∃xDW φ ∧ ∀xDW ◇φ | with FC-inference |
DW would then be justified in this case, irgendein would be correctly predicted to have deontic FC-uses. Again for this analysis to work, (total variation) FC-inferences should be automatically generated with deontic modals, but not with epistemic modals, otherwise the felicity of (24) in the Hideand Seek Scenario would remain unexplained.
To summarize, in Aloni and Port (Reference Aloni, Port, Fainleib, LaCara and Park2010) German irgendein is an existential which triggers an obligatory domain shift (either CC-shift or DW), which has to come for a reason. Ignorance uses of irgendein (specific or under epistemic modals) are explained by CC-shift, occurrences in downward-entailing environments by DW. Deontic FC-uses can be also explained by DW, but only if FC-inferences are incorporated. Again, FC-effects should be independently generated to rescue irgendein under deontic modals, but not under epistemic modals.
2.4 Modal variability
In summary, this section presented three case studies that independently of each other provided indirect evidence for the hypothesis in (29) below:
(29) Modal Variability Hypothesis
Deontic and epistemic modals have a different FC-potential. In particular, deontic FC seems to enter into the recursive computation of compositional semantic values, whereas epistemic FC does not.
Of course, the evidence in favor of (29) is only indirect: it is the assumption of (29) that helps explain the intricate modal data from three different languages. But in that case, it is worthwhile to inquire why exactly (29) should hold.
We suggest here that deontic and epistemic modality differ in the kind of information they provide. The former is able to penetrate into the compositional computation of semantic values because it conveys information of the right type more reliably than epistemic modality does. This is not as aberrant and arbitrary a proposal as it might sound. Evidence comes from the availability of FC-inferences in embedded contexts, in particular from FC-inferences scoping under universal quantifiers, so called universal FC-inferences, which we present in Section 4. Interestingly, the difference we argue for can be formalized rather nicely in dynamic semantics, where indeed epistemic and deontic modals are often treated differently. To flesh out our proposal in a formal way, we therefore present a toy fragment of a dynamic semantics for both modalities in Section 6 where we trace the intuitive pragmatic differences that we propose is relevant for (29) to a notion of persistence of the FC-inference. But before doing so, Section 3 attempts to formulate a straightforward explanation for (29) in terms of a formal/sortal difference between deontic and epistemic modals, and identifies the problems faced by this approach.
3. Semantic explanations for modal variability
A natural way to implement the Modal Variability Hypothesis assumes a formal/sortal difference between deontic and epistemic modals: FC-inferences are derived as semantic entailments for disjunctions and existentials under the former, but not under the latter. We are not aware of any such account in the literature, so we attempt to develop one that is as viable as possible. This hypothetical analysis adopts Chierchia’s (to appear a) framework of polarity and free choice and attempts to accommodate the phenomena we discussed in Sections 2.1 and 2.3, namely the different behavior of vreunand irgendein under deontic and epistemic modals, by employing, within Chierchia’s framework, an explicit semantic analysis of deontic FC-inferences along the lines of Aloni (Reference Aloni2007b).
The main motivation for adopting Aloni’s (Reference Aloni2007b) analysis comes from the following considerations. A first potential difficulty for an entailment account of deontic FC-inferences is that it would predict total variation effects for all indefinites under deontic modals, even unmarked ones as in “John may/must marry someone,” which clearly does not entail that any person is a permissible marriage option for John. Aloni’s (Reference Aloni2007b) analysis provides a potential way out of this difficulty by allowing two different representations for indefinites – an alternative-inducing and a flat representation – and by letting modals be sensitive to the alternatives generated in their scope. Only alternative-inducing indefinites give rise to free choice entailments on this account. Unmarked indefinites can then be required to adopt flat representations.
According to Aloni (Reference Aloni2007b), alternative-inducing representations of indefinites generate a genuine set of propositional alternatives, but not flat representations:
(30) Alternative inducing representation:
(31) Flat representation:
Possibility modals and imperatives (but nothing prevents us from extending this analysis to necessity modals as well)6 are assumed to entail that all alternatives generated in their scope should be compatible with the relevant modal base:
(32) Building on Aloni (Reference Aloni2007b):
a. ◇ϕ is true in w iff every alternative induced by ϕ is compatible with the set of accessible worlds λv . wRv;
b. ☐ϕ is true in w iff every alternative induced by ϕ is compatible with the set of accessible worlds λv . wRv and at least one alternative induced by ϕ is entailed by λv . wRv.
On this analysis of modality, only alternative-inducing indefinites give rise to FC-inferences in modal environments. Unmarked indefinites like someone can be then required to employ flat representations. Marked indefinites like irgendein and vreun instead should use alternative-inducing representations. FC-effects would be generated solely for the latter.
Let’s adopt the analysis of modality in (32), but solely for priority modals. Since FC-inferences would be generated under priority modals but not under epistemic modals, the distribution of concessive scalar particles would be explained along the lines of Crnič (Reference Crnič, Fainleib, LaCara and Park2011b). As for epistemic indefinites, adopting Chierchia’s (to appear a) alternative-based account, we could assume that both irgendein and vreun generate singleton domain alternatives as in (30b). Under epistemic modals, via recursive exhaustification in the vein of Chierchia or Fox we can derive partial variation effects, as in example (11) from Fălăuş¸. Under deontic modals, total variation effects are independently derived because of (32). Vreun would then be predicted to be out under deontic modals, because of a clash between the deontic FC-entailment and the lexically encoded ban on total variation, as in example (12). The different behavior of irgendein under deontic and epistemic modals would also be accounted for: semantic total variation would be derived under the former (via (32)), and obligatory pragmatic partial variation effects would be generated under the latter (via recursive exhaustification). There are however a number of problems for this strategy.
First of all there are other examples of marked indefinites for which we want to derive partial variation under epistemic modals, but, as it seems, no total variation under deontic or other priority modals, like Spanish algún or Italian un qualche. Let us illustrate these facts for the Italian case. The compatibility of (33) with the HideAnd Seek Scenario in (23) shows that in epistemic contexts un qualche only generates partial variation effects (Aloni and PortReference Aloni, Port, Fainleib, LaCara and Park2010):
(33)
| Juan deve | essere | in una | qualche | stanza | della | casa. |
| Juan must | be | in un | qualche | room | of_the | house |
That un qualche does not give rise to total variation effects under priority modals is demonstrated by the following example from the web, where the continuation explicitly specifies that not any kind of basic skill is enough:
(34)
| Per | diventare | traduttore | devi | avere | un qualche | tipo | di | base. | Di | sicuro |
| to | become | translator | must.2sg | have | un qualche | kind | of | basic_skill. | Of | sure |
| devi | saper | leggere | e | in | alcuni | casi | devi | anche | sapere | scrivere. |
| must.2sg | know | read | and | in | some | cases | must.2sg | also | know | write. |
Why do these examples constitute a problem for the strategy we are investigating in this section? To generate epistemic partial variation in Chierchia’s approach we need to assume that un qualche activates singleton alternatives, but then, given (32), total variation under priority modals would be automatically generated as well (unless we assume that in Italian priority modals behave differently than in Romanian and German, which seems quite implausible).
A second more conceptual problem of this implementation is that there is no reason why the generation of deontic FC-inferences formalized in (32) should not automatically extend to epistemic modals. The difference in FC-potential between epistemic and deontic modals would be stipulated. Why would only the latter be sensitive to the alternatives generated in their scope?7
A final and quite general problem with a semantic explanation of (29) is that the to-be-explained difference in FC-potential does not seem to be a black-and-white matter, but rather calls for a more nuanced picture of why only some FC-inferences penetrate into compositional semantics. This is the topic of the following section.
4. Contextual-variability of universal free choice
In order to come to grips with the variable FC-potential of deontic and epistemic modals, this section looks more closely at the behavior of disjunction in the scope of these modals. FC-inference can arise for both deontic and epistemic modals if they embed a disjunction, as witnessed by the following examples:
a. You may go to the beach or to the cinema.
b. ⇒ You may go to the beach and you may go to the cinema.
a. Mr. X might be in Victoria or in Brixton.
b. ⇒ Mr. X might be in Victoria and Mr. X might be in Brixton.
But also other modalities, e.g., ability modals, can give rise to FC-inferences:
a. Hans can play the violin or the trombone.
b. ⇒ Hans can play the violin and Hans can play the trombone.
Moreover, as observed by Klinedinst (Reference Klinedinst2006), Eckardt (Reference Eckardt, Sauerland and Stateva2007), and Fox (Reference Fox, Sauerland and Stateva2007), FC-like inferences may even arise for disjunctions when embedded under non-modal existential constructions (38).
a. Some of the guests had burgers or fruit salads.
b. ⇒ Some of the guests had burgers and some of the guests had fruit salads.
Under a standard semantic analysis of the components involved, all of these putative FC-inferences would not be considered semantic entailments. This could mean that these inferences are pragmatic inferences, but it could also mean that the standard semantics is wrong. The main argument why these inferences are probably not semantic entailments comes from the observation that FC-inferences do not seem to arise in all downward-entailing environments, such as in the scope of quantifier none:
a. None of the boys may go to the beach or to the cinema.
b. ⇒ All of the boys are not permitted to go to either.
c. ⇏All of the boys are permitted one option, but none is free to choose.
a. None of the agents might be in Victoria or in Brixton.
b. ⇒ All of the agents must be somewhere other than Victoria or Brixton.
c. ⇏For all agents x, either (i) it is conceivable that x is in Victoria and it is certain that x is not in Brixton, or (ii) it is conceivable that x is in Brixton and it is certain that x is not in Victoria.
So far the picture seems quite homogeneous, with different kinds of modals (and even non-modal constructions) giving rise to FC-inferences, with some evidence that these are pragmatic entailments. But the impression of peaceful homogeneity is spoiled, when we look at how readily FC-inferences associated with disjunctions and different modals embed. Indeed, in spite of the data in (39) and (40), there is not a total ban on FC-inferences possibly occurring in the scope of other logical operators. It has been observed, for instance, that FC-inferences associated with disjunction under deontic modals apparently can take scope under universal quantifiers, so-called universal free choice (henceforth: UFC) (cf. ChemlaReference Chemla2009b, for empirical evidence):
a. All of the boys may go to the beach or to the cinema.
b. ⇒ All of the boys may go to the beach and all of the boys may go to the cinema.
For our purposes here it is important to notice that UFC does not arise as readily for other sorts of modals. More concretely, the apparent strength of UFC seems to depend on the kind of modality. This was suggested first by Geurts and Pouscoulous (Reference Geurts and Pouscoulous2009) based on introspective judgments, but is also backed-up by empirical data (Tiel Reference Tiel2011). The study of Tiel provides clear evidence that deontic UFC, as in (41) is much more readily endorsed by the participants than epistemic UFC, as in (42).
a. According to the professor, every research question might be answered by a survey or an experiment.
b. ⇒ According to the professor, every research question might be answered by a survey, and, according to the professor, every research question might be answered by an experiment.
On a coarse-grained picture, the different UFC-potential of deontic and epistemic modals seems to support the Modal Variability Hypothesis in (29): deontic FC-inferences infiltrate compositional semantics, epistemic FC-inferences do not, or at least not to the same degree. The problem is that this coarsed-grained picture is too coarse-grained. As soon as we zoom in, we realize that there is a devil in the quantitative details. The problem is that the Modal Variability Hypothesis in (29) would rather lead us to expect that while deontic UFC is readily attested, epistemic UFC should be impossible, or at least very inaccessible. But that is not supported by Tiel’s data: epistemic UFC is attested, but much less prominent than deontic UFC. In other words, although (29) does square well with the qualitative difference in UFC-potential of deontic and epistemic modals, it does not square well with the more nuanced quantitative picture.
Of course, there can be many factors figuring into the explanation why epistemic UFC is less readily attested. We could think that epistemic FC is more computationally costly than deontic FC (irrespective of which system computes these inferences), or less available for other reasons – reasons that lie outside of formal semantics proper. In essence, this is exactly the line of explanation that we would like to explore here. We would like to suggest for consideration the idea that the strength of an FC-inference is relative to the relevance of the inferred information.8, 9 To demonstrate this, we consider sentences that could give rise to UFC-inferences in two contexts each: one which makes the UFC-inference relevant for practical purposes and one which does not (cf. Malamud forthcoming, for a similar approach to the interpretation of plural definites). Our main argument is that contexts of the former type are not equally natural for different kinds of modals, suggesting that the availability of embedded FC-inferences depends on pragmatic factors of relevance of information in context.
Let’s consider a “neutral” case first, namely the UFC-potential of ability modals. The study of Tiel attests only very little UFC-potential to ability modals. And, indeed, it is also intuitive that the sentence in (43a) does not readily invite the UFC-inference in (43b).
a. Everybody at the ILLC can play the violin or the trombone.
b. ⇒? Everybody at the ILLC can play the violin, and everybody at the ILLC can play the trombone.
However, the availability of (43b) as an inference from (43a) seems to depend crucially on the context of utterance and, in particular, on the relevance of the information in (43b). To see this consider two scenarios:
(44) Scenario 1: After many of the established musicians had to cancel on short notice due to a flu epidemic, your task is to re-assemble an improvised university orchestra. The dean has given you permission to recruit and assign to instruments whoever you like. However, it is essential for the quality of the performance that you do not recruit people for instruments that they cannot play. You know that we know our fellow colleagues (and we know that you know . . .). Time is short and the only piece of information that you get from us is (43a). After that you go pick arbitrary members of the ILLC (possibly none) and assign them to instruments.
(45) Scenario 2: You claim that researchers at the ILLC, though certainly capable logicians, are lacking in musical talent, as nobody is able to play an instrument. We rebut your statement using (43a).
When uttered in a context like (45), the sentence (43a) clearly does not convey the strengthened meaning in (43b). In contrast, when uttered in a context like (44), it seems to us that (43a) does convey (43b). It seems to us that, when you get the information in (43a) in context (44), it would be justified for you to assign to either the violin or the trombone any member of the ILLC you choose. If we had not meant you to do that, we should have given you more specific information. In other words, the UFC-inference in (43b) is available in (44) but not (45) because the question whether the “ability predicate”
(46) λ x λ y . x is able to play instrument y
holds true of any arbitrary member of the ILLC and any arbitrary relevant instrument is of acute practical relevance in context (44), but not in context (45).
We suggest that the availability of UFC-inferences under deontic and epistemic modals similarly depends on matters of contextual relevance. Take the more critical case of epistemic UFC. We claim that whether an utterance of (47a) triggers the UFC-inference in (47b) depends on context, similarly to the previous case of ability modals.
a. According to Inspector Clouseau, everybody at the ILLC might have strangled Prof. X or shot Prof. Y.
b. ⇒? According to Inspector Clouseau, everybody at the ILLC might have strangled Prof. X, and, according to Inspector Clouseau, everybody at the ILLC might have shot Prof. Y.
To test this intuition, consider again two scenarios:
(48) Scenario 1: There has been a triple murder at the university. Professors X, Y, and Z have been strangled, shot, and poisoned respectively. Each murder occurred at a different time. (Chief!) Inspector Clouseau is present to investigate. Going somewhat out of his way, Clouseau sticks to antiquated method, the Art of Deduction, and considers any university employee a suspect for a particular murder who has no alibi for the time of that murder. Clouseau has assigned three helpers to lead interviews, one for each case. Your job is to send each employee of the university home or to one of the three interviewers concerned with the case for which the person is a suspect. This is the first round of interrogation, so the only thing that really matters is that nobody is assigned to an interview if she is not a suspect for that case. Again time is short and the only information you get from Clouseau’s assistant, Cato Fong, is (47a). After that your task is to schedule everybody from the ILLC for interviewing or to send us home.
(49) Scenario 2: We demand that everybody at the ILLC is obviously innocent, but Cato Fong dismisses our demand by uttering (47a).
In context (49), the UFC-inference in (47b) clearly does not heavily invite itself, because it is not relevant in this context. But in the context (48), where it is relevant for your practical choice of action whether the “epistemic relation”
(50) λ x λ y . Clouseau considers it possible that x is the murderer of y
is true of each member of the ILLC and each victim in question, the UFC-inference does seem to arise. Of course, the example is quite contrived and set up deliberately so that Clouseau’s epistemic state about whether x is a suspect for case y matters for practical purposes. But the artificiality of this context is exactly what we are interested in. We hypothesize that, in general, it takes rather exceptional and involved situations in which an epistemic UFC-inference would arise because “epistemic relations” like (50) are seldom practically relevant.
This is different for deontic UFC in context. Let’s consider the question whether (51a) gives rise to the UFC-inference in (51b).
a. Everybody at the ILLC is entitled to a pay raise or a research assistant.
b. ⇒? Everybody at the ILLC is free to choose between pay raise and assistant.
Parallel to the previous cases, here are two scenarios, one which makes the UFC inference practically relevant, and the other one which does not.
(52) Scenario 1: It is your task to make sure that none of the employees of the university lays claims on bonuses that she is not entitled to. When it comes to the ILLC, the dean tells you (51a).
(53) Scenario 2: A research institute receives the predicate “distinguished” if all of its members are entitled to at least one bonus from the university’s special bonus programme. You claim that the ILLC is not distinguished, we correct you stating (51a).
According to our intuitions, the UFC-inference in (51b) clearly arises in context (52), and not, or at least clearly less strongly, in context (53). The context (52) is one where it is a relevant issue whether the “deontic predicate”
(54) λ x λ y . x is entitled to bonus y
is true of all relevant x and y. What is interesting for our argument is that the context (52) does not at all seem very contrived or artificially set up. If at all, then it’s rather (53) that appears less natural or common.
Of course, having provided only one pair of contexts for each case, we extrapolate from a weak inductive basis here, but we still feel that there is a general difference in the naturalness of contexts in which UFC-inferences for different types of modals would be (practically) relevant. We suggest that this is due to a pragmatic circumstance: whether information expressed by modalized predicates like (46), (50), or (54) are frequently relevant for concrete practical purposes.
5. Modal variability from pragmatic relevance
If the picture we just sketched is correct, then a pragmatic explanation for the Modal Variability Hypothesis pops into view, especially when we consider a diachronic perspective: if putative FC-inferences of different modal flavors differ in the frequency or strength of their contextual relevance, and so are available more or less frequently/strongly, then, over time, this will lead to differences in the extent to which these inferences could infiltrate compositional semantics. Essentially, this is a direct application of the well-established notion of pragmatic fossilization that yesterday’s pragmatics will gradually become tomorrow’s semantics.10
More concretely, suppose a community of language users has available two kinds of constructions, type-1 and type-2. Type-1 constructions are able to give rise to deontic FC-inferences; type-2 to epistemic FC-inferences. Suppose further that these FC-inferences are originally entirely pragmatic and do not enter the recursive computation of semantic values at all. But now let’s also assume, as we tried to make plausible above, that deontic FC-inferences are more readily available than epistemic FC-inferences if we look at a random sample of conversational contexts in which either type of construction is used. In that case, if pragmatics fossilizes, then it must be that deontic FC-inferences, simply due to a slight advance in contextual prominence, would be more prone to enter the compositional computation of semantics values. As the differences in informational relevance are presumably universal, the hypothesized asymmetry in the pattern of fossilization would likewise presumably be universal.11 Consequently, the Modal Variability Hypothesis might be a true empirical generalization at the current stage of language evolution, not because of a strong sortal difference between modalities, but because of a tendentious pragmatic difference that is operative on a long time scale.
Essentially, this diachronic view endorses the general possibility of “local implicatures” in the spirit of Chierchia (Reference Chierchia2006, to appear a, to appear b) and Chierchia et al. (Reference Chierchia, Fox, Spector, Maienborn, von Heusinger and Portner2012), but also adds its own particular flavor to the overall picture. Chierchia’s approach to “grammaticalized implicatures” has obvious empirical advantages over purely pragmatic approaches when it comes to positive fit to the empirical data: allowing pragmatic inferences to systematically intrude into compositional semantics yields more fine-grained, but also seemingly better empirical predictions in many domains of application than standard “globalist” accounts of pragmatic inferences (cf. Chierchia et al. Reference Chierchia, Fox, Spector, Maienborn, von Heusinger and Portner2012; SauerlandReference Sauerland2012, for recent arguments). But allowing implicatures into compositional semantics runs the risk of generating problems with negative fit: the generating power of potential “local implicatures” is overwhelming and not borne out empirically. For instance, lacking special intonation that might trigger alternative pragmatic processes, FC-inferences do not arise in the scope of none as in (39). To cope with issues of “negative fit,” usually some variant of the strongest meaning hypothesis of Dalrympleet al. (Reference Dalrymple, Kanazawa, Kim, Mchombo and Peters1998) is employed (cf. Chierchia to appear b; Chierchia et al. Reference Chierchia, Fox, Spector, Maienborn, von Heusinger and Portner2012; Sauerland Reference Sauerland2012). Roughly speaking, the strongest meaning hypothesis states that if two alternative semantic interpretations are available, then ceteris paribus the semantically stronger one is to be preferred. Unfortunately, the case of modal variability cannot be explained by appeal to a strongest meaning hypothesis. Adding FC-inference to an existential modal sentence always creates a stronger meaning irrespective of the nature of the modal. Hence, the strongest meaning hypothesis cannot explain any differences in FC-potential. Consequently, the pattern of available local pragmatic effects must be explained in a different way. This is exactly where our present proposal adds to the picture. The proposed diachronic perspective vindicates the general possibility of local pragmatic effects, but both adds a general explanation for patterns of distribution of local pragmatic effects, and also yields, not a black-or-white picture, but the more nuanced quantitative perspective that, arguably, the empirical data demand.
To make these ideas at least somewhat more precise, we would like to conclude this book by sketching the outlines of a framework, couched in dynamic semantics, that shows how deontic, but not epistemic FC-inferences could be treated as embeddable pragmatic effects, because of a difference in the usual base-level relevance of the information conveyed by these modals in discourse (cf. Aloni Reference Aloni, Guevara, Chernilovskaya and Nouwen2012).
6. Deontic and epistemic FC in discourse
To account for the different discourse behavior of deontic and epistemic FC, we will adopt a dynamic account of epistemic modality (GilliesReference Gillies2004; VeltmanReference Veltman1996; YalcinReference Yalcin2007) combined with a more classical approach to deontic modality and propose a genuinely dynamic mechanism to generate and incorporate pragmatic inferences which is sensitive to the differences between the two modalities. We will assume that FC-inferences of a given form ϕ are implicatures that can be captured by specifying a set of states in which ϕ would optimally or rationally be uttered by a speaker (given the usual assumptions of cooperativity, mutual recognition of relevance of information etc.). Aloni (Reference Aloni, ten Cate and Zeevat2007a) and Franke (Reference Franke2011), among others, both spell out systems that give us exactly that, and we will stay close to these accounts, but nothing hinges on specifics here. Also, our exposition here focuses on FC-inferences derived from disjunction, but this too is for convenience only. The crucial bit is the integration of the information provided by deontic and epistemic FC-inferences into the discourse: only deontic FC-inferences are persistent and therefore can be fully integrated (via Aloni’s (Reference Aloni, Guevara, Chernilovskaya and Nouwen2012) +I-operator) into the dynamic process of interpretation in a non-trivial way. The reason why deontic and epistemic information is treated differently in dynamics semantics is, we suggest, exactly the difference in practical relevance of information of that kind, as argued above in Section 4.
6.1 Modals in dynamic semantics
In dynamic semantics (ChierchiaReference Chierchia1995; HeimReference Heim, Bäuerle, Schwarze and von Stechow1983a; Kamp and ReyleReference Kamp and Reyle1993), meanings are identified with context change potentials.12 In the simplest case, an information state is a set of possible worlds that captures different ways the world could be, according to the mutually accepted information of the discourse so far. The meaning of a sentence (represented by a logical formula) is captured by defining its context change potential as a function that specifies how that sentence would transform any given information state into another information state. In the simplest case of propositional information uptake, a sentence like “it is raining” would simply rule out all worlds in the input state where it is not raining.
Crucial for our purposes is the analysis of epistemic and deontic modals. Following standard treatments (GilliesReference Gillies2004; VeltmanReference Veltman1996; YalcinReference Yalcin2007), epistemic modals are evaluated as tests on the input state. Essentially, an epistemic modal statement, be it universal ☐eϕ or existential ◇eϕ, tests whether the current information information state σ contains the information in ϕ already (for ☐eϕ) or whether σ is compatible with the information in ϕ (for ◇eϕ). This is because an information state as such is already an epistemic notion: it contains the possibilities currently not ruled out by accepted factual information. In contrast to epistemic modals, the most straightforward evaluation of deontic modals is not as tests but as informative updates that can also rule out some but not all possibilities from an input information state.13 In that case, deontic statements, ☐dϕ and ◇dϕ, receive a classical interpretation: ☐dϕ discards a possible world w from an input state if ϕ is false in some world deontically accessible from w; ◇dϕ discards a possible world w from an input state if ϕ is false in all worlds deontically accessible from w. This implementation is reasonable because deontic information is not on a par with epistemic information states, but rather what the latter are about. In rough but accessible terms, the difference between modals in dynamic semantics comes down to this: deontic information is what information states are about, epistemic information is what information states are.
Of course, information states could be defined to be about epistemic information as well. In this case we would essentially lift the notion of an information state to a higher-order object. The point of interest here is not that this cannot or should not be done. The point is that the most straightforward way of treating modals in dynamic semantics neatly formalizes exactly the hypothesis that we worked out previously, namely that normally in discourse deontic information and epistemic information are not relevant in the same way.
6.2 Implicatures in dynamic semantics
The difference in dynamic impact of deontic and epistemic modals will naturally also show in the impact of deontic and epistemic FC-implicatures. In order to see this, it is necessary, firstly, to spell out the FC-implicatures in question in a format suitable for integration into a simple dynamic framework and, secondly, to define a mechanism for the dynamic uptake of implicatures.
In order to be able to merge implicated information with information states in dynamics semantics, we consider the implicatures of an utterance of ϕ as what is supported, in the dynamic sense (see Appendix 5A), by any optimal information state for ϕ. Recently a number of algorithms have been proposed to compute optimal states suitable for our present purposes (e.g., AloniReference Aloni, ten Cate and Zeevat2007a; Franke Reference Franke2009, Reference Franke2011; SchulzReference Schulz2005).14 Although there are minor differences in the way optimal states are derived in these systems, these do not affect our argument, so that, for illustration, we will only consider the most recent approach of Franke (Reference Franke2009, Reference Franke2011). Franke calculates optimal states as the outcome of reasoning rationally about language use and interpretation in a formal game-theoretic model of the context of utterance. Essentially, this kind of reasoning gives rise to a mapping between forms ϕ as possible speaker utterances and information states t of the speaker as the receiver’s interpretation of ϕ. These mappings constitute an equilibrium between the speaker’s use of language and the receiver’s interpretation in an abstract language game that captures basic Gricean assumptions about conversational behavior. Let opt(ϕ) be the set of states t such that (ϕ, t) is optimal in this equilibrium sense. Within a dynamic setting, the implicatures of an utterance of ϕ are then defined as what is supported in any state in opt(ϕ) but not entailed by ϕ itself.
Definition 1 (Implicature) ψ is an implicature of ϕ iff σ |= ψ for all σ ∈ opt(ϕ), while not ϕ |= ψ.
To give an example, assume that our dynamic model contains a simple set W = {wa, wb, wab, w∅} as our logical space, where a, b ∈ P are proposition letters and the index in wx gives the proposition letters true in world wx. For example, in wa, a is true and b is false, whereas in w∅ both a and b are false. For the case of a plain disjunction like (55a), Franke (Reference Franke2009, Reference Franke2011) predicts as unique optimal state the state in (55b) (assuming a and b are both relevant and that a, b, and a ∧ b are alternatives to a ∨ b).
a.
| a ∨ b | [plain disjunction] |
b. opt(a ∨ b) = {{wa, wb}}
c. predicted implicatures: ◇e a ∧ ◇e b, ☐e¬(a ∧ b), ¬(a ∧ b), . . .
The information state { wa, wb } describes a belief state of an agent who believes that exactly one out of a and b is true and is uncertain which one. That { wa, wb } is optimal means that a ∨ b is predicted to be the most rational/pragmatically efficient sentence to use for a speaker in such a state. If the speaker had known which of a and b is true, she would have played the language game better saying so, and similarly had she thought that a and b are both true. Some of the implicatures of an utterance of (55a) are given in (55c). Implicatures are thus inferences from the abducted rational explanation of the speaker’s observed behavior.
Examples (56) and (57) similarly give the predictions of Franke (Reference Franke2009, Reference Franke2011) for disjunction under epistemic modals. Epistemic FC-implicatures are derived for both the possibility and the necessity case.15
a.
| ◇e (a ∨ b) | [epistemic possibility] |
b. opt(◇e (a ∨ b)) = {{wa, wb}, {wa, wb, w∅}}
c. predicted implicatures: ◇e a ∧ ◇e b, ¬ ◇e (a ∧ b), . . .
a.
| ☐e (a ∨ b) | [epistemic necessity] |
b. opt(☐e (a ∨ b)) = {{wa, wb } , { wa, wb, wab }}
c. predicted implicatures: ◇e a ∧ ◇e b, ¬☐e (a ∧ b), . . .
To be able to state the predictions for deontic modality, let’s fix that with Rd ⊆ W × W being the deontic accessibility relation of the underlying model; Rd (w) = {v ∈ W | 〈w, v〉∈ Rd } is the deontic possibility correspondence. Franke’s system then computes the following optimal states (and thereby implicatures) for disjunctions under deontic modals:
a.
| ◇d (a ∨ b) | [deontic possibility] |
b.
| opt(◇d(a ∨ b)) = | {{w ∈ W | Rd (w) = {wa, wb }}, |
| {w ∈ W | Rd (w) = {wa, wb, w∅}}} |
c. predicted implicatures: ◇d a ∧ ◇d b,¬◇d(a ∧ b), . . .
a.
| ☐d(a ∨ b) | [deontic necessity] |
b. opt(☐d (a ∨· b)) = {{w ∈ W | Rd (w) = { wa, wb }}}
Given a notion of implicature in terms of optimal states, the next question to address is how optimal states should be merged with information states in dynamic semantics. The most natural way of doing so is simply to add the information that is contained in all the optimal states in opt(ϕ) after updating with ϕ (cf. AloniReference Aloni, Guevara, Chernilovskaya and Nouwen2012):
Definition 2 (Implicature uptake)σ[ϕ+I] = σ[ϕ] ∩ ∪(opt(ϕ)).
As a first illustration consider the uptaking of the implicatures of plain disjunction, starting from a state of minimal information:
(60) {wa,wb,wab,w∅}[(a ∨ b) + I] = {wa ,wb ,wab} ∩ {wa ,wb} = {wa ,wb}
We first update with a ∨ b, which eliminates w∅, and then intersect the output state with the optimal state for a ∨ b. The resulting state supports both the scalar implicature ¬ (a ∧ b), and the clausal implicature ◇e a ∧ ◇e b. However, there is a crucial difference between these two inferences, the first is persistent, the second is anti-persistent:
In a propositional system, σ is at least as strong as τ iff σ ⊆ τ. Negative sentences like ¬(a ∧ b) are persistent because they assert the unavailability of a possibility, namely the possibility that a and b are both true. Eliminating possibilities (going to a smaller state) will never make that possibility available. Epistemic possibility sentences like ◇e a ∧ ◇e b instead are not persistent because they express the availability of two epistemic possibilities that might fail to be available as soon as more information is achieved (if the state gets smaller). Actually ◇e a ∧ ◇e b is anti-persistent because if an epistemic possibility is available, it will stay available if we enlarge our state.
Since σ[ϕ + I] ⊆ σ[ϕ] (by definition of +I in terms of intersection), uptaking anti-persistent implicatures is vacuous in the following sense: if ψ is anti-persistent and σ[ϕ + I] |= ψ, then σ[ϕ] |= ψ. As we saw, epistemic FC-implicatures are anti-persistent. Deontic FC-implicatures instead are persistent: ◇d a ∧ ◇d b expresses a property of all available possibilities, namely that they can deontically access both a-worlds and b-worlds. Adding possibilities might affect the validity of this universal property, but eliminating possibilities will not. But then uptaking epistemic FC-implicatures is vacuous, while uptaking deontic FC-implicatures is not:
(63)
| σ[ϕ + I] |= ◇e a ∧ ◇e b | implies | σ[ϕ] |= ◇e a ∧ ◇e b | ||
| σ[ϕ + I] |= ◇d a ∧ ◇d b | doesn’t imply | σ[ϕ] |= ◇d a ∧ ◇d b |
Examples (64) and (65) illustrate these facts.
(64) {wa}☐e (a ∨ b) + I] = {wa} ∩ {wa, wb, wab} = {wa}
(65)
| σ[☐d (a ∨ b) + I] | = σ ∩ {w ∈ W | Rd (w) = {wa , wb}} |
| = {w ∈ W | Rd (w) = {wa, wb}} |
These considerations of persistence tell us that epistemic FC-inferences (due to anti-persistence and vacuity) are not relevant in the usual information-accumulating sense of dynamic semantics. Deontic FC-implicatures, on the other hand, are. Again, we need to stress for clarity that we do not claim that the dynamic formalization as such proves that this contrast is real across the board, but merely pins down in a formally traceable manner our intuition that epistemic and modal information are not on equal footing in normal information-seeking discourse.
We conclude this section by briefly discussing how the dynamic account we have just sketched out can be employed to explain the three case-studies which constituted our point of departure. Let us assume that such a dynamic account would extend to existential statements in much the way one would expect with only deontic FC-inference derivable via application of +I (see AloniReference Aloni, Guevara, Chernilovskaya and Nouwen2012):
(66) Deontic: ☐d∃xϕ+I ⊨ ∀x◇dϕ
(67) Epistemic: ☐e∃xϕ+I ⊨ ∀x◇eϕ
Concessive scalar particles. Given the fact that only deontic FC-implicatures can be uptaken in a non-vacuous way, the possibility of adding +I can rescue *magari under deontic, but not under epistemic modals. Assuming a dynamic version of Crnič (Reference Crnič, Fainleib, LaCara and Park2011b), the scalar presupposition triggered by even in the deontic case is plausible (68c), while in the epistemic case it is contradictory (69c).
a. You must win *magari a bronzeF medal.
b. evenC’ [☐d [at leastC [you win bronzeF ]] + I]
c. Scalar presupposition triggered by even:
☐d (bronze ∨ silver ∨ gold) ∧ (◇d bronze ∧ ◇d silver ∧ ◇d gold) <c
☐d (silver ∨ gold) ∧ (◇d silver ∧◇d gold), ☐d gold ∧ ◇d gold
a. #You must have won *magari a bronzeF medal.
b. evenC′ [☐e [at leastC [you have won bronzeF ]] + I]
c. Scalar presupposition triggered by even:
☐e (bronze ∨ silver ∨ gold) <c ☐e (silver ∨ gold), ☐e gold
Epistemic indefinites. Building on Aloni and Port (Reference Aloni, Port, Fainleib, LaCara and Park2010), we propose that both irgendein and vreun are existentials which trigger an obligatory domain shift (either CC-shift or DW), which has to come for a reason. Recall that DW is justified only if it does not create a weaker statement, and that under a modal, as shown in (28) here rewritten as (70), DW ceases to create a weaker statement only if FC-inferences are incorporated:
| ☐∃xφ |= ☐∃xDWφ | without FC-inference |
| ☐∃xφ ∧ ∀x◇φ ⊭ ☐∃xDWφ ∧ ∀xDW◇φ | with DC-inference |
But then since only deontic FC-inferences can be uptaken in a non-vacuous way (66), when incorporating FC-implicatures via +I, DW can be justified in the deontic case, but not in the epistemic case:
| Deontic: ☐d ∃x ϕ +I ⊭ ☐d ∃xDW ϕ +I | DW justified | |
| Epistemic: ☐e ∃x ϕ +I |⊭ ☐e ∃xDW ϕ +I | DW unjustified |
Normally optional, +I becomes then obligatory in uses of irgendein under deontic modals: CC-shifts are vacuous in these contexts and without implicature uptake DW would also have been unjustified. Total variation (FC) effects are then predicted to arise systematically for the German indefinite in deontic contexts:
(72)
| Mary | musste | irgendeinen | Mann | heiraten. |
| Mary | had_to | irgend_one | man | marry |
a.
| # ☐d ∃x ϕ | neither CC-shift, nor DW can apply |
b.
| ☐d ∃xϕ +I | with FC-inference DW can apply |
To account for the exclusion of Romanian vreun from deontic contexts, we follow Fălăuş (Reference Fălăuş2009, Reference Fălăuş, Li and Lutz2011, Reference Fălăuş2012) in assuming that vreun expresses a ban on total variation.16 But then rescuing the indefinite under deontic modals via +I is no longer possible:
(73)
| # Trebuie | să | citesc | vreo | carte | până | mâine. |
| must | subj | read.1sg | vreun | book | by | tomorrow |
a.
| # ☐d ∃xϕ | neither CC-shift, nor DW can apply |
b.
| # ☐d ∃xφ +I | clashes with anti-total variation |
The epistemic case works similarly for Romanian and German. As shown in (71), DW cannot apply under epistemic modals, but then CC-shift must apply and only a cover-dependent partial variation effect is generated:
a. Juan must be in irgendeinem/vreun room of the house.
b.
| ☐e ∃xϕ | only CC-shift can apply, partial variation generated |
The infelicity of vreun in the Shell Game Scenario described in (8) can be accounted for again in terms of a clash with an independently generated anti-total variation inference.
7. Conclusion
Summing up, we have here reviewed evidence from three independent case-studies that the FC-inferences associated with deontic modals appear more “semanticized” than their epistemic counterparts. Taking the possibility of local pragmatic effects seriously, but noting that the usual explanation for the distribution of local pragmatic effects in terms of the strongest meaning hypothesis (cf. Chierchia to appear b; Chierchia et al. Reference Chierchia, Fox, Spector, Maienborn, von Heusinger and Portner2012; SauerlandReference Sauerland2012) does not apply in the case at hand, we tried to give a diachronic pragmatic explanation for the different degree to which deontic and epistemic FC-inferences penetrate into compositional semantics. Our proposal revolves around the notion of pragmatic fossilization, and we argued that deontic FC-inferences are practically relevant more often than epistemic FC-inferences. We discussed data on universal FC-inferences in favor of this claim and showed how this difference cashes out in dynamic semantics as a difference in persistence of the FC-information. In sum, according to our proposal, it is essentially basic features of contextual relevance that affect why some inferences penetrate compositional semantics more readily than others, even to the extent that these asymmetries in fossilization patterns inform grammaticality judgments (as in the case of vreun or *magari).
An important conceptual question is how this explanation of the Modal Variability Hypothesis relates to the “grammatical view” of scalar implicatures, as championed by Chierchia (Reference Chierchia2006, to appear a) and others. As mentioned before, our account acknowledges the possibility of local pragmatic effects, but, migrating beyond applications of the strongest meaning hypothesis, we have tried to give a diachronic pragmatic explanation for the distribution of these. This essentially commits us to a particular interpretation of the “grammatical view.” In a recent paper, Chierchia (to appear b) argues that the “grammatical view” of scalar implicature calculation can (but need not) be seen as a stepwise expansion of the more confined classical Gricean reasoning that has long been taken for gospel (Grice Reference Grice, Cole and Morgan1975). But it can also be considered a purely grammatical thing. What we suggested in this chapter commits us to consider the “grammatical view” in the first manner, as intrinsically pragmatic, even if integrated into formal semantic theory – that is, subject to general functionalist considerations about language use. This is opposed to a strictly grammatical view which would consider language an object of inquiry on its own, detached from issues of language use, that would then not admit functional considerations to play a role in explanations of local pragmatic effects.
Even if equivalent in terms of actual predictions, there is a conceptual difference between these two perspectives on the “grammatical view” that shows if we look at which auxiliary assumptions would be deemed feasible to adapt the formalism to new sets of empirical data. The data we discussed in Section 2 is such a set of novel empirical data that challenges the current formulation of the “grammatical view,” in particular, its sole reliance on the strongest meaning hypothesis to account for the distribution of embedded implicatures. To account for (something like) the Modal Variability Hypothesis, an intrinsically pragmatic grammatical view can appeal to differences in frequencies of contextual relevance. But a strict grammatical view cannot. Viable explanations for the strict grammatical view could involve stipulating (and motivating) formal differences in the way different modals interact with domain alternatives and the like. No knock-down argument has been given here that the Modal Variability Hypothesis cannot be explained in terms of a strict grammatical view. But the burden of proof that modal variability is amenable to a strictly non-pragmatic explanation lies with those who favor this view.
Acknowledgments
This research has been largely inspired by the work of Gennaro Chierchia: mostly by his recent research on free choice and local implicatures but also by his less recent contributions on dynamic semantics. Maria Aloni would further like to thank Gennaro Chierchia for supervising her 1996 Master thesis on “Dynamic semantics and epistemic modalities,” and for his contagious passion for linguistic research: as this article tries to demonstrate, in even the most intricate empirical data some logical pattern can be found. We also thank Stephanie Solt and four anonymous reviewers for helpful comments, as well as the financial support of the Netherlands Organization for Scientific Research (NWO).
Appendix 5A Formal definitions supporting Section 6
Let P be a set of proposition letters and, with p ∈ P, let L* be the propositional base language, i.e., the smallest set that contains P such that if ϕ, ψ ∈ L*, then also ¬ϕ ∈ L* and ϕ ∧ ψ ∈ L*. Let L be the modal language we are interested in that, for simplicity, does not allow nesting of modal operators, defined in the usual way as the smallest set that contains L* such that if ϕ, ψ ∈ L, then also ¬ϕ ∈ L and ϕ ∧ ψ ∈ L and if ϕ ∈ L*, then also ☐eϕ ∈ L and ☐dϕ ∈ L. Additionally, the usual notational variants apply:
A model for language L is a triple M =〈 W, V, (Rd ) 〉 such that: W is a set of worlds, V : W × P ⇒ {0, 1} is a valuation function, and Rd ⊆ W × W is a deontic accessibility relation. We require that Rd is serial, i.e., for each w ∈ W there is some v ∈ W such that 〈w, v〉 ∈ Rd. We defined static truth conditions for a formula ϕ ∈ L* with respect to a model and a world in that model in the standard way:

Entailment is defined as usual: ϕ |= ψ iff for all M, w if M, w |= ϕ, then M, w |= ψ.
An information state σ based on a given model M with possible worlds W is just a subset of W. Let ΣM be the set of all information states for model M. The semantics for expressions in L is given for a fixed model M in terms of context change potentials, i.e., a function ΣM × L → ΣM. Using standard notation this function is defined as follows:

The dynamic interpretation of epistemic modals is a test: update with ☐eϕ either yields the input state or the empty set. This is because epistemic information is of a higher-order type than base-level information. Deontic modality, however, is defined as an eliminative update that adds information in the same way as propositional information. Finally, we say that an information state σ that is non-empty supports a formula ϕ, σ |= ϕ, iff σ[ϕ] = σ.
Notes
1. Fălăuş does not explicitly work out this line of explanation, but this seems to be her strategy as is clear from passages like the following: “. . . I take these facts to indicate that deontic modals with existentials activating alternatives in their scope have a strong free choice flavor, i.e., they give rise to a total variation inference. As we have seen, epistemic operators do not impose such a requirement. This distinct behavior should follow from the semantics of these modals, something which cannot be properly explored here” (Fălăuş Reference Fălăuş2012: 43).
2. These analyses are not necessarily faithful to the meaning of English even and at least.
3. Indeed, Crnič (Reference Crnič2011a) adopts Fox (Reference Fox, Sauerland and Stateva2007a) to generate FC-effects, but as we have already observed, Fox’s machinery is blind towards the differences between deontic and epistemic modals and therefore overgenerates: potentially rescuing FC-inferences are derived also in the scope of epistemic modals. Crnič (Reference Crnič2011a) notes this problem and expresses a hope that it might disappear on the account of the differences in felicity of the scalar presuppositions triggered by even. For example, for a doxastic sentence like “John thinks that Peter wins *magari a bronze medal,” Crnič (Reference Crnič2011a) predicts the quite implausible presupposition “that there is an alternative that is more likely than that John thinks that Peter won a medal and that he might have won any medal.” (Crnič Reference Crnič2011a: 126). We agree that this presupposition is implausible (see also our discussion in Section 4), but it is not contradictory (in contrast to the presupposition triggered by even in (18)). But then such an explanation might be too weak for a proper account of the systematic exclusion of *magari from epistemic contexts.
4. See Crnič (2011: 11, footnote 12). In the same footnote, Crnič also mentions a different possible strategy to account for the data building on Heim’s (Reference Heim1992) non-monotonic, preference-based semantics for bouletic modals. If deontic modals were non-monotone, we would not need to rely on an FC-effect to account for the data. Crnič (Reference Crnič2011a) discusses this strategy in more detail, but, as far as we understand, dismisses it because of lack of independent evidence for the non-monotonic behavior of deontic operators.
5. In FC-uses, irgendein is typically stressed. Notice that stressing irgendein under epistemic modals as in (24) results in oddity.
6. Actually Aloni’s (Reference Aloni2007b) assumption that under necessity modals no FC-inferences are generated was crucial for her explanation of the distribution of any in terms of Kadmon and Landman’s (Reference Kadmon and Landman1993) analysis. In this section we will adopt Chierchia’s account of polarity and free choice, therefore that assumption is no longer needed.
7. Most semantic theories of FC-inferences do share this problem with Aloni (Reference Aloni2007b) (also BarkerReference Barker2011) with the exception of performative accounts like Rooij (Reference Rooij2000). Performative accounts however fail to explain FC-inferences of non-performative uses of deontic modals (e.g., in the past tense).
8. An alternative explanation of the variability in UFC-potential revolves around the notion of homogeneity. The idea is that the availability of an UFC-inference like in (41) is proportional to the plausibility of the assumption that everybody in the group has the same rights, i.e., that the group is homogeneous (FrankeReference Franke2011; TielReference Franke2011). Our proposal here is orthogonal to this issue. According to the picture sketched here, homogeneity assumptions may very well affect the availability of UFC-inferences for de-contextualized sentences. All we claim here is independent of that: we argue merely that, if a proper context is supplied, then the contextual relevance of certain information will affect the availability of an UFC-inference. It is not crucial for our purposes here whether homogeneity assumptions mingle in some fashion or other with considerations of relevance.
9. It is important to note that we propose that it is the inferred information that is at stake, not the relevance of contextually given alternatives. This is important, because, as a helpful reviewer correctly pointed out to us, the subsequent discussion about differences in contextual availability of UFC could possibly be accounted for as well by a theory that treats UFC inferences purely as a grammatical phenomenon in the vein of Chierchia (Reference Chierchia2006, to appear a, to appear b) or Chierchia et al. (Reference Chierchia, Fox, Spector, Maienborn, von Heusinger and Portner2012) with a pragmatic constraint on contextually relevant alternatives. However, we are not sure, at the present moment, how suitable and justified restrictions on relevant alternatives could do the trick. This is not to say that this is necessarily impossible, but until proven, we stick to the distinction between relevant information and relevant alternatives and therefore prefer accounts where both relevant interpretations and relevant alternatives are explicitly handled, such as done in optimality theory (e.g., Aloni Reference Aloni, ten Cate and Zeevat2007a), game-theoretic approaches (e.g., Franke Reference Franke2011) or Geurts’s (Reference Franke2011) intention-driven account. Be that as it may, this issue does not really matter for our overall argument.
10. Grice already stated: “It may not be impossible for what starts life, so to speak, as a conversational implicature to become conventionalized” (Grice Reference Grice, Cole and Morgan1975: 58). This idea has been later explored by many researchers, notably Hopper and Traugott (Reference Hopper and Traugott1993).
11. A reviewer raised an important point here, namely whether we would predict different stages of fossilization in different languages and/or for different types of modals. To answer this question, more detailed and more quantitative data would be necessary. In principle, we would expect some differences, but the main tendency that deontic FC-inferences fossilize stronger and more readily should be universally attested.
12. A fragment of a minimal system that covers the formal details necessary for our present concerns is given in Appendix 5A.
13. Another natural idea is to treat deontic modals as performative elements that change obligations and permissions (e.g., KampReference Kamp1973; RooijReference Rooij2000), which is straightforwardly modeled in dynamic semantics. However, we believe that performative analyses are problematic, for instance, because they cannot straightforwardly explain FC-inferences in the past tense, etc. (see also note 7). It is therefore often attempted to derive performative effects from informative uses of deontic modals, e.g., under the assumption that the speaker is an authority (e.g., KaufmannReference Kaufmann2012).
14. Other approaches to the calculation of FC-implicatures (e.g., FoxReference Fox, Sauerland and Stateva2007a) would equally qualify, but would require more conceptual shifting to yield a description of optimal states.
15. To be able to distinguish between partial variation and total variation we would need at least three alternatives. For ◇/☐ (a ∨ b ∨ c) Aloni (Reference Aloni, ten Cate and Zeevat2007a), for example, does indeed predict the total variation FC-implicature ◇a ∧ ◇b ∧ ◇c without further ado; Franke’s system needs extra assumptions, but can achieve the same thing.
16. This ban on total variation should be derived as a blocking effect. The availability of a specialized total variation indefinite in Romanian (but not in German) blocks the FC-interpretation of vreun expressed in (73b). No other interpretation is compatible with the felicity conditions. The indefinite is predicted to be out under deontic modals.