Diachronic Development of the K-suffixes: Evidence from Classical New Persian, Contemporary Written Persian, and Contemporary Spoken Persian

Abstract This paper aims to investigate the usage and frequency of what we refer to as K-suffixes in Classical New Persian of the ninth to thirteenth centuries, Contemporary Written Persian of the late nineteenth to mid-twentieth centuries, and Contemporary Spoken Persian. It shows that K-suffixes are most likely to be the reflexes of earlier evaluative morphemes, traditionally called “diminutives,” and are characterized by a high degree of multifunctionality. While evaluative functions continue to dominate in the Classical New Persian works, they have largely been lost in contemporary spoken Persian, and the suffix is now systematically used to express definiteness. The development of the K-suffix as a definiteness marker in contemporary colloquial Persian appears to be innovative, and is mainly dependent on genre, speaker, and speech situation. Data for Classical New Persian is taken from critical editions of works from the ninth to thirteenth centuries. The data for Contemporary Written Persian comes from comprehensive books of fiction from the late nineteenth to mid-twentieth centuries, and for Contemporary Spoken Persian from an extensive corpus of spoken Persian narratives and a questionnaire answered by fifteen speakers. The results suggest that evaluative morphology can develop into definiteness marking, with the development passing through a stage of combination with a deictic marker. This paper concludes that the development of definiteness marking can proceed down a new pathway that is different from the one normally assumed for demonstrative-based definite marking, though the endpoint may be similar. The study contributes the second detailed documentation of this process for any Iranian language, and one of the few well-documented cases of a non-demonstrative origin of definiteness marking worldwide.


Introduction
Persian is a term for a collection of closely related western Iranian varieties.It is spoken in Iran, Afghanistan, and Tajikistan, and serves as an official language in these counties.This paper deals with the K-suffix in Classical New Persian of the ninth to thirteenth centuries (CNP), Contemporary Written Persian of the late nineteenth to mid-twentieth centuries (CWP), and Contemporary Spoken Persian (Tehran variety) in Iran (CSP).
In all CNP written works, a suffix of the form -ak/ek/ag/ is attested, primarily occurring with nouns but also with adjectives and adverbs.It has traditionally been classified as the usage of evaluative morphology is, by definition, primarily determined by interactional context, this finding is not surprising.
This paper is organized as follows: first, it deals with definiteness and types of definiteness contexts and provides an overview of the Persian language and data.Then it covers previous studies of the K-suffix in Persian and demonstrates the multifunctionality of the K-suffix.The evaluative function of K-suffixes in CNP and CWP is then presented, after which K-suffixes functioning as definiteness markers in CSP are illustrated.Data is presented from an extensive text corpus and questionnaire data, and a suggestion is made regarding the original K-suffix in CNP, CWP, and CSP.Finally, the findings are discussed in light of a new grammaticalization pathway from evaluative to definiteness marker.

Definiteness
Definiteness will be understood here as a property of a noun phrase that is derived from its information status in a given linguistic context.It is thus a contextual property of referring expressions rather than an inherent property of nouns.A number of different approaches to definiteness have been pursued in the literature, including a philosophical approach invoking uniqueness, 7 and a discourse-pragmatic approach. 8I follow Lyon in considering the primary component of definiteness to be the notion of identifiability. 9A noun phrase is considered definite if the speaker assumes that its referent is uniquely identifiable by the addressee.Languages differ cross-linguistically in the extent to which, and means by which, they systematically indicate definiteness in morphosyntax.In English, French, or Arabic, definiteness is marked fairly consistently using items generally referred to as "articles."Other languages may mark definiteness by affixes, clitics, word-order properties, or various combinations of these strategies; alternatively, they may have no regular means for indicating definiteness.A noun phrase may have definite status by virtue of several pos-

Unique referents
Entities which are assumed to be uniquely identifiable by all members of a given speech community, hence requiring no preceding or inferable mention: the sun, the river (in a given community), the president.

Situational definiteness
Identifiability is achieved through the immediate speech context, possibly aided by additional gestures and adverbial expressions: the man over there (pointing).
In contrast to the seven definiteness contexts outlined above, nouns may be indefinite, (either specific or non-specific), or have generic or sortal reference.The correct analysis of generics is beyond the scope of this paper. 12

The Persian Language
Persian belongs to the Western Iranian branch of the Iranian languages, which in turn belong to the Indo-Iranian branch of Indo-European.Persian is the only Iranian language that has documents available from the Old Persian of the Achaemenids, the Middle Persian of the Sassanids, to New Persian (since the eighth century).Different delimitations of the phases in the development of New Persian have been presented by Iranian scholars.For instance, Lazard introduces the following phases: Early New Persian for the language of the tenth to eleventh centuries, and Classical New Persian for the New Persian of the twelfth to nineteenth centuries, with the twelfth century as a transitional period. 13I find these classifications to be a bit too complicated for the present study.For the sake of brevity, I use Classical New Persian (CNP) of the ninth to thirteenth centuries, Contemporary Written Persian (CWP) of the late nineteenth to mid-twentieth centuries, and Contemporary Spoken Persian (CSP) in the present paper.
Modern Persian is a verb-final language that shows the same alignment system in the past and non-past tenses by not having a morphological case system.Persian is mainly spoken in Iran, Afghanistan, and Tajikistan, and is considered a language of education in these countries.The area where Persian is spoken is highly diverse linguistically.Contact languages include four different language families and different genera: Indo-European (Indo-Aryan and Iranian), Dravidian, Turkic, and Semitic.
Data for CNP is taken from critical editions of works from the ninth to thirteenth centuries (see Table 1), data for CWP come from books of fiction from the late nineteenth to midtwentieth century (see Table 2), and CSP from an extensive corpus of spoken Iranian Persian narrative and a questionnaire answered by fifteen speakers from Tehran (see Section 5).Fig. 1 presents the location of the data for Contemporary Spoken Persian.
I will briefly comment on other functions of the K-suffix (viz., derivational) than evaluative, before we begin our journey into the K-suffixes in the Persian language.
Derivations with the suffix *-ka-are well attested in Old Indo-Iranic (especially in Old Indo-Aryan).Edgerton offers a detailed survey in two papers with the same title, published in the consecutive issues 2-3 of volume 31 of the Journal of the American Oriental Society. 14He identifies the core semantics of *-ka-for Proto-Indo-Iranic by comparing the Vedic, Sanskrit, and Avestan evidence: 15 "1) the formation of nouns of likeness or adjectiv[e]s of characteristic; 2) the diminutiv[e] and (perhaps) pejorativ[e] formations, 3) occasional formations with 2 ka [i.e., adjectives of appurtenance or relationship], 16 mainly pronominal adjectiv[e]s, and 4) the primary formations from verbal bases, apparently inclining towards the meaning of verbal adjectives or nouns of agent." The K-suffix -ak in Persian largely reflects Edgerton's classification.Iranian traditional grammarians already report a similar classification. 17n the CNP works under study, the evaluative semantics of K-suffixes are more predominant than other functions (derivational) including adjective<adverb N<adjective.Note that the K-suffix -ak is more productive as a word-creation suffix in CWP and CSP than in CNP, probably because of a national need for creation of words.
In the following example, the adjective narm "soft" has changed into the adverb narmak, "softly, slowly."

The K-suffixes in CNP: Initial Observations 19
Data for analyzing the K-suffixes in CNP comes from critical editions of works from the ninth to thirteenth centuries.Table 1 provides a list of these works.Across CNP texts, a nominal suffix is found with the forms -ak/ek/ag. 22These are likely to be reflexes of the K-suffix -ag in Middle Persian, 23 e.g., pus-ag "boy" and CNP pesar-ak "boy." The K-suffix has been attested with nouns, e.g., pesar-ak "boy," darvīš-ak "dervish," adjectives, e.g., ǰavān-ak "young," saqīr-ak 24 "small," andak "little," and adverbs, ānak "now." 25

Ex. (2)
ammā kas=ī-rā ke ranǰ kam resad ū-rā gūšt=e but person=IND-OBJ CLM pain little arrive.NPST.3SGPN.3SG-OBJ meat=EZ gūsāle=e xord-ak beh-tar bovād calf=EZ small-EV good-COMP be.NPST.3SG"but a person who has less pain, it is better for him/her [to eat] meat of a young calf" 26 Traditionally this suffix is referred to as a "diminutive."Investigation of the K-suffix in CNP has largely been ignored.However, its existence has been reported by scholars.For Early New Judeo-Persian, Paul reports that "-ak functions as diminutive, or it appears without The date here refers to the first edition of the book. 21This book is a translation from Azerbaijani Turkish into Persian by Mirzā JaꜤfar Qarājedaghi. 22I have not found the suffix -ag in my data.However, Sadeghi, "Pasvandha-ye Tahbibi-ye Farsi," reports a few items with the K-suffix -ag instead of -ak, for instance, farzandag "child," xordag "little," and Sahlagī "?"He also mentions that in another manuscript of Qorʾān-e Qods "son" is attested with the K-suffix -ag, as in pusag, which is similar to pusag in Middle Persian.In addition, Khatamipoor, "yā-ye maʿrefeh," based on three manuscripts (titled hezār hekāyate sūfīyān, from the thirteenth century), reports the -ī suffix including ak and considers the -ī suffix to be a definiteness marker.
23 See Durkin-Meisterernst, Grammatik des Westmitteliranischen, 253; and Nourzaei and Jügel, "On the Function of -ag Suffix in MP," for a detailed discussion of the K-suffix -ag in Middle Persian. 24The word saqīr is an Arabic word meaning small. 25See Ciancaglini, "Outcomes of the Indo-Iranian Suffix *-ka in Old Persian and Avestan," for the attestation of this suffix in Old Persian. 26Al-abniye, 287.

Evaluative and Diminutive Usage in CNP
The most frequent usage of the K-suffix is to express evaluative or diminutive semantics, and it is even compatible with indefinite contexts.The term "diminutive" implies the descriptive content "smaller than normally expected," and this is evident in some usages of K-suffixes.However, even in these contexts, an evaluative connotation is often discernible and, for the sake of brevity, following Nourzaei 34 I gloss the suffix with EV, as the most general indication of function, regardless of actual context. 27Paul, A Grammar of Early Judaeo-Persian, 63. 28 Gindin, "The Early Judeo-Persian Tafsīrs of Ezekiel." 29 Qarib et al.,46. 30 Ahmadi Givi and Anvari, Dastur-e zabān-e Fārsi 1, 77.In example (3) the K-suffix gives a description of the physical size of the branch, šāx-ak=ī "a small branch." Noe that the K-suffix is compatible with the indefiniteness context.
Ex. (3) The K-suffix with small size šāx-ak=ī az īn toxm-hā bar ǰast branch-EV=IND from PROX seed-PL PREV spring.PST.3SG"a small branch grew up from these seeds" 35 Similarly, in example (4), the K-suffix provides a description of the physical size of the deer's fawn.Note that the K-suffix follows a distal demonstrative ān "that." Ex. (4) The K-suffix with small size be-dān k e ān baxšāyeš ke bar ān āhū=ye be 36 -know.NPST.2SGCLM PROX forgiveness CLM to PROX deer=EZ mādeh kard-ī va ān bačeg-ak be=dū bāz dād-ī female do.PST-2SG and DIST child-EV to=PC.3SGagain give.PST-3SG "Know that the mercy that you have shown to that female deer and that small child returned to her […]" 37 In example (5), the K-suffix provides a description of a small amount of water.

PST-3SG
"There is a spring to this village that comes out of a stone with little water and they have paved (lit.cut) a long stream from it" 38 In example (6), the K-suffix adds a flavor of sorrow on the part of the speaker regarding the Hendu male slave, rather than a description of the physical size of the male slave.
Ex. (6) The K-suffix conveys a flavor of sorrow man va barādar=m va ġolām-ak=ī hendū ke bā PN.1SG and brother=PC.1SG and male.slave-EV=INDHendu CLM with mā būd v āred šod-īm PN.1PL be.PST.3SGenter become.PST-1PL "I and my brother and a poor Hendu male slave who was with us arrived (lit.entered) to [xarzawīl]" 39 Similar to example (6), example (7) adds a flavor of sorrow on the part of the speaker regarding the deer's mother, who was following the hunter when she repeatedly fell down, rather than a description of the physical size of the deer's mother.Note that the K-suffix follows a proximal demonstrative īn "this." Ex. (7) The K-suffix conveys a flavor of sorrow bāz gašt-am va do se bār ham=čenīn mī-oftād PREV turn.PST-1SG and two three time EMPH=PROX IMP-fall.PST.3SGva īn bīčāreg-ak mī-ām-ad and PROX poor-EV IMP-come.PST-3SG "I returned and [I saw that the female deer] two or three times it fell down and this poor one still was coming" 40 The evaluative component is more obvious in the following examples.In example (8), Joseph's father refers to his son with a K-suffix, although the son is grown up.This is obviously a signal of endearment and affection on the part of the speaker towards the son, rather than a description of his physical size.Note that the K-suffix has been attested with vocative and non-vocative contexts.
Ex. ( 8 Similar to example (8), in the following passage, a dialogue between God and the prophet Noah, Noah refers to his son with a K-suffix, although the son is grown up.Again, this is obviously a signal of endearment and affection on the part of the speaker towards the son, rather than a description of his physical size.
Ex. (9) The K-suffix with endearment pesar-ak=e man az ahl=e man ast son-EV=EZ PN.1SG from group=EZ PN.1SG COP.NPST.3SG"my lovely son is from my group" 42 The K-suffix occurs here with an "admiration and respect" connotation.The K-suffix on "Hasan" demonstrates respect towards Hasan, who was an important and influential figure in the Ghaznavid state, rather than a description of his physical size.

Ex. (11) The K-suffix with respect
Abulqāsem=e Hakīm-ak ke nadīm Amir Yusef bud mard=ī Abulqāsem=EZ Hakīm-EV CLM friend Amir Yusef be.PST.3SGman=IND 40 Tārikh-e Beyhaqi 1, 250. 41Ibid.,136. 43 This term refers to a branch of Islam whose adherents believe in seven Imams. 44Tārikh-e Beyhaqi 1, 229."Abulqāsem-e Hakimak, who was a friend of Amir Yusuf, he was an educated and skilled [man], he was not at the service of anyone, and he was generous" 45 K-suffixes also occur with pejorative connotations.This can be seen in vocative contexts such as in example (13).The following passage is taken from a dispute between the king and a dervish.Here the K-suffix reflects the king's anger and disapproval of the dervish in the given context.
Ex. ( 12) The K-suffix with disapproval īn darvīš mard-ak=ī nādān va kūhparvar ast PROX dervish man-EV=IND ignorance and cave man COP.NPST.3SG"This dervish is an ignoramus and a caveman" 46 This can be observed in vocative contexts, as in example ( 13), where it is taken from a dispute between Halāl and the holy man.Here the K-suffix reflects the king's anger and disapproval of the holy man in the given context.
Ex. ( 13 Finally, we should point out that certain words typically indicating both human and nonhuman referents seem to include the K-suffix as part of the word stem.The suffix lacks any apparent separate semantic content. In sum, the K-suffixes of CNP are widely attested with some kind of evaluative semantics, but also as lexicalized and semantically empty elements, and are presumably remnants of the high-frequency evaluative usage associated with certain words.We assume that the multifunctionality of the K-suffix is reasonably representative of earlier stages of Persian and is also compatible with what is known about K-suffixes in earlier stages of other New Western Iranian languages such as Shirazi, Lari, and Balochi.However, in the three phases of Persian (CNP, CWP, and CSP) being studied here, the functionality and frequency of K-suffixes have diverged quite considerably.In particular, in specific genres of CSP, the K-suffix -e/he exhibits a regular marking of definiteness in anaphoric and bridging contexts (see Section 6).
I begin with an outline of K-suffixes in CNP, before focusing on the usage of the K-suffix in CWP (Section 5) and CSP (Section 6) and presenting frequency data from the corpora (Section 7).
Ex. ( 14 The K-suffix in CNP has a variety of functions, with no obvious structural constraints.However, there is one type of context that demonstrates a different reading than the normal multifunctional semantics of the K-suffix (see Sections 3.4 and 3.5).
The K-suffix in CNP is compatible with indefinite contexts, as in examples ( 15) and ( 16).
Ex. ( 15 Examples ( 17) and ( 18) show that the K-suffix is compatible with proper nouns, for example, the personal names Hasan-ak "Hasan," Mahmūd-ak "Mahmud," gandom-ak "Gandom," xayr-ak "Xayrak," mār-ak ebne allsalāt "Marak ebne allsalāt," and sarbāt-ak "Sarbātak."Note that proper nouns such as these, where the stem and this suffix can be clearly distinguished, are very rare in the manuscripts.The lack of such examples in these works is probably indicative of the strongly interactional nature of the K-suffix in CNP. 51. ( 17 As with proper nouns, the K-suffix is compatible with place names, for example, "čenāša," "koškak," and "ġūzak," as in the following example: Ex. ( 21) The K-suffix with place names vazīr bar rāh=e bež=e ġūzak raft vizir to way=EZ hill=EZ ġūzak go.PST.3SG"the vizier set out towards ġūzak hill" 57 Note that it is not at all obvious what semantic content the K-suffixes have in these contexts; they appear to be relatively vacuous.In contrast to the proper nouns, this type of nouns has a high frequency across the critical editions of works, with Tārikh-e Sistān being an example.
In CNP, there is no constraint against combining the K-suffix with the plural suffix (see Sections 5 and 6 on this point in CWP and CSP).The following examples illustrate a K-suffix with evaluative sense followed by a plural marker "-ān." Ex. ( 22 To sum up, the K-suffix in CNP texts has various functions, 62 and is not subject to structural constraints such as obtain for CWP and CSP (see Sections 5 and 6).However, we find singular nouns, often accompanied by proximal/distal demonstratives, taking a K-suffix with no apparent connection to small size or any particular evaluative notion.Such examples are very rare and would require a larger corpus to study.However, in Old Shirazi these functions of the K-suffix predominate. 63efore demonstrating the use of K-suffixes as signals of proximity and familiarity/recognition, it would be helpful to outline indefiniteness and definiteness strategies in CNP.

Indefiniteness and Definiteness Strategies in CNP
In CNP, discourse-new, 64 specific, singular NPs are overtly marked for indefiniteness with an enclitic=ī on the nouns dōst=ī "a friend" and zan=ī "a woman," as in the following examples.This pattern has been attested in Middle Persian 65 and Old Shirazi. 66Definite NPs, on the other hand, are generally considered to lack any consistent marker of definiteness and are left unmarked.

Ex. (26)
az dost=ī šenīd-am from friend=IND hear.PST-1SG "I heard from a friend" 67 Ex. ( 27) zan=ī būd d īvāneh woman=IND be.PST.3SGcrazy "it was a crazy woman" 68 Once introduced, a referent has the status of definite (anaphoric definite).The two most common strategies for indicating definiteness across CNP (ignoring anaphoric pronouns and zero anaphora) are either combining the noun with a demonstrative pronoun, preferably the distal demonstrative -ān, or using the bare form of the noun with 60 Nowruznāme, 29. 61Qorʾān-e Qods, 136. 62Similar functions are attested for the Balochi of Sistan; see Nourzaei, "Definiteness Marking." 63Nourzaei, "History of the Suffix -ū in Shirazi"; Firoozbakhsh, "The Former Dialect of Šīrāz." 64The term "discourse-new" is here defined as the first mention of a noun in the discourse. 65Nourzaei and Jügel, "On the Function of -ag Suffix in MP"; Josephson, "Definiteness and Deixis in Middle Persian." 66Nourzaei, "History of the Suffix -ū in Shirazi"; Firoozbakhsh, "The Former Dialect of Šīrāz." 67Nowruznāme, 24. 68Ibid.
https://doi.org/10.1017/irn.2021.27Published online by Cambridge University Press no additional marking. 69The following passages (taken from Dārābname) demonstrate these two possibilities.A garden is introduced as a singular indefinite in example (28): Ex. ( 28) dar bīrūn=e šahr bāġ=ī Būd in outside=EZ city garden=IND be.PST.3SG"[Enalhayāt] he had a garden outside of the city, (lit.there was a garden for him)" 70 The second mention (anaphoric definite) takes the distal demonstrative ān "that" in combination with the noun ān bāġ-rā, "that garden": Ex. ( 29) ān bāġ-rā nešāt ābād mī-gū-yand DIST garden-OBJ Neshāt Abad IMP-say.NPST-3PL "they call that garden Neshāt Abad" 71 After this introductory sequence, there are several lines of intervening text with distal demonstratives referring to the garden before it is mentioned again as a bare noun bāġ, "the garden": Ex. ( 30) ayām=e bahār būd va Enalhayāt dar bāġ būd time=EZ spring be.PST.3SGand Enalhayāt in garden be.PST.3SG"it was spring, Enalhayāt was in the garden" 72 Similar examples with bare nouns can be found in comparable contexts in all works.A similar system has been noted for other Iranian languages such as Vasfi, 73 Balochi, 74 and Kurdish. 75n sum, I can conclude that, although discourse-new, singular nouns are consistently marked throughout CNP, the marking of definiteness is not consistent.The two strategies most commonly mentioned are the use of the demonstrative plus noun, or the bare form of the noun.

K-suffixes as Signals of Proximity
The K-suffixes occur in what I will refer to as contexts of proximity.By this I mean contexts in which the referent is an item within the immediate perceptual range of the interlocutors, and will therefore often be accompanied by a proximate demonstrative.Thus, we have a combination of a proximal demonstrative and a noun carrying a K-suffix, as in example (31).
Ex. ( 31 Note that this example lacks any obvious physical size connotations.Instead, it seems to be dependent on a deictic concept of proximity.This is one of most prevalent functions of the K-suffix -ō in Old Shirazi. 78

K-suffixes as Signals of Recognition and Familiarity
The only evidence of a familiarity/recognitional reading of the K-suffixes occurs in some works under a relatively tightly constrained set of conditions, and only with the singular nouns discussed in examples ( 32) and ( 33).
The following passage is taken from an account in Nowruznāme. 79In line 3 of the story, the boy has been introduced for the first time with pesar=ī "a boy," and the writer refers to the same referent, "boy," with a proximal demonstrative plus a K-suffix.Among the spectators, the king is pointing to the boy.He says "bring that boy to me," in line 5 of the story, which refers to the same referent again with a demonstrative pronoun plus a K-suffix (when the king commands his ministers to bring that boy to the palace).Interestingly enough, at the end of the same line, he refers to him with a K-suffix without a demonstrative pronoun.In the rest of this account, the writer refers to him either with a bare noun pesa-rā "the boy" or a distal demonstrative pronoun plus null form īn pesar/ān pesar "this boy/that boy."This passage demonstrates that the K-suffix does not convey the physical size of the boy, but instead illustrates a familiarity/recognitional notion of the reference.
Ex. ( 32 "Sultan Mahmud, having arrived at the door of the city gate, among the spectators, he saw (lit.his eyes fell to) a boy, in dirty clothes, about 12 years old, but, very handsome and charming and pretty, with a perfect disposition and of moderate stature.He pulled on the bridle and said, 'bring this boy to me'; when they brought [him], he said, 'O boy, who are you and who is your father?'; he said, 'I do not have a father, but my mother is living in such and such an area.'He said, 'what skill are you learning?'He said, 'I am memorizing the Quran'; he commanded that the boy be brought to the palace; when the sultan got off his horse, he called the boy […]" 80 In the works, I only found one particular case of this.In line 1 the doctor is introduced in the discourse for the first time without the K-suffix tabīb=ī "a physician," and in line 5 the writer refers to the same referent with a K-suffix tabīb-ak "the doctor."In the rest of the story, the same referent appears without the K-suffix, tabīb "the physician."Such passages demonstrate that the K-suffix does not express any physical notion about the physician.Instead, it conveys familiarity/recognition. Ex. ( 33 every day physician-OBJ IMP-ask.PST-3SG "He gave a rich reward to a physician from Samānīyan; the physician brought a wooden stick and band and said 'his leg was broken'; he asked the physician every day." 81Note that we do not have sufficient examples of this type to draw any significant conclusion.In the later stages of Persian, for instance in Golestān Saʿdī and Totināme, we cannot find these types of passages.It would be interesting to closely examine this suffix from the fourteenth to the early nineteenth centuries to see which evaluative notions are more predominant.

Summary
The corpus data for CNP demonstrate that the K-suffix has evaluative semantics that account for most of its usage.It is compatible with indefiniteness contexts, and there are no structural constraints (see CWP and CSP on this issue).It somewhat resembles a sporadic remnant of a now defunct morphology that appears to have been incorporated into some items without any discernible change in meaning; see examples ( 19) and ( 21).
In CNP, however, we find nouns accompanied by demonstratives and nouns taking a K-suffix, with no clear connotation of small size, little amount, or clear evaluative content.These passages provide some evidence of how evaluative markers might have evolved towards definiteness marking.One of the most recent cross-linguistic studies on diminutives demonstrates that diminutives also convey meanings of endearment, familiarity, and 80 Ibid. 81Tārikh-e Beyhaqi 2, 495.
proximity. 82In the case of the proximity and recognitional contexts shown in examples ( 32) and ( 33), the concept of familiarity is reduced to physical proximity and shared common ground.Thus, it is not unreasonable to see an evaluative suffix becoming associated with proximity in a non-evaluative sense.We have already observed the concepts of proximity and shared common ground in the K-suffix in Balochi, 83 and it is the most prominent function of the K-suffix -ō in Old Shirazi Persian, 84 although in both Sistani Balochi and Old Shirazi, evaluative usage prevails overall.The suggestion here is that the proximate and shared-knowledge usage may have provided a bridging context for the transition from evaluative meaning to definiteness marking.

The K-suffix in Contemporary Written Persian: Initial Observations
Data for Contemporary Written Persian are taken from books written in colloquial Persian published from the late nineteenth to mid-twentieth centuries.Table 2 gives an overview of these books.
So far, I have given a detailed discussion of the nature of the K-suffix -ak in CNP (see Section 3).Across the works, we only found one form of the K-suffix, namely, -ak.However, in the CWP books we found four varied forms of the K-suffix (see Section 7 for a discussion of their origin): (a) a continuation of the K-suffix -ak in CNP as an evaluative notion, e.g., Hammad-ak, "Ahmad," dīb-ak "demon," 85 and hamūm-ak "bathroom."(b) the existence of new K-suffixes, e.g., īk, in zan-īk-e, "woman," ū, in yār-ū "friend," 86 -ī, in Hasan-ī "Hasan," and -e in pesar-e, "boy," which are mostly found in colloquial and informal written texts with mostly singular nouns. 87I assume the -ī suffix to be a short form of the -īk suffix in Hasan-ī "Hasan." 88Determining whether or not they derive from the same origin is not the main point of this paper; what is important is that they display similar (evaluative) semantics.
87 I have found forms with such words as martīke, mardīke, mardake "man" and zanīke/zanake "woman," and once with pesarīe/pesarīke "boy."I am uncertain of the origin of -īk; it is an evaluative suffix.Cross-linguistically, it is possible to have more than one diminutive suffix on words, such as in Slavic languages; for Russian, see Volek, Emotive Signs.We find the same nouns with two evaluative suffixes in Balochi: mard-ak-ok "man," ǰan-ak-ok "woman," maškečok "goat skin," where the first K-suffix appears to have been re-analyzed as part of a word stem.It is also attested in Kurdish as ženek.Note that these words are not common in CSP, but they can be found in some older speakers' daily speech (unpublished Hamedani tale); the standard terms are mard and zan. 88Khatamipoor, "yā-ye maʿrefeh," 18, mentions that the K-suffix -ī is a definiteness marker in Kashmari dialect.Future corpus-based investigation is needed to ascertain how far this suffix has been grammaticalized as a definiteness marker. 89 various scholars. 91In the following section, I will discuss the K-suffix -e in Contemporary Written (see below section) and Contemporary Spoken Persian in Iran (Section 5).

K-suffix -e in Contemporary Written Persian
Before we study the status of the K-suffix -e/he in CSP, I will give a detailed description of the K-suffix -e in CWP.In contrast to the K-suffix -ak in CNP (Section 3), the K-suffix -e is mostly attested in informal and colloquially written books with a handful of singular nouns. 92Note that I found three instances of the K-suffix -e with the plural marker -hā e.g., čerā mesl e xāle zan-īk-e-hā harf mīzanī "why are you talking like gossiping women?" 93 Its semantic domains in CWP are, to a large extent, similar to those in CNP.However, there are some examples of K-suffixes that distinguish CWP from CNP (see Section 4.2).

Analysis of the K-suffix in CWP
As in CNP, the K-suffix in CWP is compatible with indefinite contexts.See example (34). 94. ( 34 It has been attested with the proper nouns ādm-e and Havvā-e, which are signals of the endearment connotations of this suffix. 97Note that the same writer used ādm and Havvā without marking them with a K-suffix in his short story titled Afsāneye Afarīnesh.
92 For a detailed discussion of different forms of plural markers and their relation to definiteness, see Lazard, A Grammar of Contemporary Persian, 57-66, among others.
93 Zende be gur, 88. 94 My Hamedani speaker informed me that the K-suffix is expected in contexts of indefiniteness such as ye peser-e=ī bū "there was a boy." 95 Tamsilāt, 295. 96Chamedān, 68. 97My Tehrani speaker informed me that the K-suffix -ak is sporadically used with the proper nouns (adding an endearment notion) in intimate social settings as in Negin-ak ūmad "lovely Negin came."She also confirmed that the K-suffix -e can be used on proper nouns (adding a pejorative sense) as in īn negīn-e bāz umad "this Negin came again."Obviously such cases demonstrate some traces of an earlier stage of multifunctionality of the K-suffix -e, as we observe in CWP.
Ex. ( 36 37) is an ambiguous case.The K-suffix could be interpreted as adding a flavor of sorrow/empathy on the part of the speaker regarding the fate of the small, orphaned boy.It could also be interpreted as a recognitional context, when the girl again refers to the boy after several intervening lines.
Ex. ( 37 Similar to the K-suffix -ū in modern Shirazi Persian, I find it in indefiniteness contexts, as in example (42).

Ex. (42)
dīd-an ye mart-īk=e ġūzal-ū lāġar-ū see.PST-3PL one man-EV=EZ humpbacked-EV thin-EV "they saw a humpbacked and skinny man" 105 Finally, I should point out that certain words, typically indicating place referents, seem to include the K-suffix as part of the word stem, such as in example ( 43).Note that some compound nouns, such as Albālū xošk-e "dry-cheery" in ʿAlaviye khānom, need further investigation regarding the function of -e. 106. ( 43) se nafarī aġlā=šūn-o rū ham rīx-tan ke three person wisdom.PL=PC.3PL-OBJon add pour.PST-3PL CLM be-r-an emām-e SUBJV-go.NPST-3PL NP-"all three decided to go to Emāme" 107 In contrast to the K-suffix in CNP, the K-suffixes are not attested with possessed nouns formed with person-marking clitics or copula verbs (see example 24).When a noun and an adjective are combined, the K-suffix is attached to the second constituent of the NP, as in pesar bozorg-e "the old brother."See the following example.Note that in some books written earlier in the period being studied, we find the K-suffix on the first constituent of compound nouns (a noun combined with an adjective) such as doxtar-e=ye češm sefīd "impudent girl."109It seems that the movement of the K-suffix to the second constituent of the noun phrase occurred in its later stages of grammaticalization.

Attestation of the K-suffix -e in Non-evaluative Contexts 110
We have already found some contexts where the K-suffix -e does not express a diminutive or evaluative sense.Instead, the item marked with the K-suffix has a referent in the previous clauses or, in some cases, the marked items can refer to common background knowledge.
Before introducing these passages, I will briefly summarize definite and indefinite strategies in CWP.As in CNP (Section 4), discourse-new, specific, singular NPs are overtly marked for indefiniteness across the CWP texts.Definite NPs, on the other hand, are generally considered to lack any consistent signal of definiteness.
Indefinites are marked slightly differently than in CNP (see Section 3.3).The word ye/yek "one" preceding the noun ( ye kaftār, "a hyena") may combine with a suffix=ī ( yek martīke=ī "a man").Once introduced, a referent has the status of definite (anaphoric definite).As in CNP, there are two common strategies for indicating definiteness throughout CWP: (a) combining the noun with a demonstrative (ān doxtar, "that girl"), (b) using the bare form of the noun with no additional marking (kaftār "the hyena"). 111n the following passage, taken from a story in ʿAlaviye khānom, the word kaftār "the hyena" is introduced in the discourse as a singular indefinite.

Ex. (45)
az tū=ye qabrestān=e kohne=ī yek kaftār bar mā mī-gūz-īd from in=EZ graveyard=EZ old=IND one hyena to PN.1PL IMP-fart.PST-3SG paydā kard-an find do.PST-3PL "in an old graveyard, they found an arrogant/conceited hyena" 112 Following the introduction, the second mention (anaphoric definite) takes a bare noun kaftār.The writer refers to it several times in the story with a bare noun kaftār.He only marks it with the K-suffix -e once (on page 127), while in the rest of the story it appears as a bare noun.

Ex. (46)
kaftār-e-ro bā dāyereh va dombak vāred=e kešvar=e xar dar hyena-EV-OBJ with tambourine and tombak enter=EZ country=EZ donkey in čaman kard lawn do.PST.3SG"[the fox] accompanied the hyena ceremoniously (lit.with tambourine and drum) in the land where donkeys [graze] on the lawn" 113 It is evident from these passages that the K-suffix does not express an evaluative sense.Still, the K-suffix does not mark the items consistently or systematically.It is hard to find a motivation for the writer to mark the same item with a K-suffix only once, and not in the remaining passages of the story.
Similarly, in the following example, the NP, girl, has been introduced for the first time in the story in a restrictive relative clause, ān doxtarī ke "that girl who." 110 Because at this stage the K-suffix -e does not systematically appear as a definiteness marker in the texts, I would prefer to keep "EV" as a general term in the glosses. 111Note that Meshkat al-Dini, Dastur-e zabān-e Fārsi, 148, and Ahmadi Givi and Anvari, Dastur-e zabān-e Fārsi 1, 64, consider the first possibility to be a definiteness reading of nouns in Persian.
Ex. ( 47 The second mention in line 12 takes the distal demonstrative ān doxtar, "that girl."In line 36, the writer again refers to the girl and marks it with the K-suffix, as in the following example.

Ex. (48)
doxtar-e bekolī az yādam raft-e būd girl-EV totally from memory go.PST-PP COP.PST.3SG"I forgot the girl (lit.the girl has gone from my memory)" 115 In line 38 the writer refers to the girl with a combination of the distal demonstrative and the K-suffix -e, ān doxtar-e "that girl." In the following example, the item abre "cloud" marked with the K-suffix -e has a referent in the previous context yek teke abr "a bit of cloud."Note that it comes with the distal demonstrative.It is also worth noting that throughout the books, there are very few passages where the second mention (anaphoric) is marked with a K-suffix (see CSP on this issue).
Ex. ( 49 Similarly, in the following example, the item doxtar-e "the girl" marked with the K-suffix -e has a referent in the previous context ye yatīm=ī "an orphan."In the continuation of the story, the same referent appears as a bare noun and PROX+NP.It is notable that, after 17 lines, the writer refers to the girl and marks the referent with a K-suffix -e, as doxtar-e "the girl." Ex. ( 50 Example ( 50) is a unique case in the corpus.In the story, pesar "the boy" appears as a bare noun.It is marked just once with the K-suffix in combination with the demonstrative when the man points to the boy and says, "he is not a painter, he is reciting a poem for this boy who is sitting in front of the shop."In the rest of the text, the same referent "boy" appears as a bare noun.
Ex. ( 51) be-dīn pesar-e šeʿr mī-band-ad to-PROX boy-EV poem IMP-close.NPST-3SG "he is reciting poem [s] for this boy" 118 The writer similarly marks the item zan azīz-e "beloved wife" with a K-suffix, when the woman is pointing to another woman standing close by and says to the man that the beloved wife (lit.dear woman) is over there.

Ex. (52)
zan azīz-e ānǰāst wife beloved-EV DIST.COP.NPST.3SG"the beloved wife is there"119 After this, the writer refers back to it either with a bare NP or a combination of demonstrative plus noun.
The following examples, ( 53) and ( 54), demonstrate a mutuality reading.Mutuality involves contexts in which the identity of the referent is known by both speakers through their shared world knowledge, even though the referent has not previously been introduced in the linguistic context.
The marked noun dom=e šotor-e "the tail of the camel" does not have a referent in the previous clauses.However, the writer still marks it with the K-suffix because it is familiar to both writer and reader via their common cultural background.This usage has been reported for the K-suffix -ō in Old Shirazi.

Ex. (53)
tā ūn bīy-ā-d mard beše dom=e till DIST SUBJV-come.NPST-3SG man SUBJV.become.NPST.3SGtail=EZ šotor-e be zamīn mī-res-e camel-EV to earth IMP-arrive.NPST-3SG "until that one has become mature (lit.man) the tail of the camel will reach to the ground"120 Note that the same expression is not marked with the K-suffix in his other book Zende be gur. 121x. ( 54) ġese-e mā be sar resīd kalāġ-e be xūn=aš story=EZ PN.1PL to end arrive.PST.3SGcrow-EV to home=PC.3SGna-res-īd NEG-arrive.PST-3SG "our story finished (lit.came to the end) [but] the crow did not arrive at its home" 122 Summary Across the texts, the K-suffix -e of CWP is quite similar to that of CNP, with evaluative connotations accounting for the greatest amount of use.It has been attested in indefiniteness contexts.It shares deictic and recognitional uses with CNP in broader contexts.However, we also encounter some instances in which the K-suffix marks items that have a referent in a previous context and do not convey any evaluative sense.Such examples are rare, but they indicate how an evaluative suffix can develop into a definiteness marker and pave the way towards anaphoric definiteness (for discussion of this as a typical pattern in CSP, see Section 5).In contrast to the K-suffix in CNP (see examples 22-23), the K-suffix -e does not occur with plural markers and possessive constructions, typically when the latter are formed with person-marking clitics and enclitic verb copulas.
This observation can be linked to Hawkins's suggestion that each stage of grammaticalization "maintains the usage possibilities of the previous stage and introduces more ambiguity and polysemy, but expands the grammatical environments and the frequency of usage of the definite article." 123inally, what should we call the K-suffix -e in CWP? 124 In my view, this is an open question, however, as we can see above and in Section 4.1, the K-suffix -e is not yet mature and has not grammaticalized as a definiteness marker as such.It is scattered unsystematically throughout the texts and largely preserves its original evaluative connotations.It is still on the way towards becoming a definiteness marker in Persian, as will be discussed in the next section.

Contemporary Spoken Persian
Data for the CSP stem from Persian Language Database (PLD) online corpora, 125 Taghi's corpus, 126 and my new recordings of Tehrani speakers from Tajrish and my field notes. 127The corpus contains a total of 60,207 words (see Table 3 for an overview).In addition, I use spontaneous speech data from Bamberg-Hamedan joint online data, 128 a variety called Hamedani Persian, and my new recordings.The main speech topics are personal accounts, education, science, and so on.

Background of Speakers
I do not know the age of the participants for the PLD corpora, as I was informed that the data was recorded from native, educated Tehrani male and female speakers who were born and lived in Tehran.The main speech topics are marriage, women's rights, tales, and free conversations recorded in (1370/1991) and written down in Persian.I transcribed them for this work.The recorded data is about three hours long.
I use twelve texts published in Taghi. 129 These texts are recorded from two Tehrani speakers aged seventy-three and seventy-five, and written down in Persian.I transcribed them for this study.According to the information supplied by Taghi, both speakers were educated in Islamic schools (savād maktab).They were born in Tehran and lived there for their entire lives.The second speaker moved to Sweden at the age of seventy, but traveled back and forth between there and Tehran.
My data consists of recordings of bibliographical tales and accounts (about one hour) told by Tehrani-educated speakers from Tajrish aged between forty and sixty-five years.
Regarding Hamden-Bamberg, the data consists of recordings of male and female Hamdani speakers aged between thirty and seventy years with different backgrounds from 2017 onwards. 130or colloquial Tehrani Persian, I complement the quantitative data with qualitative material which illustrates the various functions with authentic examples and appropriate references to context.I also refer to the results of a questionnaire-based survey with Persian speakers based on the English version of the questionnaire used for Kurdish, Balochi, Shirazi and Lori to capture authentic colloquial speech. 131I have modified the questionnaire slightly by reducing the number of plural NPs due to the incompatibility of the K-suffix with plural nouns.
In the previous section, I gave a detailed discussion of the K-suffix -e in CWP.Now I will discuss the status of the K-suffixes -e/he/ye in CSP.The K-suffixes -e/he have been attested in different varieties of Persian, for instance, Taghi ābād, Esfahani, Hamedani, Yazdi, 132 Najaf ābādi, Qomi, Mashhadi, 133 Birjandi, Qayeni and Neshaburi. 134Notably, the K-suffix-e/he has not been attested in Sistani Persian, which is the variety spoken in Sistan and Balochistan province. 135ased on the data available in the Kalbasi, 136 the Taghi 137 and the online Bamberg-Hamedan corpora, 138 and my data, the status of the K-suffix -e/he is almost the same across Persian varieties: it is not obligatory but is systematically used in definite contexts.For instance, Hamedani Persian speech is similar to Tehrani Persian; the K-suffix is very sensitive to genre and setting, which means that it is not attested with scientific topics that need a formal setting.The frequency and usage of the K-suffix in anaphoric contexts (particularly its 132 See Kalbasi's data, Towsife gunehā-ye zabānī-ye īrān.
combination with demonstrative pronouns) diverge in these varieties.Therefore, another study is needed of these varieties using a larger corpus.
In the present study, I will concentrate on the status of the K-suffix -e-he in the Tehrani variety of Persian, for which I already have a large corpus at my disposal.Data for this section was taken from a large contemporary spoken online corpus, Persian Language Database (PLD), published texts of Tehrani Persian in Taghi's corpus 139 and my recordings of Persian speakers from Tajrish.
Before discussing the nature of the K-suffix, I will give an overview of the system of discourse-new nouns in this phase of Persian.
The system of discourse-new nouns, specific nouns for the singular, and plural nouns is the same as in CWP: the word ye/yek "one" precedes the noun, which may combine with a suffix =ī/e 140 on the noun to give an indefinite, singular, specific meaning, as in ye olāġ=ī "a donkey" and ye šīr "a lion." 141. ( 55) mī-bīn-an ye olāġ=ī gandom bār=eš hast IMP-see.NPST-3PL one donkey=IND wheat load=PC COP.NPST.3SG"they see a donkey is loading wheat" 142 Similar to CNP and CWP, the most common strategy in CSP for marking a referent with a definite status is to use bare nouns or a combination of nouns plus demonstratives.However, in some genres, typically in folktales and biographical tales, a new strategy has emerged that marks the definite nouns with the K-suffix -e/he systematically, but not obligatorily, in anaphoric contexts.In the next section, I will illustrate this usage of the K-suffix.

K-suffixes as Definiteness Markers
The common form of the K-suffix in Contemporary Spoken Persian is -e/he (when a word ends with a vowel), for instance kūze/kūze-he "the jug," bābā/bābā-he "the father."These suffixes have generally not been attested in standard Persian. 143In contrast to CWP, in CSP K-suffixes are not attested with evaluative or diminutive semantics or in indefinite contexts (see Section 4).In the following subsection I will discuss the K-suffix in CSP.

Anaphoric Definiteness
In CSP, singular nouns that are anaphorically definite take a K-suffix, when the relevant structural conditions obtain.The following examples (56 and 57) illustrate K-suffixes in anaphoric definite contexts, with both human and non-human nouns.
Ex. ( 56) Anaphoric definite with a human noun mī-bīn-an ye pīrmard=ī mesle ye ǰūǰe rū=ye zamīn IMP-say.NPST-3PL one old man=IND Like one chick on=EZ ground 139 Taghi, A Typology and Classification of Three Literary Genres. 140Taghi's corpus, A Typology and Classification of Three Literary Genres, 96, is the only one where the speaker introduces a new participant in the discourse with yek and e, for instance ye pīrezan-e būde "there was an old lady."I listened to the sound file of one text together with the author of the book.I can hear a short, unstressed -e.It might be another form of indefiniteness marker that so far has not been reported.This is a topic in need of further investigation with more examples of this construction. 141Note that in Taghi's corpus, A Typology and Classification of Three Literary Genres, 290, the discourse-new nouns appear as bare nouns, as in mīre barāš kor-e asp mīxare ke sareš be īn kor-e asp-e garm beše, "he buys a foal for him in order to be busy with this foal." 142 Persian Language Database (PLD).Similar to Shirazi Persian, the K-suffix in CSP does not appear in combination with a demonstrative pronoun in anaphoric contexts, as in the following example: Ex. ( 60) mī-res-e be ye gāv=ī va mī-bīn-e In gāv IMP-arrive.NPST-3SG to one caw=IND and IMP-see.NPST-3SG PROX cow bast-e zamīn bind.PST-PP ground "he arrives at a cow, and he sees this cow bound to the ground" 148 However, in Taghi's data, there are a few anaphoric contexts with a combination of a demonstrative pronoun plus a K-suffix, as in example (61).I have found a combination of the K-suffix with demonstrative pronouns in anaphoric contexts outside of the storyline when the storyteller explains the situation to the audience. 149. ( 61 The appearance of double marking of definite forms is unexpected in the traditional scenario of developing definiteness marking from a demonstrative, and these instances certainly call for further investigation.However, the construction is not unexpected on the analysis suggested here, where we assume that the definiteness marking evolved from evaluative marking via the marking of proximity and shared knowledge/familiarity, which is supported by our results here (see Section 3 on CNP) and also has occurred in Balochi and Old Shirazi.If this really is the first developmental stage, then it is not surprising that it is still available here in the speech of older speakers.For Old Shirazi, we have evidence that the K-suffix always occurs with a demonstrative in earlier stages of the language.At its current stage we observe a complete absence of the demonstratives in anaphoric contexts and a tendency not to use them in situational contexts. 153hese observations support my hypothesis that in earlier stages of the grammaticalization of the K-suffix towards definiteness, it occurred with the demonstratives and used them as supporting items/hooks before becoming a pure definiteness marker.In this respect, CSP is at an earlier stage of grammaticalization of the K-suffixes, and traces of this earlier stage can still be found in the speech of older speakers.

Bridging and the K-suffix
Under the heading of bridging definiteness, we include referents that are identifiable based on their unambiguous link to another previously mentioned referent.Generally, bridging contexts appear either with a bare NP or possessed nouns such as dar "the door" and modīr-e madrase šūn "the principal of their school," as in examples ( 63) and (64).
Ex. ( 63 "my brother is a teacher at a school in Tehran, the principal of their school is Mr. Irani" 155 There are some cases with K-suffixes, such as doktor-e "the doctor" in example (65).The doctor had not been mentioned previously in the story, but it is common knowledge that a hospital has a doctor/several doctors.
Ex. ( 65 Similarly, the singular NP dūkūndār-e "the shopkeeper" marked with the K-suffix is identifiable based on its clear connection with the shop, as it is common knowledge that every shop has a shopkeeper. Ex. (66) The K-suffix for bridging īnvar ūnvar nešūnī mī-dan belaxare yek ǰā=ī this.direction that.direction address IMP-give.NPST.3PLfinally one place=IND sang-o stone-OBJ paydā mī-kon-e dūkūndār-e mī-g-e finding IMP-do.NPST-3SG shopkeeper-DEF IMP-say.NPST-3SG "he looks here and there, finally he finds the stone in a place [a shop], the shopkeeper says […]" 157

Situational Contexts
Based on the data, in situational definiteness contexts, CSP uses two strategies: a combination of demonstrative plus K-suffix or just K-suffix.This is contrary to Koroshi Balochi, which always requires a combination of demonstrative plus a K-suffix. 158The following passage displays a situational definiteness context in which the demonstrative combines with a K-suffix with īn māšīn-e "this car."The car has not been mentioned previously in the story.The driver points to the car and explains to the mechanic that this car transports passengers from Kerman to Tehran.
Ex. (67) The K-suffix for situational definiteness with K-suffix īn māšīn-e tū masīr-e tehrūn kār mī-kon-e āġā PROX car-DEF in way=EZ Tehran work IMP-do.NPST-3SG sir "this car works the Tehran line, sir" 159 Example (68) displays a situational definiteness context in which the speaker does not combine a demonstrative with the K-suffix.The basket was previously introduced in line 3 of the story.In the example below (line 4 of the narrative) the speaker points to the basket and says, "give me this basket." 155Ibid. 156Ibid. 157Taghi, A Typology and Classification of Three Literary Genres, 238. 158See Nourzaei, "Definiteness Marking," examples 35-38. 159Nourzaei, Unpublished texts, recorded between 2018 and 2021.

Structural Constraints on K-suffix with Anaphoric Definiteness in CSP
As previously mentioned, anaphorically definite nouns are marked with a K-suffix in CSP.However, the presence of the K-suffix is systematically inhibited under certain conditions.In the following subsections I will describe the main systematic structural constraints on use of the K-suffix with anaphoric definiteness.

Plural
Nouns marked with a plural marker never take a K-suffix regardless of their definiteness status, as in the following examples.

Possessed Nouns
In addition to the independent pronouns, there are person-marking clitics (PC), which are used with all functions of the oblique case, direct and indirect objects, and as possessive pronouns.The K-suffix is systematically absent from possessed nouns formed with a clitic possessive pronoun, e.g., "his cow," "your son," and pronouns, e.g., baxt-e doxtar-e mā "the fate of our daughter."However, it appears with other possessed constructions formed with ezafe constructions, e.g., xūneye pedar-e "the father's house."This system is similar to Shirazi Persian163 and is contrary to Koroshi.In Koroshi, the K-suffix does not appear with all types of possessive constructions. 164x. (71) Absence of the K-suffix with a possessed noun mādar=etūn hazer=e pesar=etūn masalan mother=PC.2SGready=COP.NPST.3SGson=PC.2SGfor example pesar-e barā zan=eš kār be-kon-e son-DEF For wife=PC.3SG work SUBJV-do.NPST-3SG "is your mother ready, your son for example, the boy should work for his wife" 165 Ex. ( 72) bad az arūsī=šūn īn bar gašt xūne=ye pedar-e after wedding=PC.3PLPROX PREV turn.PST.3SGhouse=EZ father-DEF "after their wedding, the girl (lit.this) returned to the father's house" 166 Proper Nouns and Titles Generally, the K-suffix is absent from titles and proper nouns, as in examples ( 73) and (74). 167It is notable that, as in Central Kurdish 168 and Koroshi, 169 king and mullah are considered proper nouns in Persian. 170In Shirazi data, mullah is not considered a proper noun and is marked with a K-suffix -ū, e.g., āxūnd-ū "the mullah," unlike pādšāh/pādošāh "king." 171. ( 73 Note that in fairy tales the K-suffix is attested with a title in āġā dīv-e "Mr.Demon." 174wever, both the titles Mrs./Madam and Mr./Sir are marked with the K-suffix when they are used alone, as in example (75).

Some Prepositions
The data demonstrate that the K-suffix is absent in some combinations with prepositions in the corpus data: sorāġ "after," az "from," be "to," az bālā "above," as in examples ( 78)-( 81).Note that there is great variation among the speakers.

Ex. (78)
xod=eš raft sorāġ-e pīrezan REFL=PC.3SGgo.PST.3SGafter=EZ old lady "he went after the lady" 179 Ex. ( 79) goft-an yek=ī-ro dād-īm be pīrezan say.PST-3PL one=PC.3SG-OBJgive.PST-1PL to old.lady "they said, one of them we gave to the old lady" 180 Ex. ( 80) alān be šīr mī-res-īm now to lion IMP-arrive.NPST-1PL "now we will arrive at the lion" 181 Particle ham/am The data show a significant variation across the speakers regarding the absence of the K-suffix before the particle ham/am.The same speaker systematically does not apply the K-suffix before this particle, as in the following examples.

Unexpected Absence
I have already discussed the attested constraints of the K-suffix in anaphoric contexts.However, there nevertheless remains a residue of nouns in definiteness contexts that lack the K-suffix.Hence the term "unexpected absence" of K-suffix is used. 187The number of such unmarked definite NPs varies considerably across different speakers in our corpus (see below), indicating considerable inter-speaker variation.
In the following passage, the lion, as the main character in the tale, appears without marking with the K-suffix in the definite contexts.In both examples, the lion and the girl are the main characters in the story, and after several mentions with a K-suffix, they appear without a K-suffix.See also the NP gorbe, "cat" in example (56), which lacks a K-suffix despite the cat being one of the important characters in this tale.

Ex. (85)
šīr harče dast va pā mī-zan-e lion Whatever hand and feet IMP-beat.NPST-3SG "the lion is trying a lot (lit. is beating its hands and feet) […]" 188 183 PLD. 184Ibid. 185To be certain, I have checked some passages with the K-suffix in this type of environment with fifteen native speakers.I found the same variation across the speakers.The same observations hold regarding the prepositions. 186Taghi, A Typology and Classification of Three Literary Genres, 237. 187See also more passages with unexpected absence of K-suffixes in Kalbasi, Towsife gunehā-ye zabānī-ye īrān, 227-28, such as the NPs kūze "jug" and zan "the woman." 188Taghi, A Typology and Classification of Three Literary Genres, 237.

Summary
The K-suffixes in CSP are associated with definiteness contexts, usually anaphoric, and very rarely appear in bridging contexts.They are systematically excluded from indefiniteness contexts and are not associated with obvious evaluative or diminutive semantics.In this sense, we speak of a definiteness function of the K-suffix in CSP, and in this sense CSP is distinct from CWP.However, in CSP definiteness is a necessary but not sufficient condition for the K-suffix.There are still many notionally definite NPs in our corpus that do not take a K-suffix.First of all, we noted certain structural conditions that inhibit the presence of a K-suffix: (a) Plural marking of the noun, (b) In combination with clitic pronouns and copula, (c) When the noun can be construed as a title or proper noun, (d) after some prepositions, (e) after a particle "ham/am," (f) after some nouns, (g) with demonstrative pronouns.
The extent of the residue of definite but unmarked items varies from speaker to speaker and according to genre and speech situation.In the next section, we explore the quantitative data from our corpus to shed light on the nature of the changes that have occurred in Persian.
6.The Emergence of Definiteness: Evidence from the Corpus and the Questionnaire While the grammaticalization of definite markers has been a central issue in grammaticalization theory, researchers usually cite cases (the languages of Western Europe) in which the source of the definite article is some form of deictic element (a "D-element" according to Himmelmann192 ), and this has become the primary paradigm for understanding the diachronic development of definiteness marking cross-linguistically.However, in our ongoing survey of Western New Iranian languages, and Persian in particular, the definiteness suffix has an entirely different source construction, as it comes from an evaluative suffix.Thanks to the existence of data from earlier phases of Persian, we can formulate some initial hypotheses regarding the developmental sequence that led to the current situation.We can see here that the definiteness marker in Persian does not originate from a demonstrative source.And in particular, its combination with the demonstrative pronoun rules out a demonstrative origin.
An overview of the corpora for CNP, CWP and CSP is provided in Table 3.
A second source of data is a questionnaire conducted between 2018 and 2021 with fourteen Tehrani speakers, which is discussed below.But first I consider two metrics from narrative corpus: overall frequency of the K-suffix and distribution of the K-suffix across the corpora for these three phases.

Overall Frequency of K-suffixes
Overall frequency is counted as the number of occurrences of K-suffixes across all texts in the corpus per orthographic word,193 normalized to a value of frequency per 1,000 words, to enable comparison across texts of different lengths.Consideration must be given to the fact that a value of zero is not particularly significant in a small text, while zero occurrences in a larger text is much more significant.Nine texts have fewer than 700 words overall, and in many of them, the number of K-suffixes is high; I left them out of this calculation.The results for the three phases are demonstrated in Fig. 2. The vertical axis represents mean values and the bars give the data for each corpus.
There are some points of interest here.First, the hypothesis that overall frequency would increase with a shift towards a definiteness function is confirmed.In CSP, the mean value of K-suffixes per 1,000 words is 3.2, sixteen times higher than in CWP (0.2), and just over three times more than in CNP (1.0).However, it is also clear that the higher frequency of K-suffixes in CSP is largely the result of three data outliers, with 10.0, 8.0, and 7.0 K-suffixes per 1,000 words, respectively, more than twice the figure for any other texts having a K-suffix, while eight texts still have no items marked with K-suffixes. 194hus, CSP is not characterized by the consistently high level of K-suffixes that one would expect if the forms were uniformly grammaticalized as definiteness markers in this language.Overall frequency is, at best, a very crude measure of grammaticalization, however. 195ote that this is the opposite of our Shirazi results, in which the K-suffix can be found across all the texts.
Recall that the qualitative investigation of these three phases demonstrates that in CNP and CWP, K-suffixes are used with evaluative meaning in most instances of use.Given that K-suffixes in these phases are not associated with a predictable and commonly recurring function, we would not expect a uniform frequency of use.Indeed, frequency of evaluative usage may simply be a matter of genre.
In CSP, on the other hand, K-suffixes are not associated with evaluative and diminutive semantics, but are associated with definiteness.However, the association is not fully regular because, as previously mentioned, structural conditions inhibit the K-suffix.Some definite nouns also lack the expected K-suffix for reasons that are not fully understood.It is highly restricted with regard to inter-speaker, inter-setting, and inter-genre factors.
The second remark concerns the decrease and increase in frequency exhibited by the K-suffix in Persian.On the one hand, we can see a significant drop in the frequency of the K-suffixes in CWP.This decrease may be due to the fact that their syntactic domain is becoming increasingly restricted, which means they can only appear with a handful of singular nouns in informal and colloquial settings.Their semantic domain (polyfunctional evaluative notions) is becoming bleached, and the suffix is moving towards definiteness.
Recall that we can find no restrictions on the K-suffix in CNP.It can be found in all parts of speech, apart from verbs and pronouns, throughout the texts.I have noticed the same result in our ongoing survey in Shirazi and Balochi. 196 It needs to be checked in Kurdish and Lori as well, which are currently being analyzed.
The third exciting point concerns the massive inter-writer/speaker and inter-genre differences found in CWP and CSP, but not in CNP.We observe that the K-suffixes are attested in all the CNP texts studied.What is significant in CNP is the region from which the author of a work comes.We find that works written in the east of Iran have a higher frequency of K-suffixes than ones written in the north.Indications that the K-suffix is developing towards a definiteness marker (see examples 32-33) are also attested in two works titled Tārikh-e Beyhaqi and Nowruznāme, the authors of which come from Khorasan.This might be connected to Lazar's observation that New Persian originated from Khorasan in eastern Iran. 197The variety of Persian spoken in Khorasan was influenced by Semitic language earlier than Persian varieties spoken in the north of Iran.
The data from CSP demonstrates that only specific kinds of texts contain K-suffix marking.The texts with a high frequency of K-suffixes in the CSP corpus comprise three traditional folktales and two biographical tales.We cannot find the K-suffix with topics such as education, science, human rights, or the coronavirus, which require formal style.This suggests that genre is the decisive factor in CSP.Development of the definiteness marking within a specific genre has been reported for the Finnish language.Overall, the data does not show a simple picture of a spreading out from an assumed anaphoric usage, commonly taken as prototypical for definiteness marking as suggested in grammaticalization theory for Persian. 200In the following section I will examine the results of the questionnaire data.

Presentation of Questionnaire Data
In addition to the corpus data, I tested data from a questionnaire answered by fourteen speakers.The questionnaire used a set of 102 items built into six "mini-narratives" each representing short episodes of approximately ten sentences.In order to capture authentic colloquial speech, we circulated the English form of the questionnaire among participants and asked them to translate it orally into colloquial Persian.Their narratives were recorded with a mobile phone, and the relevant NPs were coded for presence vs. absence of K-suffixes and a number of other features.The results here are from the initial pilot in colloquial Persian based on fourteen speakers (nine female and five male), all of whom come from Tehran.
Fig. 3 presents the percentage of nouns carrying a K-suffix in the respective contexts: first mention (indefinite), bridging, anaphoric, demonstratives, possessed, personal nouns, unique references, and non-referential/generic (as in negated existential, such as "in those days there were no cars").When considering the questionnaire data, we find more than half of the nouns in anaphoric contexts do not take K-suffixes.Other nouns in these contexts are bare nouns or were in plural, and such cases are not counted here.
As presented in Figs. 3 and 4, overall and across all speakers, we find massive interspeaker differences in the marking of anaphoric definiteness.Only three speakers use the K-suffix in bridging contexts.The most common forms in bridging contexts are bare nouns or possessed nouns, as we observe in the corpus data.
Moreover, we find consistent observance of the structural constraint against use of K-suffixes with plural markers, possessed nouns formed with person-marking clitics, and generic nouns, along with a complete absence of K-suffixes in the indefinite.Furthermore, we find a consistent lack of K-suffixes with personal names.On the whole, this is the system that was found with the corpus data, as discussed previously.In the following section I will comment on the origin of the various K-suffixes in light of the present data.

K-suffix -ak
In general, the K-suffixes developing towards a definiteness marker in our New Western Iranian languages survey appear to be derived from *-ka-, presumably with the diminutive (and perhaps) pejorativ[e] formations.The K-suffix -ak in CNP might derive from Middle Persian -g, Pusar-ag<pesar-ak "boy" and duxtag<doxtar-ak "girl." The K-suffix -ak is attested in Persian varieties such as Shirazi Persian as an evaluative suffix, alongside the K-suffix -ū used as a definiteness marker.201

K-suffix -e/he 202
The etymological origin of the K-suffix -e/he is not yet clear to me, and I leave it as an open question.However, I can offer the following two hypotheses: (1) The K-suffix -e/he might be a short form of the -ak suffix in CNP.The sound K-has been dropped, and the a sound has changed to the e sound.This type of sound shift is widespread among Iranian languages such as in dastag>daste "handle."In addition, a natural development from Middle Persian to New Persian is the change of Middle Persian -ag to -e, as is apparent in setārag>setāre, and particle -ag>e kardag-kard-e as well.
Across the CWP corpus, however, I found many nouns with a combination of both -ak and -e suffixes, for instance, zan-ak-e "woman," mard-ak-e "man," and the following interesting variation of this combination with the same noun "demon."In its first mention in the story, it appears as yek dīb-ak=e sīyā, "a black demon," and then subsequently as dīb-e "demon," dīb-ak-e "demon," and dīb-ak "demon."203If we assume that the K-suffix -e is a short form of -ak, we should not find both suffixes combined on the same noun.The co-existence of both suffixes -ak and -e in this scenario seems to be awkward.Ex. ( 89 (2) The K-suffix -e might have originated from another source instead of being directly connected to the -ak suffix in CNP.However, both of them (-ak and e/he suffixes) are related originally to the same semantic notions, that is, evaluative (ke-suffixes).
An ongoing study by Hashabeiky on Persian (from the sixteenth to eighteenth centuries) shows that only one form of the K-suffix -ak with evaluative sense has been written in an informal style, in two of her manuscripts. 205However, Nadimi Harandi and Atayi Kachooyi provide evidence of the K-suffix -e in poetry much earlier (poet, ʿAtar-e Neshaburi, thirteenth century). 206This finding suggests that the K-suffix -e has been used by Persian speakers (in informal settings) but has not been registered in earlier texts.
Similar to the K-suffix -ū in Shirazi Persian, available data with the K-suffix -e in Persian shows that this suffix mostly appears with singular nouns and in informal registers.We do not have evidence of its final phonological form.For Shirazi -ū, we can trace this suffix back to -ūk, used as an evaluative suffix in other Iranian languages such as Bami, Kermani, and Sangsari, 207 while the etymological origin of the Persian -e suffix remains a puzzle for the time being.
In this regard, similar to my observation in Shirazi 208 of two K-suffixes -ak and -ū originally used as evaluative suffixes, I would suggest that there have been different K-suffixes in Persian with an evaluative meaning (-ak, -īk, -ūk/*ek).Whether or not they are related to the same origin is irrelevant here; what matters is that they show similar (evaluative) semantics.These various forms are most probably a matter of Persian dialectal variation, for which we do not have recorded material of the earlier stages.The K-suffix -e has been grammaticalized as a definiteness marker, and the -ak suffix continued to carry evaluative semantics regardless of genre in written, spoken, formal and informal language settings.However, its evaluative senses, such as endearment when used with proper nouns, have to a large extent been bleached 209 and its pejorative meanings have become colorless.
Note that the short form of the K-suffix -īk as ī can still be found in Persian speech, such as in māmī (my lovely mother) and xāharī (my lovely sister), but it is not so frequent.This suffix is very productive as a marker of endearment in other Iranian languages, including Balochi Sistani. 210Note that in Sistani Persian, the K-suffixes -ak/ok are still very productive on proper nouns and reflect endearment and pejorative meanings.

Considerations of Sources and Paths of Development
The CNP, CWP and CSP corpora studied here exhibit three different types of development of the K-suffix (the reflexes of cognate and originally evaluative morphemes), which can be interpreted as comprising a scale.In CNP, the most conservative stage in the present study, the K-suffix functions as a polyfunctional evaluative morpheme covering a typical array of functions generally associated with diminutives cross-linguistically 211 which are not constrained by definiteness and not subject to structural constraints.However, already at this stage we find some passages with singular nouns in deictic and recognitional contexts.It lies at one end of the scale.
Located in the middle, CWP shows a pre-grammaticalization stage of definiteness marking.The original evaluative meaning of the K-suffix is maintained at its highest usage, but the suffix is subject to structural constraints (i.e., mostly with singular nouns).It shares deictic and recognitional usages of the K-suffix with CNP.The suffix is very immature, and is only sporadically and unsystematically used, even by the same writer, with a handful of nouns.
CSP is found at the other end of the scale.The evaluative usages are not attested, and the suffix is not compatible with indefiniteness contexts.It shares the constraint regarding singular nouns with CWP, but increases in frequency and becomes more closely associated with definiteness contexts.The system does not show a unique spread across the speakers and genres.In the narrative texts investigated, we found a few speakers of CSP who had taken this usage (marking of the NPs with a K-suffix) a step further and now used the K-suffix systematically as a distinct marker of anaphoric definiteness, especially in folktales, biographical genres and informal settings.
This comparison between different stages sheds light on a developmental path from evaluative morpheme to definiteness marker in Persian, as summed up in Table 4.The grammaticalization path is similar to what I already have suggested for other New Western Iranian languages, including Balochi and Shirazi. 212hese findings suggest that the development of definiteness marking can proceed down a new pathway that is entirely distinct from the one generally presented (demonstrativebased) from a typological perspective.Despite the different pathways, however, the endpoints may be fairly similar.Here the starting point is an evaluative marker.In the first stage of the development, evaluative usage is compatible with deictic and recognitional usage, which often occurs with demonstrative pronouns.The latter are anchored to a concrete and interactive speech context involving some form of "attention direction" on the part of the speaker.In the second stage, evaluative usages may disappear entirely/bleach.In contrast, the deictic and recognitional usages are extended to include anaphoric tracking, which would be more independent of setting and not necessarily dependent on immediate interactions.In the final stages, the K-suffix is systematically associated with anaphoric definiteness contexts, although the system continues to co-exist with inherited unmarked definite strategies (bare noun and demonstrative plus noun).Thus, the basic system of definiteness marking with a K-suffix is similar to the more familiar article-based system, of which anaphoric definiteness is generally the core function.
Several differences can also still be discerned, in particular the constraint that prevents definiteness marking in combination with plural marking and possessed nouns formed with a person-marking clitic.In a recent cross-linguistic study on definiteness, 213 Becker found no typological evidence for the compatibility of definiteness markers with plural number (although there is clear evidence for incompatibility between indefiniteness markers and plural number).Thus, the Persian constraints (along with Shirazi and Balochi) remain somewhat of a puzzle, compared to definiteness markers in Lori Bakhtiyari and Central Kurdish based on the same K-suffix, for which no such constraints exist.I leave this as an open question, but assume that the constraint might be due to the following facts: (a) these two suffixes (the plural marker and the K-suffix -e/he) are compatible morphologically (since both the plural marker -hā and short form of -e are new in the language); (b) they are compatible semantically, because the plural marker -hā already has a definiteness function, and it does 211 Ponsonnet, "A Preliminary Typology." 212Nourzaei, "Definiteness Marking"; Nourzaei, "History of the Suffix -ū in Shirazi." 213Becker, "Articles in the World's Languages."not need to be marked again with another element (e-he); 214 and (c) the starting point of an evaluative marker in deictic and recognitional contexts in CNP is singular nouns, which suggests a possible scenariosimilar to that of the intrusion of the object marker (-rā) into the nominal system with singular nouns, for example in Balochi 215where the singular nouns are initially attracted more to the K-suffix than to the plural nouns.I have also noticed a tendency of using the K-suffix with the plural marker in Lori spoken in Fars.This is a topic for future study.
Finally, concerning the development of the definiteness marker in Persian, I would suggest that internal development, for example reducing the case system in Persian, may have favored the emergence of an additional nominal category such as definiteness.So far in the languages in our survey, languages/dialects with a reduced case system exhibit the development of the definiteness marker, for example, Shirazi, Koroshi, Lori, and Central Kurdish.On the other hand, one should not overlook the language contacts (possible earlier Persian contacts with the Semitic languages); see also Haig and Khan. 216 The ongoing project suggests that several New Western Iranian languages have developed some nascent form of definiteness marking based on evaluative morphology.
Due to the extensive documented material from its earlier phases, the Persian case presented here will provide a benchmark for future studies of Iranian languages, and will broaden the database for our understanding of the development of definiteness cross-linguistically.

Figure 1 .
Figure 1.Location of the data for Contemporary Spoken Persian.

Ex
CLM door=EZ house=EZ father.PC.3SG-OBJ open do.PST-PP būd be.PST.3SG"when the older son has opened the father's door of the house" 108 ) Absence of the K-suffix with a proper noun to masalan bā sīyāvoš rāh mī-raft-ī PN.2SG for instance with Siyāvosh way IMP-go.PST-2SG "for instance, you walked with Siyāvosh […]" 172 Ex. (74) Absence of the K-suffix with a title xub āġā=e doktor dar har sūrat well Mr=EZ doctor in each face "well, Mr. Doctor at any rate […]" 173 come.PST.3SGarrive.PST.3SG to one water and tree=IND girl-DEF from bālā=ye deraxt faryād zad above=EZ tree shout beat.PST.3SG"he came[and] arrived at a [body of] water and a tree […] the girl shouted from above the tree" 182 198

Figure 3 .
Figure 3. Percentage of K-suffixes, based on questionnaire (fourteen speakers, rounded mean percentages of all speakers' responses).

Figure 4 .
Figure 4. Percentage of the K-suffixes, based on questionnaire (fourteen speakers, rounded mean percentages of individual speakers' responses).

Table 1 .
List of the critical editions from which data has been extracted.
It was with this pencil that I wrote my meeting place[address] [and]gave it to that girl with whom I have become acquainted recently" 114 50×50-square-meter bit of cloud [coming]from those clouds appeared from the back side of the mountains, as soon as that cloud started to rain […]"116

Table 3 .
An overview of the corpus."History of the Suffix -ū in Shirazi"; Nourzaei and Haig, "An Overview of Definiteness Marking"; Nourzaei and Haig, Emerging of Definiteness Markers in New Western Iranian Languages.
PP.COP.NPST.3SGIMP-come.NPST-3SG lion-DEF up=EZ head=EZ old man-DEF "they saw an old man sitting on the ground like a chick, the lion came to the old man" 144 IMP-see.NPST-3SG CLM one lion have.NPST-3SG from far IMP-come.NPST-3SG "it goes until [it] arrives at a cow, the cow sees that from a far distance a lion is coming" 146 OBJ IMP-put.PST.3SGinto=EZ one basket basket-DEF from dast=am oftād hand=PC.1SGfall.PST.3SG"he put the apples into a basket […] the basket fell down from my hand" 147 143I have found one instance of the suffix in formal text, with the word pesar, as pesar-e " anyway, the youth wanted to go on a trading journey, he asked his wife what she wants, this girl [he] asked as well" 150 A combination of the K-suffix with a demonstrative pronoun is common in other Persian varieties such as Hamedani in example (62), and in the Qomi variety of Persian.151 145Ibid.146Ibid.147Nourzaei,Unpublishedtexts, recorded between 2018 and 2021. 148LD. 149ee Taghi, A Typology and Classification of Three Literary Genres, 97." DEF sick become.PST.3SG"Mr.himself, could not take care of himself, we have another case, you know, Mrs. got sick […]" 175 https://doi.org/10.1017/irn.2021.27Published online by Cambridge University Press In the same text, example (84), the speaker uses the K-suffix before ham, as in doxtar koulī-ye ham, and does not apply it to the following clause doxtar koulī ham.Such examples certainly need more research.185DEF"you know, the gypsy girl took out the last needle, […], the gypsy girl, whatever she has heard from the girl falsely […]" 186 Similar to examples (84)-(87), in example (88) the old lady is one of the main characters in the story.After several mentions with a K-suffix, she appears without a K-suffix.3SG"theold lady said, do not worry, my daughter knows how to make it calm" 191 https://doi.org/10.1017/irn.2021.27Published online by Cambridge University Press

Table 4 .
Overview of grammaticalization path from evaluative to definiteness functions.