On the nature of morphological alternations in Archaic Chinese and their relevance to morphosyntax

Abstract While sound glosses from the Six Dynasties and early Tang provide direct evidence for morphological alternations in Archaic Chinese, studies on the syntax of this language generally disregard these data. This neglect is due in part to perceived unreliability of these sound glosses. In this paper, I first argue that the arguments against their reliability do not stand scrutiny, and that they are not a simple philological curiosity, but have the potential to enrich studies on Archaic Chinese syntax, and plead for more collaboration between syntacticians and historical phonologists.


Introduction
While a considerable amount of research has been devoted to the grammar of Archaic Chinese 1 since von der Gabelentz (1881), it is striking that the study of morphological alternations on the one hand, and of syntax on the other, have been led as two independent disciplines with little interaction between them, even in the work of scholars such as Zhou Fagao, Mei Tsulin, or Edwin Pulleyblank, who have contributed to both fields.
The exact amount of morphology that can be reconstructed for pre-Han Chinese remains a controversial matter: some scholars posit a rich system of affixes (Sagart 1999), while others treat Archaic Chinese as devoid of morphological alternations (Zádrapa 2011). I argue in this paper that, regardless of which system of Archaic Chinese reconstruction one adopts, the voicing and tonal alternations are incontrovertible evidence showing that this language was not purely isolating. Moreover, this morphology is not just a matter of word formation, but also has consequences for syntax, and is in particular incompatible with the notion of Archaic Chinese as a "pre categorical" language (Bisang 2008, Sun 2020).

Tonal and voicing alternations in Archaic Chinese
Given the fact that the phonological reconstruction is still a debated topic, Sagart 2014, Schuessler 2015), I discuss the question of morphological reconstruction mainly from the point of view of Middle Chinese, a language whose phonological system is well understood, transcribed using a version of Baxter's (1992) system converted to IPA. 2 Archaic Chinese reconstruction is mentioned only briefly when absolutely necessary.
In this section, I present examples of morphological alternations found in Archaic Chinese texts, and their cognates in modern Sinitic languages. I additionally show that, although various scholars have argued since the seventeenth century that these alternations were spurious and invented by Han and Six-Dynasties scholars, none of the arguments stand close scrutiny.

The Jingdian shiwen
Our main source on the morphological alternations in Archaic Chinese is the 經 典釋文 Jingdian shiwen, a collection of sound glosses on pre-Han classical texts by the Tang dynasty scholar Lu Deming 陸德明 (556-630). 3 This work records the pronunciation of rare characters, or of characters with several alternative readings, using either the fanqie 反切 method (glossing pronunciation with two characters, the first indicating the initial consonant, and the second the rhyme) 4 or directly glossing pronunciation with a homophonous character.
There are in addition a number of less common, and poorly understood, alternations involving vowels or aspiration (see Schuessler 2007: 51-78 for possible examples), but these will not be treated in the present paper.

Morphological alternations and analogy
It is undeniable that alternations originating from voicing or tonal alternations in Middle Chinese are present in standard Mandarin and most Sinitic languages for which appropriate documentation exists, as illustrated in Table 1.
Since the verbs in Table 1 belong to colloquial vocabulary, the idea that these alternations are purely scholarly inventions ( §2.3.2) cannot be accepted: it is unrealistic to suppose that alternative readings invented by scholars could have penetrated the colloquial layer when literacy was not universal.
Alternative readings shared by both Mandarin and Middle Chinese are, however, rare. Most of the alternations in the Jingdian shiwen have left no trace in Mandarin and other Sinitic languages. However, this in itself is not surprising in view of the fact that even the pairs in Table 1 are not maintained by all Mandarin speakers. Some people generalize the transitive 折 zhé "break" even to describe spontaneous breaking, or use 繫 xì for transitive uses, saying for instance 繫安全带 xì ānquándài "fasten your seat belt" instead of jì ānquándài.
This variation is due to the generalization of one of the two members of the pair (which can be either the transitive or the intransitive verb), an elementary case of analogical levelling. These fossilized alternations are comparable to irregular verbs, and their loss can be viewed as a type of regularization, by analogy with non-alternating labile verbs, which constitute the majority.
However, not all alternations found in modern Sinitic languages necessarily go back to Middle Chinese or earlier. A particularly telling example is that of 背, which has two Middle Chinese readings pwojH (in the meaning "back") and bwojH ("leave in one's rear, 7 betray"). In Mandarin, the character 背 has two readings, bèi (corresponding to the two previous nominal and verbal meanings), and also bēi "carry on the back", a pronunciation without support from Middle Chinese, and absent from Cantonese and other Sinitic languages. The expected Middle Chinese corresponding to bēi would be †pwoj, but this reading is absent from all ancient sources, and bēi is not listed in dictionaries before the 1930s (Shi 1999). 8 Table 2 provides an account of how the verb bēi "carry on the back" was created within Mandarin in the early twentieth century: 背 bèi "back" was reanalysed as an instrumental nominalization like 鑽 zuàn "drill (n)", and a verb bēi "carry on the back" with level tone was invented by backformation, following the pattern of the alternation between 鑽 zuān "drill (v)" and 鑽 zuàn.
Thus, analogical levelling can have taken place between Middle Chinese and Mandarin, and similar phenomena may have existed between Archaic and Middle Chinese. Nevertheless, for analogical extensions of alternations and backformation such as that illustrated above to take place, a kernel of inherited alternations must have existed in the first place, and the alternations found in Middle Chinese cannot have been entirely made up out of nothing.

Arguments against the reliability of the Jingdian shiwen
The neglect of the Jingdian Shiwen in studies on Archaic Chinese syntax can be partly explained by a perceived lack of reliability of the readings contained in this source (Zádrapa 2011: 71-6). In this section, I present the main arguments that have appeared in print against the reliability of the Jingdian shiwen, and provide counter-arguments.

Absence in modern Sinitic languages
One line of argument against the value of Jingdian shiwen comes from the fact that few of the variant readings it contains are attested in Mandarin, and that even among those that are recorded, they are mere dictionary pronunciations, and do not exist in the spoken language. As an example of this phenomenon, Zádrapa (2011: 74) cites the alternation between the noun 衣 yī "clothes" (from Middle Chinese ʔjɨj) and the transitive verb yì "wear" (from ʔjɨjH): the latter is obsolete and rarely distinguished even by scholars when reading Classical Chinese. 鑽 zuān "drill (v), bore into" 鑽 zuàn "drill (n)" ∅ ⇒ bēi "carry on the back" 背 bèi "back" However, the obsolescence of this reading is expected in view of the fact that the verb 衣 ʔjɨjH "wear" has been replaced in the spoken language by 著 ʈjak (since the Han dynasty, still in use in Cantonese) and by 穿 tɕʰwen (since the Tang dynasty). While the reading ʔjɨjH is not living in any mainstream Sinitic language, it has been preserved by Bai (Gong 2015: 2). For instance, in Jianchuan Bai we have ji̠ 21 "wear" vs ji 55 "clothes" with the regular tonal correspondences.
This remarkable archaism in the Bai language, regardless of the controversy regarding the phylogenetic status of this language, 9 shows that the verb 衣 ʔjɨjH cannot be a scholarly invention. The verb 衣 ʔjɨjH "wear" is attested already in the Shijing, and occurs in a figura etymologica with its cognate noun, as in (1).
(1) 衣錦褧衣 ʔjɨjH kimX kʰweŋX ʔjɨj wear brocade slip-over clothes "(Lady Wei Jiang) is dressed in a brocade robe and an unlined slip over robe." (Shijing, 57) The same construction still exists in Bai, where the collocation ji̠ 21 ji 55 "wear clothes" is attested. This example illustrates the fact that Mandarin is not particularly conservative when it comes to preserving traces of morphological alternations, and that a broader survey of alternations in other Sinitic languages (including Bai), or in borrowings from Chinese into Hmong-Mien, Kra-Dai, or Vietic languages, would be needed to evaluate the proportion of readings in the Jingdian shiwen which are indeed not confirmed by data from modern languages.

Doubts about the reality of some of the alternations in the writings of early philologists
It is a well-known fact that Qing dynasty philologists such as Gu Yanwu and Qian Daxin believed that the alternations recorded in the Jingdian shiwen came into existence after the Han dynasty (see for instance Shen 2007: 104). However, such a hypersceptical view does not explain how these alternations could have come into being, and it is not compatible with the existence of traces of such alternations in Bai ( §2.3.1).
Moreover, there are cases when tonal and voicing alternations are reflected by distinct characters, such as 受 dʑuwX "receive" and its causative counterpart 授 dʑuwH "offer", a distinction already attested in the Oracle bone inscriptions (Takashima 2013, pace Djamouri 2013.
Among the reasons for the scepticism of Qing scholars, we find an oft-quoted passage by the Northern Qi scholar Yan Zhitui: 9 Regardless of whether Bai is a non-Sinitic language whose native vocabulary has been almost entirely replaced by several layers of Chinese (Lee and Sagart 2008), or an outlier Sinitic language (Starostin 1995, Gong 2015, the fact that it is not a literary language means that if 衣 ʔjɨjH was an entirely artificial reading, it could not have succeeded to become the main verb used in that language. Scholars from the Jiangnan area, when they read the Zuozhuan, have a made-up rule that was orally transmitted, whereby every time one's army is defeated, the verb "defeat" 敗 read baejH (蒲 bu+邁 maejH), and whenever one defeats an enemy army, it is read paejH (補 puX+敗 baejH). However, in all transmitted documents, I have never seen the reading paejH. In Xu Xianmin (Xu Miao)'s reading of the Zuozhuan, there is only one example of this reading, and it has nothing to do with the difference between "be defeated" and "defeat someone", (showing that the reading rule of Jiangnan scholars) is far-fetched and implausible. 10 In this passage however, Yan Zhitui does not express doubts on the system of alternations as a whole, but more specifically on the reading of the character 敗 in the tradition of the Jiangnan area. His stated reason for refusing to accept their tradition is the absence of such alternation in Xu Miao's work, and presumably in his own speech. However, rather than a testimony of the unreliability of the Jingdian shiwen, this passage shows that the Jiangnan dialect had preserved a distinction lost in Yan Zhitui and Xu Miao's dialects (both scholars were from present-day Shandong, though the latter lived nearly centuries before the former), a phenomenon no more surprising than the obsolescence of the reading 衣 yì "wear" in Mandarin mentioned above ( §2.3.1).
A second line of argument by Gu Yanwu against the reality of the alternations concerns the example of 惡 ʔak "evil" and 惡 ʔuH "hate". Gu Yanwu points out that the character 惡, used as a noun "evil, wrongdoings" in the Chuci (2) unexpectedly rhymes with 固 kuH and 寤 ŋuH. It is clear that in this passage 惡 is a noun, since it serves as the object of 稱 tɕʰiŋ "proclaim" and is opposed to the noun 美 mijX "beauty, good quality". 11 (2) 世溷濁而嫉賢兮,好蔽美而稱惡 ɕejH ɣwonH.ɖaewk ɲi dzit ɣen ɣej, xawH world murky LNK envy worthy INTERJ like pjiejH mijX ɲi tɕʰiŋ ʔak/ʔuH hide good.quality LNK proclaim defect "For the world is impure and envious of the able, Eager to hide men's good and make much of their ill." (Lisao, Hawkes 1985) For the rhyme to work, a Middle Chinese reading ʔuH would be expected, running counter to the function of this reading in the Shiwen. Gu Yanwu then concludes that "From this we know that the difference between the qusheng and rusheng readings (of the character 惡) is just a matter of weak and strong pronunciation, and there is no difference between them". 12 This statement can be 10 江南學士讀《左傳》，口相傳述，自為凡例，軍自敗曰敗，打破人軍曰敗。諸記 傳未 見補敗反，徐仙民讀《左傳》，唯一處有此音，又不言自敗、敗人之別，此 為穿鑿耳。 11 For philological arguments against another one of Gu Yanwu's examples, see also Sun (2007: 412), which, however, does not address (2). 12 乃知去入之别, 不過發言輕重之間,而非有此疆爾界之分也。 interpreted as the hypothesis that these two readings were in free variation in Archaic Chinese. This example (and other rhymes from the Han dynasty adduced by Gu Yanwu), however, does not prove that the alternation between ʔak "evil" and ʔuH "hate" in the Shiwen is spurious.
The reading of 惡 implied by the rhyme in (2) is not outlandish when one carefully investigates the glosses of the Shiwen. In example (3) from the Liji, for example, 惡 means "bad quality, defect" and is clearly opposed to 美 mijX "beauty, good quality" as in (2) above. However, the sound gloss in the Shiwen is 烏路反又如字, which contains the reading ʔuH (based on the fanqie spelling ʔu+luH) and the additional note 又如字 "also (read by some people as) the default reading", in this case ʔak.
(3) 君子知至學之難易，而知其美惡，然後能博喻 kjun.tsiX ʈje tɕijH tɕi nan jeH, ɲi ʈje noble.man know attaint GEN difficult easy LNK know gi mijX ʔuH/ʔak ɲen.ɣuwX noŋ pak hjuH its good.quality defect then can vary teaching "When a man of talents and virtue knows the difficulty (on the one hand) and the facility (on the other) in the attainment of learning, and knows (also) the good and the bad qualities (of his pupils), he can vary his methods of teaching." (Liji, Xueji 11, translation by Legge) Of the two alternative readings proposed by the Shiwen, the first, 惡 ʔuH, has the qusheng reading, and is clearly in nominal function, exactly like the rhyming word in (2). Thus, the Chuci rhyme in (2) supports, rather than discredits, the value of the Shiwen as preserving traces of morphological alternations. What these examples demonstrate is that the function of the qusheng alternation in the case of 惡 ʔuH is not limited to verbalizing denominalization.
Rather, (3) indicates that there were actually two abstract nouns meaning "defect, ugliness", derived from the adjective "be bad, be ugly", one derived by zero-derivation, and the other by qusheng alternation (originating from a *-s nominalization suffix through a *-h stage, according to Haudricourt 1954), as represented in Figure 1. The noun ʔuH "ugliness, defect", however, disappeared to the advantage of the former.

Absence of mention in pre-Han texts
Another possible objection against the existence of morphological alterations in Archaic Chinese is the absence of any mention of these phenomena in pre-Han texts. This argument (raised, but not strongly endorsed, by Branner 2002Branner , 2003 is, however, the weakest of all: there is likewise no mention of a distinction between main vs subordinate clauses in these texts, even if nobody would doubt that Archaic Chinese had complement and relative clauses.

Temporal gap
Chinese is not the only language with a gap of more than one millennium between a given corpus of texts written in a defective script and the creation of a full system of phonetic annotations. The Masoretic vocalization was created eight centuries after the death of Hebrew as a spoken language, and the time gap between the redaction of the earliest parts of the Tanakh and the vocalization is comparable to that between Western Zhou texts such as the Shijing and the sound glosses in the Shiwen. Yet, despite some scepticism on the value of the Masoretic text by some early-twentieth-century Bible scholars, the present consensus is that this distrust is misguided, and that the Masoretic vocalization, while distinct from the pronunciation of Hebrew in antiquity, regularly evolved from an authentic form of Hebrew (Suchard 2019: 21-23).
While some degree of healthy scepticism on the value of the readings is necessary, as mentioned above, neglecting the Shiwen when studying Archaic Chinese syntax is no different from attempting to analyse Hebrew syntax without the vocalization.

Non-specificity
Another argument to minimize the value of morphological alternations is the perceived non-specificity of the qusheng derivation, since it can derive nouns from verbs, verbs from nouns, intransitive verbs from transitive ones, and transitive verbs from intransitive ones. As Downer (1959: 262) put it: The present writer holds the opinion that with our present knowledge of Classical Chinese, it is better to regard chiuhsheng derivation not as a remnant of a former inflectional system of the Indo-European type, but simply as a system of derivation and nothing more. When new words were needed, they were created by pronouncing the basic word in the chiuhsheng. The grammatical regularity found in many cases would then be in a way fortuitous, being the result not of grammatical inflection, but of the need to create new words.
A major drawback in Downer's approach, however, is to treat Archaic Chinese morphology as an isolated problem, without any reference to non-Chinese Sino-Tibetan languages.
In Hebrew, one of the arguments in favour of the genuineness of the Masoretic vocalization comes precisely from Semitic comparative phonology and morphology (Suchard 2019: 23). In a similar way, a satisfactory assessment Downer wrote these words before Forrest (1960) showed, based on Haudricourt's (1954) theory of tonogenesis, that the nominalization function of the qusheng could be compared to the -s nominalizing suffix in Tibetan (on which, see Zhang 2009, Hill 2014).
In addition, nominalization is not the only function of the qusheng derivation which has possible comparanda in other Sino-Tibetan languages. As shown in Jacques (2016), sibilant or dental stop suffixes with functions similar or identical to that of the qusheng derivation are found in morphologically richer languages of the Sino-Tibetan family, especially Rgyalrongic, Kiranti, Nungish, and West-Himalayish (Table 3).
The idea that the qusheng could originate in part from dental stops, first suggested by Schuessler (2007: 42), is based on two observations. First, qusheng in Middle Chinese is much too frequent to originate exclusively from *-s: in Archaic Chinese reconstruction systems which adopt this hypothesis, the quantity of *-s far exceeds anything found in the rest of the Trans-Himalayan family, including Rgyalrongic and Tibetan. Second, from a typological perspective, the dental stop coda alternates with -s when following consonants in many languages of the family, including Tibetan (due to a sound change *-d → -s / {m,b,ŋ,g}_#, Coblin 1976), Kiranti (in particular Khaling and Dumi, see the internal reconstruction in Jacques et al. 2012), and West Himalayish (Martinez 2021).
Rather than a vague system of derivation insensitive to parts of speech, what these comparisons imply is that the remnants of morphology preserved in Middle Chinese glosses is the result of the convergence of unrelated suffixes due to the drastic phonetic attrition that has occurred between Archaic Chinese and Middle Chinese.
While Chinese is not phylogenetically close to either Gyalrongic or Kiranti, sharing with them only a limited number of cognates (Zhang et al. 2019, Sagart et al. 2019, the exuberant verbal morphology of these languages offers a framework against which hypotheses on the interpretation of morphological alternations can be tested (Gong 2017, Zhang 2022, and makes it possible to go beyond the circularity of Chinese-internal analysis.

The interaction of morphology and syntax in Archaic Chinese
Although the alternations described in section 2 involve only a minority of nouns and verbs, they occur in a significant part of the most common vocabulary of Archaic Chinese, and their use is systematic. I focus in this section on two of the most common functions of the qusheng alternation: nominalization and passivization, and then conclude that syntactic parameters such as transitivity and the noun-verb distinction are not as irrelevant to Archaic Chinese syntax as might appear at first glance.

Nominalization
Among the functions of the qusheng derivation described by Downer (1959), one of the most prominent is nominalization, as mentioned above in §2.1. For instance, the verb 守 ɕuwX "defend, guard" (4) has the reading ɕuwH 13 when nominalized, either as agent nominalization "guard" (5), or as patient nominalization "area that is being guarded by the country" (6).
(4) 晉侯使賈華伐屈。夷吾不能守 However, most verbs are nominalized without any tonal alternation. This is in particular the case with verbs already in qusheng tone, which have nothing to alternate with. For instance, the transitive verb 患 ɣwaenH "be worried about, be troubled by" (7) has the nominal forms "trouble, disaster" and "person in distress" as in (8), which are also pronounced ɣwaenH. Zero conversion of verbs into nouns in Archaic Chinese (动词活用为名 词, Chen 1922) is one line of argument in favour of the notion of Archaic Chinese as a "pre-categorial" language (Bisang 2008, Zádrapa 2011, Sun 2020. However, we find in Archaic Chinese not only verbs turned into nouns, but also entire phrases. For instance, in (9), 有禮 hjuwX lejX does not mean "X has ritual property", but rather "person having ritual property". hjwejH jaeX protect COP "I have heard that to submit to those who have ritual propriety is protection for the altars of the domain." (Zuozhuan,Xi,33) This noun phrase, which is frequent in the Zuozhuan, is a zero-marked relative clause, resulting from the elision of the nominalizer 者 tɕaeX, a phenomenon pointed out by von der Gabelentz (1881: §477), but relatively neglected in the literature. The full relative without relativizer elision happens also to be attested (10). "He himself is without ritual propriety, and he chastises those who do have ritual propriety." (Zuozhuan,Wen,15) This type of unmarked relative can be detected in particular when a negation is present, as in the case of 不服 pjuw bjuwk "(those who) do not submit" in (11). The occurrence of this phrase in post-verbal position, as the object of 討 tʰawX, shows that 不服 pjuw bjuwk is a relative clause.
(11) 會于溫，討不服 ɣwajH hju ʔwon tʰawX pjuw bjuwk meet LOC Wen chastise NEG submit "They met at Wen: this was to chastise those who would not submit." (Zuozhuan, Xi, 28) The problem of zero-nominalization can thus be turned around: there is no difficulty in analysing agent and abstract nominalizations as minimal relative clauses with elided 者 tɕaeX nominalizer. Tonal alternation can thus serve as a criterion for true nominalization, as opposed to zero-relativization.
In other languages of the Sino-Tibetan family with productive affixal nominalization such as Tibetan or Japhug, zero nominalization (Hill 2019) or finite relative clauses with no overt mark of nominalization (Jacques 2021b: §23.2.2, 1258) have been documented, without any need to appeal to a notion of pre-categoriality.

Passivization, relativization, and complementation
Another well-attested function of the qusheng, mentioned in §2.1 above, is turning a transitive verb into an intransitive one. Among these intransitive derivations, we find a few examples of antipassive (Jacques 2021a) and passive derivations.
For instance, the verb 使 ʂiX "send, despatch", itself a denominal verb coming from the root reflected by the noun 吏 liH "official" has the passive form 使 ʂiH "be sent on a mission". 14 (12) 其孫箴尹克黃使於齊，還及宋，聞亂。 gi swon tɕim-jwinX kʰok-ɣwaŋ his grandson deputy.for.remonstrance Kehuang ʂiH ʔjo dzej, ɣwaen gip sowŋH mjun lwanH send:PASS LOC Qi return reach Song hear troubles "His grandson, the deputy for remonstrance Kehuang, was sent on a mission to Qi. On his way back, by the time he reached Song, he had heard of the rebellion." (Zuozhuan,Xuan,04) This passive form is the one employed to derive the lexicalized relative clause 使 者 ʂiH tɕaeX "envoy", as illustrated by (13 The marker 者 tɕaeX cannot be used to relativize objects on its own (Aldridge 2013a). Thus passivization here is a strategy to make the patient accessible to subject relativization in 者 tɕaeX (Keenan and Comrie 1977).
In (14), there is co-reference between the transitive subject of 請 tsʰjeŋX and the intransitive subject of the passive 使 ʂiH "be sent". By contrast, in (15), there is no subject conference when the verb of the complement clause 使 ʂiX "send, let" is not passivized.

Consequences for the syntax of Archaic Chinese
These examples show that the morphology of Archaic Chinese is not simply a matter of word formation, but that it is directly relevant to syntactic analysis.
It is true that many verbs lack a distinct passive form expressed by tonal or voicing alternation. For instance, the verb 拘 "retain, seize", has only one reading kju, even in the passive relative clause 拘者 kju tɕaeX "those who had been retained", as in example (16) (see Luo and Wu 1983: 137 (16), there is but one clear example of relative clause of the same type in the Zuozhuan. 16 Zero-passivization in Chinese is an instance of lability, a phenomenon that is also widely observed in languages with overt marking of transitivity. In the Sino-Tibetan family for instance, although Limbu has a very elaborate polypersonal indexation system, a certain number of frequent verbs, such as khutt-"steal", can be conjugated either transitively or intransitively without any additional derivational morphology, as illustrated by (17) and (18)  Archaic Chinese syntax differs from that of Limbu in degree rather than nature. The fact that a majority of verbs are labile does not imply that transitivity is an irrelevant parameter in Archaic Chinese syntax, but rather that the morphosyntactic contexts where transitivity is relevant are fewer than in Limbu. Focusing precisely on the syntax of verbs whose passive and antipassive forms are recorded in the Shiwen, and studying their syntax in a systematic way, could bring new light on the whole field of Archaic Chinese syntax, and restore the distorted image of this language brought about by the habit of reading it in standard Mandarin.

Phonology for a better philology: a proposal for good practices in Archaic Chinese syntax and textual editions
One of the main reasons for the neglect of tonal alternations in the studies on Archaic Chinese syntax is accessibility: the Jingdian shiwen in its present form indicates readings in a way that is opaque to non-specialists in Chinese historical phonology, and omits readings when a character should be read following the default pronunciation (如字) or when the alternative reading is considered obvious enough.
In order to democratize access to the Shiwen, historical phonologists should produce text editions of all classics with full annotation of all characters in Middle Chinese, including all the variant readings indicated in the Shiwen, as well as explicitly marking the characters with variant readings that are not glossed. As indicated above, glossing Archaic Chinese in Middle Chinese transcription is clearly anachronistic, but this way of doing things is sufficient to encode the morphological alternations directly relevant to syntax (at least those that can be recovered philologically), and in any case it is a necessary step if one aims at encoding whole texts in Archaic Chinese reconstruction. 17 In the following, I present two examples of how to make this data more readily available.
In example (19) from the Zuozhuan, sound glosses are added to the relevant character in the Chinese text, and are underlined in the Middle Chinese line. Characters that have several readings, but are not glossed in the Shiwen, are marked in bold, and the default reading is chosen. Some sound glosses have scope over several sentences: for instance the mention 下 同 "same below" indicates that the character 壞 is to be read kwɛjH (音怪) not only in (19d), but also a few sentences later in (19e). In this short passage, we observe several non-trivial morphological phenomena indicated by glosses: (i) the transitive verb 被 bjeH "be covered with" derived from the noun 被 bjeX "covering" by denominal qusheng derivation, (ii) the 17 Given, moreover, that no reconstruction system of Archaic Chinese can ever be considered final, and that morphological contrasts not represented in Middle Chinese glosses cannot be directly recovered from texts, conversion to Archaic Chinese is less useful, and will always have been subjected to revision, while conversion between Middle Chinese reconstructions is bijective. transitive form 壞 kwɛjH "destroy" (from which the passive reading 壞 ɣwɛjH "be destroyed" is derived by voicing alternation), and (iii) the antipassive form 覺 kaewH "wake up" from 覺 kaewk "perceive, make clear, realize". Example (20) from the Shijing illustrates that the density of glosses in this text is much higher than in the Zuozhuan. It is also significant in having several variant readings ascribed to the same character, with sources sometimes indicated. In the case of 干 (meaning here "protection" or "shield"), the first phonetic gloss is the default reading (如字) kan, but Zheng Xuan glosses it as ɣanH (戶旦反) and Shen Xuan as kanH (音幹). Here, a rigorous text edition should not choose between the variant readings, but rather list them all and convert them to Middle Chinese reconstruction. 18 Superscript letters such as Z for Zheng Xuan or X for Xu Miao can be used as abbreviations to specify the source of the reading.
赳赳 居黝反 武夫、 kjiwX kjiwX mjuX pju That stalwart, martial man 公侯干 如字鄭戶旦反沈音幹 城。 kuwŋ ɣuw kan/ɣanH Z /kanH S dʑeŋ Might be shield and wall to his prince." (7, Legge) A systematic conversion of the complete corpus of the texts glossed in the Shiwen (and their commentaries) into Middle Chinese cannot be entirely automated: the alignment of the glosses to the text is not trivial, notably because some sound glosses strand over several sentences (as in (19e) above), and also because readings judged "obvious" by the compiler of the Shiwen have not been systematically indicated.

Conclusion
The general practice in Archaic Chinese syntax to transcribe transmitted texts in Mandarin pronunciation, followed even by eminent specialists of historical phonology (Pulleyblank 1995), masks a non-negligible amount of information relevant to syntactic analysis, and presents a highly misleading image of Chinese as a "pre-categorical" language (Bisang 2008, Sun 2020).
18 Since the phonological system of Six Dynasties scholars was distinct from that of early Tang, directly transcribing these fanqie into standard Middle Chinese may introduce some inadequacies: systematic, but minor, adjustments may be necessary. This topic, however, must be deferred to a future article.
Using sound glosses in a systematic way in the study of Archaic Chinese texts (at least those that are glossed in the Shiwen) can not only shed new light on the syntax of this language, but also, by making Chinese less outlandish from the perspective of the morphology-rich Sino-Tibetan languages, set a firmer basis for a comparative Sino-Tibetan grammar.
An edition of all the texts glossed in the Jingdian Shiwen with Middle Chinese transcription (and possibly Archaic Chinese reconstruction) in the lines of the model presented in §4 would thus be of considerable use to linguists and philologists specializing in Archaic Chinese. This endeavour would add a new dimension to the study of Chinese syntax and bring together two sister disciplines that have remained separated from each other for too long.

Conflicts of interest
None