Hostname: page-component-89b8bd64d-7zcd7 Total loading time: 0 Render date: 2026-05-10T01:01:38.621Z Has data issue: false hasContentIssue false

A new local indicator of spatial autocorrelation identifies clusters of high rendaku frequency in Japanese place names

Published online by Cambridge University Press:  16 February 2023

Thomas Pellard*
Affiliation:
CRLAO, EHESS-Inalco-CNRS, Paris, France
Akiko Takemura
Affiliation:
IFRAE, Inalco-Université Paris Cité-CNRS, Paris, France
Hyun Kyung Hwang
Affiliation:
Tsukuba University, Tsukuba, Japan
Timothy J. Vance
Affiliation:
NINJAL, Tachikawa, Japan
*
Author for correspondence: Thomas Pellard, Email: thomas.pellard@cnrs.fr
Rights & Permissions [Opens in a new window]

Abstract

The methods of spatial statistics have been successfully applied to the study of linguistic variation, especially for detecting the existence of spatial patterns in the geographical distribution of linguistic features. However, the use of local indicators of spatial autocorrelation for detecting spatial clusters have been limited to continuous variables, and we propose to apply the new method of Anselin and Li (2019) for categorical variables to linguistic data. We illustrate this method with the case of Japanese rendaku, or sequential voicing, whose dialectal variation is still poorly documented. Focusing on regional differences in the frequency of rendaku, we examined the occurrence of rendaku for four lexemes in 4,921 place names from all Japan. A statistical analysis of local spatial association and an unsupervised density-based cluster analysis revealed the existence of two cluster areas of high rendaku frequency centered around Wakayama and Fukushima-Yamagata prefectures. This suggests that rendaku is more frequent in those dialects, and we recommend that further studies in the dialectal variation of rendaku start by looking at those areas.

Information

Type
Articles
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Table 1. Rendaku rate by lexeme (original data of Takemura et al., 2019)

Figure 1

Map 1. Map of the 4,921 locations investigated.

Figure 2

Table 2. Rendaku rate by lexeme (filtered and geocoded data)

Figure 3

Map 2. Presence versus absence of rendaku in all place names.

Figure 4

Map 3. Presence versus absence of rendaku by lexeme.

Figure 5

Figure 1. Density distribution of the number per location of neighbors within a 50 km distance; vertical white lines indicate the quartiles (Q1 = 79, Q2 = 116, Q3 = 155, $$\bar x$$ = 128.8).

Figure 6

Table 3. Global join count statistics

Figure 7

Map 4. Cores of rendaku clusters.

Figure 8

Map 5. Cores (full colors) of rendaku clusters and their neighbors (dimmed colors).

Figure 9

Map 6. Cores (full colors) of individual rendaku cluster areas and their neighbors (dimmed colors).

Figure 10

Table 4. Number of cores and neighbors by area and prefecture

Figure 11

Table 5. Number of cores and neighbors by area and lexeme