Hostname: page-component-89b8bd64d-ktprf Total loading time: 0 Render date: 2026-05-07T13:10:08.873Z Has data issue: false hasContentIssue false

An exploratory investigation of functional variation in South Asian online Englishes

Published online by Cambridge University Press:  01 April 2024

MUHAMMAD SHAKIR*
Affiliation:
Englisches Seminar University of Münster Johannisstr. 12–20 48143 Münster Germany muhammad.shakir@uni-muenster.de
Rights & Permissions [Opens in a new window]

Abstract

This article conducts an exploratory multidimensional (MD) analysis of four interactive online registers, namely newspaper comments, tweets, web forums and text messages, originating from four South Asian countries (Bangladesh, India, Pakistan and Sri Lanka) and two Inner Circle (Kachru 1985) English-speaking countries (UK and USA). A principal component analysis (PCA) has been performed on the interactive registers using linguistic features tagged by a modified version of the MFTE tagger (Le Foll 2021a). The dimensions resulting from the PCA show that nominal, literate and informational features are generally more common in the South Asian data – which represent varieties belonging to the Outer Circle (Kachru 1985). Additionally, different features are used for expressing persuasion or opinion compared to the two reference varieties.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2024. Published by Cambridge University Press
Figure 0

Table 1. Description of corpus categories in the main corpus with texts included in this study (adapted from Shakir & Deuber 2023: 123)

Figure 1

Figure 1. Scree plot of the data (FA and PCA)

Figure 2

Figure 2. Distribution of registers on Dim1 ‘Oral personal (top) versus literate impersonal discourse (bottom)’Note: Country names are two-letter ISO codes: IN is India; BD is Bangladesh; PK is Pakistan; LK is Sri Lanka; UK is United Kingdom; and US is United States of America. Register codes are as follows: CMT for comments; TWT for tweets; WBF for web forums; and TXM for text messages.

Figure 3

Table 2. Dimensions and features of MD analysisa

Figure 4

Figure 3. Distribution of registers on Dim2 ‘Oral elaboration (top) versus informational concerns (bottom)’

Figure 5

Figure 4. Distribution of registers on Dim3 ‘Persuasion- and help-oriented (top) versus informational and past-oriented discussions (bottom)’

Figure 6

Figure 5. Distribution of registers on Dim4 ‘Focus on humans and mental processes (top) versus non-abstract things and activities (bottom)’

Figure 7

Figure 6. Distribution of registers on Dim5 ‘Addressee involved opinion and description (top)’

Figure 8

Table 3. Regional variation in South Asian text messages

Figure 9

Figure 7. Phylogram of the countries based on mean dimension scores of the interactive registers (excluding text messages) (see Appendix)

Figure 10

Table A1. Mean dimension scores used in figure 7 (excluding text messages)