Hostname: page-component-5db58dd55d-688nx Total loading time: 0 Render date: 2026-06-02T17:05:42.473Z Has data issue: false hasContentIssue false

Resource building and classification of Mizo folk songs

Published online by Cambridge University Press:  23 May 2024

Esther Ramdinmawii*
Affiliation:
Department of of Computer Science & Engineering. Tezpur University, Napaam, Sonitpur, Assam, India
Sanghamitra Nath
Affiliation:
Department of of Computer Science & Engineering. Tezpur University, Napaam, Sonitpur, Assam, India
*
Corresponding author: Esther Ramdinmawii; Email: esther.rdmchhakchhuak@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

Folk culture represents the social, ethnic, and traditional livelihood of people belonging to a certain tribe or community and is important in keeping their culture and tradition alive. The Mizo people are a Tibeto-Burmese ethnic group, native to the Indian state of Mizoram and neighboring regions of Northeast India. Mizo folk culture is an amalgamation of festivity, celebration, liveliness, kinship, brotherhood, and merriment, and above all, preserves the ethnicity of this tribal community that is fundamentally entrenched. Unfortunately, the Mizos are fast giving up their old customs and adopting the new mode of life that is greatly influenced by the western culture. This makes it all the more crucial to preserve the intangible cultural heritage of this ethnic tribe whose folk cultures are vanishing day by day. To the best of our knowledge, this work is the first attempt at preservation and classification of Mizo folk songs. The first part of this paper presents a literature survey on preservation, analysis, and classification of folk songs. The second part presents the methodology for preliminary classification of Mizo folk songs. Three categories of Mizo folk songs—Hunting chants (Hlado), Children’s songs (Pawnto hla), and Elderly songs (Pi pu zai)—are used in this study. A total of 29 acoustic features are used. A long short-term memory network using custom attention layer has been proposed for classification, whose results are compared with four supervised models (Support Vector Machines, K-Nearest Neighbor, Naive Baye’s, and Ensemble). Experimental results from the proposed model are promising, with an implication of scope for future research in acoustic analysis and classification of Mizo folk songs using recent unsupervised methods.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Speaker population of Mizo languagea (marked in green dots).

Figure 1

Figure 2. Block diagram of a typical MIR system.

Figure 2

Figure 3. Methodology for classification of Mizo folk songs.

Figure 3

Table 1. Categories of Mizo folk song dataset used in this study

Figure 4

Algorithm 1. Attention mechanism for LSTM

Figure 5

Figure 4. Summary of the proposed LSTM-attn model with custom attention layer.

Figure 6

Table 2. Macro-averages of long short-term memory with attention layer model for classification of three categories of Mizo folk songs, with 20% testing data

Figure 7

Figure 5. Accuracy plots for LSTM-attn with 10 epochs for different feature sets (Accuracy $\times$ 100).

Figure 8

Table 3. Classifier performances for three categories of Mizo folk songs, with different combinations of acoustic features (5-fold cross validation with 20% data for testing)

Figure 9

Figure 6. Confusion matrices of the four ML models and LSTM-attn with different feature sets.

Figure 10

Table 4. Classifier performance compared with existing studies of under-resourced folk songs