TY - JOUR
T1 - A bi-stage approach to North Indian raga distinction
AU - Basu, Debjyoti
AU - Mukherjee, Himadri
AU - Marciano, Matteo
AU - Sen, Shibaprasad
AU - Singh, Sajai Vir
AU - Obaidullah, Sk Md
AU - Roy, Kaushik
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.
PY - 2024/5
Y1 - 2024/5
N2 - Music embraces an intricate assembly of auditory elements, carefully organized in diverse configurations to articulate a variety of human emotions, moods, thoughts, feelings, temporal contexts, and situations. One of the primary aspects of any music composition is the raga which governs its melodic framework. Thus, the delineation of ragas assumes an enormous significance as a preliminary step preceding a more deeper and intricate analysis. Each of these ragas is meant to be practiced during a particular time of the day to amplify the emotional content and physical involvement. In this current work, a machine learning-based approach has been proposed to classify the dawn and dusk time (Sandhi Prakash) ragas. Here, mel-frequency cepstral coefficients (MFCC) based feature extraction technique has been applied which was further processed to generate second-level statistical features. This brought down the original feature dimension by means of effective representation of the raw features. Several classification techniques were employed and a new bi-stage raga distinction technique has been proposed. The first stage classifies ragas as dawn/ dusk while the second stage performs deeper classification for these groups separately to identify the exact raga. Experiments were performed with over 57K clips from 11 ragas belonging to the 2 time periods and a performance improvement of 0.7% was obtained for the dusk ragas using the bi-stage approach over the single shot classification technique. The highest possible accuracy of 96.47% was obtained for distinguishing the dusk ragas with only 2-second clips in the experiments.
AB - Music embraces an intricate assembly of auditory elements, carefully organized in diverse configurations to articulate a variety of human emotions, moods, thoughts, feelings, temporal contexts, and situations. One of the primary aspects of any music composition is the raga which governs its melodic framework. Thus, the delineation of ragas assumes an enormous significance as a preliminary step preceding a more deeper and intricate analysis. Each of these ragas is meant to be practiced during a particular time of the day to amplify the emotional content and physical involvement. In this current work, a machine learning-based approach has been proposed to classify the dawn and dusk time (Sandhi Prakash) ragas. Here, mel-frequency cepstral coefficients (MFCC) based feature extraction technique has been applied which was further processed to generate second-level statistical features. This brought down the original feature dimension by means of effective representation of the raw features. Several classification techniques were employed and a new bi-stage raga distinction technique has been proposed. The first stage classifies ragas as dawn/ dusk while the second stage performs deeper classification for these groups separately to identify the exact raga. Experiments were performed with over 57K clips from 11 ragas belonging to the 2 time periods and a performance improvement of 0.7% was obtained for the dusk ragas using the bi-stage approach over the single shot classification technique. The highest possible accuracy of 96.47% was obtained for distinguishing the dusk ragas with only 2-second clips in the experiments.
KW - Bi-stage classification
KW - MFCC
KW - Music information retrieval
KW - Sandhi Prakash raga
UR - http://www.scopus.com/inward/record.url?scp=85174706569&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85174706569&partnerID=8YFLogxK
U2 - 10.1007/s11042-023-17322-5
DO - 10.1007/s11042-023-17322-5
M3 - Article
AN - SCOPUS:85174706569
SN - 1380-7501
VL - 83
SP - 45163
EP - 45183
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
IS - 15
ER -