Music genre classification with modified residual learning and dual neural network
Ashraf, Mohsin; Abid, Fazeel; Raza, Muhammad Owais; Rasheed, Jawad; Alsubai, Shtwai; Asuroglu, Tunc (2025-10)
Ashraf, Mohsin
Abid, Fazeel
Raza, Muhammad Owais
Rasheed, Jawad
Alsubai, Shtwai
Asuroglu, Tunc
10 / 2025
PLoS ONE
e0333808
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:tuni-2025111010487
https://urn.fi/URN:NBN:fi:tuni-2025111010487
Kuvaus
Peer reviewed
Tiivistelmä
Music Genre is an abstract property of music that can identify shared traditions and conventions. In the recent past, music genre classification has shown a significant role in MIR that has attracted the research community to draw attention all around the world. The subjective aspect of the genre makes it challenging to define, as it relies on listeners’ interpretation. Deep Neural architectures can be used to address the efficiency and accuracy issues of traditional music systems. This paper proposes an approach to improve the music genre classification tasks with modified residual learning and hybrid convolutional neural networks. This architecture exploits the Mel-Spectrograms as input, which compute the signals as perceived by humans. We use identical layers of CNN with different pooling techniques to give rich hidden information for classification. We trained our model with Mel-Spectrograms generated from music files and obtained an accuracy of 87.80% and 68.50% for the GTZAN and FMA datasets, respectively. Our results show that the performance of the proposed model is also comparable with the other state-of-the-art models.
Kokoelmat
- TUNICRIS-julkaisut [24610]
