Klasifikasi Instrumen Musik dari Sinyal Audio menggunakan ResNet
DOI:
https://doi.org/10.61132/mars.v2i5.1213Keywords:
Musical Instrument Classification, ResNet-18, Transfer learning, Convolutional Neural Network, Deep LearningAbstract
This study presents the implementation of Transfer learning using the ResNet-18 architecture for classifying 10 musical instrument categories based on visual representations of audio signals. The audio waveform is transformed into image-like inputs appropriate for CNN processing, accompanied by data augmentation and ImageNet-standard normalization. ResNet-18 is utilized due to its efficient feature extraction capability enabled by residual blocks, which help overcome vanishing gradient issues. The model was trained for 10 Epochs using the AdamW optimizer and Cross-Entropy Loss. Experimental results show that the model achieved a maximum validation accuracy of 77.35%, with a stable downward trend in training loss, indicating effective feature learning. However, several misclassification cases were observed, particularly among instruments with similar spectral characteristics, such as drum–violin and tabla–sitar. These findings demonstrate that while ResNet-18 performs reliably for musical instrument classification, further improvements remain possible through deeper architectures like ResNet-50, more comprehensive hyperparameter optimization, and the use of richer audio representations such as Mel-Spectrograms. This research provides an essential foundation for developing automated music analysis systems powered by Deep Learning.
References
Alberto, J., & Hermanto, D. (2023). Klasifikasi jenis burung menggunakan metode CNN dan arsitektur ResNet-50. Jurnal Teknik Informatika dan Sistem Informasi, 10(3), 34–46.
He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 770–778).
Nainggolan, C. E. S., Nasir, M., Fatoni, & Udariansyah, D. (2024). Perbandingan klasifikasi jenis sampah menggunakan Convolutional Neural Network dengan arsitektur ResNet18 dan ResNet50. CSRID Journal, 16(1), 76–90.
Saputra, F. A., & Nudin, S. R. (2021). Pengembangan sistem rekomendasi pada pemutar musik menggunakan face emotion detection dan ResNet berbasis website. Artikel ilmiah, detail publikasi tidak tersedia.
Setyawati, E. (2018). Aplikasi pengenalan jenis keris tradisional dengan menggunakan augmented reality berbasis Android (pp. 590–595).
Tsiakmaki, M., Kostopoulos, G., Kotsiantis, S., & Ragos, O. (2020). Transfer learning from deep neural networks for predicting student performance. Applied Sciences, 10(6). https://doi.org/10.3390/app10062145
Wani, M. A., Bhat, F. A., Afzal, S., & Khan, A. I. (2019). Advances in deep learning (Vol. 57).
Wijaya, A. E., Swastika, W., & Kelana, O. H. (2021). Implementasi transfer learning pada Convolutional Neural Network untuk diagnosis Covid-19 dan pneumonia pada citra X-ray. Sainsbertek: Jurnal Ilmiah Sains dan Teknologi, 2(1), 10–15. https://doi.org/10.33479/sb.v2i1.125
Yang, C., Gan, X., Peng, A., & Yuan, X. (2023). ResNet based on multi-feature attention mechanism for sound classification in noisy environments. Sustainability, 15(14), 10762. https://doi.org/10.3390/su151410762
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Mars : Jurnal Teknik Mesin, Industri, Elektro Dan Ilmu Komputer

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.



