
Applications of deep neural networks on music emotion recognition
- 1 Boston University, Boston, MA, 02215,United States
* Author to whom correspondence should be addressed.
Abstract
Music Emotion Recognition (MER) is a subfield of Music Information Retrieval (MIR) that focuses on finding a relationship between music and human emotions by applying machine learning and signal processing techniques. In recent years, neural networks have achieved great success in a large number of areas, such as speech recognition and image processing, sparking many attempts to utilize neural networks in the MER task. Although new models for MER are constantly emerging, there are few systematic reviews in this field that involves the latest models and datasets. Therefore, in this paper, we provide a detailed review of this task. Our work first expounds on the practical significance and research status of the MER problem. Then, we encapsulate the background research and contributions of predecessors in both machine learning and psychology fields. Our work also includes a thorough analysis of several important datasets and their mathematical principles. Finally, we summarize four novel models from an application perspective and conclude a few potential challenges for the task.
Keywords
Music Emotion Recognition, Neural Networks, Nature Language Processing
[1]. Loreto Parisi, Simone Francia, Silvio Olivastri, Maria Stella Tavella. Exploiting Synchronized Lyrics And Vocal Features For Music Emotion Detection. arXiv:1901.04831, 2019.
[2]. Jacek Grekow. Music emotion recognition using recurrent neural networks and pretrained models. Journal of Intelligent Information Systems (2021) 57:531–546.
[3]. Richard Orjesek, Roman Jarina, Michal Chmulik. End to end music emotion variation detection using iteratively reconstructed deep features. Multimedia Tools and Applications (2022) 81:5017–5031.
[4]. Serhat Hizlisoy, Serdar Yildirim, Zekeriya Tufekci. Music emotion recognition using convolutional long short term memory deep neural networks. Engineering Science and Technology, an International Journal 24 (2021) 760–767.
[5]. DEAM dataset - The MediaEval database for emotional analysis of music. http://cvml.unige.ch/datab ases/DEAM. Accessed 30 Sept 2020
[6]. Fika Hastarita Rachman, Riyanarto Sarno, Chastine Fatichah. Music Emotion Classification based on Lyrics-Audio using Corpus based Emotion. International Journal of Electrical and Computer Engineering (IJECE), Vol. 8, No. 3, June 2018, pp. 1720~1730.
[7]. Russell, J. A. A circumplex model of affect. Journal of Personality and Social Psychology, 39(6), 1161–1178, 1980.
[8]. K. Hevner. Experimental studies of the elements of expression in music. The American Journal of Psychology, pages 246–268, 1936.
[9]. Gers, F. A., Schmidhuber, J., & Cummins, F.A. Learning to forget: Continual prediction with LSTM. Neural Computation, 12, 2451–2471, 2000.
[10]. Witten, I. H., Frank, E., Hall, M. A., & Pal, C.J. Data mining fourth edition: Practical machine learning tools and techniques, 4th edn. USA: Morgan Kaufmann Publishers Inc, 2016.
Cite this article
Du,Y. (2023). Applications of deep neural networks on music emotion recognition. Applied and Computational Engineering,5,316-322.
Data availability
The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.
Disclaimer/Publisher's Note
The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
About volume
Volume title: Proceedings of the 3rd International Conference on Signal Processing and Machine Learning
© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and
conditions of the Creative Commons Attribution (CC BY) license. Authors who
publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons
Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this
series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published
version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial
publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and
during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See
Open access policy for details).