Deep Neural Networks for Emotion Recognition

The paper investigates the problem of recognizing human emotions by voice using deep learning methods. Deep convolutional neural networks and recurrent neural networks with bidirectional LSTM memory cell were used as models of deep neural networks. On their basis, an ensemble of neural networks is proposed. We carried out computer experiments on using the constructed neural networks and popular machine learning algorithms for recognizing emotions in human speech contained in the RAVDESS audio record database. The computational results showed a higher efficiency of neural network models compared to machine learning algorithms. Accuracy estimates for individual emotions obtained using neural networks were 80%. The directions of further research in the field of recognition of human emotions are proposed. © 2020, Springer Nature Switzerland AG.

Authors
Shchetinin E.Y.1 , Sevastianov L.A. 2, 3 , Kulyabov D.S. 2, 3 , Ayrjan E.A. 3, 4 , Demidova A.V. 2
Language
English
Pages
365-379
Status
Published
Volume
12563 LNCS
Year
2020
Organizations
  • 1 Government of the Russian Federation, Financial University, Moscow, Russian Federation
  • 2 Peoples’ Friendship University of Russia (RUDN University), Moscow, Russian Federation
  • 3 Joint Institute for Nuclear Research, Dubna, Russian Federation
  • 4 Dubna State University, Dubna, Russian Federation
Keywords
BLSTM model; Convolutional neural networks; Deep learning; Emotion recognition; Paralinguistic model; Recurrent neural networks
Share

Other records

Sopin E., Botvinko A., Darmolad A., Bixalina D., Daraseliya A.
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 12563 LNCS. 2020. P. 77-86