Emotions recognition in human speech with deep learning models

Shchetinin, E.Y.; Sevastianov, L.A.; Kulyabov, D.S.; Demidova, A.V.

Emotions recognition in human speech with deep learning models

The paper investigates the architecture of deep neural networks for recognizing human emotions from speech. Convolutional neural networks and recurrent neural networks with an LSTM memory cell were used as models of deep neural networks. An ensemble of neural networks was also built on their basis. Computer experiments with the proposed deep learning models and basic machine learning algorithms for recognizing emotions in human speech contained in the RAVDESS audio database were conducted. The results obtained showed high efficiency of neural network models, and accuracy estimates for some classes of the emotions were 80%.

Авторы

Shchetinin E.Y. ¹ , Sevastianov L.A. ^2, ³ , Kulyabov D.S. ^2, ³ , Demidova A.V. ²

Сборник материалов конференции

Пятая Международная конференция по стохастическим методам (МКСМ-5)

Издательство

Российский университет дружбы народов (РУДН)

Язык

Английский

Страницы

368-372

Статус

Опубликовано

Год

2020

Организации

¹ Financial University under the Government of the Russian Federation
² Peoples' Friendship University of Russia (RUDN University)
³ Joint Institute for Nuclear Research

Ключевые слова

emotion recognition; deep learning; recurrent networks; BLSTM model

Цитировать

ГОСТ MLA RIS BibTex

Другие записи

ON DECOMPOSABLE SEMI-REGENERATIVE PROCESSES AND THEIR APPLICATIONS

Статья

Rykov Vladimir

Пятая Международная конференция по стохастическим методам (МКСМ-5). 2020. С. 365-367

NUMERICAL ANALISYS OF RELATIVISTIC FINITE-DIFFERENCE SCHRODINGER EQUATION WITH RANDOM QUASIPOTENTIAL AND SMALL PARAMETER

Статья

Vasilyev Sergey, Bouatta M.A., Kolosova Irina

Пятая Международная конференция по стохастическим методам (МКСМ-5). 2020. С. 378-383