Speech Emotion Recognition Based on Speech Segment Using LSTM with Attention Model

Bagus Tris Atmaja, Masato Akagi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

50 Citations (Scopus)

Abstract

Automatic speech emotion recognition has become popular as it enables natural interaction between human-machine interaction. One modality of recognizing emotion is speech. However, the speech also contains silence that may not relevant to emotion. Two ways to improve performance is by removing silence and/or paying more attention to speech segment while ignoring the silence. In this paper, we propose both, a combination of silence removal and attention model to improve speech emotion recognition performance. The results show that utilizing combination silence removal and attention model outperforms the use of either noise removal only or attention model only.

Original languageEnglish
Title of host publicationProceedings - 2019 IEEE International Conference on Signals and Systems, ICSigSys 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages40-44
Number of pages5
ISBN (Electronic)9781728121772
DOIs
Publication statusPublished - Jul 2019
Externally publishedYes
Event2019 IEEE International Conference on Signals and Systems, ICSigSys 2019 - Bandung, Indonesia
Duration: 16 Jul 201918 Jul 2019

Publication series

NameProceedings - 2019 IEEE International Conference on Signals and Systems, ICSigSys 2019

Conference

Conference2019 IEEE International Conference on Signals and Systems, ICSigSys 2019
Country/TerritoryIndonesia
CityBandung
Period16/07/1918/07/19

Keywords

  • attention model
  • silence removal
  • speech emotion recognition
  • voice segments

Fingerprint

Dive into the research topics of 'Speech Emotion Recognition Based on Speech Segment Using LSTM with Attention Model'. Together they form a unique fingerprint.

Cite this