Emotion Recognition Based on Speech Signals by Combining Empirical Mode Decomposition and Deep Neural Network | BOHR International Journal of Internet of things, Artificial Intelligence and Machine Learning

This is an outdated version published on 2023-10-18. Read the most recent version.

PDF HTML XML EPUB

Abstract Views: 204

PDF Views/Downloads: 44

HTML Views/Downloads: 6

XML Views/Downloads: 25

EPUB Views/Downloads: 31

How to Cite

Tai Pan, S., -Fa Chen, C., & Cheng Hong, C.-. (2023). Emotion Recognition Based on Speech Signals by Combining Empirical Mode Decomposition and Deep Neural Network. BOHR International Journal of Internet of Things, Artificial Intelligence and Machine Learning, 2(1), 85–92. https://doi.org/10.54646/bijiam.2023.11

Published: Oct 18, 2023

Versions:

2023-10-18 (2)

2023-10-18 (1)

DOI: https://doi.org/10.54646/bijiam.2023.11

Dimensions Citation count:

Keywords:

Speech emotion recognition
, empirical mode decomposition
deep neural network
Mel-scale Frequency
Cepstral Coefficients
hidden Markov model

Authors

Shing Tai Pan

Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung811, Taiwan

Ching-Fa Chen

Department of Electronic Engineering, Kao Yuan University, Kaohsiung 821, Taiwan

Chuan-Cheng Hong

Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung 811, Taiwan

Abstract

This paper proposes a novel method for speech emotion recognition. Empirical mode decomposition
(EMD) is applied in this paper for the extraction of emotional features from speeches, and a deep neural network
(DNN) is used to classify speech emotions. This paper enhances the emotional components in speech signals by
using EMD with acoustic feature Mel-Scale Frequency Cepstral Coefficients (MFCCs) to improve the recognition
rates of emotions from speeches using the classifier DNN. In this paper, EMD is first used to decompose the speech
signals, which contain emotional components into multiple intrinsic mode functions (IMFs), and then emotional
features are derived from the IMFs and are calculated using MFCC. Then, the emotional features are used to train
the DNN model. Finally, a trained model that could recognize the emotional sig

Share This Article On Social Media

Usage Statistics

Downloads

Download data is not yet available.

Issue

Vol. 2 No. 1 (2023): BOHR International Journal of Internet of things, Artificial Intelligence and Machine Learning (BIJIAM)

Section

Articles

Article Sidebar

Main Article Content

Authors

Abstract

Downloads

Article Details