Speech emotion recognition with combined short and long term features

Journal of Tsinghua University(Science and Technology)（2008）

引用 16|浏览18

暂无评分

摘要

Utterance-based global statistics and frame-based temporal features have been widely used in speech emotion recognition systems,but these features can not effectively describe all of the emotional information.In this research,segment-based features are extracted and applied with a best segment length for emotion recognition for each emotional state.Further more,a novel neural network model named Global control Elman is proposed to combine the utterance-based features and segment-based features together.Experiments show that the performance of combined features may reach a recognition rate of 66.0%,much higher than obtained by utterance-based features or segment-based features.The recognition rate may be improved by 5.9% and 1.7% respectively,and the confusion between emotional state is also effectively reduced.

查看译文

关键词

Elman neural network,Emotion feature,Pattern recognition,Speech emotion recognition

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要