标题: 论情绪辨识架构中时频调变特征参数的强健性
On Robustness of Spectro-Temporal Modulation Features in an Emotion Recognition Framework
作者: 许晋诚
Hsu, Chin-Cheng
冀泰石
Chi, Tai-Shih
电信工程研究所
关键字: 情绪;辨识;时频调变特征参数;强健性;Emotion;Recognition;Spectro-Temporal Modulation Features;Robustness
公开日期: 2011
摘要: 杂讯无论在情绪辨识或其他任何应用中,都是相当困扰的问题。当前最常见的做法是采取匹配训练(matched condition)来对抗杂讯;相反的,本文考虑不匹配条件、只训练单一分类器来对抗各种情况下的杂讯。实验结果显示:就算在最严格的不匹配条件下,本文采用的时频调变特征参数组也有极为稳健的表现。文中亦讨论该时频调变特征参数的特性以及它如何受杂讯干扰所影响。本实验包含四项变因:两组资料库(Berlin Emotional Speech Database、Aibo Emotional Speech)、两种杂讯(white noise、babble noise)、两组特征参数(spectro-temporal modulation features、INTERSPEECH 2009 Emotion Challenge features)、两种训练-测试条件(slack or strict mismatched condition)。针对资料失衡的问题,本文则提出结合效度的样本合成方案来改善。
Noise is an annoying problem either in emotion recognition or in other applications. Previous research has considered matched condition to counter it. This article, on the contrary, considers mismatched condition which trains only one classifier that confronts all kinds of situation. Experiments show that the proposed feature set, which contains spectro-temporal modulation information, is robust, indicating that the mismatched training/testing condition is feasible. This paper also discussed the properties of the proposed features and how noise affected the features. The experiments included four variables: two databases (Aibo Emotion Corpus and Berlin Emotional Speech Database), two types of noise (additive white Gaussian noise and babble noise), two feature sets (spectro-temporal modulation features and INTERSPEECH 2009 Emotion Challenge features), and two conditions (slack and strict mismatched conditions). As for the issue of data imbalance, a synthetic method based on emotion validity was proposed to deal with it.
URI: http://140.113.39.130/cdrfb3/record/nctu/#GT079813533
http://hdl.handle.net/11536/47019
显示于类别:Thesis


文件中的档案:

  1. 353301.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.