标题: 基于不同曲风训练资料之音乐情绪分类与演绎系统比较及应用于声景情绪辨识与混音分析
Comparison of Music Emotion Classification and Interpretation System Based on Different Genre of Training Data and Applied to Soundscape Emotion Recognition and Mixing Audio Analysis
作者: 许丞恺
郑泗东
Hsu, Cheng-Kai
Cheng, Stone
工学院声音与音乐创意科技硕士学位学程
关键字: 音乐情绪辨识;高斯混合模型;声景;Music Emotion Recognition;Gaussian Mixture Model (GMM);Soundscape
公开日期: 2017
摘要: 本研究融合类别式情绪分类法与二维情绪平面作为情绪辨识模型,搭配机器学习技术和音乐讯号处理,建立即时性音乐情绪轨迹追踪系统,将音乐讯号诱发的情感成份分类,并以视觉化平面呈现乐曲演译情绪变动的轨迹,本研究亦以此系统分析声景(Soundscape)所唤起的人类情绪感受,设计混音乐曲,及分析混音后之情绪变动轨迹。实验过程中搜集预判情绪标记“Pleasant”、“Solemn”、“Agitated”、“Exuberant”的古典音乐与流行音乐风格的样本各192首作为两套训练资料,从中萃取音量、音乐事件密度、调性、和声不和谐度和音色以代表音乐样本的特征,计算音讯特征与情绪辨识之关联性,透过情绪分数计算程序,并使用高斯混合模型(GMM)作为分类器划定四种情绪类别的边界,以建立图像化情绪辨识介面,追踪由音乐所诱发的人类情绪感受变化。实验结果证实不同的训练资料将导致两个情绪辨识平面的边界差异。声景即为人类日常活动场域的听觉环境,对于人类的情绪状态、生活品质皆有影响,本研究侧重于针对各种场域之商业目的或听觉环境气氛营造等需求,提供一套基于情绪辨识与心理声学的环境声音设计依据,应用音乐情绪辨识系统至声景情绪分析之方法,为评估声景录音档与音乐讯号混音后的声音情绪轨迹变化,以模拟真实场域中,藉由播放背景音乐并善用其情感特性来帮助人类达到情绪状态改变、转换心境,进而影响人类的商业行为与决策之研究。
This study presents an approach to analyze the inherent emotional ingredients in the polyphonic music signals, and applied to the soundscape emotion analysis. The proposed real-time music emotion trajectory tracking systems are established by maching learning techniques, music signal processing, and the integration of two-dimensional emotion plane and categorical taxonomy as emotion recognition model. Two sets of training data are collected, one is consisted of popular music and the other is consisted of western classical music, each set contains 192 emotion-predefined music clips respectively. Volume, onset density, mode, dissonance, and timbre are extracted to serve as the characteristics of a music excerpt. After emotion score counting process, Gaussian mixture model (GMM) is used to demarcate the margins between four emotion states. A graphical interface with mood locus on emotion plane is established to trace the alteration of music-evoked human emotions. Experimental result verified that different sets of training data would lead to the variation of boundaries among two emotion recognition models. Soundscape specifies the auditory environment of human daily activities, which can affect emotion states and living quality of human beings. This study proposed an access to environmental sound designing based on emotion recognition and psychoacoustic, especially focusing on the needs of various fields for commercial purpose or auditory atmosphere creation. The soundscape study is conducted by evaluating the effectiveness of emotion locus variation of selected urban soundscape sets blending with music signals. The simulation of playing background music in authentic field makes good use of music emotional characteristics to help people alter the emotion states and the state of mind, and further affect human behavior and decision-making.
URI: http://etd.lib.nctu.edu.tw/cdrfb3/record/nctu/#GT070451901
http://hdl.handle.net/11536/142363
显示于类别:Thesis