标题: 可供普及沟通使用之“富含讯息视讯”之建构与应用
Construction and Applications of Message-rich Videos for Pervasive Communication
作者: 吴季儒
Wu, Chi-Ju
蔡文祥
Tsai, Wen-Hsiang
多媒体工程研究所
关键字: 资讯隐藏;普及沟通;富含讯息多媒体;message-rich video;data hiding;pervasive communication
公开日期: 2013
摘要: 在日常生活中,存于环境周遭的许多物体都可用来容纳资讯,达到普及沟通(pervasive communication)的目的。而这些物体广义来说均可视为多媒体,会以不同的形式出现在生活周遭,如广告看板、海报、在电视萤幕上所放映的视讯等等。讯息若可藉由资讯隐藏的技术嵌入在这些物体之中,而隐藏的讯息亦可被人们用讯息读取器(如:手机、平版电脑)抽取出来,则此种嵌入讯息的多媒体即称为“富含讯息多媒体(message-rich multimedia)”。
本研究尝试设计一新方法来制作一种新型的视讯,可不必透过直接传输数位档案的方式来交换讯息。此为一新型的富含讯息多媒体,称之为“富含讯息视讯”,可藉由资讯隐藏的方式实现;而被嵌入在视讯之中的讯息可以藉由“放映与再取像(display-and-imaging) ”的方式抽取出来。为了达到这个目标,本论文设计了一个讯息嵌入程序和一个对应的讯息抽取程序。
在讯息嵌入程序中,本研究采用了嵌入“对角形圆讯号”在视讯画面频谱中(video-frame spectrum)之方式,让视讯中的每一画面皆可藏入一个二位元讯号。另一方面,在放映影片进行再次取像时,所翻拍之影片几乎不可能与原有影片有相同的起始点。此外,视讯有许多不同格式也必须考虑。为了解决以上的议题,本论文提出一藉由特殊设计之起始及结束信号及一与影片格式特性符合之同步机制,让讯息嵌入程序与讯息抽取程序之间不需传输讯息即可达成同步。
在讯息抽取程序中,本论文提出一藉由“矩量保持门槛化(moment-preserving thresholding)”技术与一使用特殊遮罩之讯号侦测方式,让讯息抽取程序可自适性的侦测每一画面中的讯号,并达到很高的辨识率。接着,本论文设计了一线性资料结构,用以重现被嵌入于视讯中的讯息,此结构可使讯号更易于分析与转换,使得所提抽取程序可成功的抽取出被嵌入的讯号。最后,本论文提出视讯取样与画面缩小之方式,来加速讯息抽取程序,来提高所提方法的实用性。
以上所提出系统为一创新之作,深入的理论分析及实验结果显示该系统具有可行性及实用性。
In people’s daily life, many kinds of identities existing in the environment can be utilized to accommodate information for the purpose of pervasive communication. These identities may all be, in a broad sense, regarded as multimedia. They can appear in our daily life in many forms like hard copies, advertisements, posters, displays on monitors or TVs, etc. Messages may be injected into such identities by data hiding techniques. Such messages can be extracted by people using message readers such as smart phones, tablets. Such multimedia with messages embedded are called message-rich multimedia.
Specifically, it is tried in this study to design a new technique for creating a new type of video, by which, differently from traditional ways of information transmission, people can exchange information without sending directly digital files such as videos and images. That is, a new type of message-rich multimedia, called message-rich video, implemented by hiding messages in videos is proposed. Moreover, the message embedded in the video can be extracted in a display-and-imaging fashion. To achieve this purpose, a message embedding process and a corresponding message extraction process are designed.
For the message embedding process, the use of diagonal circular signals in the frequency domain is proposed, by which the proposed system can embed a 2-bit signal in each video frame. Next, it is almost impossible to conduct imaging of a displayed video exactly from its starting point. Furthermore, videos have various formats which must be considered as well in the design of the embedding process. To overcome these issues, a synchronization mechanism by uses of certain specially-designed initial and ending signals based on different video formats is proposed, by which the system can synchronize the message extraction work with the embedded message signals without transmitting any information.
For the message extraction process, a signal detection method based on the technique of moment-preserving thresholding using certain specially-designed masks is proposed, by which the system can detect signals adaptively with a high recognition rate. Also, a linear structure is designed for representing the message signals by which signals can be extracted, analyzed, and converted into ASCII codes. Furthermore, the extraction process is speeded up by frame resizing and down-sampling techniques.
The feasibility and effectiveness of all the proposed methods are demonstrated by theoretical analyses and good experimental results.
URI: http://140.113.39.130/cdrfb3/record/nctu/#GT070256606
http://hdl.handle.net/11536/74982
显示于类别:Thesis