完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | Hsu, Chih-Fan | en_US |
dc.contributor.author | Chen, Yu-Cheng | en_US |
dc.contributor.author | Wang, Yu-Shuen | en_US |
dc.contributor.author | Lei, Chin-Laung | en_US |
dc.contributor.author | Chen, Kuan-Ta | en_US |
dc.date.accessioned | 2019-04-02T06:04:24Z | - |
dc.date.available | 2019-04-02T06:04:24Z | - |
dc.date.issued | 2018-01-01 | en_US |
dc.identifier.uri | http://dx.doi.org/10.1145/3204949.3209618 | en_US |
dc.identifier.uri | http://hdl.handle.net/11536/150966 | - |
dc.description.abstract | Retaining eye contact of remote users is a critical issue in video conferencing systems because of parallax caused by the physical distance between a screen and a camera. To achieve this objective, we present a real-time gaze redirection system called Flx-gaze to post-process each video frame before sending it to the remote end. Specifically, we relocate and relight the pixels representing eyes by using a convolutional neural network (CNN). To prevent visual artifacts during manipulation, we minimize not only the L2 loss function but also four novel loss functions when training the network. Two of them retain the rigidity of eyeballs and eyelids; and the other two prevent color discontinuity on the eye peripheries. By leveraging the CPU and the GPU resources, our implementation achieves real-time performance (i.e., 31 frames per second). Experimental results show that the gazes redirected by our system are of high quality under this restrict time constraint. We also conducted an objective evaluation of our system by measuring the peak signal-to-noise ratio (PSNR) between the real and the synthesized images. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | Gaze Manipulation | en_US |
dc.subject | Convolutional Neural Network | en_US |
dc.title | Realizing the Real-time Gaze Redirection System with Convolutional Neural Network | en_US |
dc.type | Proceedings Paper | en_US |
dc.identifier.doi | 10.1145/3204949.3209618 | en_US |
dc.identifier.journal | PROCEEDINGS OF THE 9TH ACM MULTIMEDIA SYSTEMS CONFERENCE (MMSYS'18) | en_US |
dc.citation.spage | 509 | en_US |
dc.citation.epage | 512 | en_US |
dc.contributor.department | 資訊工程學系 | zh_TW |
dc.contributor.department | Department of Computer Science | en_US |
dc.identifier.wosnumber | WOS:000455343100061 | en_US |
dc.citation.woscount | 0 | en_US |
顯示於類別: | 會議論文 |