Full metadata record
DC FieldValueLanguage
dc.contributor.authorHsu, Chih-Fanen_US
dc.contributor.authorWang, Yu-Shuenen_US
dc.contributor.authorLei, Chin-Laungen_US
dc.contributor.authorChen, Kuan-Taen_US
dc.date.accessioned2019-09-02T07:46:19Z-
dc.date.available2019-09-02T07:46:19Z-
dc.date.issued2019-06-01en_US
dc.identifier.issn1551-6857en_US
dc.identifier.urihttp://dx.doi.org/10.1145/3311784en_US
dc.identifier.urihttp://hdl.handle.net/11536/152695-
dc.description.abstractAlthough live video communication is widely used, it is generally less engaging than face-to-face communication because of limitations on social, emotional, and haptic feedback. Missing eye contact is one such problem caused by the physical deviation between the screen and camera on a device. Manipulating video frames to correct eye gaze is a solution to this problem. In this article, we introduce a system to rotate the eyeball of a local participant before the video frame is sent to the remote side. It adopts a warping-based convolutional neural network to relocate pixels in eye regions. To improve visual quality, we minimize the L2 distance between the ground truths and warped eyes. We also present several newly designed loss functions to help network training. These new loss functions are designed to preserve the shape of eye structures and minimize color changes around the periphery of eye regions. To evaluate the presented network and loss functions, we objectively and subjectively compared results generated by our system and the state-of-the-art, DeepWarp, in relation to two datasets. The experimental results demonstrated the effectiveness of our system. In addition, we showed that our system can perform eye-gaze correction in real time on a consumer-level laptop. Because of the quality and efficiency of the system, gaze correction by postprocessing through this system is a feasible solution to the problem of missing eye contact in video communication.en_US
dc.language.isoen_USen_US
dc.subjectEye contacten_US
dc.subjectlive video communicationen_US
dc.subjectgaze correctionen_US
dc.subjectimage processingen_US
dc.subjectconvolutional neural networken_US
dc.titleLook at Me! Correcting Eye Gaze in Live Video Communicationen_US
dc.typeArticleen_US
dc.identifier.doi10.1145/3311784en_US
dc.identifier.journalACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONSen_US
dc.citation.volume15en_US
dc.citation.issue2en_US
dc.citation.spage0en_US
dc.citation.epage0en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:000477935400009en_US
dc.citation.woscount0en_US
Appears in Collections:Articles