ADAPTING SEMANTIC SEGMENTATION OF URBAN SCENES VIA MASK-AWARE GATED DISCRIMINATOR

doi:10.1109/ICME.2019.00046

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lin, Yong-Xiang	en_US
dc.contributor.author	Tan, Daniel Stanley	en_US
dc.contributor.author	Cheng, Wen-Huang	en_US
dc.contributor.author	Hua, Kai-Lung	en_US
dc.date.accessioned	2020-01-02T00:03:28Z	-
dc.date.available	2020-01-02T00:03:28Z	-
dc.date.issued	2019-01-01	en_US
dc.identifier.isbn	978-1-5386-9552-4	en_US
dc.identifier.issn	1945-7871	en_US
dc.identifier.uri	http://dx.doi.org/10.1109/ICME.2019.00046	en_US
dc.identifier.uri	http://hdl.handle.net/11536/153327	-
dc.description.abstract	Training a deep neural network for semantic segmentation relies on pixel-level ground truth labels for supervision. However, collecting large datasets with pixel-level annotations is very expensive and time consuming. One workaround is to utilize synthetic data where we can generate potentially unlimited data with their corresponding ground truth labels. Unfortunately, networks trained on synthetic data perform poorly on real images due to the domain shift problem. Domain adaptation techniques have shown potential in transferring the knowledge learned from synthetic data to real world data. Prior works have mostly leveraged on adversarial training to perform a global aligning of features. However, we observed that background objects have lesser variations across different domains as opposed to foreground objects. Using this insight, we propose a method for domain adaptation that models and adapts foreground objects and background objects separately. Our approach starts with a fast style transfer to match the appearance of the inputs. This is followed by a foreground adaptation module that learns a foreground mask that is used by our gated discriminator in order to adapt the foreground and background objects separately. We demonstrate in our experiments that our model outperforms several state-of-the-art baselines in terms of mean intersection over union (mIoU).	en_US
dc.language.iso	en_US	en_US
dc.subject	Semantic segmentation	en_US
dc.subject	Domain adaptation	en_US
dc.subject	Gated-convolution	en_US
dc.title	ADAPTING SEMANTIC SEGMENTATION OF URBAN SCENES VIA MASK-AWARE GATED DISCRIMINATOR	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.doi	10.1109/ICME.2019.00046	en_US
dc.identifier.journal	2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME)	en_US
dc.citation.spage	218	en_US
dc.citation.epage	223	en_US
dc.contributor.department	電子工程學系及電子研究所	zh_TW
dc.contributor.department	Department of Electronics Engineering and Institute of Electronics	en_US
dc.identifier.wosnumber	WOS:000501820600038	en_US
dc.citation.woscount	0	en_US
Appears in Collections:	Conferences Paper