FADE: Feature Aggregation for Depth Estimation With Multi-View Stereo

doi:10.1109/TIP.2020.2991883

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Yang, Hsiao-Chien	en_US
dc.contributor.author	Chen, Po-Heng	en_US
dc.contributor.author	Chen, Kuan-Wen	en_US
dc.contributor.author	Lee, Chen-Yi	en_US
dc.contributor.author	Chen, Yong-Sheng	en_US
dc.date.accessioned	2020-10-05T01:59:42Z	-
dc.date.available	2020-10-05T01:59:42Z	-
dc.date.issued	2020-01-01	en_US
dc.identifier.issn	1057-7149	en_US
dc.identifier.uri	http://dx.doi.org/10.1109/TIP.2020.2991883	en_US
dc.identifier.uri	http://hdl.handle.net/11536/154831	-
dc.description.abstract	Both structural and contextual information is essential and widely used in image analysis. However, current multi-view stereo (MVS) approaches usually use a single common pre-trained model as pixel descriptor to extract features, which mix structural and contextual information together and thus increase the difficulty of matching correspondence. In this paper, we propose FADE (feature aggregation for depth estimation), which treats spatial and context information separately and focuses on aggregating features for efficient learning of the MVS problem. Spatial information includes image details such as edges and corners, whereas context information comprises object features such as shapes and traits. To aggregate these multi-level features, we use an attention mechanism to select important features for matching. We then build a plane sweep volume by using a homography backward warping method to generate match candidates. Furthermore, we propose a novel cost volume regularization network aims to minimize the noise in the matching candidates. Finally, we take advantage of 3D stacked hourglass and regression to produces high-quality depth maps. With these well-aggregated features, FADE can efficiently perform dense depth reconstruction, achieving state-of-the-art performance in terms of accuracy and requiring the least amount of model parameters.	en_US
dc.language.iso	en_US	en_US
dc.subject	Three-dimensional displays	en_US
dc.subject	Feature extraction	en_US
dc.subject	Estimation	en_US
dc.subject	Image reconstruction	en_US
dc.subject	Cameras	en_US
dc.subject	Visualization	en_US
dc.subject	Computational modeling	en_US
dc.subject	Multi-view stereo	en_US
dc.subject	depth estimation	en_US
dc.subject	feature aggregation	en_US
dc.subject	attention mechanism	en_US
dc.subject	homography	en_US
dc.subject	plane sweep algorithm	en_US
dc.title	FADE: Feature Aggregation for Depth Estimation With Multi-View Stereo	en_US
dc.type	Article	en_US
dc.identifier.doi	10.1109/TIP.2020.2991883	en_US
dc.identifier.journal	IEEE TRANSACTIONS ON IMAGE PROCESSING	en_US
dc.citation.volume	29	en_US
dc.citation.spage	6590	en_US
dc.citation.epage	6600	en_US
dc.contributor.department	資訊工程學系	zh_TW
dc.contributor.department	電子工程學系及電子研究所	zh_TW
dc.contributor.department	Department of Computer Science	en_US
dc.contributor.department	Department of Electronics Engineering and Institute of Electronics	en_US
dc.identifier.wosnumber	WOS:000545739000001	en_US
dc.citation.woscount	0	en_US
顯示於類別：	期刊論文