完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | Yang, Hsiao-Chien | en_US |
dc.contributor.author | Chen, Po-Heng | en_US |
dc.contributor.author | Chen, Kuan-Wen | en_US |
dc.contributor.author | Lee, Chen-Yi | en_US |
dc.contributor.author | Chen, Yong-Sheng | en_US |
dc.date.accessioned | 2020-10-05T01:59:42Z | - |
dc.date.available | 2020-10-05T01:59:42Z | - |
dc.date.issued | 2020-01-01 | en_US |
dc.identifier.issn | 1057-7149 | en_US |
dc.identifier.uri | http://dx.doi.org/10.1109/TIP.2020.2991883 | en_US |
dc.identifier.uri | http://hdl.handle.net/11536/154831 | - |
dc.description.abstract | Both structural and contextual information is essential and widely used in image analysis. However, current multi-view stereo (MVS) approaches usually use a single common pre-trained model as pixel descriptor to extract features, which mix structural and contextual information together and thus increase the difficulty of matching correspondence. In this paper, we propose FADE (feature aggregation for depth estimation), which treats spatial and context information separately and focuses on aggregating features for efficient learning of the MVS problem. Spatial information includes image details such as edges and corners, whereas context information comprises object features such as shapes and traits. To aggregate these multi-level features, we use an attention mechanism to select important features for matching. We then build a plane sweep volume by using a homography backward warping method to generate match candidates. Furthermore, we propose a novel cost volume regularization network aims to minimize the noise in the matching candidates. Finally, we take advantage of 3D stacked hourglass and regression to produces high-quality depth maps. With these well-aggregated features, FADE can efficiently perform dense depth reconstruction, achieving state-of-the-art performance in terms of accuracy and requiring the least amount of model parameters. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | Three-dimensional displays | en_US |
dc.subject | Feature extraction | en_US |
dc.subject | Estimation | en_US |
dc.subject | Image reconstruction | en_US |
dc.subject | Cameras | en_US |
dc.subject | Visualization | en_US |
dc.subject | Computational modeling | en_US |
dc.subject | Multi-view stereo | en_US |
dc.subject | depth estimation | en_US |
dc.subject | feature aggregation | en_US |
dc.subject | attention mechanism | en_US |
dc.subject | homography | en_US |
dc.subject | plane sweep algorithm | en_US |
dc.title | FADE: Feature Aggregation for Depth Estimation With Multi-View Stereo | en_US |
dc.type | Article | en_US |
dc.identifier.doi | 10.1109/TIP.2020.2991883 | en_US |
dc.identifier.journal | IEEE TRANSACTIONS ON IMAGE PROCESSING | en_US |
dc.citation.volume | 29 | en_US |
dc.citation.spage | 6590 | en_US |
dc.citation.epage | 6600 | en_US |
dc.contributor.department | 資訊工程學系 | zh_TW |
dc.contributor.department | 電子工程學系及電子研究所 | zh_TW |
dc.contributor.department | Department of Computer Science | en_US |
dc.contributor.department | Department of Electronics Engineering and Institute of Electronics | en_US |
dc.identifier.wosnumber | WOS:000545739000001 | en_US |
dc.citation.woscount | 0 | en_US |
顯示於類別: | 期刊論文 |