UM  > GRADUATE SCHOOL
Residential Collegefalse
Status已發表Published
Rethinking 3D cost aggregation in stereo matching
Gan, Wanshui1,3; Wu, Wenhao2; Chen, Shifeng1; Zhao, Yuxiang1; Wong, Pak Kin3
2023-03-01
Source PublicationPattern Recognition Letters
ISSN0167-8655
Volume167Pages:75-81
Abstract

In the stereo matching task, the 3D convolution network can effectively aggregate the cost volume with the strong representation ability to model the spatial and depth dimensions but with the disadvantage of a high computational cost. In this letter, we revisit the 3D convolution network and its common variant, and then propose the Depth Shift Module (DSM) to model the cost volume in the depth dimension which could imitate the 3D convolution function with the computational complexity of the 2D convolution. The proposed DSM is easy to extend to present 3D cost aggregation methods in stereo matching with less inference time, lower computational complexity, and minor precision loss. Moreover, a novel compact but efficient stereo matching framework named HybridNet is proposed. This framework can hybridize the 2D convolution layer with the proposed DSM to effectively aggregate the cost volume. The proposed HybridNet achieves a better trade-off between the performance, computational complexity, and model size (e.g., 30% less than the size of AANet and 25% less than the size of PSMNet) in public open-source datasets (e.g., Scene Flow and KITTI Stereo 2015). The relevant code is available at https://github.com/GANWANSHUI/HybridNet.

Keyword3d Convolution Disparity Estimation Shift Operation Stereo Matching
DOI10.1016/j.patrec.2023.02.011
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence
WOS IDWOS:000943212600001
PublisherELSEVIER, RADARWEG 29, 1043 NX AMSTERDAM, NETHERLANDS
Scopus ID2-s2.0-85147733894
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionGRADUATE SCHOOL
Faculty of Science and Technology
DEPARTMENT OF ELECTROMECHANICAL ENGINEERING
Corresponding AuthorChen, Shifeng
Affiliation1.Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China
2.Department of Computer vision Technology (VIS), Baidu Inc., China
3.Department of Electromechanical Engineering, University of Macau, Macau SAR, China
First Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Gan, Wanshui,Wu, Wenhao,Chen, Shifeng,et al. Rethinking 3D cost aggregation in stereo matching[J]. Pattern Recognition Letters, 2023, 167, 75-81.
APA Gan, Wanshui., Wu, Wenhao., Chen, Shifeng., Zhao, Yuxiang., & Wong, Pak Kin (2023). Rethinking 3D cost aggregation in stereo matching. Pattern Recognition Letters, 167, 75-81.
MLA Gan, Wanshui,et al."Rethinking 3D cost aggregation in stereo matching".Pattern Recognition Letters 167(2023):75-81.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Gan, Wanshui]'s Articles
[Wu, Wenhao]'s Articles
[Chen, Shifeng]'s Articles
Baidu academic
Similar articles in Baidu academic
[Gan, Wanshui]'s Articles
[Wu, Wenhao]'s Articles
[Chen, Shifeng]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Gan, Wanshui]'s Articles
[Wu, Wenhao]'s Articles
[Chen, Shifeng]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.