Residential Collegefalse
Status已發表Published
Multi-stage Feature Pyramid Stereo Network based Disparity Estimation Approach for Two to Three-dimensional Video Conversion
Pan, B.; Zhang, L.; Wang, H.
2021
Source PublicationIEEE Transactions on Circuits and Systems for Video Technology
ISSN1051-8215
Pages1-14
Abstract

Disparity estimation is a popular topic in computer vision and has drawn increasing attention in recent years. In this3 article, we propose a new multi-stage network for the purpose4 of two to three-dimensional video conversion that contains5 two training stages: an initial disparity estimation as the first training stage and depth-image-based rendering (DIBR) as an extra component to form the second training stage. In the first training stage, we propose a revised end-to-end feature pyramid stereo network, in which the original non-pyramid structure is replaced by a bottom-up convolutional neural network pyramid for disparity regression. It utilizes the spatial information by concatenating different scale features to boost the performance on boundary consistency. Mirror connections between feature extraction and disparity regression on the corresponding layers are also added to improve the quality of the results. In the second stage, we propose an improved disocclusion filling technique in the DIBR branch and connect the non-neural-network method to the disparity estimation network. This two-stage training strategy can work effectively to generate the improved disparity estimation for two to three-dimensional video conversion. Exten21 sive experiments are conducted and some selected state-of-the art algorithms are compared with our proposed approach on the popular KITTI2015 and Scene Flow datasets. The results demonstrate that our estimated disparity map can generate high quality 3D images.

Keyword2d To 3d Video Conversion Neural Network Deep Learning Disparity Estimation Feature Pyramid Depth Image Based Rendering (Dibr)
DOI10.1109/TCSVT.2020.3014053
Language英語English
The Source to ArticlePB_Publication
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Recommended Citation
GB/T 7714
Pan, B.,Zhang, L.,Wang, H.. Multi-stage Feature Pyramid Stereo Network based Disparity Estimation Approach for Two to Three-dimensional Video Conversion[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2021, 1-14.
APA Pan, B.., Zhang, L.., & Wang, H. (2021). Multi-stage Feature Pyramid Stereo Network based Disparity Estimation Approach for Two to Three-dimensional Video Conversion. IEEE Transactions on Circuits and Systems for Video Technology, 1-14.
MLA Pan, B.,et al."Multi-stage Feature Pyramid Stereo Network based Disparity Estimation Approach for Two to Three-dimensional Video Conversion".IEEE Transactions on Circuits and Systems for Video Technology (2021):1-14.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Pan, B.]'s Articles
[Zhang, L.]'s Articles
[Wang, H.]'s Articles
Baidu academic
Similar articles in Baidu academic
[Pan, B.]'s Articles
[Zhang, L.]'s Articles
[Wang, H.]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Pan, B.]'s Articles
[Zhang, L.]'s Articles
[Wang, H.]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.