Residential College | false |
Status | 已發表Published |
Multi-stage Feature Pyramid Stereo Network based Disparity Estimation Approach for Two to Three-dimensional Video Conversion | |
Pan, B.; Zhang, L.; Wang, H. | |
2021 | |
Source Publication | IEEE Transactions on Circuits and Systems for Video Technology |
ISSN | 1051-8215 |
Pages | 1-14 |
Abstract | Disparity estimation is a popular topic in computer vision and has drawn increasing attention in recent years. In this3 article, we propose a new multi-stage network for the purpose4 of two to three-dimensional video conversion that contains5 two training stages: an initial disparity estimation as the first training stage and depth-image-based rendering (DIBR) as an extra component to form the second training stage. In the first training stage, we propose a revised end-to-end feature pyramid stereo network, in which the original non-pyramid structure is replaced by a bottom-up convolutional neural network pyramid for disparity regression. It utilizes the spatial information by concatenating different scale features to boost the performance on boundary consistency. Mirror connections between feature extraction and disparity regression on the corresponding layers are also added to improve the quality of the results. In the second stage, we propose an improved disocclusion filling technique in the DIBR branch and connect the non-neural-network method to the disparity estimation network. This two-stage training strategy can work effectively to generate the improved disparity estimation for two to three-dimensional video conversion. Exten21 sive experiments are conducted and some selected state-of-the art algorithms are compared with our proposed approach on the popular KITTI2015 and Scene Flow datasets. The results demonstrate that our estimated disparity map can generate high quality 3D images. |
Keyword | 2d To 3d Video Conversion Neural Network Deep Learning Disparity Estimation Feature Pyramid Depth Image Based Rendering (Dibr) |
DOI | 10.1109/TCSVT.2020.3014053 |
Language | 英語English |
The Source to Article | PB_Publication |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Recommended Citation GB/T 7714 | Pan, B.,Zhang, L.,Wang, H.. Multi-stage Feature Pyramid Stereo Network based Disparity Estimation Approach for Two to Three-dimensional Video Conversion[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2021, 1-14. |
APA | Pan, B.., Zhang, L.., & Wang, H. (2021). Multi-stage Feature Pyramid Stereo Network based Disparity Estimation Approach for Two to Three-dimensional Video Conversion. IEEE Transactions on Circuits and Systems for Video Technology, 1-14. |
MLA | Pan, B.,et al."Multi-stage Feature Pyramid Stereo Network based Disparity Estimation Approach for Two to Three-dimensional Video Conversion".IEEE Transactions on Circuits and Systems for Video Technology (2021):1-14. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment