UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
ShaDocFormer: A Shadow-Attentive Threshold Detector with Cascaded Fusion Refiner for Document Shadow Removal
Chen, Weiwen1; Lei, Yingtie1; Luo, Shenghong1; Zhou, Ziyang2; Li, Mingxian2; Pun, Chi Man1
2024-09
Conference Name2024 International Joint Conference on Neural Networks, IJCNN 2024
Source PublicationProceedings of the International Joint Conference on Neural Networks
Conference Date30 June 2024 through 5 July 2024
Conference PlaceYokohama, Japan
CountryJapan
PublisherInstitute of Electrical and Electronics Engineers Inc.
Abstract

Document shadow is a common issue that arises when capturing documents using mobile devices, which significantly impacts readability. Current methods encounter various challenges, including inaccurate detection of shadow masks and estimation of illumination. In this paper, we propose ShaDoc-Former, a Transformer-based architecture that integrates traditional methodologies and deep learning techniques to tackle the problem of document shadow removal. The ShaDocFormer architecture comprises two components: the Shadow-attentive Threshold Detector (STD) and the Cascaded Fusion Refiner (CFR). The STD module employs a traditional thresholding technique and leverages the attention mechanism of the Transformer to gather global information, thereby enabling precise detection of shadow masks. The cascaded and aggregative structure of the CFR module facilitates a coarse-to-fine restoration process for the entire image. As a result, ShaDocFormer excels in accurately detecting and capturing variations in both shadow and illumination, thereby enabling effective removal of shadows. Extensive experiments demonstrate that ShaDocFormer outperforms current state-of-the-art methods in both qualitative and quantitative measurements. The code is available at https://github.com/kilito777/ShaDocFormer.

KeywordShadow Removal Text Document Images Vision Transformer
DOI10.1109/IJCNN60899.2024.10651298
URLView the original
Language英語English
Scopus ID2-s2.0-85204971600
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionFaculty of Science and Technology
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorPun, Chi Man
Affiliation1.University of Macau, Macau, Macao
2.Huizhou University, Huizhou, China
First Author AffilicationUniversity of Macau
Corresponding Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Chen, Weiwen,Lei, Yingtie,Luo, Shenghong,et al. ShaDocFormer: A Shadow-Attentive Threshold Detector with Cascaded Fusion Refiner for Document Shadow Removal[C]:Institute of Electrical and Electronics Engineers Inc., 2024.
APA Chen, Weiwen., Lei, Yingtie., Luo, Shenghong., Zhou, Ziyang., Li, Mingxian., & Pun, Chi Man (2024). ShaDocFormer: A Shadow-Attentive Threshold Detector with Cascaded Fusion Refiner for Document Shadow Removal. Proceedings of the International Joint Conference on Neural Networks.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Chen, Weiwen]'s Articles
[Lei, Yingtie]'s Articles
[Luo, Shenghong]'s Articles
Baidu academic
Similar articles in Baidu academic
[Chen, Weiwen]'s Articles
[Lei, Yingtie]'s Articles
[Luo, Shenghong]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Chen, Weiwen]'s Articles
[Lei, Yingtie]'s Articles
[Luo, Shenghong]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.