UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
Difference-guided multi-scale spatial-temporal representation for sign language recognition
Gao, Liqing1; Hu, Lianyu1; Lyu, Fan1; Zhu, Lei2; Wan, Liang1; Pun, Chi Man3; Feng, Wei1
2023-07-30
Source PublicationVisual Computer
ISSN0178-2789
Volume39Issue:8Pages:3417-3428
Abstract

Sign language recognition (SLR) is a challenging task, which requires a thorough understanding of spatial-temporal visual features for translating it into comprehensible written or spoken language. However, existing SLR methods ignore the importance of key spatial-temporal representation due to its sparsity and inconsistency in space and time. To solve this problem, we present a difference-guided multi-scale spatial-temporal representation (DMST) learning model for SLR. In DMST, we devise two modules: (1) key spatial-temporal representation, to extract and enhance key spatial-temporal information by a spatial-temporal difference strategy and (2) multi-scale sequence alignment, to perceive and fuse multi-scale spatial-temporal features and achieve sequence mapping. The DMST model outperforms state-of-the-art performance on four public sign language datasets, which demonstrates the superiority of DMST model and the significance of key spatial-temporal representation for SLR.

KeywordKey Spatial-temporal Representation Multi-scale Sequence Alignment Sign Language Recognition (Slr)
DOI10.1007/s00371-023-02979-8
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Software Engineering
WOS IDWOS:001040330500003
PublisherSPRINGERONE NEW YORK PLAZA, SUITE 4600 , NEW YORK, NY 10004, UNITED STATES
Scopus ID2-s2.0-85166243912
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionFaculty of Science and Technology
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorFeng, Wei
Affiliation1.Tianjin University, Tianjin, China
2.The Hong Kong University of Science and Technology (Guangzhou), Guangzhou, China
3.University of Macau, Macao
Recommended Citation
GB/T 7714
Gao, Liqing,Hu, Lianyu,Lyu, Fan,et al. Difference-guided multi-scale spatial-temporal representation for sign language recognition[J]. Visual Computer, 2023, 39(8), 3417-3428.
APA Gao, Liqing., Hu, Lianyu., Lyu, Fan., Zhu, Lei., Wan, Liang., Pun, Chi Man., & Feng, Wei (2023). Difference-guided multi-scale spatial-temporal representation for sign language recognition. Visual Computer, 39(8), 3417-3428.
MLA Gao, Liqing,et al."Difference-guided multi-scale spatial-temporal representation for sign language recognition".Visual Computer 39.8(2023):3417-3428.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Gao, Liqing]'s Articles
[Hu, Lianyu]'s Articles
[Lyu, Fan]'s Articles
Baidu academic
Similar articles in Baidu academic
[Gao, Liqing]'s Articles
[Hu, Lianyu]'s Articles
[Lyu, Fan]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Gao, Liqing]'s Articles
[Hu, Lianyu]'s Articles
[Lyu, Fan]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.