UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
GENERALIZED UNCERTAINTY-BASED EVIDENTIAL FUSION WITH HYBRID MULTI-HEAD ATTENTION FOR WEAK-SUPERVISED TEMPORAL ACTION LOCALIZATION
He, Yuanpeng1,2; Li, Lijian3; Zhan, Tianxiang3; Jiao, Wenpin1,2; Pun, Chi Man3
2024
Conference NameICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Source PublicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Pages3855-3859
Conference Date14-19 April 2024
Conference PlaceSeoul
CountrySouth Korea
PublisherInstitute of Electrical and Electronics Engineers Inc.
Abstract

Weakly supervised temporal action localization (WS-TAL) is a task of targeting at localizing complete action instances and categorizing them with video-level labels. Action-background ambiguity, primarily caused by background noise resulting from aggregation and intra-action variation, is a significant challenge for existing WS-TAL methods. In this paper, we introduce a hybrid multi-head attention (HMHA) module and generalized uncertainty-based evidential fusion (GUEF) module to address the problem. The proposed HMHA effectively enhances RGB and optical flow features by filtering redundant information and adjusting their feature distribution to better align with the WS-TAL task. Additionally, the proposed GUEF adaptively eliminates the interference of background noise by fusing snippet-level evidences to refine uncertainty measurement and select superior foreground feature information, which enables the model to concentrate on integral action instances to achieve better action localization and classification performance. Experimental results conducted on the THUMOS14 dataset demonstrate that our method outperforms state-of-the-art methods. Our code is available in https://github.com/heyuanpengpku/GUEF/tree/main.

KeywordGeneralized Uncertainty-based Evidential Fusion Hybrid Multi-head Attention Weakly-supervised Temporal Action Localization
DOI10.1109/ICASSP48485.2024.10446799
URLView the original
Language英語English
Scopus ID2-s2.0-85198563796
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionFaculty of Science and Technology
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorPun, Chi Man
Affiliation1.Key Laboratory of High Confidence Software Technologies (Peking University), Ministry of Education, Beijing, 100871, China
2.School of Computer Science, Peking University, Beijing, 100871, China
3.Department of Computer and Information Science, University of Macau, Macao
Corresponding Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
He, Yuanpeng,Li, Lijian,Zhan, Tianxiang,et al. GENERALIZED UNCERTAINTY-BASED EVIDENTIAL FUSION WITH HYBRID MULTI-HEAD ATTENTION FOR WEAK-SUPERVISED TEMPORAL ACTION LOCALIZATION[C]:Institute of Electrical and Electronics Engineers Inc., 2024, 3855-3859.
APA He, Yuanpeng., Li, Lijian., Zhan, Tianxiang., Jiao, Wenpin., & Pun, Chi Man (2024). GENERALIZED UNCERTAINTY-BASED EVIDENTIAL FUSION WITH HYBRID MULTI-HEAD ATTENTION FOR WEAK-SUPERVISED TEMPORAL ACTION LOCALIZATION. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 3855-3859.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[He, Yuanpeng]'s Articles
[Li, Lijian]'s Articles
[Zhan, Tianxiang]'s Articles
Baidu academic
Similar articles in Baidu academic
[He, Yuanpeng]'s Articles
[Li, Lijian]'s Articles
[Zhan, Tianxiang]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[He, Yuanpeng]'s Articles
[Li, Lijian]'s Articles
[Zhan, Tianxiang]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.