Residential College | false |
Status | 已發表Published |
GENERALIZED UNCERTAINTY-BASED EVIDENTIAL FUSION WITH HYBRID MULTI-HEAD ATTENTION FOR WEAK-SUPERVISED TEMPORAL ACTION LOCALIZATION | |
He, Yuanpeng1,2; Li, Lijian3; Zhan, Tianxiang3; Jiao, Wenpin1,2; Pun, Chi Man3![]() ![]() | |
2024 | |
Conference Name | ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
Source Publication | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
![]() |
Pages | 3855-3859 |
Conference Date | 14-19 April 2024 |
Conference Place | Seoul |
Country | South Korea |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Abstract | Weakly supervised temporal action localization (WS-TAL) is a task of targeting at localizing complete action instances and categorizing them with video-level labels. Action-background ambiguity, primarily caused by background noise resulting from aggregation and intra-action variation, is a significant challenge for existing WS-TAL methods. In this paper, we introduce a hybrid multi-head attention (HMHA) module and generalized uncertainty-based evidential fusion (GUEF) module to address the problem. The proposed HMHA effectively enhances RGB and optical flow features by filtering redundant information and adjusting their feature distribution to better align with the WS-TAL task. Additionally, the proposed GUEF adaptively eliminates the interference of background noise by fusing snippet-level evidences to refine uncertainty measurement and select superior foreground feature information, which enables the model to concentrate on integral action instances to achieve better action localization and classification performance. Experimental results conducted on the THUMOS14 dataset demonstrate that our method outperforms state-of-the-art methods. Our code is available in https://github.com/heyuanpengpku/GUEF/tree/main. |
Keyword | Generalized Uncertainty-based Evidential Fusion Hybrid Multi-head Attention Weakly-supervised Temporal Action Localization |
DOI | 10.1109/ICASSP48485.2024.10446799 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85198563796 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Pun, Chi Man |
Affiliation | 1.Key Laboratory of High Confidence Software Technologies (Peking University), Ministry of Education, Beijing, 100871, China 2.School of Computer Science, Peking University, Beijing, 100871, China 3.Department of Computer and Information Science, University of Macau, Macao |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | He, Yuanpeng,Li, Lijian,Zhan, Tianxiang,et al. GENERALIZED UNCERTAINTY-BASED EVIDENTIAL FUSION WITH HYBRID MULTI-HEAD ATTENTION FOR WEAK-SUPERVISED TEMPORAL ACTION LOCALIZATION[C]:Institute of Electrical and Electronics Engineers Inc., 2024, 3855-3859. |
APA | He, Yuanpeng., Li, Lijian., Zhan, Tianxiang., Jiao, Wenpin., & Pun, Chi Man (2024). GENERALIZED UNCERTAINTY-BASED EVIDENTIAL FUSION WITH HYBRID MULTI-HEAD ATTENTION FOR WEAK-SUPERVISED TEMPORAL ACTION LOCALIZATION. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 3855-3859. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment