Residential College | false |
Status | 已發表Published |
Text-Driven Traffic Anomaly Detection with Temporal High-Frequency Modeling in Driving Videos | |
Liang, Rongqin1; Li, Yuanman1![]() ![]() | |
2024-04-17 | |
Source Publication | IEEE Transactions on Circuits and Systems for Video Technology
![]() |
ISSN | 1051-8215 |
Abstract | Traffic anomaly detection (TAD) in driving videos is critical for ensuring the safety of autonomous driving and advanced driver assistance systems. Previous single-stage TAD methods primarily rely on frame prediction, making them vulnerable to interference from dynamic backgrounds induced by the rapid movement of the dashboard camera. While two-stage TAD methods appear to be a natural solution to mitigate such interference by pre-extracting background-independent features (such as bounding boxes and optical flow) using perceptual algorithms, they are susceptible to the performance of first-stage perceptual algorithms and may result in error propagation. In this paper, we introduce TTHF, a novel single-stage method aligning video clips with text prompts, offering a new perspective on traffic anomaly detection. Unlike previous approaches, the supervised signal of our method is derived from languages rather than orthogonal one-hot vectors, providing a more comprehensive representation. Further, concerning visual representation, we propose to model the high frequency of driving videos in the temporal domain. This modeling captures the dynamic changes of driving scenes, enhances the perception of driving behavior, and significantly improves the detection of traffic anomalies. In addition, to better perceive various types of traffic anomalies, we carefully design an attentive anomaly focusing mechanism that visually and linguistically guides the model to adaptively focus on the visual context of interest, thereby facilitating the detection of traffic anomalies. It is shown that our proposed TTHF achieves promising performance, outperforming state-of-the-art competitors by +5.4% AUC on the DoTA dataset and achieving high generalization on the DADA dataset. |
Keyword | Traffic Anomaly Detection Multi-modality Learning High Frequency Attention |
DOI | 10.1109/TCSVT.2024.3390173 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85190722133 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Li, Yuanman |
Affiliation | 1.College of Electronics and Information Engineering, Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University, Shenzhen, China 2.State Key Laboratory of Internet of Things for Smart City and the Department of Computer and Information Science, University of Macau, Macau, China |
Recommended Citation GB/T 7714 | Liang, Rongqin,Li, Yuanman,Zhou, Jiantao,et al. Text-Driven Traffic Anomaly Detection with Temporal High-Frequency Modeling in Driving Videos[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024. |
APA | Liang, Rongqin., Li, Yuanman., Zhou, Jiantao., & Li, Xia (2024). Text-Driven Traffic Anomaly Detection with Temporal High-Frequency Modeling in Driving Videos. IEEE Transactions on Circuits and Systems for Video Technology. |
MLA | Liang, Rongqin,et al."Text-Driven Traffic Anomaly Detection with Temporal High-Frequency Modeling in Driving Videos".IEEE Transactions on Circuits and Systems for Video Technology (2024). |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment