Residential College | false |
Status | 已發表Published |
Referring Multi-Object Tracking | |
Wu, Dongming1; Han, Wencheng2; Wang, Tiancai3; Dong, Xingping4; Zhang, Xiangyu3,5; Shen, Jianbing2![]() ![]() | |
2023-08-22 | |
Conference Name | Conference on Computer Vision and Pattern Recognition (CVPR) |
Source Publication | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
![]() |
Volume | 2023-June |
Pages | 14633-14642 |
Conference Date | 17-24 June 2023 |
Conference Place | Vancouver |
Country | Canada |
Publisher | IEEE |
Abstract | Existing referring understanding tasks tend to involve the detection of a single text-referred object. In this paper, we propose a new and general referring understanding task, termed referring multi-object tracking (RMOT). Its core idea is to employ a language expression as a semantic cue to guide the prediction of multi-object tracking. To the best of our knowledge, it is the first work to achieve an arbitrary number of referent object predictions in videos. To push forward RMOT, we construct one benchmark with scalable expressions based on KITTI, named Refer-KITTI. Specifically, it provides 18 videos with 818 expressions, and each expression in a video is annotated with an average of 10.7 objects. Further, we develop a transformer-based architecture TransRMOT to tackle the new task in an online manner, which achieves impressive detection performance and out-performs other counterparts. The Refer-KITTI dataset and the code are released at https://referringmot.github.io. |
Keyword | And Reasoning Language Vision |
DOI | 10.1109/CVPR52729.2023.01406 |
URL | View the original |
Indexed By | CPCI-S |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence |
WOS ID | WOS:001062522106092 |
Scopus ID | 2-s2.0-85166611315 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) |
Co-First Author | Wu, Dongming |
Corresponding Author | Shen, Jianbing |
Affiliation | 1.Beijing Institute of Technology, China 2.SKL-IOTSC, Cis, University of Macau, Macao 3.Megvii Technology, China 4.School of Computer Science, Wuhan University, China 5.Beijing Academy of Artificial Intelligence, China |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Wu, Dongming,Han, Wencheng,Wang, Tiancai,et al. Referring Multi-Object Tracking[C]:IEEE, 2023, 14633-14642. |
APA | Wu, Dongming., Han, Wencheng., Wang, Tiancai., Dong, Xingping., Zhang, Xiangyu., & Shen, Jianbing (2023). Referring Multi-Object Tracking. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2023-June, 14633-14642. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment