Residential Collegefalse
Status已發表Published
Referring Multi-Object Tracking
Wu, Dongming1; Han, Wencheng2; Wang, Tiancai3; Dong, Xingping4; Zhang, Xiangyu3,5; Shen, Jianbing2
2023-08-22
Conference NameConference on Computer Vision and Pattern Recognition (CVPR)
Source PublicationProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Volume2023-June
Pages14633-14642
Conference Date17-24 June 2023
Conference PlaceVancouver
CountryCanada
PublisherIEEE
Abstract

Existing referring understanding tasks tend to involve the detection of a single text-referred object. In this paper, we propose a new and general referring understanding task, termed referring multi-object tracking (RMOT). Its core idea is to employ a language expression as a semantic cue to guide the prediction of multi-object tracking. To the best of our knowledge, it is the first work to achieve an arbitrary number of referent object predictions in videos. To push forward RMOT, we construct one benchmark with scalable expressions based on KITTI, named Refer-KITTI. Specifically, it provides 18 videos with 818 expressions, and each expression in a video is annotated with an average of 10.7 objects. Further, we develop a transformer-based architecture TransRMOT to tackle the new task in an online manner, which achieves impressive detection performance and out-performs other counterparts. The Refer-KITTI dataset and the code are released at https://referringmot.github.io.

KeywordAnd Reasoning Language Vision
DOI10.1109/CVPR52729.2023.01406
URLView the original
Indexed ByCPCI-S
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence
WOS IDWOS:001062522106092
Scopus ID2-s2.0-85166611315
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionTHE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)
Co-First AuthorWu, Dongming
Corresponding AuthorShen, Jianbing
Affiliation1.Beijing Institute of Technology, China
2.SKL-IOTSC, Cis, University of Macau, Macao
3.Megvii Technology, China
4.School of Computer Science, Wuhan University, China
5.Beijing Academy of Artificial Intelligence, China
Corresponding Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Wu, Dongming,Han, Wencheng,Wang, Tiancai,et al. Referring Multi-Object Tracking[C]:IEEE, 2023, 14633-14642.
APA Wu, Dongming., Han, Wencheng., Wang, Tiancai., Dong, Xingping., Zhang, Xiangyu., & Shen, Jianbing (2023). Referring Multi-Object Tracking. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2023-June, 14633-14642.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Wu, Dongming]'s Articles
[Han, Wencheng]'s Articles
[Wang, Tiancai]'s Articles
Baidu academic
Similar articles in Baidu academic
[Wu, Dongming]'s Articles
[Han, Wencheng]'s Articles
[Wang, Tiancai]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wu, Dongming]'s Articles
[Han, Wencheng]'s Articles
[Wang, Tiancai]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.