Residential Collegefalse
Status已發表Published
Grounding Commands for Autonomous Vehicles via Layer Fusion with Region-specific Dynamic Layer Attention
Hou Pong Chan; Mingxi Guo; Cheng-Zhong Xu
2022
Conference NameIEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Source PublicationIEEE International Conference on Intelligent Robots and Systems
Volume2022-October
Pages12464-12470
Conference DateOCT 23-27, 2022
Conference PlaceKyoto, JAPAN
CountryJAPAN
PublisherInstitute of Electrical and Electronics Engineers Inc
Abstract

Grounding a command to the visual environment is an essential ingredient for interactions between autonomous vehicles and humans. In this work, we study the problem of language grounding for autonomous vehicles, which aims to localize a region in a visual scene according to a natural language command from a passenger. Prior work only employs the top layer representations of a vision-and-language pretrained model to predict the region referred to by the command. However, such a method omits the useful features encoded in other layers, and thus results in inadequate understanding of the input scene and command. To tackle this limitation, we present the first layer fusion approach for this task. Since different visual regions may require distinct types of features to disambiguate them from each other, we further propose the region-specific dynamic (RSD) layer attention to adaptively fuse the multimodal information across layers for each region. Extensive experiments on the Talk2Car benchmark demonstrate that our approach helps predict more accurate regions and outperforms state-of-the-art methods.

DOI10.1109/IROS47612.2022.9981515
URLView the original
Indexed ByCPCI-S
Funding ProjectResearch on Key Technologies and Platforms for Collaborative Intelligence Driven Auto-driving Cars ; Efficient Integration and Dynamic Cognitive Technology and Platform for Urban Public Services
Language英語English
WOS IDWOS:000909405303123
Scopus ID2-s2.0-85146332389
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Faculty of Science and Technology
Corresponding AuthorCheng-Zhong Xu
AffiliationUniversity of Macau, Department of Computer and Information Science, Macao
First Author AffilicationUniversity of Macau
Corresponding Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Hou Pong Chan,Mingxi Guo,Cheng-Zhong Xu. Grounding Commands for Autonomous Vehicles via Layer Fusion with Region-specific Dynamic Layer Attention[C]:Institute of Electrical and Electronics Engineers Inc, 2022, 12464-12470.
APA Hou Pong Chan., Mingxi Guo., & Cheng-Zhong Xu (2022). Grounding Commands for Autonomous Vehicles via Layer Fusion with Region-specific Dynamic Layer Attention. IEEE International Conference on Intelligent Robots and Systems, 2022-October, 12464-12470.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Hou Pong Chan]'s Articles
[Mingxi Guo]'s Articles
[Cheng-Zhong Xu]'s Articles
Baidu academic
Similar articles in Baidu academic
[Hou Pong Chan]'s Articles
[Mingxi Guo]'s Articles
[Cheng-Zhong Xu]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Hou Pong Chan]'s Articles
[Mingxi Guo]'s Articles
[Cheng-Zhong Xu]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.