Residential College | false |
Status | 已發表Published |
Grounding Commands for Autonomous Vehicles via Layer Fusion with Region-specific Dynamic Layer Attention | |
Hou Pong Chan; Mingxi Guo; Cheng-Zhong Xu | |
2022 | |
Conference Name | IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) |
Source Publication | IEEE International Conference on Intelligent Robots and Systems |
Volume | 2022-October |
Pages | 12464-12470 |
Conference Date | OCT 23-27, 2022 |
Conference Place | Kyoto, JAPAN |
Country | JAPAN |
Publisher | Institute of Electrical and Electronics Engineers Inc |
Abstract | Grounding a command to the visual environment is an essential ingredient for interactions between autonomous vehicles and humans. In this work, we study the problem of language grounding for autonomous vehicles, which aims to localize a region in a visual scene according to a natural language command from a passenger. Prior work only employs the top layer representations of a vision-and-language pretrained model to predict the region referred to by the command. However, such a method omits the useful features encoded in other layers, and thus results in inadequate understanding of the input scene and command. To tackle this limitation, we present the first layer fusion approach for this task. Since different visual regions may require distinct types of features to disambiguate them from each other, we further propose the region-specific dynamic (RSD) layer attention to adaptively fuse the multimodal information across layers for each region. Extensive experiments on the Talk2Car benchmark demonstrate that our approach helps predict more accurate regions and outperforms state-of-the-art methods. |
DOI | 10.1109/IROS47612.2022.9981515 |
URL | View the original |
Indexed By | CPCI-S |
Funding Project | Research on Key Technologies and Platforms for Collaborative Intelligence Driven Auto-driving Cars ; Efficient Integration and Dynamic Cognitive Technology and Platform for Urban Public Services |
Language | 英語English |
WOS ID | WOS:000909405303123 |
Scopus ID | 2-s2.0-85146332389 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE Faculty of Science and Technology |
Corresponding Author | Cheng-Zhong Xu |
Affiliation | University of Macau, Department of Computer and Information Science, Macao |
First Author Affilication | University of Macau |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Hou Pong Chan,Mingxi Guo,Cheng-Zhong Xu. Grounding Commands for Autonomous Vehicles via Layer Fusion with Region-specific Dynamic Layer Attention[C]:Institute of Electrical and Electronics Engineers Inc, 2022, 12464-12470. |
APA | Hou Pong Chan., Mingxi Guo., & Cheng-Zhong Xu (2022). Grounding Commands for Autonomous Vehicles via Layer Fusion with Region-specific Dynamic Layer Attention. IEEE International Conference on Intelligent Robots and Systems, 2022-October, 12464-12470. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment